Showing 21-24 of 24 packages
Comprehensive document processing pipeline for Node.js - PDF, DOCX, HTML, Markdown parsing with intelligent chunking, table/image extraction, and OCR
PDF Parser
Basic PDF Xref parser
PDF file parser that converts PDF binaries to text based JSON, powered by porting a fork of PDF.JS to Node.js