Anyparser Typescript SDK for RAG/ETL Pipelines - File Content Extraction. Supports extraction from various file formats including PDF, Microsoft Office documents, OCR/Image to Text, Audio to Text, and Website to Text.
crawler ocr microsoft-word web-crawler text-extraction artificial-intelligence knowledgebase ms-office microsoft-office etl-pipeline rag pdf-extraction n8n-nodes langchain retrieval-augmented-generation graph-rag cache-augmented-generation anyparser
-
Updated
Feb 26, 2025 - TypeScript