天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取
-
Updated
Dec 16, 2025 - Python
天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
📱 A React app to preview and edit Markdown✍. You can also export it as HTML.
[Required for large models] Office to Markdown service implementation, based on Microsoft Markitdown.
Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance.
High-performance Python Excel processing library with advanced conversion capabilities
A URL Fetch Gemini Processor to be used with Gemini's genai-processors
DocuGenius 是一个专业的 VSCode 插件,专门为使用 AI 编程工具的产品经理设计。它能够将你的 Word、Excel、PowerPoint 和 PDF 文件转换为 AI 友好的 Markdown 格式,让 Trae AI、CodeBuddy、Qoder等 AI 编程工具能够直接理解和处理你的业务文档。
📄 Professional MCP server for converting 29+ file formats to Markdown - Perfect for Claude Desktop and AI workflows!
simplified and containerized version of MarkItDown running as a FastAPI service, with a RESTful API for file-to-Markdown conversion.
CV Matcher is a Python-based application that helps analyze resumes and match them against job descriptions. It provides both CLI and server-based interfaces for resume analysis.
Simple FastAPI wrapper for Document-to-Markdown conversion using Microsoft's MarkItDown library.
convert a file to a markdown file
📄 Extract detailed text, tables, and layout data from machine-generated PDFs with ease using pdfplumber, built on pdfminer.six for reliable results.
📄 Convert 29+ file formats to clean Markdown using the Model Context Protocol for seamless integration with AI workflows.
AI-powered document processing tool with smart extraction, OCR, and intelligent content analysis
PDF extraction samples comparing Azure Document Intelligence (layout model) 🏢 vs Markitdown ✍️vs Apache Tika
A collection of Model Context Protocol (MCP) servers enabling AI assistants to securely access and interact with local files, Gmail, and web content, facilitating integration with MCP-compatible applications.
Add a description, image, and links to the markitdown topic page so that developers can more easily learn about it.
To associate your repository with the markitdown topic, visit your repo's landing page and select "manage topics."