Skip to content

opendatalab/MinerU-Ecosystem

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

216 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MinerU-Ecosystem

The official ecosystem toolkit for MinerU Open API

Empowering developers and AI agents with seamless document parsing capabilities.

License MinerU Online

中文文档


📖 Overview

MinerU-Ecosystem provides a full suite of tools, SDKs, and integrations built on top of the MinerU Open API. Whether you're building production pipelines, integrating with LangChain for RAG, or enabling AI agents to parse documents on the fly — this repository has you covered.

MinerU is an open-source, high-quality document extraction tool that converts unstructured documents (PDFs, images, Office files, etc.) into machine-readable Markdown and JSON.

🏗️ Repository Structure

MinerU-Ecosystem/
├── cli/                  # Command-line tool for document parsing
├── sdk/                  # Multi-language SDKs
│   ├── python/           #   Python SDK
│   ├── go/               #   Go SDK
│   └── typescript/       #   TypeScript SDK
├── langchain_mineru/     # LangChain document loader integration
├── mcp/                  # Model Context Protocol server (Python)
└── skills/               # AI agent skills (Claude Code, OpenClaw, etc.)

🔑 Supported APIs

All components in this repository support both API modes:

Comparison 🎯 Precision Extract API ⚡ Quick Parse API (Agent-Oriented)
Auth ✅ Token required ❌ Not required (IP rate-limited)
Model Versions pipeline (default) / vlm (recommended) / MinerU-HTML Fixed lightweight pipeline model
Table / Formula Recognition ✅ Supported (configurable) ❌ Disabled (speed-first)
File Size Limit ≤ 200 MB ≤ 10 MB
Page Limit ≤ 600 pages ≤ 20 pages
Batch Support ✅ Supported (≤ 200 files) ❌ Single file only
Output Formats Markdown, JSON, Zip; optional export to DOCX / HTML / LaTeX Markdown only

🚀 Quick Start

CLI (cli/)

A fast command-line tool for parsing documents directly from your terminal. Supports both Standard API and Quick Parse API.

Installation

Windows (PowerShell)

irm https://cdn-mineru.openxlab.org.cn/open-api-cli/install.ps1 | iex

Linux / macOS (Shell)

curl -fsSL https://cdn-mineru.openxlab.org.cn/open-api-cli/install.sh | sh

Usage

1. Flash Extract (no login, fast, Markdown only)

Great for quick previews. No Token needed. Limit: 10 MB / 20 pages per file.

mineru-open-api flash-extract report.pdf

2. Precision Extract (login required)

Supports large documents (200 MB / 600 pages), preserves layout and resources, multiple output formats.

# First-time setup: configure Token (or set MINERU_TOKEN env var)
mineru-open-api auth

# Extract and print Markdown to stdout
mineru-open-api extract paper.pdf

# Extract and save all resources (images/tables) to a directory
mineru-open-api extract report.pdf -o ./output/

# Export to other formats
mineru-open-api extract report.pdf -f docx,latex,html -o ./results/

3. Web Crawl

Convert web pages into high-quality Markdown.

mineru-open-api crawl https://www.example.com

4. Batch Processing

# Batch process all PDFs in the current directory
mineru-open-api extract *.pdf -o ./results/

# Batch process from a file list
mineru-open-api extract --list filelist.txt -o ./results/

Python SDK

Installation

pip install mineru-open-sdk

Usage

1. Flash Extract (no login, Markdown only)

Great for quick previews. No Token needed.

from mineru import MinerU

# Flash mode requires no Token
client = MinerU()
result = client.flash_extract("https://cdn-mineru.openxlab.org.cn/demo/example.pdf")

print(result.markdown)

2. Precision Extract (login required)

Supports large files, rich assets (images/tables), and multiple output formats.

from mineru import MinerU

# Get a free Token from https://mineru.net
client = MinerU("your-api-token")
result = client.extract("https://cdn-mineru.openxlab.org.cn/demo/example.pdf")

print(result.markdown)
print(result.images)  # Get the list of extracted images

Multi-language SDKs are also available: Go | TypeScript. See the sdk/ directory for details.

AI Agent Skills (skills/)

Pre-built skill for AI coding agents, enabling document extraction directly within agent workflows. The skill is wrapper by the mineru-open-api CLI and provides:

Skills Download

  • OpenClawView skill details on ClawHub
  • CDN Link — One-click download skill package
  • Other AI agents like zeroclaw that also support skill/tool interfaces

MCP Server (mcp/)

A Model Context Protocol server implementation in Python, allowing MCP-compatible AI clients (such as Claude) to use MinerU's document parsing as a tool.

Configuration

Using uvx (recommended — always runs the latest version):

{
  "mcpServers": {
    "mineru": {
      "command": "uvx",
      "args": ["mineru-open-mcp"],
      "env": {
        "MINERU_API_TOKEN": "your_key_here"
      }
    }
  }
}

LangChain Integration (langchain_mineru/)

A LangChain Document Loader that turns PDFs and documents into LangChain-compatible Document objects with one line of code — ready to plug into RAG pipelines.

Installation

pip install langchain-mineru

Usage

from langchain_mineru import MinerULoader

loader = MinerULoader(source="demo.pdf")
docs = loader.load()

print(docs[0].page_content[:500])
print(docs[0].metadata)

Default is mode="flash" (no API token required). Switch to mode="precision" for higher fidelity with token auth.

Two parsing modes are available:

See the full documentation and RAG pipeline examples in langchain_mineru/.

📚 Documentation

Resource Link
MinerU Open API Docs mineru.net/apiManage/docs
MinerU Online Demo mineru.net/OpenSourceTools/Extractor
MinerU Open Source Project github.com/opendatalab/MinerU

📄 License

This project is licensed under the Apache License 2.0.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors