A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装，同时附带本地的文本处理(提升PDF在RAG中的召回率)。

Python 227 13 Updated Feb 19, 2025

fundamentalvision / Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,403 541 Updated May 16, 2024

raphael-baena / DTLR

Handwritten Text Recognition and Character Detection

Python 130 13 Updated Nov 6, 2024

ZZZHANG-jx / DocRes

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 377 39 Updated Jan 28, 2025

chatmcp / mcp-server-chatsum

Query and Summarize your chat messages.

TypeScript 450 40 Updated Dec 4, 2024

PFCCLab / PPOCRLabel

PPOCRLabelv2 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.

Python 158 45 Updated Feb 15, 2025

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,394 1,045 Updated Sep 26, 2024

dailenson / SDT

This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR 2023)

Python 1,106 92 Updated Nov 26, 2024

SCUT-DLVCLab / Document-AI-Recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

177 6 Updated Dec 9, 2024

shannanyinxiang / UPOCR

Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)

Python 46 4 Updated Jun 6, 2024

BobH233 / BITSuperGPT-client

一个能够在本地共享Plus账号，避免OpenAI标记降智前提下，使用原生网页版ChatGPT Plus的项目

JavaScript 81 2 Updated Feb 12, 2025

shannanyinxiang / PageNet

Official implementation of PageNet (IJCV 2022)

Python 80 11 Updated Oct 31, 2022

opendatalab / DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 861 63 Updated Jan 16, 2025

serengil / deepface-react-ui

Deep Face Recognition UI With ReactJS

JavaScript 90 17 Updated Jan 4, 2025

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 17,848 2,476 Updated Feb 20, 2025

ZhaoJ9014 / face.evoLVe

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Python 3,487 758 Updated Dec 23, 2022

lumina-ai-inc / chunkr

Vision model based document ingestion

Rust 1,664 85 Updated Feb 21, 2025

ohayonguy / PMRF

[ICLR 2025] Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

Python 598 36 Updated Feb 5, 2025

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,938 607 Updated Feb 10, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 488 38 Updated Feb 23, 2025

Zeyi-Lin / HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

shaohua.zhang BeyondYourself

Lists (32)

GPT

✨ Inspiration

LLM相关工具

OCR相关库

SD

Sora类

TV

其他工具

分割

分类

办公工具

动作捕捉

印章生成

多模态

大模型

大模型部署

学习集

提词相关

教程

数字人

数据集

杂货铺

标注工具

框架

检测

模型库

消除工具库

深度学习相关工具库

算法刷

表格类

语音TTS

面试

Starred repositories

digital-human

table-extraction

table-detection

mobilenet-v2

table-recognition