Skip to content
View evanfebrianto's full-sized avatar
😃
😃

Block or report evanfebrianto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 7,611 728 Updated Mar 28, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 21,427 2,718 Updated Apr 26, 2025

Medium article "Clean Architecture with Python"

Python 56 11 Updated Jan 25, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 27,569 5,633 Updated Apr 12, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 24,536 1,542 Updated Apr 24, 2025

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

Jupyter Notebook 3,941 476 Updated Feb 12, 2025

A very quick project that transforms research papers into engaging three-person discussions, offering an intuitive and thought-provoking listening experience. Perfect for podcast enthusiasts seekin…

Python 565 69 Updated Dec 9, 2024

Get your documents ready for gen AI

Python 28,335 1,735 Updated Apr 25, 2025

PPOCRLabelv2 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.

Python 192 50 Updated Mar 21, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 32,231 2,564 Updated Apr 25, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 34,319 3,167 Updated Apr 26, 2025

CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code

Go 13,489 1,739 Updated Apr 16, 2025

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 422 50 Updated Jan 28, 2025

An open-source RAG-based tool for chatting with your documents.

Python 22,088 1,743 Updated Apr 15, 2025

A feature-rich command-line audio/video downloader

Python 109,340 8,585 Updated Apr 25, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 51,359 1,447 Updated Apr 25, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,304 1,397 Updated Mar 3, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,243 1,128 Updated Apr 24, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,491 654 Updated Feb 10, 2025

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,148 2,114 Updated Apr 23, 2025

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 135 11 Updated Jan 11, 2025

A Repo For Document AI

Python 2,800 154 Updated Apr 10, 2025

🙌 OpenHands: Code Less, Make More

Python 53,456 5,970 Updated Apr 26, 2025

Chat first code editor. To download the packaged app:

TypeScript 5,407 368 Updated Nov 14, 2024

Investment Research for Everyone, Everywhere.

Python 41,096 3,655 Updated Apr 25, 2025

Data processing with ML, LLM and Vision LLM

Python 4,489 452 Updated Apr 20, 2025

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 56,177 6,164 Updated Apr 26, 2025

virtual home staging code

Python 16 2 Updated Apr 5, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 39,590 5,023 Updated Aug 16, 2024

Inference and training library for high-quality TTS models.

Python 5,213 550 Updated Dec 10, 2024
Next