Skip to content
View nikolaospapachristou's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report nikolaospapachristou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Jupyter Notebook 2,493 520 Updated Jan 30, 2025

Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques

129 27 Updated Oct 27, 2024

📚 Process PDFs, Word documents and more with spaCy

Python 361 17 Updated Dec 24, 2024

A system that tries to resolve all issues on a github repo with OpenHands.

Python 98 22 Updated Nov 18, 2024

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Jupyter Notebook 3,383 252 Updated Mar 15, 2024

AutoChain: Build lightweight, extensible, and testable LLM Agents

Python 1,825 98 Updated May 23, 2024

🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.

Python 1,762 204 Updated Nov 4, 2024

AI programming assistant

Python 365 42 Updated May 5, 2024

A framework for generative software.

Python 104 15 Updated Jan 21, 2025

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 25,615 3,463 Updated Jan 30, 2025

Build resilient language agents as graphs.

Python 8,499 1,372 Updated Jan 30, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 38,502 5,636 Updated Jan 30, 2025

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 22,868 3,451 Updated Jan 30, 2025

Chat with PDF files with source highlights

Python 125 13 Updated Dec 6, 2024

Python library to extract tabular data from images and scanned PDFs

Python 270 34 Updated Jul 30, 2024

Document Layout Analysis resources repos for development with PdfPig.

C# 599 67 Updated Oct 1, 2023

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,446 267 Updated Jun 24, 2024

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,320 563 Updated Jan 30, 2025

img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing

Python 641 85 Updated Jan 28, 2025

Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase

Python 5,306 417 Updated Jan 24, 2025

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 4,727 308 Updated Jan 30, 2025

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,923 830 Updated Jan 30, 2025

PyTorch deep learning models for document classification

Python 593 126 Updated Jul 21, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,649 2,589 Updated Jan 7, 2025

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,037 7,964 Updated Jan 29, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 25,346 3,232 Updated Sep 24, 2024

Tesseract Open Source OCR Engine (main repository)

C++ 64,149 9,663 Updated Jan 17, 2025

Provides a simple and efficient way to interact with the LLMWhisperer API

Python 10 1 Updated Nov 5, 2024

Python tool for converting files and office documents to Markdown.

Python 35,902 1,604 Updated Jan 24, 2025
Next