nikolaospapachristou

💭

I may be slow to respond.

Nikolaos Papachristou nikolaospapachristou

💭

I may be slow to respond.

Quality, Data Scientist, Senior Manager

96 followers · 1.7k following

https://scholar.google.co.uk/citations?hl=en&user=hjlvCIQAAAAJ

Achievements

Starred repositories

patchy631 / ai-engineering-hub

Jupyter Notebook 2,493 520 Updated Jan 30, 2025

youssefHosni / Hands-On-LLM-Fine-Tuning

Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques

129 27 Updated Oct 27, 2024

explosion / spacy-layout

📚 Process PDFs, Word documents and more with spaCy

Python 361 17 Updated Dec 24, 2024

All-Hands-AI / openhands-resolver

A system that tries to resolve all issues on a github repo with OpenHands.

Python 98 22 Updated Nov 18, 2024

promptslab / Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Jupyter Notebook 3,383 252 Updated Mar 15, 2024

Forethought-Technologies / AutoChain

AutoChain: Build lightweight, extensible, and testable LLM Agents

Python 1,825 98 Updated May 23, 2024

melih-unsal / DemoGPT

🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.

Python 1,762 204 Updated Nov 4, 2024

ennucore / clippinator

AI programming assistant

Python 365 42 Updated May 5, 2024

agentic-ai / enact

A framework for generative software.

Python 104 15 Updated Jan 21, 2025

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 25,615 3,463 Updated Jan 30, 2025

langchain-ai / langgraph

Build resilient language agents as graphs.

Python 8,499 1,372 Updated Jan 30, 2025

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 38,502 5,636 Updated Jan 30, 2025

microsoft / semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 22,868 3,451 Updated Jan 30, 2025

aws-samples / patient-matching-of-clinical-trials-using-generative-ai

HTML 14 4 Updated Nov 23, 2024

denser-org / denser-chat

Chat with PDF files with source highlights

Python 125 13 Updated Dec 6, 2024

ExtractTable / ExtractTable-py

Python library to extract tabular data from images and scanned PDFs

Python 270 34 Updated Jul 30, 2024

BobLd / DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

C# 599 67 Updated Oct 1, 2023

microsoft / table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,446 267 Updated Jun 24, 2024

pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,320 563 Updated Jan 30, 2025

xavctn / img2table

img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing

Python 641 85 Updated Jan 28, 2025

cyclotruc / gitingest

Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase

Python 5,306 417 Updated Jan 24, 2025

comet-ml / opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 4,727 308 Updated Jan 30, 2025

Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,923 830 Updated Jan 30, 2025

castorini / hedwig

PyTorch deep learning models for document classification

Python 593 126 Updated Jul 21, 2023

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,649 2,589 Updated Jan 7, 2025

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,037 7,964 Updated Jan 29, 2025