Starred repositories
π A curated list of awesome .cursorrules files
A Python library for reading and writing PDF, powered by QPDF
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, itβ¦
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
Flexpilot - Open-Source, Native and a True GitHub Copilot Alternative for VS Code
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scβ¦
Vision infrastructure to turn complex documents into RAG/LLM-ready data
Proxy server to bypass Cloudflare protection
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! π¦₯
π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
π€ Assemble, configure, and deploy autonomous AI Agents in your browser.
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.
Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-paragraphs. Full support for scans and images.
Pixeltable β AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
π Sycamore is an LLM-powered search and analytics platform for unstructured data.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery π§βπ¬
Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.
A machine learning software for extracting information from scholarly documents
Implementation of Nougat Neural Optical Understanding for Academic Documents
Versatile agents for long running, research intensive tasks.
OpenUI let's you describe UI using your imagination, then see it rendered live.
Developer APIs to Accelerate LLM Projects