Stars
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Open-source no-code web data extraction platform. Turn websites to APIs & spreadsheets with no-code robots in minutes.
A system for agentic LLM-powered data processing and ETL
Twitter Automation Framework without using Twitter's official API.
Implementation of a discord channel scraper to generate datasets.
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
Command-line interface (CLI) program for downloading 10-K, 10-K/A, 10-Q, 10-Q/A filings from the SEC EDGAR database.
A Collection of BM25 Algorithms in Python
Retrieval and Retrieval-augmented LLMs
Finetune Llama 3.3, DeepSeek-R1, Mistral, Phi-4 & Gemma 2 LLMs 2-5x faster with 70% less memory
2024! X / Twitter API scrapper with authorization support. Allows you to scrape search results, User's profiles (followers/following), Tweets (favoriters/retweeters) and more.
Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.
The most advanced AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
A modular graph-based Retrieval-Augmented Generation (RAG) system
Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.
Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message se…
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
converts url content into JSON with a simple prefix
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
Helper tools to analyze the " Financial Statement Data Sets" from the U.S. securities and exchange commission (sec.gov)
Projects from my TailwindCSS course
Top2Vec learns jointly embedded topic, document and word vectors.
Modern JavaScript Tutorial