Stars
This project is real-time visualization of a network recognizing digits from user's input.
This repository contains the Hugging Face Agents Course.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
A Chrome extension for asking questions over websites
Fully open reproduction of DeepSeek-R1
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Convert PDF to markdown + JSON quickly with high accuracy
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from vari…
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A community-maintained Python framework for creating mathematical animations.
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
This project implements a state-of-the-art solution for de-identifying audio data using Fully Homomorphic Encryption (FHE). It allows for privacy-preserving processing of voice data by securely ano…
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Open source PII detection and anonymization tool: easy-to-use, configurable, and extensible
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Large Action Model framework to develop AI Web Agents
An open-source RAG-based tool for chatting with your documents.
real time face swap and one-click video deepfake with only a single image
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
LLM Frontend for Power Users.
Official inference repo for FLUX.1 models
A modular graph-based Retrieval-Augmented Generation (RAG) system
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…