Stars
Janus-Series: Unified Multimodal Understanding and Generation Models
An annotated implementation of the Transformer paper.
Lightweight and portable LLM sandbox runtime (code interpreter) Python library.
Make websites accessible for AI agents
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Fully local web research and report writing assistant
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
A Python package that makes it easy for developers to create AI apps powered by various AI providers.
Search-o1: Agentic Search-Enhanced Large Reasoning Models
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
A 15TB Collection of Physics Simulation Datasets
Build multimodal language agents for fast prototype and production
[arXiv 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Code/data for MARG (multi-agent review generation)
PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simple tasks to complex challenges. It provides a low-code soluti…
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Welcome to ResearchAgent ! A personal research assistant powered by GPT-3.5/GPT-4. You can ask follow up questions. Get source details of your answer
A suite of image and video neural tokenizers
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
A library for building and serving multi-node distributed faiss indices.
The code and models for the paper: Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Recipes to scale inference-time compute of open models
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Fruits-360: A dataset of images containing fruits and vegetables