Stars
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Whisper.net. Speech to text made simple using Whisper Models
TypeScript source generator to provide strongly typed SignalR clients by analyzing C# type definitions.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Controllable and fast Text-to-Speech for over 7000 languages!
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Official inference repo for FLUX.1 models
SignalR development tools inspired by SwaggerUI.
Generative AI extensions for onnxruntime
🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.
Foundational model for human-like, expressive TTS
CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A self-hosted dashboard that puts all your feeds in one place
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.
State-of-the-art 2D and 3D Face Analysis Project
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.