Skip to content
View MithrilMan's full-sized avatar

Block or report MithrilMan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 5,891 428 Updated Feb 6, 2025

Whisper.net. Speech to text made simple using Whisper Models

C# 644 98 Updated Feb 3, 2025

Recurrent neural network for audio noise reduction

C 4,314 921 Updated Feb 8, 2025

TypeScript source generator to provide strongly typed SignalR clients by analyzing C# type definitions.

C# 102 11 Updated Feb 3, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,458 402 Updated Jan 29, 2025

Official repository for LTX-Video

Python 2,762 232 Updated Jan 3, 2025

SOTA Open Source TTS

Python 18,961 1,434 Updated Feb 3, 2025

Diffusion-based Portrait and Animal Animation

Python 641 56 Updated Jan 13, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 7,512 535 Updated Dec 27, 2024

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,534 173 Updated Nov 7, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,465 1,267 Updated Feb 9, 2025

All algorithms implemented in C#.

C# 7,332 1,544 Updated Dec 6, 2024

GRadient-INformed MoE

261 18 Updated Sep 25, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,426 593 Updated Feb 9, 2025

Official inference repo for FLUX.1 models

Python 20,024 1,396 Updated Feb 6, 2025

SignalR development tools inspired by SwaggerUI.

C# 51 3 Updated Jan 27, 2025

Generative AI extensions for onnxruntime

C++ 608 150 Updated Feb 9, 2025

🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.

C++ 46,976 2,037 Updated Feb 9, 2025

Foundational model for human-like, expressive TTS

Python 4,020 675 Updated Jul 30, 2024

CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner

Python 512 33 Updated Dec 24, 2024

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 24,388 2,483 Updated Feb 7, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 70,930 8,387 Updated Feb 10, 2025

A self-hosted dashboard that puts all your feeds in one place

Go 10,315 365 Updated Feb 9, 2025

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,074 216 Updated Nov 27, 2024

This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.

Python 4,496 344 Updated Feb 8, 2025

ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.

C# 36,061 10,216 Updated Feb 9, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 24,195 5,486 Updated Dec 5, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 66,412 7,102 Updated Feb 9, 2025

RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.

C# 1,750 334 Updated Feb 7, 2025
Next