Lists (1)
Sort Name ascending (A-Z)
Stars
OBLITERATE THE CHAINS THAT BIND YOU
⚡ Accelerate chat and IDE workflows with a proxy for llama.cpp, managing slots and cached context for efficient, low-latency interactions.
ComfyUI nodes for Wan 2.2 SVI 2 Pro with Keyframe control via First/Last Frame and seamless video stitching.
NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.
KV cache compression via block-diagonal rotation. Beats TurboQuant: better PPL (6.91 vs 7.07), 28% faster decode, 5.3x faster prefill, 44x fewer params. Drop-in llama.cpp integration.
One-click LLM server with TurboQuant Llama CPP engine
The agent that grows with you
Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and protects hot slots from being overwritten. Accelerates long pr…
Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click
A zero-allocation, header-only C++ BPE tokenizer for Qwen, built for maximum inference throughput.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
TRACER: replace 90%+ of your LLM classification calls with a traditional ML model. Formal parity guarantees. Self-improving.
Turbo Lossless - 1.33x Smaller, 2.93x Faster, Decode with 1 ADD operation
System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge
A free, open source, and extensible speech-to-text application that works completely offline.
Open-source, secure environment with real-world tools for enterprise-grade agents.
NDI -> Daydream Real-Time video AI
Lightweight ComfyUI wrapper for IndexTTS 2 (voice cloning + emotion control). The nodes call the original IndexTTS2 inference and keep behavior faithful to the repo.
ComfyUI custom nodes for speech, voice cloning, and voice design based on Qwen3-TTS models
Audio Reactivity Nodes for ComfyUI 🔊 Create AI generated audio-driven animations
An open-source desktop app for generating videos with LTX models
LoRA Pilot is an ultimate docker image for all Stable Diffusion LoRA trainers. Includes kohya_ss, diffusion pipes and TensorBoard for trainings and ComfyUI and InvokeAI for validation. Features sha…
PainterI2VAdvanced is a drop-in replacement for the standard Wan2.2 I2V conditioning node that solves the critical color drift problem at high motion amplitude
Portable ComfyUI installer for Windows, macOS and Linux 🔹 Nvidia GPU support 🔹 Pixaroma Community Edition