Skip to content
View xenshinu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xenshinu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Optimized primitives for collective multi-GPU communication

C++ 3,454 858 Updated Jan 27, 2025

Universal LLM Deployment Engine with ML Compilation

Python 19,952 1,663 Updated Feb 12, 2025

Tile primitives for speedy kernels

Cuda 2,014 110 Updated Feb 13, 2025

the only google you'll need

JavaScript 29 4 Updated Feb 9, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,710 4,229 Updated Feb 13, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,461 466 Updated Feb 12, 2025

A list of papers about distributed consensus.

2,541 214 Updated Aug 8, 2024
10 1 Updated Oct 28, 2024

Making Long-Context LLM Inference 10x Faster and 10x Cheaper

Python 454 48 Updated Feb 13, 2025

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 144 9 Updated Oct 15, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,797 189 Updated Nov 14, 2024

An open-source RAG-based tool for chatting with your documents.

Python 20,987 1,642 Updated Feb 5, 2025

Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS

Python 802 115 Updated Oct 2, 2024

Real time transcription with OpenAI Whisper.

Python 2,558 426 Updated Jun 1, 2024

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 5,789 458 Updated Feb 2, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,437 300 Updated Jan 7, 2025

Converts text to speech in realtime

Python 2,499 241 Updated Feb 9, 2025

OpenAI Assistants API quickstart with Next.js.

TypeScript 1,719 470 Updated Jul 29, 2024

real time face swap and one-click video deepfake with only a single image

Python 43,821 6,407 Updated Feb 12, 2025

A versatile pairwise aligner for genomic and spliced nucleotide sequences

C 18 7 Updated Jan 21, 2025

Puzzles for learning Triton

Jupyter Notebook 1,382 101 Updated Nov 18, 2024

A large scale non-linear optimization library

C++ 3,971 1,053 Updated Feb 11, 2025

A massively parallel, high-level programming language

Rust 18,266 456 Updated Feb 3, 2025

An optimization-based multi-sensor state estimator

C++ 3,674 1,432 Updated May 23, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,375 1,445 Updated Jan 19, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 6,200 1,066 Updated Feb 11, 2025

📐 Jekyll theme for building a personal site, blog, project documentation, or portfolio.

HTML 12,688 26,088 Updated Feb 8, 2025

Context aware, pluggable and customizable data protection and de-identification SDK for text and images

Python 4,128 599 Updated Feb 5, 2025

[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering

HTML 963 63 Updated Jan 8, 2025

Library for faster pinned CPU <-> GPU transfer in Pytorch

Python 684 39 Updated Feb 21, 2020
Next