Skip to content
Change the repository type filter

All

    Repositories list

    • skypilot

      Public
      SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
      Python
      774000Updated Sep 12, 2025Sep 12, 2025
    • gpt-oss

      Public
      Inference gpt-oss in one file of pure C
      Python
      2.4k100Updated Sep 11, 2025Sep 11, 2025
    • Helm Chart Repository
      Makefile
      0000Updated Sep 10, 2025Sep 10, 2025
    • 0000Updated Sep 5, 2025Sep 5, 2025
    • LMCache

      Public
      Redis for LLMs
      Python
      572001Updated Sep 1, 2025Sep 1, 2025
    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      920000Updated Aug 1, 2025Aug 1, 2025
    • Fast and memory-efficient exact attention
      Python
      2k000Updated Jul 8, 2025Jul 8, 2025
    • 자동화를 위한 git pre-commit hook
      Shell
      0000Updated Jun 17, 2025Jun 17, 2025
    • Python
      1201Updated Jun 11, 2025Jun 11, 2025
    • tt-umd

      Public
      User-Mode Driver for Tenstorrent hardware
      C++
      15000Updated Jun 10, 2025Jun 10, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      268400Updated Jun 4, 2025Jun 4, 2025
    • Starter Dockerfiles for onboarding and custom image creation in MoAI
      Dockerfile
      0000Updated Jun 1, 2025Jun 1, 2025
    • Cuda
      6000Updated May 26, 2025May 26, 2025
    • React LogViewer
      TypeScript
      27000Updated May 14, 2025May 14, 2025
    • Python
      2000Updated Mar 19, 2025Mar 19, 2025
    • FlashMLA

      Public
      C++
      899000Updated Feb 26, 2025Feb 26, 2025
    • Tutorial code for moreh docs
      Python
      4300Updated Feb 14, 2025Feb 14, 2025
    • trl

      Public
      Train transformer language models with reinforcement learning.
      Python
      2.2k000Updated Nov 25, 2024Nov 25, 2024
    • peft

      Public
      🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
      Python
      2k000Updated Nov 25, 2024Nov 25, 2024
    • diffusers

      Public
      🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
      Python
      6.3k001Updated Nov 19, 2024Nov 19, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k200Updated Nov 19, 2024Nov 19, 2024
    • PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
      Python
      5k200Updated Nov 19, 2024Nov 19, 2024
    • [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
      Python
      56000Updated Jun 13, 2024Jun 13, 2024
    • HTML
      0000Updated Jun 4, 2024Jun 4, 2024
    • ComfyUI

      Public
      The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
      Python
      9.8k000Updated May 31, 2024May 31, 2024
    • A validation and profiling tool for AI infrastructure
      Python
      74000Updated Mar 1, 2024Mar 1, 2024
    • Python
      0200Updated Feb 28, 2024Feb 28, 2024
    • AutoAWQ

      Public
      AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
      Python
      288000Updated Jan 19, 2024Jan 19, 2024
    • C++
      22000Updated Jan 19, 2024Jan 19, 2024
    • perfetto

      Public
      Performance instrumentation and tracing for Android, Linux and Chrome (read-only mirror of https://android.googlesource.com/platform/external/perfetto/)
      C++
      580000Updated Dec 14, 2023Dec 14, 2023