-
smolagents Public
Forked from huggingface/smolagents🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Python Apache License 2.0 UpdatedJan 6, 2025 -
OpenHands Public
Forked from All-Hands-AI/OpenHands🙌 OpenHands: Code Less, Make More
Python MIT License UpdatedDec 18, 2024 -
letta Public
Forked from letta-ai/lettaLetta (formerly MemGPT) is a framework for creating LLM services with memory.
Python Apache License 2.0 UpdatedDec 9, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedOct 27, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedOct 11, 2024 -
nim-anywhere Public
Forked from NVIDIA/nim-anywhereAccelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
Python Apache License 2.0 UpdatedAug 18, 2024 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedAug 17, 2024 -
nim-deploy Public
Forked from NVIDIA/nim-deployA collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
Jupyter Notebook Apache License 2.0 UpdatedAug 16, 2024 -
torchtune Public
Forked from pytorch/torchtuneA Native-PyTorch Library for LLM Fine-tuning
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 11, 2024 -
jupyterlab-nvdashboard Public
Forked from rapidsai/jupyterlab-nvdashboardA JupyterLab extension for displaying dashboards of GPU usage.
TypeScript BSD 3-Clause "New" or "Revised" License UpdatedAug 6, 2024 -
accelerate Public
Forked from huggingface/accelerate🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedAug 6, 2024 -
DeepSpeed Public
Forked from microsoft/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedAug 6, 2024 -
dcgm-exporter Public
Forked from NVIDIA/dcgm-exporterNVIDIA GPU metrics exporter for Prometheus leveraging DCGM
Go Apache License 2.0 UpdatedAug 6, 2024 -
gpu-operator Public
Forked from NVIDIA/gpu-operatorNVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Go Apache License 2.0 UpdatedAug 6, 2024 -
dspy Public
Forked from stanfordnlp/dspyDSPy: The framework for programming—not prompting—foundation models
Python MIT License UpdatedAug 6, 2024 -
mlflow Public
Forked from mlflow/mlflowOpen source platform for the machine learning lifecycle
Python Apache License 2.0 UpdatedAug 6, 2024 -
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-OptimizerTensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…
Python Other UpdatedAug 5, 2024 -
ray Public
Forked from ray-project/rayRay is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python Apache License 2.0 UpdatedJul 16, 2024 -
graphrag Public
Forked from microsoft/graphragA modular graph-based Retrieval-Augmented Generation (RAG) system
Python MIT License UpdatedJul 3, 2024 -
DeepSpeedExamples Public
Forked from microsoft/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedJun 26, 2024 -
kuberay Public
Forked from ray-project/kuberayA toolkit to run Ray applications on Kubernetes
Go Apache License 2.0 UpdatedMay 28, 2024 -
DeepSpeed-MII Public
Forked from microsoft/DeepSpeed-MIIMII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Python Apache License 2.0 UpdatedApr 8, 2024 -
dbrx Public
Forked from databricks/dbrxCode examples and resources for DBRX, a large language model developed by Databricks
Python Other UpdatedMar 27, 2024 -
foundation-model-stack Public
Forked from foundation-model-stack/foundation-model-stackPython Apache License 2.0 UpdatedMar 26, 2024 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Python Apache License 2.0 UpdatedMar 26, 2024 -
ignite Public
Forked from pytorch/igniteHigh-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 26, 2024 -
DALI Public
Forked from NVIDIA/DALIA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
C++ Apache License 2.0 UpdatedMar 26, 2024 -
optimum Public
Forked from huggingface/optimum🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Python Apache License 2.0 UpdatedMar 26, 2024 -
streaming Public
Forked from mosaicml/streamingA Data Streaming Library for Efficient Neural Network Training
Python Apache License 2.0 UpdatedMar 26, 2024 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedMar 26, 2024