Skip to content
View spaparaju's full-sized avatar
🎯
Focus
🎯
Focus

Block or report spaparaju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • 🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.

    Python Apache License 2.0 Updated Jan 6, 2025
  • 🙌 OpenHands: Code Less, Make More

    Python MIT License Updated Dec 18, 2024
  • letta Public

    Forked from letta-ai/letta

    Letta (formerly MemGPT) is a framework for creating LLM services with memory.

    Python Apache License 2.0 Updated Dec 9, 2024
  • vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python Apache License 2.0 Updated Oct 27, 2024
  • triton Public

    Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    C++ MIT License Updated Oct 11, 2024
  • Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench

    Python Apache License 2.0 Updated Aug 18, 2024
  • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

    C++ Apache License 2.0 Updated Aug 17, 2024
  • nim-deploy Public

    Forked from NVIDIA/nim-deploy

    A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.

    Jupyter Notebook Apache License 2.0 Updated Aug 16, 2024
  • torchtune Public

    Forked from pytorch/torchtune

    A Native-PyTorch Library for LLM Fine-tuning

    Python BSD 3-Clause "New" or "Revised" License Updated Aug 11, 2024
  • A JupyterLab extension for displaying dashboards of GPU usage.

    TypeScript BSD 3-Clause "New" or "Revised" License Updated Aug 6, 2024
  • 🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

    Python Apache License 2.0 Updated Aug 6, 2024
  • DeepSpeed Public

    Forked from microsoft/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    Python Apache License 2.0 Updated Aug 6, 2024
  • NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

    Go Apache License 2.0 Updated Aug 6, 2024
  • NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

    Go Apache License 2.0 Updated Aug 6, 2024
  • dspy Public

    Forked from stanfordnlp/dspy

    DSPy: The framework for programming—not prompting—foundation models

    Python MIT License Updated Aug 6, 2024
  • mlflow Public

    Forked from mlflow/mlflow

    Open source platform for the machine learning lifecycle

    Python Apache License 2.0 Updated Aug 6, 2024
  • TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

    Python Other Updated Aug 5, 2024
  • ray Public

    Forked from ray-project/ray

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

    Python Apache License 2.0 Updated Jul 16, 2024
  • graphrag Public

    Forked from microsoft/graphrag

    A modular graph-based Retrieval-Augmented Generation (RAG) system

    Python MIT License Updated Jul 3, 2024
  • Example models using DeepSpeed

    Python Apache License 2.0 Updated Jun 26, 2024
  • kuberay Public

    Forked from ray-project/kuberay

    A toolkit to run Ray applications on Kubernetes

    Go Apache License 2.0 Updated May 28, 2024
  • MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

    Python Apache License 2.0 Updated Apr 8, 2024
  • dbrx Public

    Forked from databricks/dbrx

    Code examples and resources for DBRX, a large language model developed by Databricks

    Python Other Updated Mar 27, 2024
  • Python Apache License 2.0 Updated Mar 26, 2024
  • diffusers Public

    Forked from huggingface/diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

    Python Apache License 2.0 Updated Mar 26, 2024
  • ignite Public

    Forked from pytorch/ignite

    High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

    Python BSD 3-Clause "New" or "Revised" License Updated Mar 26, 2024
  • DALI Public

    Forked from NVIDIA/DALI

    A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

    C++ Apache License 2.0 Updated Mar 26, 2024
  • optimum Public

    Forked from huggingface/optimum

    🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

    Python Apache License 2.0 Updated Mar 26, 2024
  • streaming Public

    Forked from mosaicml/streaming

    A Data Streaming Library for Efficient Neural Network Training

    Python Apache License 2.0 Updated Mar 26, 2024
  • Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python Other Updated Mar 26, 2024