Skip to content
View windtara0619's full-sized avatar

Block or report windtara0619

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,656 500 Updated Feb 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 10,545 1,032 Updated Feb 23, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 18,828 2,006 Updated Oct 15, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 713 92 Updated Feb 3, 2025

LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-device AI, now with an expanded vision.

C++ 281 20 Updated Feb 23, 2025

Official inference repo for FLUX.1 models

Python 20,370 1,431 Updated Feb 6, 2025

Development repository for the Triton language and compiler

MLIR 14,563 1,808 Updated Feb 23, 2025

The easiest way to use Agentic RAG in any enterprise

TypeScript 4,108 463 Updated Jan 22, 2025

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,260 539 Updated Feb 23, 2025

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 458 59 Updated Feb 21, 2025

[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models

Python 55 6 Updated Sep 22, 2024

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,529 453 Updated Feb 23, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,180 1,450 Updated Dec 25, 2024

A vector search SQLite extension that runs anywhere!

C 4,901 175 Updated Jan 24, 2025

An open-source framework for machine learning and other computations on decentralized data.

Python 2,349 589 Updated Feb 21, 2025

A framework for Privacy Preserving Machine Learning

Python 1,577 284 Updated Nov 23, 2024

Federated Learning Simulator (FLSim) is a flexible, standalone core library that simulates FL settings with a minimal, easy-to-use API. FLSim is domain-agnostic and accommodates many use cases such…

Python 255 58 Updated Aug 26, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,494 1,112 Updated Feb 21, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,437 1,744 Updated Feb 19, 2025
Python 365 58 Updated Dec 12, 2024

LLM inference in C/C++

C++ 75,057 10,846 Updated Feb 23, 2025
C++ 60 7 Updated Dec 2, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,250 71 Updated Feb 14, 2025

Meaningful control of data in distributed systems.

Rust 1,346 114 Updated Feb 22, 2025

Making transparency normal!

Go 24 9 Updated Dec 18, 2023

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,637 5,676 Updated Feb 23, 2025