Skip to content
View tarjintor's full-sized avatar

Block or report tarjintor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,421 350 Updated Mar 9, 2025

Distribute and run LLMs with a single file.

C++ 21,911 1,150 Updated Mar 10, 2025

LeetCode Training and Evaluation Dataset

Python 4 Updated Mar 6, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,235 785 Updated Mar 1, 2025

A monitor of resources

C++ 23,521 713 Updated Feb 13, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,884 518 Updated Mar 7, 2025

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 306 23 Updated Sep 30, 2024

Fully open reproduction of DeepSeek-R1

Python 22,498 2,018 Updated Mar 10, 2025

run DeepSeek-R1 GGUFs on KTransformers

Python 174 13 Updated Mar 3, 2025

LLM inference in C/C++

C++ 76,206 11,023 Updated Mar 10, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,960 6,553 Updated Dec 9, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,575 836 Updated Mar 7, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 15,089 3,647 Updated Mar 10, 2025

Tracking Ray Enhancement Proposals

50 29 Updated Feb 26, 2025

A Lightweight Recommendation System

Python 8,683 669 Updated Nov 8, 2023
Java 202 26 Updated Jun 15, 2023

Open source platform for the machine learning lifecycle

Python 19,725 4,384 Updated Mar 10, 2025

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,285 383 Updated Nov 26, 2024

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Shell 4,166 418 Updated Jun 28, 2023

PyTorch on Kubernetes

Jsonnet 308 143 Updated Dec 1, 2021

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,888 6,090 Updated Mar 10, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,104 3,453 Updated Mar 10, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,945 5,748 Updated Mar 10, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,941 28,231 Updated Mar 10, 2025

CUDA integration for Python, plus shiny features

Python 1,906 291 Updated Feb 7, 2025

The fastai deep learning library

Jupyter Notebook 26,734 7,584 Updated Feb 28, 2025

Code from various chapters in OSTEP (http://www.ostep.org)

C 3,638 1,374 Updated Nov 9, 2023

MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.

C++ 11,191 3,999 Updated Jan 22, 2025
Next