Stars
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
Distribute and run LLMs with a single file.
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Fully open reproduction of DeepSeek-R1
run DeepSeek-R1 GGUFs on KTransformers
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Open source platform for the machine learning lifecycle
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
High-speed download of LLaMA, Facebook's 65B parameter GPT model
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
CUDA integration for Python, plus shiny features
Code from various chapters in OSTEP (http://www.ostep.org)
MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.