llm
SGLang is a fast serving framework for large language models and vision language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Minimal library to train LLMs on TPU in JAX with pjit().
Supercharge huggingface transformers with model parallelism.
DSPy: The framework for programming—not prompting—language models
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Access large language models from the command-line
An implementation of the Llama architecture, to instruct and delight
Entropy Based Sampling and Parallel CoT Decoding
Open-source framework for the research and development of foundation models.