Superpipe - optimized LLM pipelines for structured data
-
Updated
Jun 18, 2024 - Python
Superpipe - optimized LLM pipelines for structured data
Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.
Nadir is a Python package designed to dynamically choose the best llm for your prompt by balancing complexity and cost and response time.
Research into neural network quantization with focus on domain-specific calibration and hardware-aware optimization. Committed to rigorous methodology and reproducible results.
DocuMerge for LLMs 🚀 Easily prepare GitHub repositories for LLM analysis!
Add a description, image, and links to the llm-optimization topic page so that developers can more easily learn about it.
To associate your repository with the llm-optimization topic, visit your repo's landing page and select "manage topics."