Skip to content

A curated list about how to build a LLM application including Input augment, model augment, RAG-system, serving, evaluation and software UI

License

Notifications You must be signed in to change notification settings

lumiere-ml/Awesome-LLM-Application

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Awesome-LLM-Application

Table of Content


Input UI


Blogs

Trending Projects

Companys

Input Augment


Blogs

title content updated time
使用 Langchain 的 LLM 的对话记忆 25/06/2023

Model Augment

Blogs

title content domain updated time
Calculate GPU Requirements for Your LLM Training hardware 06/12/2023
Large Language Models - The Hardware Connection hardware 03/10/2023

RAG System


Blogs

blog content updated time
The architecture of today’s LLM applications 30/10/2023
Building RAG-based LLM Applications for Production 25/10/2023

papers

Frameworks

companys

LLM Serving


Blogs

title content domain updated time
A guide to LLM inference and performance hardware 07/11/2023
A Comprehensive Guide to Selecting and Estimating GPUs for Serving ML Models hardware 05/07/2023
7 Frameworks for Serving LLMs 31/7/2023
[Optimized large language model (LLM) serving
大语言模型的模型量化(INT8/INT4)技术 quant 6/7/2023
NVIDIA HPC Application Performance hardware /

Trending Frameworks

papers

title content domain conference data
ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models 25/1/2024
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness 27/05/2023
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Efficient memory management for large language model serving with pagedattention

Frameworks

LLM Evaluate


others

blogs

About

A curated list about how to build a LLM application including Input augment, model augment, RAG-system, serving, evaluation and software UI

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published