-
Seoul National University
- Seoul, Korea
- https://leewonbeom.github.io
Highlights
- Pro
Stars
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
Large Language Model (LLM) Systems Paper List
Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
A curated list for Efficient Large Language Models
✨✨Latest Advances on Multimodal Large Language Models
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
A GPU performance profiling tool for PyTorch models
💯 Curated coding interview preparation materials for busy software engineers
😎 Awesome lists about all kinds of interesting topics