Skip to content
View leewonbeom's full-sized avatar

Highlights

  • Pro

Block or report leewonbeom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

3,132 211 Updated Jan 8, 2025

Large Language Model (LLM) Systems Paper List

721 26 Updated Jan 8, 2025

Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)

Python 13 1 Updated Jul 4, 2024

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)

Python 97 21 Updated Jul 10, 2024

A curated list for Efficient Large Language Models

Python 1,377 103 Updated Dec 30, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,415 850 Updated Jan 6, 2025

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,519 938 Updated Nov 15, 2024

A GPU performance profiling tool for PyTorch models

Python 499 46 Updated Jul 13, 2021

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 120,605 14,884 Updated Oct 8, 2024

😎 Awesome lists about all kinds of interesting topics

340,868 28,213 Updated Dec 12, 2024

Advice for writing LaTeX documents

TeX 1,150 122 Updated Nov 14, 2024