Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 552 66 Updated Sep 11, 2024

SpRegTiling / sparse-register-tiling

C++ 9 3 Updated Mar 2, 2024

apuaaChen / vectorSparse

Cuda 32 12 Updated Aug 24, 2022

etlundquist / rankfm

Factorization Machines for Recommendation and Ranking Problems with Implicit Feedback Data

Python 175 38 Updated Aug 14, 2024

YXJ-123 / MGopt-APP

3 Updated May 9, 2023

nDIRECT / nDIRECT

A direct convolution library targeting ARM multi-core CPUs.

C 12 3 Updated Nov 27, 2024

merrymercy / awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,647 318 Updated Oct 19, 2024

google-deepmind / alphatensor

Python 2,785 254 Updated Apr 22, 2024

facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,675 381 Updated Aug 22, 2025

buaa-hipo / TCStencil

Cuda 9 2 Updated Apr 21, 2022

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 93,292 10,502 Updated Sep 14, 2025

dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,065 3,050 Updated Jul 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

coder(anonymous) AnonymousYWL

Achievements

Achievements

Block or report AnonymousYWL

Stars

fmtlib / fmt

intel / intel-extension-for-pytorch

hyungyokim / LIA_AMXGPU

PKU-SEC-Lab / HybriMoE

IntelLabs / Hardware-Aware-Automated-Machine-Learning

neuralmagic / AutoFP8

microsoft / T-MAC

xdit-project / xDiT

vllm-project / vllm

openvinotoolkit / openvino

kvcache-ai / ktransformers

deepseek-ai / DeepGEMM

deepseek-ai / FlashMLA

ggml-org / llama.cpp

microsoft / BitBLAS

microsoft / SparTA

HPC4AI / MeAtten

intel / intel-extension-for-transformers

hahnyuan / LLM-Viewer