ZJY0516

ZJY0516

10 followers · 55 following

Nanjing University
19:03 (UTC +08:00)
https://riverclouds.net/

Highlights

Lists (5)

Sort

游戏

Stars

albertan017 / LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

Python 5,087 341 Updated Oct 28, 2024

andrewkchan / deepseek.cpp

CPU inference for the DeepSeek family of large language models in pure C++

C++ 251 23 Updated Feb 11, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 2,113 219 Updated Feb 20, 2025

MoE-Inf / awesome-moe-inference

Curated collection of papers in MoE model inference

69 4 Updated Feb 19, 2025

kyutai-labs / jax-flash-attn3

JAX bindings for the flash-attention3 kernels

C++ 11 1 Updated Aug 6, 2024

microsoft / demikernel

Kernel-Bypass LibOS Architecture

Rust 1,092 125 Updated Feb 21, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,601 1,479 Updated Feb 19, 2025

parttimenerd / concurrency-fuzz-scheduler

Custom Linux scheduler for concurrency fuzzing written in Java with hello-ebpf

Java 22 1 Updated Feb 13, 2025

FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Python 421 65 Updated Feb 21, 2025

fattorib / fusedswiglu

Fused SwiGLU Triton kernels

Python 4 Updated Jan 25, 2024

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 868 42 Updated Feb 21, 2025

Netflix / vmaf

Perceptual video quality assessment based on multi-method fusion.

Python 4,799 767 Updated Feb 12, 2025

FFmpeg / asm-lessons

FFMPEG Assembly Language Lessons

1,370 39 Updated Jan 27, 2025

zhangchenchen / self-consistent-coder

如何成为一名自洽的程序员

Shell 1,933 90 Updated Feb 20, 2025

mirage-project / mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 746 44 Updated Feb 21, 2025

zzhbrr / mlsys-arxiv-daily

Forked from Vincentqyw/cv-arxiv-daily

🎓Automatically Update MLSys Papers Daily using Github Actions (Update Every 12th hours)

Python 2 Updated Feb 21, 2025

microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table

C++ 687 53 Updated Jan 9, 2025

lucidrains / alphafold3-pytorch

Implementation of Alphafold 3 from Google Deepmind in Pytorch

Python 1,365 169 Updated Jan 22, 2025

PKUFlyingPig / MIT6.5940_TinyML

Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing

Jupyter Notebook 29 2 Updated Jan 8, 2025

vyokky / LLM-Brained-GUI-Agents-Survey

GitHub page for "Large Language Model-Brained GUI Agents: A Survey"

CSS 114 6 Updated Feb 2, 2025

carottX / nju-class

Python 13 2 Updated Jan 13, 2025

graphitemaster / incbin

Include binary files in C/C++

C 1,021 95 Updated Jul 12, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 40,301 5,389 Updated Feb 20, 2025

andrewkchan / yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 247 25 Updated Jan 15, 2025

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 37,916 3,925 Updated Feb 19, 2025

google-deepmind / alphafold3

AlphaFold 3 inference pipeline.

Python 6,080 747 Updated Feb 14, 2025

HazyResearch / aisys-building-blocks

Building blocks for foundation models.

449 19 Updated Jan 3, 2024

icsnju / visualinux

A visualized debugging framework to aid in understanding the Linux kernel.

C 104 7 Updated Feb 20, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,603 159 Updated Feb 20, 2025

sjtug / SJTUBeamer

上海交通大学 Beamer 模版 | Beamer template for Shanghai Jiao Tong University

TeX 619 62 Updated Jan 19, 2025

ZJY0516

Highlights

Lists (5)

LaTeX

学习资源

实用工具

影音

游戏

Stars