Skip to content
View pengxin99's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report pengxin99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

LLM inference in C/C++

C++ 86,522 13,048 Updated Sep 15, 2025

程序员延寿指南 | A programmer's guide to live longer

34,343 2,354 Updated May 19, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,713 70 Updated Sep 11, 2025

Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package.

1,625 171 Updated Nov 25, 2024

how to optimize some algorithm in cuda.

Cuda 2,465 223 Updated Sep 15, 2025

OpenAI Triton backend for Intel® GPUs

MLIR 206 71 Updated Sep 15, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,355 336 Updated Sep 15, 2025
C++ 132 100 Updated Sep 11, 2025

A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

Python 195 9 Updated Jan 14, 2025

Float 16/32 Converter

C 9 1 Updated May 30, 2019

Run Generative AI models with simple C++/Python API and using OpenVINO Runtime

C++ 334 285 Updated Sep 15, 2025

📚 Solutions to Introduction to Algorithms Third Edition

Markdown 4,952 1,276 Updated Apr 9, 2025

This collection of samples demonstrates best practices to achieve optimal video quality and performance on Intel GPUs for content delivery networks. Check out our demo, recommended command lines an…

Shell 109 32 Updated Apr 21, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,337 1,817 Updated Sep 15, 2025

A toolkit showing GPU's all-round capability in video processing

C 190 42 Updated Aug 7, 2023

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,170 215 Updated Oct 8, 2024
C++ 129 24 Updated Aug 26, 2025

Apply cnocr to achieve conversion from image to excel file

Python 45 12 Updated Jun 21, 2021

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 2,297 203 Updated Jul 17, 2024
Python 616 65 Updated Jun 4, 2024

Code repository of all OpenGL chapters from the book and its accompanying website https://learnopengl.com

C++ 11,919 2,898 Updated Aug 6, 2024

NVIDIA Linux open GPU kernel module source

C 16,195 1,484 Updated Sep 10, 2025

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

LLVM 1,366 802 Updated Sep 15, 2025

Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions

C++ 1,354 239 Updated Jun 10, 2024
C++ 181 95 Updated Sep 11, 2025

oneAPI Level Zero Specification Headers and Loader

C++ 278 117 Updated Aug 27, 2025

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,490 282 Updated Sep 15, 2025

Examples for the usage of "pybind11"

C++ 662 94 Updated Jun 11, 2021

Seamless operability between C++11 and Python

C++ 17,260 2,217 Updated Sep 15, 2025

Intel® Video Processing Library (Intel® VPL) API, dispatcher, and examples

C++ 321 96 Updated Aug 6, 2025
Next