Skip to content
View saeedmaleki's full-sized avatar

Block or report saeedmaleki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
15 results for source starred repositories
Clear filter

FlashInfer: Kernel Library for LLM Serving

Cuda 2,267 236 Updated Mar 4, 2025

Grok open release

Python 50,201 8,366 Updated Aug 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,205 6,022 Updated Mar 4, 2025

AICI: Prompts as (Wasm) Programs

Rust 2,004 83 Updated Jan 22, 2025

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 306 45 Updated Mar 4, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,603 1,126 Updated Mar 4, 2025

Updated C version of the Test Suite for Vectorising Compilers

C 56 27 Updated Mar 14, 2024

NCCL Profiling Kit

Python 127 12 Updated Jul 1, 2024

TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches

Python 70 10 Updated Jul 25, 2023
Python 135 11 Updated Jul 22, 2024

Microsoft Collective Communication Library

C++ 339 31 Updated Sep 20, 2023

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,262 254 Updated Mar 2, 2025

Development repository for the Triton language and compiler

MLIR 14,705 1,834 Updated Mar 4, 2025
Python 3 1 Updated Jun 22, 2023

Synthesizer for optimal collective communication algorithms

Python 104 25 Updated Apr 8, 2024