Skip to content
View LopezCastroRoberto's full-sized avatar

Highlights

  • Pro

Organizations

@IST-DASLab

Block or report LopezCastroRoberto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. IST-DASLab/qutlass IST-DASLab/qutlass Public

    QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

    C++ 165 17

  2. IST-DASLab/Sparse-Marlin IST-DASLab/Sparse-Marlin Public

    Boosting 4-bit inference kernels with 2:4 Sparsity

    Cuda 93 5

  3. UDC-GAC/venom UDC-GAC/venom Public

    A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores

    Python 57 7

  4. UDC-GAC/openCNN UDC-GAC/openCNN Public

    A Winograd Minimal Filter Implementation in CUDA

    Cuda 28 2

  5. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 69.8k 13.3k

  6. flashinfer-ai/flashinfer flashinfer-ai/flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    Python 4.9k 698