Skip to content
View tlemo's full-sized avatar

Highlights

  • Pro

Block or report tlemo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
17 stars written in Cuda
Clear filter

A massively parallel, optimal functional runtime in Rust

Cuda 10,873 419 Updated Nov 21, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 2,375 247 Updated Mar 13, 2025

cuGraph - RAPIDS Graph Analytics Library

Cuda 1,904 316 Updated Mar 13, 2025

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 1,734 450 Updated Oct 9, 2023

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 852 203 Updated Mar 14, 2025

Fast CUDA matrix multiplication from scratch

Cuda 661 90 Updated Dec 28, 2023

CUDA-accelerated GIS and spatiotemporal algorithms

Cuda 646 158 Updated Mar 13, 2025

CUDA Kernel Benchmarking Library

Cuda 588 72 Updated Mar 12, 2025

CUDA Data Parallel Primitives Library

Cuda 428 97 Updated Nov 9, 2018

Spiking Neural Networks in C++ with strong GPU acceleration through CUDA

Cuda 126 25 Updated Jul 3, 2020

NVIDIA tools guide

Cuda 113 5 Updated Jan 7, 2025

CUDA kernel author's tools

Cuda 110 8 Updated Apr 24, 2022

[ARCHIVED] GPU String Manipulation --> Moved to cudf

Cuda 46 38 Updated Sep 24, 2019

Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser

Cuda 13 Updated Nov 17, 2020

A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs

Cuda 11 2 Updated Dec 17, 2024

Demonstration of various ways to implement an N-body simulation using CUDA

Cuda 1 Updated Jul 19, 2024