
Starred repositories
3
stars
written in Cuda
Clear filter
PyTorch 1.0 implementation of the approximate Earth Mover's Distance
gevtushenko / llm.c
Forked from karpathy/llm.cLLM training in simple, raw C/CUDA
A C++ allocator based on cudaMallocManaged