Fast inference engine for Transformer models
-
Updated
Apr 8, 2025 - C++
Fast inference engine for Transformer models
Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
monolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Fast Unit Root Tests and OLS regression in C++ with wrappers for R and Python
Least squares adjustment software
A c++ library for Numerical renormalization group (NRG)
Numerical computing library for linear algebra and task-based parallelism.
This repository houses the Statslabs.Matrix Linear Algebra Library for use while learning C++ from Bjarne Stroustrup's book 'The C++ Programming Language (4th Edition)'
Implemented LeNet for MNIST Digit Recognition and carried out optimizations using pthreads, openBLAS etc
LU-factorization with Scalapack
Least squares adjustment software
Add a description, image, and links to the mkl topic page so that developers can more easily learn about it.
To associate your repository with the mkl topic, visit your repo's landing page and select "manage topics."