transformer

Star

Here are 36 public repositories matching this topic...

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Jan 21, 2025
C++

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./project/android/apps/MnnLlmApp/README.md)

machine-learning arm deep-learning vulkan ml transformer convolution embedded-devices mnn winograd-algorithm llm

Updated Jan 28, 2025
C++

NVIDIA / FasterTransformer

Star

Transformer related optimization, including BERT, GPT

pytorch transformer gpt bert

Updated Mar 27, 2024
C++

bytedance / lightseq

Star

LightSeq: A High Performance Library for Sequence Processing and Generation

training cuda inference transformer accelerate bart beam-search sampling gpt bert multilingual-nmt diverse-decoding

Updated May 16, 2023
C++

ppogg / YOLOv5-Lite

Star

🍅🍅🍅YOLOv5-Lite: Evolved from yolov5 and the size of model is only 900+kb (int8) and 1.7M (fp16). Reach 15 FPS on the Raspberry Pi 4B~

pytorch transformer android-app tensorrt mnn mobilenet ncnn tflite shufflenetv2 onnxruntime yolov5 repvgg pplcnet openvivo picodet

Updated Jun 22, 2024
C++

Tencent / TurboTransformers

Star

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

nlp gpu decoder machine-translation inference pytorch transformer albert bert roberta gpt2 huggingface-transformers

Updated Jun 12, 2023
C++

athena-team / athena

Star

an open-source implementation of sequence-to-sequence based speech processing engine

deployment tensorflow tts speech-synthesis transformer speech-recognition sequence-to-sequence unsupervised-learning speaker-recognition asr ctc wfst

Updated Dec 2, 2022
C++

azkadev / whisper

Sponsor

Star

Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models

Updated Jan 25, 2025
C++

zhihu / cuBERT

Star

Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

deep-learning tensorflow cuda inference transformer bert predict mkl

Updated Nov 18, 2020
C++

bytedance / ByteTransformer

Star

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

research gpu inference transformer bert

Updated Mar 15, 2024
C++

bytedance / effective_transformer

Star

Running BERT without Padding

machine-learning tensorflow inference transformer bert

Updated Mar 18, 2022
C++

vectorch-ai / ScaleLLM

Star

A high-performance inference system for large language models, designed for production environments.

performance gpu model production cuda efficiency inference transformer llama speculative serving llm llm-inference llama3

Updated Jan 31, 2025
C++

intel / xFasterTransformer

Star

intel inference transformer xeon llama model-serving llm chatglm qwen

Updated Jan 23, 2025
C++

NiuTrans / NiuTrans.NMT

Star

A Fast Neural Machine Translation System developed in C++.

machine-translation transformer neural-machine-translation fast-decoding

Updated Mar 7, 2024
C++

tklab-tud / uscxml

Star

SCXML interpreter and transformer/compiler written in C/C++ with bindings to Java, C#, Python and Lua

python c java c-plus-plus embedded interpreter lua cpp w3c ecmascript transformer scxml

Updated Jan 26, 2025
C++

AXERA-TECH / ax-llm

Star

Explore LLM model deployment based on AXera's AI chips

transformer vlm edge-computing huggingface llm minicpm qwen2 axera llama3 minicpm-v gemma2 internvl2

Updated Dec 29, 2024
C++

gpustack / llama-box

Star

LM inference server implementation based on *.cpp.

cpp transformer diffusion gguf openai-compatible-api

Updated Jan 31, 2025
C++

duyvuleo / Transformer-DyNet

Star

An Implementation of Transformer (Attention Is All You Need) in DyNet

cpp transformer neural-machine-translation dynet sequence-to-sequence-models

Updated Nov 30, 2023
C++

dianhsu / swin-transformer-cpp

Star

Swin Transformer C++ Implementation

neural-network cpp transformer swin-transformer

Updated May 19, 2021
C++

hpc203 / LSTR-lane-detect-onnxrun-cpp-py

Star

使用ONNXRuntime部署LSTR基于Transformer的端到端实时车道线检测，包含C++和Python两个版本的程序

python cpp transformer lane-detection realtime-detection

Updated Jan 27, 2023
C++

Improve this page

Add a description, image, and links to the transformer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformer

Here are 36 public repositories matching this topic...

ggerganov / whisper.cpp

alibaba / MNN

NVIDIA / FasterTransformer

bytedance / lightseq

ppogg / YOLOv5-Lite

Tencent / TurboTransformers

athena-team / athena

azkadev / whisper

zhihu / cuBERT

bytedance / ByteTransformer

bytedance / effective_transformer

vectorch-ai / ScaleLLM

intel / xFasterTransformer

NiuTrans / NiuTrans.NMT

tklab-tud / uscxml

AXERA-TECH / ax-llm

gpustack / llama-box

duyvuleo / Transformer-DyNet

dianhsu / swin-transformer-cpp

hpc203 / LSTR-lane-detect-onnxrun-cpp-py

Improve this page

Add this topic to your repo