Streaming on-device speech recognition for Android — NEON-accelerated FastConformer (32M params), cache-aware streaming with 80 ms look-ahead, no cloud. Powered by the VoxRT runtime.
-
Updated
Jul 2, 2026 - Kotlin
Streaming on-device speech recognition for Android — NEON-accelerated FastConformer (32M params), cache-aware streaming with 80 ms look-ahead, no cloud. Powered by the VoxRT runtime.
On device streaming voice activity detection (Silero VAD v5) for Android. ~424 KB native binary, NEON-accelerated arm64-v8a, RTF ~3% on Snapdragon 662.
Streaming on-device speech recognition for iOS — NEON-accelerated, encrypted FastConformer (32M params), RTF 0.08–0.10 on iPhone 13 Pro Max. Built on the VoxRT custom Rust inference runtime. SwiftPM distribution.
On device streaming voice activity detection (Silero VAD v5) for iOS. Custom Rust inference runtime, NEON-accelerated arm64, RTF ~1.85% on iPhone.
On-device wake-phrase detection for Linux aarch64 — one glibc-2.17 .so behind Python (PyPI wheel), Node.js (npm), Go (go get), C/C++ (tarball + CMake), Rust. RTF 0.053 on Raspberry Pi Zero 2 W. Same runtime as voxrt-wake-word-{android,ios}.
Pre-compiled ASR model weights for the VoxRT on-device runtime. Encrypted .vxrt v2 format. streaming-medium-pc: FastConformer 32M, CTC + RNN-T, CC-BY-4.0 (NVIDIA NeMo).
Pre-compiled Silero v5 VAD weights in .vxrt format for the VoxRT inference runtime. AES-256-GCM encrypted, ~1.2 MB, MIT.
Always-on wake-phrase detection for iOS on the VoxRT custom on-device inference runtime — Swift Package, 16 kHz mono PCM in, threshold-crossing events out. Custom phrases at voxrt.com.
Always-on wake-phrase detection for Android on the VoxRT custom on-device inference runtime — Kotlin library, 16 kHz mono PCM in, threshold-crossing events out. Custom phrases at voxrt.com.
On-device wake-phrase detection in the browser — WebAssembly + SIMD128, ~275 KB total (runtime + model), no server. Same runtime as voxrt-wake-word-{android,ios,linux}. Free demo tier of the VoxRT wake-word family.
Wake-phrase model weights (.vxrt, ~100 KB, AES-GCM encrypted) for the VoxRT custom on-device inference runtime. Paired with voxrt-wake-word-{android,ios,linux}. Custom phrases at voxrt.com.
Detect wake phrases directly in the browser using WebAssembly. Runs locally without servers or plugins to ensure privacy and speed.
Distribute pre-compiled VoxRT wake-word model weights for on-device inference using the compact .vxrt format.
Run Silero v5 voice activity detection on iOS using a custom, high-performance Rust inference runtime for low-latency streaming.
Stream on-device speech recognition on Android using the custom VoxRT inference runtime with NeMo FastConformer support.
Add a description, image, and links to the voxrt topic page so that developers can more easily learn about it.
To associate your repository with the voxrt topic, visit your repo's landing page and select "manage topics."