Silero VAD: pre-trained enterprise-grade Voice Activity Detector
-
Updated
Jun 11, 2025 - Python
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
An audio/acoustic activity detection and audio segmentation tool
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
TranscribeTube is a Python tool that transcribes and generates subtitles for videos from local files or YouTube links using Hugging Face models. It features an interactive Gradio web interface, allowing users to easily upload videos, select languages, and download subtitles in SRT format.
DΞCIBΞLION is an audio intelligence module forged in the labs of OBINexus, where noise meets logic and shouting is a feature, not a bug. It mathematically analyzes human vocal input to determine emotional projection through log-scaled loudness evaluation, using a sacred constant: 85 dB.
A Python project that handles speech commands and retrieves results from Google or Wikipedia based on the spoken input. Functions are organized in separate files, with a single raw file to execute the project. This repository is intended for project purposes and will be updated with additional features in the future.
Add a description, image, and links to the voice-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-detection topic, visit your repo's landing page and select "manage topics."