Skip to content
/ vad Public
forked from ricky0123/vad

Voice activity detector (VAD) for the browser with a simple API

License

Notifications You must be signed in to change notification settings

DOSYCORPS/vad

Repository files navigation

Voice Activity Detection for the Browser

npm version

This package aims to provide an accurate, user-friendly voice activity detector that runs in the browser. Currently, it runs Silero VAD [1] in the browser using ONNX Runtime Web.

A demo is hosted at vad-demo-script.vercel.app. The source code for the demo can be found here. A separate demo showing how to use the VAD with a bundler like webpack can be found here.

The API works as follows:

  1. Create the VAD object with a line such as

    const myvad = await vad.MicVAD.new(options)

    options can include any of the parameters defined here. It essentially consists of callbacks that run on every audio frame, whenever a speech start is detected, whenever speech ends, etc, as well as parameters that control the voice activity detection algorithm.

  2. Start and pause the VAD object as needed with myvad.start() and myvad.pause(). The object starts in the paused state.

References

[1] Silero Team. (2021). Silero VAD: pre-trained enterprise-grade Voice Activity Detector (VAD), Number Detector and Language Classifier. GitHub, GitHub repository, https://github.com/snakers4/silero-vad, hello@silero.ai.

About

Voice activity detector (VAD) for the browser with a simple API

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 52.5%
  • JavaScript 37.0%
  • HTML 9.7%
  • Shell 0.8%