OpenWakeWord Evaluation and Adversarial Data Generation

This repository evaluates the OpenWakeWord engine's performance in creating custom wake words . This repository is the fork of synthetic speech dataset generation repository by OpenWakeWord, available at https://github.com/dscripka/synthetic_speech_dataset_generation and users some scripts from https://github.com/dscripka/openWakeWord

Introduction

We focus on generating adversarial synthetic data to test the engine's robustness. Adversarial data includes words that sound phonetically similar to the wake word, which helps assess false accept and reject rates.

Setup

Clone the Repository:

git clone https://github.com/yourusername/openwakeword-evaluation.git
cd openwakeword-evaluation

Set Up Virtual Environment:

pip3 install virtualenv
virtualenv venv
source venv/bin/activate

Install Dependencies:

pip3 install -r requirements.txt
apt-get install espeak

Generating Adversarial Text

Command to Run:

python generate_adversarial_text.py "hey mycroft" 1 adversarial_texts.txt

Generating Synthetic Speech

Command to Run:

python3 generate_speech.py --model VITS --input_file adversarial_texts.txt --n_speakers 100 --output_dir aout --max_per_speaker 5

Evaluating OpenWakeWord Engine

Command to Run:

python3 wakeword_test.py about

Results :

False accepts are 3.0 %
Details of false accepts:
{'aout/1d4f5a333c4c49279b59e350054da39c.wav': 5}
{'aout/bdb669a443d249408eeee25ed5005c95.wav': 2}
{'aout/8f1f817fb995479ab8389aa499159a8e.wav': 4}
{'aout/972975ef4ace498aa2d122a523c05487.wav': 5}
{'aout/e3e9868f9996451fb495194bb944d63b.wav': 3}
{'aout/8af19950c9904b8cb98b91ccdcee2785.wav': 4}

References

•	OpenWakeWord GitHub Repository
•	Synthetic Speech Dataset Generation Repository

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
models		models
openwakeword		openwakeword
results		results
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE.txt		LICENSE.txt
README.md		README.md
Untitled-1.ipynb		Untitled-1.ipynb
adversarial_text.py		adversarial_text.py
augment_clips.py		augment_clips.py
data.py		data.py
download_dataset.py		download_dataset.py
download_tts_models.py		download_tts_models.py
generate_clips.py		generate_clips.py
generate_speech.py		generate_speech.py
requirements.txt		requirements.txt
wakeword_test.py		wakeword_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Uh oh!

Repository files navigation

OpenWakeWord Evaluation and Adversarial Data Generation

Table of Contents

Introduction

Setup

Generating Adversarial Text

Command to Run:

Generating Synthetic Speech

Evaluating OpenWakeWord Engine

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

Licenses found

pyVision/openwakeword_eval

Folders and files

Latest commit

History

Repository files navigation

OpenWakeWord Evaluation and Adversarial Data Generation

Table of Contents

Introduction

Setup

Generating Adversarial Text

Command to Run:

Generating Synthetic Speech

Evaluating OpenWakeWord Engine

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages