Voice Node

This contains a satellite voice assistant node for my Homenode home automation system. A lot of the AI features are contained within that system so this node just handles wake word detection, audio processing, and streaming audio to/from the server.

This project aims for cheap hardware. You will most likely get better performance designing with an XMOS hardware based system but you should be able to splatter more of these around the house and hopefully make use of any software improvements in the future.

Hardware

Audio Pipeline

The audio processing is challenging with the Gemini Live API as any speaker feedback will interrupt the model's output.

1. Dynamic Capture Volume Control

The system dynamically adjusts microphone capture volume in sync with audio playback. When the speaker is active, capture volume is reduced minimizing the amount of speaker output picked up by the microphone This adjustment occurs before any acoustic echo cancellation (AEC) or noise reduction (NR) is applied. While hacky, this has a big impact and work well my usage of the device as I will usually I will speak louder to cancel or correct an action.

2. Acoustic Echo Cancellation (AEC)

Acoustic Echo Cancellation (AEC) is implemented using SpeexDSP. This needs to be aligned to your audio pipeline latency but is the most effective way to remove speaker echo from the mic input.

3. Noise Reduction

Non-stationary spectral gating via noisereduce is used to remove residual echo and background noise. These parameters should be adjusted to optimize performance on your setup. You may be able to run RNNoise on a Pi4+ but I was not able to keep up with input.

Deployment

The image folder contains scripts to provision a pre-flashed base Raspbian Bookworm Lite (64-bit) image. We setup some base kernel drivers and Docker on the device then run the main application in a privileged container. The script can be re-run for updates. You can setup network, hostname, SSH using Raspberry Pi Imager allowing for a full headless setup.

To build the Docker image you can run the following command from the root folder:

docker build --platform=linux/arm64 -t voicenode -f image/Dockerfile .

To deploy just run the following with your hostname or IP address:

./image/provision.ps1 voicenode.local

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
image		image
src		src
.gitignore		.gitignore
.python-version		.python-version
Hola_casita.onnx		Hola_casita.onnx
Hola_casita.tflite		Hola_casita.tflite
README.md		README.md
asound.state		asound.state
pyproject.toml		pyproject.toml
voicenode.3mf		voicenode.3mf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice Node

Hardware

Audio Pipeline

1. Dynamic Capture Volume Control

2. Acoustic Echo Cancellation (AEC)

3. Noise Reduction

Deployment

About

Uh oh!

Contributors 2

Uh oh!

Languages

prasha-au/voicenode

Folders and files

Latest commit

History

Repository files navigation

Voice Node

Hardware

Audio Pipeline

1. Dynamic Capture Volume Control

2. Acoustic Echo Cancellation (AEC)

3. Noise Reduction

Deployment

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages