SoundWave

Real-Time Audio Processing and Speech-to-Text with Kafka and Flink

Project Overview

SoundWave is a real-time audio processing pipeline that ingests audio streams from multiple sources, performs audio processing tasks (primarily pitch shift), and stores the processed audio, which can be further processed to generate a transcript. Built using Apache Kafka, Apache Flink and GlusterFS, SoundWave is optimized for fault tolerance, data redundancy, and efficient storage, using a distributed setup to ensure scalability and reliability.

Architecture

Audio Processing

Kafka Setup

download the latest kafka release and extract it from here here

tar -xzf kafka_2.13-3.8.1.tgz
cd kafka_2.13-3.8.1

run the following commands in separate sessions in order to start all services in the correct order:

bin/zookeeper-server-start.sh config/zookeeper.properties
bin/kafka-server-start.sh config/server.properties

now, your kafka broker is up and running.

Flink Setup

download flink sql connector jar from here and correctly replace the JAR path in flink.properties.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
assets		assets
audio_consumer		audio_consumer
audio_processor		audio_processor
audio_producer		audio_producer
flask_pp		flask_pp
flask_rtc		flask_rtc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SoundWave

Table of Contents

Project Overview

Architecture

Audio Processing

Kafka Setup

Flink Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

NeonBeach/soundwave

Folders and files

Latest commit

History

Repository files navigation

SoundWave

Table of Contents

Project Overview

Architecture

Audio Processing

Kafka Setup

Flink Setup

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages