Skip to content

A Python project that was developed as a university assignment for the subject of Signal Processing and Voice Recognition. The goal of this assignment was to make an ASR system that predict digits from a voice signal using Neural Network.

Notifications You must be signed in to change notification settings

klaudiozdrava/Digit-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Digit-Recognition

A Python project that was developed as a university assignment for the subject of Signal Processing and Voice Recognition. The goal of this assignment was to make an ASR system that predict digits from a voice signal using Neural Network. The dataset that was used for the purpose of this assigment is AudioMNIST.

The steps of the algorithm are :

  1. We train a simple Feed Forward Neural Network model using only Mel Spectogram as features.
  2. Seperate foreground from background information using REPET algorithm.
  3. In the foreground signal,we extract digits information using sliding window technique.
  4. Finally we feed our model with these digits and make predictions.

To run this project :

  1. You should download the necessary libraries from requirement.txt and also the audio dataset.
  2. Run Dataset.py first and after run Network.py.
  3. Finally you should run the prediction.py and insert the file path when prompted.

About

A Python project that was developed as a university assignment for the subject of Signal Processing and Voice Recognition. The goal of this assignment was to make an ASR system that predict digits from a voice signal using Neural Network.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages