Digit-Recognition

A Python project that was developed as a university assignment for the subject of Signal Processing and Voice Recognition. The goal of this assignment was to make an ASR system that predict digits from a voice signal using Neural Network. The dataset that was used for the purpose of this assigment is AudioMNIST.

The steps of the algorithm are :

We train a simple Feed Forward Neural Network model using only Mel Spectogram as features.
Seperate foreground from background information using REPET algorithm.
In the foreground signal,we extract digits information using sliding window technique.
Finally we feed our model with these digits and make predictions.

To run this project :

You should download the necessary libraries from requirement.txt and also the audio dataset.
Run Dataset.py first and after run Network.py.
Finally you should run the prediction.py and insert the file path when prompted.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dataset.py		Dataset.py
ExtractDigits.py		ExtractDigits.py
LibrosaClassification.py		LibrosaClassification.py
Network.py		Network.py
Prediction.py		Prediction.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Digit-Recognition

About

Uh oh!

Releases

Packages

Languages

klaudiozdrava/Digit-Recognition

Folders and files

Latest commit

History

Repository files navigation

Digit-Recognition

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages