Audio signal classification using deep learning algorithms

In this thesis we compared the performance of multiple feature parameters for environmental sound classification problems by developing multiple evaluating models. Specifically, as audio representation of two different datasets, we used raw waveforms, log-mel spectrograms and short-time Fourier transforms. Finally we set four different experiments and each one of them was divided in two discrete audio representation modes. For their evaluation and also for comparability purposes we developed hybrid CNN models. Along with comparing each mode within each experiment, we also compared the performances achieved by using each different dataset through inspecting and examining the factors of structure, the technical features and various prospects of the initial data distribution, respectively for each dataset. The nature of this research additionally enabled us to seek for potential environmental class-conditional audio features.

Download

You can access the uploaded document in this link

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
Results		Results
analysis_parameters		analysis_parameters
arch		arch
code		code
datanalysis		datanalysis
other		other
README.md		README.md
contents.jpg		contents.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio signal classification using deep learning algorithms

Download

Contents

Folders' Structure

Tools

Datasets

Audio_representations

Data_augmentation

Method

General method scheme concerning all experiments:

Models

1.raw architecture scheme

2.flat architecture scheme

3.mel architecture scheme

4.stfts architecture scheme

Results

Average

analytical fold results

analytical class results

About

Releases

Packages

Languages

pasquale90/mthesis

Folders and files

Latest commit

History

Repository files navigation

Audio signal classification using deep learning algorithms

Download

Contents

Folders' Structure

Tools

Datasets

Audio_representations

Data_augmentation

Method

General method scheme concerning all experiments:

Models

1.raw architecture scheme

2.flat architecture scheme

3.mel architecture scheme

4.stfts architecture scheme

Results

Average

analytical fold results

analytical class results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages