Skip to content

Environmental sound classification using deep learning algorithms

Notifications You must be signed in to change notification settings

pasquale90/mthesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio signal classification using deep learning algorithms

University Subject Subject City Year

In this thesis we compared the performance of multiple feature parameters for environmental sound classification problems by developing multiple evaluating models. Specifically, as audio representation of two different datasets, we used raw waveforms, log-mel spectrograms and short-time Fourier transforms. Finally we set four different experiments and each one of them was divided in two discrete audio representation modes. For their evaluation and also for comparability purposes we developed hybrid CNN models. Along with comparing each mode within each experiment, we also compared the performances achieved by using each different dataset through inspecting and examining the factors of structure, the technical features and various prospects of the initial data distribution, respectively for each dataset. The nature of this research additionally enabled us to seek for potential environmental class-conditional audio features.

Download

You can access the uploaded document in this link

Contents

Folders' Structure

Tools

Datasets

Audio_representations

  • 1.raw : 1D raw waveform
  • 2.flat : 1D flattened log mel-spectogram
  • 3.mel : 2D log mel-spectogram
  • 4.stfts : 2D short-time Fourier transform

Data_augmentation

  • Audio data augmentation: White Noise, Time Stretching, Time Shifting, Pitch Shifting
  • Image data augmentation (2D exps) : Random Flip, Random Erasing

Method

General method scheme concerning all experiments:

Models

1.raw architecture scheme

2.flat architecture scheme

3.mel architecture scheme

4.stfts architecture scheme

Results

Average

About

Environmental sound classification using deep learning algorithms

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages