Skip to content

yug-am/Music-Genre-Classfication

Repository files navigation

Music genre classification

Libraries used

About Dataset

GTZAN dataset

The GTZAN dataset is the most-used public dataset for evaluation in machine listening research for music genre recognition (MGR). The files were collected in 2000-2001 from a variety of sources including personal CDs, radio, microphone recordings, in order to represent a variety of recording conditions. 30 seconds audio files are arranged in folders sorted by genres.

First Publication with this dataset

G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," in IEEE Transactions on Speech and Audio Processing, vol. 10, no. 5, pp. 293-302, July 2002, doi: 10.1109/TSA.2002.800560.

publication

Dataset link

Caveat

Latest numpy, numpy v1.24.0 has issues with librosa library Degrade to version <=1.20.0

Results

Basic neural network

Basic_nn_accuarcy

Basic_nn_accuarcy

Generalized neural network(Overfittting fixed)

generalized_nn_accuarcy

generalized_nn_accuarcy

Convolutional neural network

cnn_accuarcy

cnn_accuarcy

Github

About

Music Genre Classfication with deep learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages