Instructor : E. Nazerfard
Semester: Spring 2023
This repository consists of Data mining projects at Amirkabir University of technology.
A mini project which helps to undrestand concepts such as:
-
Garbage in, Garbage out
-
One hot encoding and Label encoding
-
Data augmentation
-
Down sampling and Upsampling
-
Imbalanced dataset techniques such as smotetomek and smoteenn
-
Noramlization
-
Principle component analysis
-
3D Data visualization and box plot
Libraries: Scikit-learn, Pandas, Imbalanced-learn, Matplotlib
Dataset: Palmer penguin
A mini project which helps to undrestand concepts such as:
-
Q-box
-
Linear Regression Vs Polynomial Regression
-
Classification using : Decision tree, Random forest, KNN, Linear & Non-Linear SVM
-
Mutli-class classification using Deep learning
-
Confusion matrix
Libraries: Scikit-learn, Tensorflow, Pandas, Numpy , Matplotlib
Dataset: House price
-
Create Similarity matrix using Cosine Similarity and euclidean distance
-
Implementing Kmeans algorithm.
Results :
Libraries: Scikit-learn,matplotlib, numpy
Dataset: Kmeans dataset
A project which wants to make some prediction of persian music dataset.
-
Consists of EDA and PCA visualization
-
Implementing Regression for popularity prediction
-
Implementing Classification for traditional music prediction
Libraries: Scikit-learn, matplotlib, numpy
Dataset: Persian Spotify