This repository contains my notes, practices, and solutions for the Applied Data Science Course offered by Sharif University of Technology (EE-879). The content covers various data science concepts, hands-on exercises, and solutions to course assignments.
For more details about the course, visit the official course website: Applied Data Science - Sharif University of Technology, Spring 2025
The course covers the following key topics:
- π Introduction to Pandas
- π§Ή Data Cleaning and Preprocessing
- π Data Visualization
- βοΈ Feature Engineering and Dimensionality Reduction
- π― Different Problem Types and Accuracy Measures
- π Regression Methods
- π Classification Methods
- π Multiclass/Multilabel Classification and Boosting
- π§ Neural Networks
- π Deep Learning
- πΌοΈ Deep Learning Application: Image Classification
- π€ Generative AI
- π Model Explainability and Imbalanced Data Problems
The following datasets are utilized in this repository:
- π Car Features and MSRP
- π§ Stroke Prediction
- π Ames Iowa Housing Data
- π¨ Hotel Booking Demand
- π₯ Avocado Prices
- Assignment 1: Kaggle mini tutorial for introduction to Pandas.
- Assignment 2:
- Exploratory Analysis and Data Cleaning on Stroke Prediction dataset.
- Kaggle mini tutorial for Data Cleaning.
- Assignment 3:
- Data Visualization on Hotel Booking Demand
- Kaggle mini tutorial for Data Visualization.
- Practice 1: Explore pandas features on Car Features and MSRP dataset.
- Notes β Summarized lecture notes & key concepts
- Practices β Hands-on coding exercises & projects
- Assignments β Solutions to course assignments
Stay tuned for continuous updates! π