Leonardo Cavalcante Araújo
Data Analytics Full-Time FEB2021, Paris & March 25th 2021
Customer analytics: Data Analysis and Visualizations using the Airline Passenger Satisfaction dataset from Kaggle.
- Socio-demographics data from customers.
- Service attributes satisfaction grades given by the customers.
- Satisfaction/Dissatisfaction of customers.
- Predict dissatisfaction: predictive analysis using
sklearn
library. - Customers Clustering: classification analysis using
sklearn
library.
- Data search, import and setting the libraries
Pandas
,Numpy
,Matplotlib
,Seaborn
andSKlearn
. - Data cleaning Clean data and prepare for analysis.
- Data Analysis and Visualization.
- Dissatisfaction Prediction Analysis: dataset preparation, normalization of attributes, one hot encoding, feature selection and applying model.
- Customer Clustering: same steps as before, but with one difference: features transformation with Principal Component Analysis (PCA). Thus, clustering customers in 5 different customer segments.
- Presentation: construction and oral presentation to the students of Ironhack Data Cohort.
- Repository "https://github.com/leo-cavalcante/airline-passenger-satisfaction" : you may find the main Python code as well as the data used. You may find the Google Slides in the 'Links' section below or by clicking on the cover photo on this document.
PS.: individual project.
Here you may find the relevant links for the repository, the main code and the final presentation slides.
GitHub Repository: airline-passenger-satisfaction