EDA project. Cleaning and analyzing data on VNL 2023.
Import the dataset and get a feel for it. In other words, view the columns, general statistics, data types of the values.
After getting a fair understanding of the dataset, prepare the dataset for analysis. This means, remove an duplicates, null values, unused observations, etc.
Look at each feature themselves to understand distribution and potential outliers. Univariate analysis with boxplots, histograms, bar graphs.
Look at features together to understand relationships between them. Multivariate analysis with scatterplots, heatmaps, pairplots.
After visualizing the data and statistics, try to draw up a question to be answered from the dataset.
After everything, one can begin to perform more advanced statistical analysis with machine learning models or other forms of analysis.