ML Project first split the data into a train set and validation set (no test set) perform EDA (on the train set) and data preparation (on the train, validation, and Kaggle's test sets) try at least 3 Regression models choose the best model (Evaluation metric is MSLE Mean Square Log Error) https://scikit-learn.org/stable/modules/generated/sklearn.metrics.mean_squared_log_error.html