In this repository it is shared the code of the bachelor's degree final project in Software Engineering:
Machine Learning applied to Wind and Waves Modelling
The repository is structured in three folders:
-
EDA and Data cleaning contains .m files needed during the developing of Data loading, Exploratory Data Analysis and Data cleaning phases. Here is included the code for:
- Data loading from the web page of NOAA
- Variables analysis and plots visualization,
- Correlation analysis implementation and
- Data cleaning; which include missing values detection and deletion as well as outliers deletion.
-
Features selection and extraction contains the calculation of F-test score for measuring the significance of the variables as well as .m files for data combination that let us obtain the data sets used during the Training phase.
-
Models evaluation contain a main file .mlx where we test the trained models with data from 2019 and 2020 (Real Time) and we plot "responses vs predicted" plot. We include also two of the files for two concrete models (as examples) used to plot the Learning Curves to evaluate models.
Before executing the code, install:
And install from Matlab apps manager: