Name	Name	Last commit message	Last commit date
Latest commit History 134 Commits
part_1_exploration	part_1_exploration
part_2_decoder	part_2_decoder
part_3_robust	part_3_robust
part_4_improved_decoder	part_4_improved_decoder
part_5_further_analysis	part_5_further_analysis
part_6_multimodal	part_6_multimodal
.gitattributes	.gitattributes
.gitignore	.gitignore
Background_Report.pdf	Background_Report.pdf
README.md	README.md
lint.sh	lint.sh
requirements.txt	requirements.txt

Name

Last commit message

Last commit date

134 Commits

part_1_exploration

part_2_decoder

part_3_robust

part_4_improved_decoder

part_5_further_analysis

part_6_multimodal

.gitattributes

.gitignore

Background_Report.pdf

README.md

lint.sh

requirements.txt

MSc Project: Multi-Modal Data Fusion Through Contrastive Learning in Geoscience

This repository contains code and experiments for an MSc thesis focused on contrastive learning and decoder design for multi-modal geoscientific data.

Repository Structure

.
├── part_1_exploration/
├── part_2_decoder/
├── part_3_robust/
├── part_4_improved_decoder/
├── part_5_further_analysis/
├── part_6_multimodal/
├── requirements.txt

Project Parts

Part 1: Exploration
- Initial experiments and exploration of contrastive learning
- Trains SIMCLR, Barlow Twins, Supervised Contrastive Learning Methods
- Contrasts with an Autoencoder Approach
Part 2: Decoder
- Adds a decoder to the contrastive learning methods to provide a direct comparison to the autoencoder approach
- Contrasts two methods of adding the decoder
- Compares Instance Norm and Batch Norm Approaches
- Evaluates the approaches to the task of downstream weather forecasting
Part 3: Robust
- Attempts to make the chosen SIMCLR approach more robust and have better downstream performance
- Trialled numerous approaches and collated them all into one final solution
Part 4: Improved Decoder
- Final decoder architecture improvements
- Contrasts the final approach with a more representative autoencoder
- Adds additional tasks of data assimilation and conditional latent diffusion models
- Expands the downstream task of forecasting with added stride and changing context windows
Part 5: Further Analysis
- Analysis on the latent space to link smoothness and forecasting performance
- Implements changes to batch size, cycle loss and alpha decay based on findings
Part 6: Multimodal
- Implements several Multimodal late fusion methods
- Average Pooling
- Self-Attention
- GNN

Getting Started

Clone the repository
```
git clone <repo-url>
```

Create and activate a virtual environment

python -m venv venv
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

Details on Folder Structure

Each model in the directories has corresponding files for training and testing that are self-explanatory.

Each model has several notebooks that provide results in the report:

eval.ipynb - Evaluates the model for single-step forecasting
eval_autoregressive.ipynb - Evaluates the model for autoregressive forecasting
eval_autoregressive_seed_avg.ipynb - Evaluates the model for autoregressive forecasting for strided data
eval_latent.ipynb - Evaluates the model for conditional latent diffusion
visual.ipynb - Visualises the latent space and computes smoothness metrics
visual_window_next_t.ipynb - Plots trajectories of context windows with the next step

Notes

Each subfolder contains scripts and modules for the specific experiments and model variants.

For questions or collaboration, please get in touch with me

About

2024–2025 MSc Computing (AI & ML) thesis: Multi-Modal Data Fusion Through Contrastive Learning in Geoscience

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MSc Project: Multi-Modal Data Fusion Through Contrastive Learning in Geoscience

Repository Structure

Project Parts

Getting Started

Details on Folder Structure

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Languages

nathanwbailey/msc_project

Folders and files

Latest commit

History

Repository files navigation

MSc Project: Multi-Modal Data Fusion Through Contrastive Learning in Geoscience

Repository Structure

Project Parts

Getting Started

Details on Folder Structure

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages