Spatiotemporal Temperature Fusion Network & StarFM

Using machine learning to turn unreadable or missing satellite images into usable and rich data.

An example implementation of the STTFN.

This repository, and it's contents, are the result of the 2023-2024 STTFN Senior Project at Oregon State University with constellr gmbh.

This repository has a strict license to ensure read-only rights, for educational and research purposes. Check the LICENSE.md for more.

What is a Spatiotemporal Temperature Fusion Network? (STTFN)

Our implementation is based on the works of Zhixianh Yin, Penghai Wu, and Giles M. Foody in "Spatiotemporal Fusion of Land Surface Temperature Based on a Convolutional Neural Network"

The specific research paper, including the model architecture, can be found here: https://ieeexplore.ieee.org/document/9115890

A Spatiotemporal Temperature Fusion Network (STTFN) is a multiscale fusion-based convolutional neural network utilized to build nonlinear relationships between input and output images -- in this case, Land Surface Temperature (LST) satellite imaging. By utilizing two convolutional neural networks to predict a forward sequence and a backward sequence, we can predict a "middle" sequence, filling in possible missing or damaged LST satellite imaging.

In remote sensing, MODIS satellite data is considered lower resolution, but common -- while Landsat satellite data is higher resolution, but rarer. Utilizing this STTFN method, we aim to use the context of surrounding Landsat and MODIS imagery to infer Landsat-quality imaging for days where there would typically be none. For later descriptions in this document, Landsat imagery will be referred to as L# (with the # dictating the timestamp in relation to the prediction image), and MODIS will be referred to as M#, with the same rules.

We implemented the STTFN research paper, which is provided in the .ipnyb file, and trained on Oregon State University's HPC Clusters. For performance metrics, we used Root Mean Squared Error (RMSE) and Structural Similarity (SSIM). The .ipnyb file also includes an implementation of a "comparison model", the STARFM model (original implementation and repository can be found at https://github.com/nmileva/starfm4py).

An example of our implemented STTFN results can be seen below.

Pictured above is the "target MODIS image".

The MODIS image has been converted into the "inferred Landsat-Quality" model output.

The model output's SSIM: 0.9968653789596235

The model output's RMSE: 1.726944568447375

The "Goal" Landsat (Notice the similarity, and how the Model Output ignores the Data's artifacts)

The Underlying Architecture

The general architecture of the Spatiotemporal Temperature Fusion Network consists of three primary factors:

A trained forward convolutional neural network.
A trained backward convolutional neural network.
An STC-Weighting function to average the results of the above-mentioned CNNs for the final prediction.

The architecture of the model, as well as the training flow, is specified below:

Source: "Spatiotemporal Fusion of Land Surface Temperature Based on a Convolutional Neural Network" (https://ieeexplore.ieee.org/document/9115890)

The Convolutional Neural Network

Both the forward trained and the backward trained neural network have the exact same shape and architecture, their difference comes from the training data provided. For the forward trained network, the model is fed days/timestamps leading up to the prediction-needed/missing/damaged image, while the backward trained network is fed days/timesamps counting down from the prediction-needed/missing/damaged image. In this way, there are two models -- one essentially trying to predict the "next sequence", and another trying to predict the "precursor".

Each Convolutional Neural Network has three defining networks that process and learn from the images provided -- a Super-Resolution Net, an Integration Net and Extraction Net.

The specific architecture for the Convolutional Neural Network can be seen below:

Source: "Spatiotemporal Fusion of Land Surface Temperature Based on a Convolutional Neural Network" (https://ieeexplore.ieee.org/document/9115890)

The STC-Weighting Function

The final part of the STTFN is to weigh the average of the forward trained CNN and the backward trained CNN for an accurate upscaling of the given MODIS image.

The STC-Weighting function is the function that calculates the weights/preference given over a certain CNN output based on it's accuracy to the given MODIS image the STTFN is trying to upscale. The formula is specified below:

Source: "Spatiotemporal Fusion of Land Surface Temperature Based on a Convolutional Neural Network" (https://ieeexplore.ieee.org/document/9115890)

i denotes the timestamp in relation to the missing image, which is i = 2. i = 1 refers to a previous time, and i = 3 refers to a time after. L# and M#, as stated above, refers to the image provided at that time. For example, M2 is the "second" timestamp MODIS image, which is the one to be upscaled by the STTFN. L1 is the Landsat image in a previous timestamp.

The first formula is calculates the weight parameter for each CNN output. Essentially, the weight parameter for a specific CNN output is calculated by element-wise subtracting the given MODIS and the CNN output, estimating CNN accuracy. That accuracy is this used to show a preference towards a specific CNN output in that weight parameter.

The second formula, which then produces the final STTFN output prediction, uses the calculated weight parameters to find the most accurate estimation of the combined CNN outputs.

STARFM: What is it?

StarFM is an established baseline model we are comparing STTFN’s performance with in terms of RMSE and SSIM. The model functions the same as the model listed in the original repo (https://github.com/nmileva/starfm4py), including use of the same parameters’ values.

For testing, we put the StarFM model inside a class to substantiate instances and make the model’s parameters part of its class.

Using the .ipnyb

We encourage anyone looking to try out the .ipnyb file, though if you don't have access to University-level HPC Clusters, then the L4 GPU on the Google Colab should work just fine for this purpose.

Provide a named area of interest compatible with the Microsoft Planetary Computer connectors, and test your area of interest.

We have provided example areas of interest, named within the repository.

For the StarFM, you can change the TEST_POINT global variable in Imports to a different Test Data Point to test with. If TEST_POINT is >= the number of datapoints available, it defaults to the last data point available in trainingDataSet (check StarFM Test Run). Model outputs, including RMSE and SSIM are displayed in Testing Results.

Contacts

If you have any questions, or struggle with the code, please feel free to contact us at any of the emails below or create a Github Issue, which we will look to resolve.

Team Members:

Todd Goldfarb (tcgoldfarb@gmail.com)

Joseph Balaty (balatyj@oregonstate.edu)

Jarod Lokrantz (lokrantj@oregonstate.edu)

Mahmoud Fahkry (fakhryk@oregonstate.edu)

Link to the Repository (if you are on pages): https://github.com/Todd-C-Goldfarb/STTFN-OSU

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Spatiotemporal Temperature Fusion Network & StarFM

Using machine learning to turn unreadable or missing satellite images into usable and rich data.

What is a Spatiotemporal Temperature Fusion Network? (STTFN)

The Underlying Architecture

The Convolutional Neural Network

The STC-Weighting Function

STARFM: What is it?

Using the .ipnyb

Contacts

Files

README.md

Latest commit

History

README.md

File metadata and controls

Spatiotemporal Temperature Fusion Network & StarFM

Using machine learning to turn unreadable or missing satellite images into usable and rich data.

What is a Spatiotemporal Temperature Fusion Network? (STTFN)

The Underlying Architecture

The Convolutional Neural Network

The STC-Weighting Function

STARFM: What is it?

Using the .ipnyb

Contacts