feat(training): support torch losses #398

JPXKQX · 2025-07-03T14:42:11Z

Description

This PR adds a TorchLoss class to use any torch.nn class without needing to implement them inside Anemoi. This would allow us to delete the mae.py, huber.py, and the new smooth_l1.py proposed in #367 .

We would move from

training_loss:
   _target_: anemoi.training.losses.MAELoss
  scalers: ['pressure_level', 'general_variable', 'nan_mask_weights', 'node_weights']
  ignore_nans: False

to

training_loss:
   _target_: anemoi.training.losses.TorchLoss
  loss: 
     _target_: torch.nn.L1Loss
  scalers: ['pressure_level', 'general_variable', 'nan_mask_weights', 'node_weights']
  ignore_nans: False

What problem does this change solve?

This PR reduces code duplication, as it avoids the need to reimplement functionality that is already available in Torch. It also prevents the need for future PRs to test new loss functions inside Anemoi.

As a contributor to the Anemoi framework, please ensure that your changes include unit tests, updates to any affected dependencies and documentation, and have been tested in a parallel setting (i.e., with multiple GPUs). As a reviewer, you are also responsible for verifying these aspects and requesting changes if they are not adequately addressed. For guidelines about those please refer to https://anemoi.readthedocs.io/en/latest/

By opening this pull request, I affirm that all authors agree to the Contributor License Agreement.

for more information, see https://pre-commit.ci

ssmmnn11

Nice!

One question: what tests did you run?

ssmmnn11 · 2025-07-12T07:14:39Z

training/src/anemoi/training/losses/torch.py

+        Parameters
+        ----------
+        pred : torch.Tensor
+            Prediction tensor, shape (bs, ensemble, lat*lon, n_outputs)


Minor: do we write lat*lon everywhere? better gridpoints?

mchantry · 2025-08-14T07:41:54Z

@JPXKQX is this labelled draft for a reason?

support torch loss

a0a984f

github-actions bot added the training label Jul 3, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

b9d9d9e

for more information, see https://pre-commit.ci

JPXKQX mentioned this pull request Jul 3, 2025

feat(models): interpolation mappers and point-mlp proccesor #367

Open

5 tasks

JPXKQX self-assigned this Jul 3, 2025

JPXKQX and others added 2 commits July 3, 2025 16:37

Merge branch 'main' into feat/import-torch-losses

40c8ad2

pre-commit

2dcfc80

anaprietonem added this to Anemoi-dev Jul 10, 2025

github-project-automation bot moved this to Now In Progress in Anemoi-dev Jul 10, 2025

ssmmnn11 requested changes Jul 12, 2025

View reviewed changes

github-project-automation bot moved this from Now In Progress to Under Review in Anemoi-dev Jul 12, 2025

mchantry added the ATS Approval Not Needed No approval needed by ATS label Aug 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(training): support torch losses #398

feat(training): support torch losses #398

Uh oh!

JPXKQX commented Jul 3, 2025 •

edited

Loading

Uh oh!

ssmmnn11 left a comment

Uh oh!

ssmmnn11 Jul 12, 2025

Uh oh!

mchantry commented Aug 14, 2025

Uh oh!

Uh oh!

feat(training): support torch losses #398

Are you sure you want to change the base?

feat(training): support torch losses #398

Uh oh!

Conversation

JPXKQX commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

What problem does this change solve?

Uh oh!

ssmmnn11 left a comment

Choose a reason for hiding this comment

Uh oh!

ssmmnn11 Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

mchantry commented Aug 14, 2025

Uh oh!

Uh oh!

JPXKQX commented Jul 3, 2025 •

edited

Loading