refactor variable scaling, pressure level scalings only applied in specific circumstances #52

sahahner · 2024-12-27T13:35:33Z

Solve the problem explained in issue #7 by refactoring the variable scalings into a general variable scaling and a pressure level scaling.
@mc4117 , @pinnstorm and me came up with a new structure. This PR implements this.

This is first draft. Feedback very welcome!

allow several variable level scaling (i.e. pressure level and model level)
implement/update tests
decide: do we want to allow scaling by variable_ref and variable_name, i.e. scale q_50 by q and q_50?

b8raoult · 2024-12-30T09:35:33Z

Please consider using the knowledge about variables that come from the dataset metadata. See https://github.com/ecmwf/anemoi-transform/blob/7cbf5f3d4baa37453022a5a97e17cc71a5b8ceeb/src/anemoi/transform/variables/__init__.py#L47

sahahner · 2024-12-30T09:51:50Z

Please consider using the knowledge about variables that come from the dataset metadata. See https://github.com/ecmwf/anemoi-transform/blob/7cbf5f3d4baa37453022a5a97e17cc71a5b8ceeb/src/anemoi/transform/variables/__init__.py#L47

We have given this some thought, and after wanting to use the information from the dataset in the beginning, I have opted for allowing the definition of our own groups here to use different scaling for self-defined groups.
Also, I was also told that it is possible to build datasets without information about the variable types and therefore not to rely on that metadata.
If you have strong opinions on this I am happy to discuss it again.

mc4117 · 2024-12-30T11:01:08Z

training/src/anemoi/training/train/forecaster.py

+            data_indices,
+        ).get_variable_scaling()
+
+        # Instantiate the pressure level scaling class with the training configuration


I don't know if this is possible but I wonder if here we could instantiate from a list of scalars rather than specific ones? I think this is how it is done for the validation metrics

Yes, this would be useful, as we want to allow more scalar methods for model levels/tendency/etc.
I will have a look.

training/src/anemoi/training/train/scaling.py

mc4117 · 2024-12-31T09:55:38Z

training/src/anemoi/training/train/forecaster.py

+
+        # Instantiate the pressure level scaling class with the training configuration
+        pressurelevelscaler = instantiate(
+            config.training.pressure_level_scaler,


I think this config location is wrong? It should be training.variable_loss_scaling.pressure_level_scalar

Indeed, in the config file, the pressure_level_scalar should be defined directly in training as before.

…umstances' of https://github.com/ecmwf/anemoi-core into 7-pressure-level-scalings-only-applied-in-specific-circumstances

FussyDuck · 2025-01-02T11:48:17Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ sahahner
❌ mc4117
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

mc4117 · 2025-01-07T10:13:48Z

training/src/anemoi/training/train/scaling.py

+    @abstractmethod
+    def get_variable_scaling(self) -> np.ndarray: ...
+
+    def get_variable_group(self, variable_name: str) -> tuple[str, str, int]:


I think maybe we should put this function in an utils. I only say this because I want to use it in another application outside of the loss scalings

JPXKQX · 2025-01-08T11:43:58Z

Hi, I would like to know what you think about making all scalers explicit in the config file. Something similar to the additional_scalers: field, but including not only the scalers per variable, but also the node_loss_weight,... The positive aspect I see is that there would be more homogeneity in the scalers defined in the metrics/loss fields.

sahahner added 3 commits December 27, 2024 10:15

first version of refactor of variable scaling

511ed18

config training changes

7ddf6d6

avoid multiple scaling

3ddeccc

sahahner linked an issue Dec 27, 2024 that may be closed by this pull request

Loss scalings #5

Open

2 tasks

sahahner linked an issue Dec 30, 2024 that may be closed by this pull request

Pressure Level Scalings only applied in specific circumstances #7

Open

mc4117 reviewed Dec 30, 2024

View reviewed changes

training/src/anemoi/training/train/scaling.py Outdated Show resolved Hide resolved

docstring and explain variable reference

be4602c

mc4117 reviewed Dec 31, 2024

View reviewed changes

mc4117 added 4 commits December 31, 2024 10:47

fix to config for pressure level scaler

195af07

instantiating scalars as a list

2644c18

preparing for tendency losses

718fc57

Merge branch '7-pressure-level-scalings-only-applied-in-specific-circ…

a34ac02

…umstances' of https://github.com/ecmwf/anemoi-core into 7-pressure-level-scalings-only-applied-in-specific-circumstances

sahahner changed the title ~~pressure level scalings only applied in specific circumstances~~ refactor variable scaling, pressure level scalings only applied in specific circumstances Jan 2, 2025

log the variable level scaling information as before

b91af11

HCookie added the training label Jan 6, 2025

HCookie self-requested a review January 6, 2025 14:36

mc4117 reviewed Jan 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor variable scaling, pressure level scalings only applied in specific circumstances #52

refactor variable scaling, pressure level scalings only applied in specific circumstances #52

sahahner commented Dec 27, 2024 •

edited

Loading

b8raoult commented Dec 30, 2024

sahahner commented Dec 30, 2024

mc4117 Dec 30, 2024

sahahner Dec 30, 2024

mc4117 Dec 31, 2024

sahahner Dec 31, 2024

FussyDuck commented Jan 2, 2025 •

edited

Loading

mc4117 Jan 7, 2025

JPXKQX commented Jan 8, 2025

refactor variable scaling, pressure level scalings only applied in specific circumstances #52

Are you sure you want to change the base?

refactor variable scaling, pressure level scalings only applied in specific circumstances #52

Conversation

sahahner commented Dec 27, 2024 • edited Loading

b8raoult commented Dec 30, 2024

sahahner commented Dec 30, 2024

mc4117 Dec 30, 2024

Choose a reason for hiding this comment

sahahner Dec 30, 2024

Choose a reason for hiding this comment

mc4117 Dec 31, 2024

Choose a reason for hiding this comment

sahahner Dec 31, 2024

Choose a reason for hiding this comment

FussyDuck commented Jan 2, 2025 • edited Loading

mc4117 Jan 7, 2025

Choose a reason for hiding this comment

JPXKQX commented Jan 8, 2025

sahahner commented Dec 27, 2024 •

edited

Loading

FussyDuck commented Jan 2, 2025 •

edited

Loading