Add non-trivial convolutional encoder/decoder #90

jemrobinson · 2025-09-09T09:41:48Z

Further simplify the naive linear encoder/decoder
Add a non-linear CNN with activation functions

Testing

The following validation results come from the same minimal training (Dec 2019) and validation (Jan01-Jan15 2020) data.

I'm a bit worried about the fact that 500 epochs and 20 epochs look essentially the same, but perhaps this is due to the very small amount of training data?

Thoughts @marianovitasari20 @louisavz ?
Should we be doing some sort of normalisation at data-loading/data-prep time?

naive-null-naive: 781 params

  | Name      | Type               | Params | Mode
---------------------------------------------------------
0 | encoder_0 | NaiveLinearEncoder | 700    | train
1 | encoder_1 | NaiveLinearEncoder | 40     | train
2 | processor | NullProcessor      | 0      | train
3 | decoder   | NaiveLinearDecoder | 41     | train
---------------------------------------------------------
781       Trainable params
0         Non-trainable params
781       Total params
0.003     Total estimated model params size (MB)
14        Modules in train mode
0         Modules in eval mode

20 epochs

media_images_sea-ice_concentration-static-maps_0_7af4f583bbaa43c82447

500 epochs

sea-ice_concentration-static-maps_0_86a172b8cee2c183f726

cnn-null-cnn: 1.8M params

  | Name      | Type          | Params | Mode
----------------------------------------------------
0 | encoder_0 | CNNEncoder    | 1.3 M  | train
1 | encoder_1 | CNNEncoder    | 1.4 K  | train
2 | processor | NullProcessor | 0      | train
3 | decoder   | CNNDecoder    | 937    | train
----------------------------------------------------
1.3 M     Trainable params
0         Non-trainable params
1.3 M     Total params
5.286     Total estimated model params size (MB)
95        Modules in train mode
0         Modules in eval mode

20 epochs

sea-ice_concentration-static-maps_0_7952c2b73d4a517da08b

500 epochs

sea-ice_concentration-static-maps_0_3a9925cfa25e18d1c9e8

naive-unet-naive: 11M params

  | Name      | Type               | Params | Mode
---------------------------------------------------------
0 | encoder_0 | NaiveLinearEncoder | 700    | train
1 | encoder_1 | NaiveLinearEncoder | 40     | train
2 | processor | UNetProcessor      | 11.0 M | train
3 | decoder   | NaiveLinearDecoder | 41     | train
---------------------------------------------------------
11.0 M    Trainable params
0         Non-trainable params
11.0 M    Total params
43.901    Total estimated model params size (MB)
102       Modules in train mode
0         Modules in eval mode
https://wandb.ai/turing-seaice/leaderboard/runs/aizqh771

20 epochs

media_images_sea-ice_concentration-static-maps_0_680f19e0583b1348bd22

500 epochs

sea-ice_concentration-static-maps_0_eee9235d21ec8f57c46b

cnn-unet-cnn: 12.8M params

  | Name      | Type          | Params | Mode
----------------------------------------------------
0 | encoder_0 | CNNEncoder    | 1.3 M  | train
1 | encoder_1 | CNNEncoder    | 1.4 K  | train
2 | processor | UNetProcessor | 11.0 M | train
3 | decoder   | CNNDecoder    | 937    | train
----------------------------------------------------
12.3 M    Trainable params
0         Non-trainable params
12.3 M    Total params
49.184    Total estimated model params size (MB)
183       Modules in train mode
0         Modules in eval mode

20 epochs

sea-ice_concentration-static-maps_0_d914aba2473456ae47f2

500 epochs

sea-ice_concentration-static-maps_0_b05c21a410d986073ea7

github-actions · 2025-09-09T09:44:34Z

Coverage report

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
ice_station_zebra/models/common
__init__.py
conv_block_downsample.py
conv_block_upsample.py
resizing_average_pool_2d.py
ice_station_zebra/models/decoders
__init__.py
base_decoder.py
cnn_decoder.py
naive_linear_decoder.py
ice_station_zebra/models/encoders
__init__.py
base_encoder.py
cnn_encoder.py
naive_linear_encoder.py
Project Total

_{This report was generated by python-coverage-comment-action}

…nd ConvBlockDownsample

jemrobinson · 2025-09-15T08:52:51Z

Latent space visualisation

I've done some quick visualisations of the latent space for NaiveLinear and CNN encoders.

Naive linear

CNN

Normalisation

Now we can see that the CNN has actually lost some information during the encoding process, let's try adding a normalisation layer as the first step of the encoder.

CNN with initial `BatchNorm`

CNN with initial `LayerNorm`

CNN with initial `InstanceNorm2d`

jemrobinson · 2025-09-15T15:05:58Z

After adding a normalisation layer and running for 20 epochs

cnn-unet-cnn

media_images_sea-ice_concentration-static-maps_0_52979194e3758d2a8340

naive-unet-naive

media_images_sea-ice_concentration-static-maps_0_ad40b52f99d54facc23d

IFenton · 2025-09-16T08:30:47Z

Looking at this, I'm wondering about whether we need to do any regridding / reprojecting. At the moment, the SIC is centred round the south pole (-90 lat, 0 long), but I assume the ERA5 data is centred on something like -45 lat, 0 long?

IFenton · 2025-09-16T08:51:42Z

Reviewing this

jemrobinson · 2025-09-16T11:46:35Z

@IFenton: I could see that the different scales/grids for ERA5 and OSISAF might mean that we end up depending more heavily on OSISAF but not that we can't produce anything at all. I wonder whether there's an additional issue in the decoder?

IFenton · 2025-09-16T14:18:16Z

@jemrobinson Curiously, I've just done a couple of runs of naive_unet_naive, and it's producing pretty good results, e.g. https://wandb.ai/turing-seaice/leaderboard/runs/3g3iduk3?nw=nwuserifenton. It's just running on the defaults, so not quite sure what's going on

IFenton

LGTM

IFenton · 2025-09-16T08:35:45Z

ice_station_zebra/config/model/cnn_ddpm_cnn.yaml

@@ -0,0 +1,22 @@
+_target_: ice_station_zebra.models.EncodeProcessDecode
+
+name: encode-ddpm-decode


Suggested change

name: encode-ddpm-decode

name: cnn-ddpm-cnn

jemrobinson added 10 commits September 8, 2025 16:45

✨ Add a ResizingAveragePool2d class

2b11e2e

✨ Add a downsampling ConvBlock class

6d0a4b2

✨ Add a CNN encoder

51f5fc1

✨ Add an upsampling ConvBlock class

3792c73

✨ Add a CNN decoder

92c379b

🚨 Linting fixes for ResizingAveragePool2d

8a1d304

🎨 Simplify and rename NaiveLinearEncoder

532c15e

🎨 Simplify and rename NaiveLinearDecoder

7415568

🎨 Ensure that CNNDecoder and CNNEncoder are mirror-images

92a8683

✅ Update test imports

b50760d

jemrobinson added 7 commits September 9, 2025 14:55

✅ Add tests for CNNEncoder

1b79424

✅ Add tests for CNNDecoder

242e851

✅ Fix decoder test arguments

78c689e

🔧 Add a CNN-UNet-CNN model

1c25e8b

🔧 Decrease latent space size in CNN-UNet-CNN

4cfeecf

🎨 Add an additional size-preserving conv layer to ConvBlockUpsample a…

efb50b5

…nd ConvBlockDownsample

✅ Better structure of test functions

0f61efc

jemrobinson force-pushed the 45-add-convolutional-encoder branch from 528a25c to 0f61efc Compare September 12, 2025 12:48

jemrobinson added 2 commits September 15, 2025 08:40

🔧 Added additional configs

85d555b

Merge branch 'main' into 45-add-convolutional-encoder

7ef4811

🎨 Add a BatchNorm2d layer in front of encoders

7a3c07c

jemrobinson requested review from IFenton and a team September 15, 2025 15:06

IFenton approved these changes Sep 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add non-trivial convolutional encoder/decoder #90

Add non-trivial convolutional encoder/decoder #90

Uh oh!

jemrobinson commented Sep 9, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 9, 2025 •

edited

Loading

Uh oh!

jemrobinson commented Sep 15, 2025 •

edited

Loading

Uh oh!

jemrobinson commented Sep 15, 2025

Uh oh!

IFenton commented Sep 16, 2025

Uh oh!

IFenton commented Sep 16, 2025

Uh oh!

jemrobinson commented Sep 16, 2025 •

edited

Loading

Uh oh!

IFenton commented Sep 16, 2025

Uh oh!

IFenton left a comment

Uh oh!

IFenton Sep 16, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,22 @@
		_target_: ice_station_zebra.models.EncodeProcessDecode

		name: encode-ddpm-decode

Add non-trivial convolutional encoder/decoder #90

Are you sure you want to change the base?

Add non-trivial convolutional encoder/decoder #90

Uh oh!

Conversation

jemrobinson commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

naive-null-naive: 781 params

cnn-null-cnn: 1.8M params

naive-unet-naive: 11M params

cnn-unet-cnn: 12.8M params

Uh oh!

github-actions bot commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage report

Uh oh!

jemrobinson commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Latent space visualisation

Naive linear

CNN

Normalisation

CNN with initial BatchNorm

CNN with initial LayerNorm

CNN with initial InstanceNorm2d

Uh oh!

jemrobinson commented Sep 15, 2025

cnn-unet-cnn

naive-unet-naive

Uh oh!

IFenton commented Sep 16, 2025

Uh oh!

IFenton commented Sep 16, 2025

Uh oh!

jemrobinson commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IFenton commented Sep 16, 2025

Uh oh!

IFenton left a comment

Choose a reason for hiding this comment

Uh oh!

IFenton Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jemrobinson commented Sep 9, 2025 •

edited

Loading

github-actions bot commented Sep 9, 2025 •

edited

Loading

jemrobinson commented Sep 15, 2025 •

edited

Loading

CNN with initial `BatchNorm`

CNN with initial `LayerNorm`

CNN with initial `InstanceNorm2d`

jemrobinson commented Sep 16, 2025 •

edited

Loading