Create config for jepa forecasting finetuning, and fix jepa finetuning by csjfwang · Pull Request #1946 · ecmwf/WeatherGenerator

csjfwang · 2026-02-27T10:10:03Z

Description

create config_jepa_forecasting_finetuning.yml
fix issue JEPA fine-tuning fails with 2D-rope #1943 , JEPA finetuning with 2D-RoPE

Issue Number

Closes #1943

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

2. fix issue 1943, JEPA finetuning with 2D-RoPE

sophie-xhonneux · 2026-02-27T10:20:18Z

config/config_jepa_finetuning.yml

    "student-teacher": {
        enabled: False,
-        type: LossLatentSSLStudentTeacher,
+        type: Disabled,


Why was this change necessary?

Because here is not filtered by enabled: False:

WeatherGenerator/src/weathergen/model/model.py

Line 489 in 730ee72

if v.type == "LossLatentSSLStudentTeacher"

sophie-xhonneux · 2026-02-27T10:20:53Z

config/config_jepa_finetuning.yml

      enabled: False,
      masking_strategy: "random",
-      num_samples: 1,
+      num_samples: 0,


if enabled: False already why do we need num_samples 0?

Since I re-used function get_batch_size_from_config (), and it doesn't use filter enabled: False, I will try to send another fix to avoid num_samples: 1 still be used.

WeatherGenerator/src/weathergen/model/model.py

Line 94 in 730ee72

self.batch_size_per_gpu = get_batch_size_from_config(cf.training_config)

oh I see, I guess in effect this part of the code gets the unfiltered config and that is the source of the error.

For now set it to 0 samples, but maybe raise an issue!

Thank you! Then I will keep the num_samples: 0 and raise an issue to report the unfiltered config thing!

sophie-xhonneux · 2026-02-27T10:21:36Z

config/config_jepa_forecasting_finetuning.yml

+# granted to it by virtue of its status as an intergovernmental organisation
+# nor does it submit to any jurisdiction.
+
+embed_orientation: "channels"


let's remove model params of the encoder, because they should be taken from the base_config anyway

@sophie-xhonneux I removed the params related to encoder, can you look again?

wang85 and others added 30 commits July 16, 2025 10:07

Replace cf.rank==0 with utils.distributed.is_root

317501e

replace cf.rank==0 with weathergen.utils.distributed.is_root

77de417

Merge branch 'ecmwf:develop' into develop

6439618

Merge branch 'ecmwf:develop' into develop

8993875

Merge branch 'ecmwf:develop' into develop

f4a9d85

Merge branch 'ecmwf:develop' into develop

f8fdef4

Merge branch 'ecmwf:develop' into develop

ca89e7b

Merge branch 'ecmwf:develop' into develop

49d7a4d

Merge branch 'ecmwf:develop' into develop

f39f094

Merge branch 'ecmwf:develop' into develop

ebb03ea

Merge branch 'ecmwf:develop' into develop

f40737d

Merge branch 'ecmwf:develop' into develop

87fa078

Merge branch 'ecmwf:develop' into develop

5dfe275

Merge branch 'ecmwf:develop' into develop

b7244d9

Merge branch 'ecmwf:develop' into develop

5be41f5

Merge branch 'ecmwf:develop' into develop

39d3965

Merge branch 'ecmwf:develop' into develop

015ec88

Merge branch 'ecmwf:develop' into develop

cb1b7cc

Merge branch 'ecmwf:develop' into develop

90da4cf

Merge branch 'ecmwf:develop' into develop

f04891b

Merge branch 'ecmwf:develop' into develop

105d992

Merge branch 'ecmwf:develop' into develop

5f56073

Merge branch 'ecmwf:develop' into develop

95ee18a

Merge branch 'ecmwf:develop' into develop

3c702d3

Merge branch 'ecmwf:develop' into develop

6f14a30

Merge branch 'ecmwf:develop' into develop

5e87881

Merge branch 'ecmwf:develop' into develop

0c7d305

Merge branch 'ecmwf:develop' into develop

e43ac94

Merge branch 'ecmwf:develop' into develop

5f63bcc

Merge branch 'ecmwf:develop' into develop

c51eb94

csjfwang and others added 17 commits November 26, 2025 21:12

Merge branch 'ecmwf:develop' into develop

dd5acc2

Merge branch 'ecmwf:develop' into develop

f03672d

Merge branch 'ecmwf:develop' into develop

49c52e1

Merge branch 'ecmwf:develop' into develop

c6356a2

Merge branch 'ecmwf:develop' into develop

36c709a

Merge branch 'ecmwf:develop' into develop

765276a

Merge branch 'ecmwf:develop' into develop

f3eb78a

Merge branch 'ecmwf:develop' into develop

542f23e

Merge branch 'ecmwf:develop' into develop

d14f61f

Merge branch 'ecmwf:develop' into develop

692703b

Merge branch 'ecmwf:develop' into develop

165f498

Merge branch 'ecmwf:develop' into develop

a5adf2a

Merge branch 'ecmwf:develop' into develop

0442d5d

Merge branch 'ecmwf:develop' into develop

eb8480d

Merge branch 'ecmwf:develop' into develop

fff2626

Merge branch 'ecmwf:develop' into develop

c612dff

1. create config_jepa_forecasting_finetuning.yml

9d140b1

2. fix issue 1943, JEPA finetuning with 2D-RoPE

github-project-automation bot added this to WeatherGen-dev Feb 27, 2026

sophie-xhonneux reviewed Feb 27, 2026

View reviewed changes

remove encoder related config in config_jepa_forecasting_finetuning.yml

b307e4e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create config for jepa forecasting finetuning, and fix jepa finetuning#1946

Create config for jepa forecasting finetuning, and fix jepa finetuning#1946
csjfwang wants to merge 48 commits intoecmwf:developfrom
csjfwang:develop-add-jepa-forecast-finetune

csjfwang commented Feb 27, 2026

Uh oh!

sophie-xhonneux Feb 27, 2026

Uh oh!

csjfwang Feb 27, 2026

Uh oh!

sophie-xhonneux Feb 27, 2026

Uh oh!

sophie-xhonneux Feb 27, 2026

Uh oh!

csjfwang Feb 27, 2026

Uh oh!

sophie-xhonneux Feb 27, 2026

Uh oh!

csjfwang Feb 27, 2026

Uh oh!

sophie-xhonneux Feb 27, 2026

Uh oh!

csjfwang Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

csjfwang commented Feb 27, 2026

Description

Issue Number

Checklist before asking for review

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants