Improving the ConvGUR baseline (i.e. make ConvGRU an ensemble model)

As mentioned on Slack, during Christmas @camilletti and I had some fun refreshing the ConvGRU baseline already in [mlcast](https://github.com/mlcast-community/mlcast/blob/main/src/mlcast/modules/convgru_modules.py).

What bothered me was that all the advanced models we're considering (LDCast, etc.) are generative and produce ensembles by nature, while ConvGRU was purely deterministic.

**The fix turned out to be surprisingly simple:**
- Feed noise into the decoder instead of zeros
- Forward the decoder N times → N ensemble members
- Replace MSE/MAE with CRPS loss (à la AIFS)
- profit! :)

The architecture stays unchanged. Deterministic training is still possible, so we lose nothing.

**Preliminary results** on Italian radar data look promising - decent rank histogram and better CRPS than a STEPS ensemble (pysteps):

![Image](https://github.com/user-attachments/assets/47485012-f723-4425-8489-7679f183464f)

![Image](https://github.com/user-attachments/assets/59cc6ba5-8af4-4606-8eeb-0cd08150b053)

Current implementation lives here: https://github.com/DSIP-FBK/ConvGRU-Ensemble

Full evaluation is ongoing. If results hold up, I'll open a PR to bring this into mlcast.

(Better names than "ConvGRU-CRPS" are welcome!)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving the ConvGUR baseline (i.e. make ConvGRU an ensemble model) #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improving the ConvGUR baseline (i.e. make ConvGRU an ensemble model) #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions