Designing configuration infrastructure for mlcast python package

@franchg and I have just been [having a discussion](https://mlcast.slack.com/archives/C09VC47ANG2/p1764230325293959) about how to design the configuration in `mlcast`. I used ChatGPT to summarise our conversion and make summary of the two approaches and tables of pros and cons below:

### Summary of the Two Configuration Approaches

We are considering two alternative approaches for handling configuration in the project:

1. **Hydra + Pydantic**  
   Uses Hydra to manage hierarchical YAML configurations and dynamically instantiate Python objects via `_target_`. Pydantic is used for schema validation. This approach treats the YAML config as both configuration and a class instantiation template, enabling highly flexible and composable setups.

2. **Dataclasses + dataclasses-wizard**  
   Treats configuration as Python code expressed through `dataclasses`, with `dataclasses-wizard` providing YAML serialization/deserialization. YAML is a passive storage format rather than an active driver of object creation. This approach emphasizes type safety, static tooling support, and conceptual simplicity by avoiding dynamic class instantiation in configuration files.


### Hydra + Pydantic

| Category | Pros | Cons |
|----------|------|------|
| Flexibility | Highly flexible configuration system; `_target_` allows substituting whole classes without code changes | Flexibility comes at cost of complexity; config effectively becomes executable code |
| Modularity | Strong support for hierarchical config composition and experiment overrides | Hard to trace which classes are instantiated until runtime |
| Ecosystem | Mature tooling and widely used in ML training frameworks | Requires learning Hydra concepts (composition, instantiation, overrides) in addition to PyTorch/Lightning |
| User Experience | Powerful for advanced users who need dynamic architecture changes | Steep learning curve; difficult for newcomers and harder to teach (as observed in AIWCAS school) |
| Type Safety | Pydantic provides runtime validation | In practice, config typing often degrades to `Any`, reducing discoverability and static guarantees |
| Tooling Support | Hydra CLI utilities, sweeps, config stores | IDEs can't reliably infer instantiated types → limited autocomplete and code navigation |
| Separation of Concerns | YAML can define whole object graphs | Mixing Python class references into YAML blurs boundary between config and code |

Examples:
- anemoi:
   - config in yaml: https://github.com/ecmwf/anemoi-core/blob/main/training/src/anemoi/training/config/lam.yaml
   - validation schema: https://github.com/ecmwf/anemoi-core/blob/main/training/src/anemoi/training/schemas/base_schema.py#L94
   - object instantiation: https://github.com/ecmwf/anemoi-core/blob/main/models/src/anemoi/models/models/encoder_processor_decoder.py#L36
- general purpose pytorch-lightning + hydra setups:
   - https://github.com/ashleve/lightning-hydra-template
   - https://github.com/franchg/yalht


### Dataclasses + dataclasses-wizard

| Category | Pros | Cons |
|----------|------|------|
| Mental Model | Configuration is plain Python code; easier to reason about | Lacks Hydra’s built-in config composition and sweeping mechanisms out of the box |
| Type Safety | Strong typing and inheritance via dataclasses makes config structure explicit | Requires disciplined design if config grows large or complex |
| Tooling Support | Excellent IDE support: autocomplete, type hints, static analysis, navigation | Fewer higher-level utilities; some features must be implemented manually |
| User Experience | Lower cognitive load; no hidden instantiation magic | Less dynamic than `_target_`-based systems if users expect full pluggability |
| Serialization | dataclasses-wizard handles YAML round-tripping seamlessly | Not as feature-rich for configuration lifecycle management as Hydra |
| Debuggability | No dynamic class instantiation hidden in config files; code paths are explicit | Fewer "batteries included" for large-scale experiment management |
| Separation of Concerns | Config remains config; no Python identifiers or imports leaked into YAML | If the project requires dynamic class selection at runtime, patterns must be implemented intentionally |


Examples:
- mllam:
   - config in yaml: https://github.com/mllam/mllam-data-prep/blob/main/example.danra.yaml
   - dataclasses for config (validation through type annotations): https://github.com/mllam/mllam-data-prep/blob/main/mllam_data_prep/config.py#L301
   - use of config values (no object instantiation based on class-names in config): https://github.com/mllam/mllam-data-prep/blob/main/mllam_data_prep/create_dataset.py#L117

If anyone has further thoughts on this please join in the discussion here by posting comments ☺️ 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Designing configuration infrastructure for mlcast python package #5

Summary of the Two Configuration Approaches

Hydra + Pydantic

Dataclasses + dataclasses-wizard

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Category	Pros	Cons
Flexibility	Highly flexible configuration system; `_target_` allows substituting whole classes without code changes	Flexibility comes at cost of complexity; config effectively becomes executable code
Modularity	Strong support for hierarchical config composition and experiment overrides	Hard to trace which classes are instantiated until runtime
Ecosystem	Mature tooling and widely used in ML training frameworks	Requires learning Hydra concepts (composition, instantiation, overrides) in addition to PyTorch/Lightning
User Experience	Powerful for advanced users who need dynamic architecture changes	Steep learning curve; difficult for newcomers and harder to teach (as observed in AIWCAS school)
Type Safety	Pydantic provides runtime validation	In practice, config typing often degrades to `Any`, reducing discoverability and static guarantees
Tooling Support	Hydra CLI utilities, sweeps, config stores	IDEs can't reliably infer instantiated types → limited autocomplete and code navigation
Separation of Concerns	YAML can define whole object graphs	Mixing Python class references into YAML blurs boundary between config and code

Category	Pros	Cons
Mental Model	Configuration is plain Python code; easier to reason about	Lacks Hydra’s built-in config composition and sweeping mechanisms out of the box
Type Safety	Strong typing and inheritance via dataclasses makes config structure explicit	Requires disciplined design if config grows large or complex
Tooling Support	Excellent IDE support: autocomplete, type hints, static analysis, navigation	Fewer higher-level utilities; some features must be implemented manually
User Experience	Lower cognitive load; no hidden instantiation magic	Less dynamic than `_target_`-based systems if users expect full pluggability
Serialization	dataclasses-wizard handles YAML round-tripping seamlessly	Not as feature-rich for configuration lifecycle management as Hydra
Debuggability	No dynamic class instantiation hidden in config files; code paths are explicit	Fewer "batteries included" for large-scale experiment management
Separation of Concerns	Config remains config; no Python identifiers or imports leaked into YAML	If the project requires dynamic class selection at runtime, patterns must be implemented intentionally

Designing configuration infrastructure for mlcast python package #5

Description

Summary of the Two Configuration Approaches

Hydra + Pydantic

Dataclasses + dataclasses-wizard

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions