ml-lab

ML research control plane — experiment lifecycle, model registry, cloud training launcher

Demo coming soon

Why

Running ML experiments across local hardware and cloud GPUs produces scattered checkpoints, siloed W&B projects, and no systematic way to compare results. ml-lab connects existing tools (ml-experiment-scaffold, gpu-server-test-suite, llm-wiki) into a unified 7-stage lifecycle: preflight → init → configure → train → eval → register → publish. Same configs work locally on an RTX 5070 Ti and on cloud A100s.

Features

Experiment initialization from ml-experiment-scaffold templates
GPU preflight checks via gpu-server-test-suite before training
Config validation catches impossible hyperparameter combos (fp8 training, OOM configs)
Cloud training with rsync + SSH to RunPod, Lambda, or vast.ai
Model registry — append-only JSONL with eval scores, config hashes, metadata
Cross-experiment leaderboard for comparing models across methods and seeds
Automated W&B sync for Device Guard environments via WSL
Knowledge integration — publish findings to llm-wiki

Architecture

graph TD
    ML[ml-lab<br/>Control Plane] --> SC[ml-experiment-scaffold<br/>Template]
    ML --> GPU[gpu-server-test-suite<br/>Preflight]
    ML --> WIKI[llm-wiki<br/>Knowledge Base]

    subgraph "Experiment Lifecycle"
        P[1. Preflight] --> I[2. Init]
        I --> C[3. Configure]
        C --> T[4. Train]
        T --> E[5. Eval]
        E --> R[6. Register]
        R --> PB[7. Publish]
    end

    ML --> P

    subgraph "Training Targets"
        LOCAL[Local RTX 5070 Ti]
        CLOUD[Cloud A100/H100]
    end

    T --> LOCAL
    T --> CLOUD

Quick Start

# Clone
git clone https://github.com/t-timms/ml-lab.git
cd ml-lab

# Install
pip install -e ".[dev]"

# Create a new experiment
make new-experiment NAME=gsm8k-grpo

# Validate config
make validate-config EXP=2026-04-gsm8k-grpo

# Run preflight + train
make train EXP=2026-04-gsm8k-grpo

# Register model after training
make register EXP=2026-04-gsm8k-grpo

# View leaderboard
make leaderboard

Project Structure

ml-lab/
├── experiments/          # Experiment instances (from scaffold template)
│   └── YYYY-MM-<name>/  # Each experiment with configs, src, results
├── registry/
│   ├── models.jsonl      # Append-only model index
│   └── README.md         # Schema documentation
├── cloud/
│   ├── providers.yaml    # RunPod/Lambda/vast.ai configs
│   ├── launch.py         # rsync + SSH orchestrator
│   ├── Dockerfile.train  # Training container
│   └── setup_remote.sh   # One-shot remote env setup
├── scripts/
│   ├── new_experiment.py # Init from scaffold template
│   ├── preflight.py      # GPU health check
│   ├── register_model.py # Post-training registration
│   ├── cross_compare.py  # Leaderboard generator
│   ├── sync_wandb.py     # WSL-based W&B sync
│   └── research_to_wiki.py # Push findings to llm-wiki
├── src/ml_lab/
│   ├── cli.py            # Click CLI
│   └── config_validator.py # Config validation
├── tests/                # pytest test suite
├── Makefile              # Top-level orchestration
└── pyproject.toml

Development

# Run tests
make test

# Lint + format
make lint

# Run specific test
pytest tests/test_config_validator.py -v

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github		.github
cloud		cloud
experiments		experiments
registry		registry
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ml-lab

Why

Features

Architecture

Quick Start

Project Structure

Development

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ml-lab

Why

Features

Architecture

Quick Start

Project Structure

Development

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages