feat: experience-guided training with continual learning replay by pearlq12345 · Pull Request #98 · MINT-SJTU/RoboClaw

pearlq12345 · 2026-05-07T10:43:50Z

What

Add ExperienceStore: append-only JSONL store that records training outcomes (dataset, policy, result, lesson)
Add get_replay_datasets(): retrieves historically successful datasets for the same policy to mix into new training runs
TrainSession now records every training submission and completion as a structured experience
TrainSession generates experience_hint from past terminal-state records (success/failed/error/stopped only, not submitted)
Add continual_learning flag (default false): when enabled, mixes historical datasets into the current training run to prevent catastrophic forgetting (ref: continual learning survey, arxiv 2302.00487)
HTTP training route and UI Training Center checkbox expose the flag — off by default, does not affect existing workflows

Testing

7 unit tests in tests/test_experience_replay.py and tests/test_train_experience.py
Tests are hermetically isolated via per-test tmp_path fixtures

… from hints

…checks - Validates checkpoint path exists before inference starts - Checks action_dim in config.json matches manifest follower motor count - Warns on device mismatch between checkpoint and manifest - Warns on dataset repo_id mismatch between checkpoint and current dataset - Warns if dataset codebase_version is older than v2.1 - Hooked into EmbodiedService.start_inference() - 4 new tests covering missing checkpoint, action_dim mismatch, version warning, passing case

…nferenceConfigVerifier Experience replay for continual learning was premature: - lerobot BC models train from scratch, no catastrophic forgetting - no checkpoint-resume support, so replay has no effect - confusing for research users testing different models Kept: - ExperienceStore: records training history, surfaces hints for next run - InferenceConfigVerifier: checkpoint/dataset consistency checks before inference

Xiaofang Wu added 6 commits May 6, 2026 11:17

Refactor training policy registry

d032d24

feat: add experience-guided training hints

1ec1b00

fix: align ExperienceStore path with agent workspace root

547ab70

fix: isolate ExperienceStore path in tests, filter submitted outcomes…

8c69852

… from hints

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: experience-guided training with continual learning replay#98

feat: experience-guided training with continual learning replay#98
pearlq12345 wants to merge 6 commits into
MINT-SJTU:mainfrom
pearlq12345:feat/experience-loop

pearlq12345 commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pearlq12345 commented May 7, 2026

What

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant