feat: SG/LP/Biased/Residual/GN/TD-MPPI (22~27번째 변형) by Geonhee-LEE · Pull Request #18 · Geonhee-LEE/learning_mppi

Geonhee-LEE · 2026-03-19T12:01:20Z

Summary

6종 MPPI 변형 일괄 구현 (22~27번째):

SG-MPPI (22번째): Denoising Score Matching + score-guided 샘플링
LP-MPPI (23번째): Butterworth LPF 기반 주파수 도메인 smoothness
Biased-MPPI (24번째): 혼합 분포 샘플링 — J개 보조 정책 + (K-J)개 가우시안
Residual-MPPI (25번째): 사전 정책 nominal + MPPI 잔차 최적화 + KL 페널티
GN-MPPI (26번째): 가우스-뉴턴 2차 업데이트 + 병렬 라인 서치 + MPPI 폴백
TD-MPPI (27번째): TD-learned terminal value V(x_T)로 짧은 호라이즌 장기 계획

주요 추가

구분	내용
컨트롤러	8개 신규
테스트	6개 신규 (~170 tests)
벤치마크	6개 신규 (4-Way × 4 시나리오)
전체	1519 tests 통과

Test plan

python -m pytest tests/ (1519/1519 통과)
6개 벤치마크 --all-scenarios 실행 확인

🤖 Generated with Claude Code

Denoising Score Matching으로 비용 지형의 score function을 학습하고, MPPI 가우시안 노이즈에 score 방향 bias를 추가하여 저비용 영역으로 유도. Score 미학습 시 순수 가우시안 fallback으로 graceful degradation 보장. - SGMPPIController: 단일/다중 반복 + DIAL 결합 + 온라인 학습 - ScoreNetwork: SigmaEmbedding + zero-init 출력층 - ScoreMatchingTrainer: 다중 σ 스케일 DSM Loss 학습 - 28 tests, 4-Way 벤치마크 (Vanilla vs DIAL vs Flow vs SG, 4 시나리오) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

LP-MPPI: Butterworth LPF 기반 주파수 도메인 smoothness (Kicki 2025) Biased-MPPI: 혼합 분포 샘플링 — J개 보조 정책 + (K-J)개 가우시안 (Trevisan 2024) Residual-MPPI: 사전 정책 nominal + MPPI 잔차 최적화 + KL 페널티 (Wang 2025) GN-MPPI: 가우스-뉴턴 2차 업데이트 + 병렬 라인 서치 + MPPI 폴백 (Homburger 2025) TD-MPPI: TD-learned terminal value V(x_T)로 짧은 호라이즌 장기 계획 (Crestaz 2026) - 5개 컨트롤러, 5개 벤치마크 (4-Way × 4 시나리오), 140개 테스트 추가 - AncillaryPolicy ABC + 5개 내장 정책 (PurePursuit/Braking/Feedback/MaxSpeed/PreviousSolution) - ValueNetwork + TDValueLearner + TDExperienceBuffer (온라인 TD 학습) - 전체 1519 tests 통과 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Geonhee-LEE and others added 2 commits March 19, 2026 21:00

Geonhee-LEE changed the title ~~feat: SG-MPPI (22번째 변형) — Score-Guided MPPI~~ feat: SG/LP/Biased/Residual/GN/TD-MPPI (22~27번째 변형) Mar 20, 2026

Geonhee-LEE merged commit 72b7e32 into main Mar 20, 2026
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: SG/LP/Biased/Residual/GN/TD-MPPI (22~27번째 변형)#18

feat: SG/LP/Biased/Residual/GN/TD-MPPI (22~27번째 변형)#18
Geonhee-LEE merged 2 commits intomainfrom
feature/score-guided-mppi

Geonhee-LEE commented Mar 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Geonhee-LEE commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

주요 추가

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Geonhee-LEE commented Mar 19, 2026 •

edited

Loading