Skip to content

wm-research/PhyGenesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PhyGenesis: Toward Physically Consistent Driving Video World Models under Challenging Trajectories

Jiawei Zhou1,2*, Zhenxin Zhu2*, Lingyi Du1*, Linye Lyu3, Lijun Zhou2, Zhanqian Wu2,
Hongcheng Luo2, Zhuotao Tian4, Bing Wang2, Guang Chen2, Hangjun Ye2, Haiyang Sun2†, Yu Li1✉

1 Zhejiang University     2 Xiaomi EV     3 The Hong Kong Polytechnic University     4 Shenzhen Loop Area Institute

(*) Equal contribution. (†) Project leader. (✉) Corresponding Author.

Paper PDF Project Page

News

[2026/03/20] Project page is officially released. Welcome to check our visual results! [2026/03/xx] We will release our Paper on arXiv soon! Please stay tuned! ☕️

Updates

  • Release Project Page
  • Release Paper
  • Release inference & training codes
  • Release model weights

Abstract

Video generation models have shown strong potential as world models for autonomous driving simulation. However, existing approaches are primarily trained on real-world driving datasets, which mostly contain natural and safe driving scenarios. As a result, current models often fail when conditioned on challenging or counterfactual trajectories—such as imperfect trajectories generated by simulators or planning systems—producing videos with severe physical inconsistencies and artifacts.

To address this limitation, we propose PhyGenesis, a world model designed to generate driving videos with high visual fidelity and strong physical consistency. Our framework consists of two key components: (1) a physical condition generator that transforms potentially invalid trajectory inputs into physically plausible conditions, and (2) a physics-enhanced video generator that produces high-fidelity multi-view driving videos under these conditions.

To effectively train these components, we construct a large-scale, physics-rich heterogeneous dataset. Specifically, in addition to real-world driving videos, we generate diverse challenging driving scenarios using the CARLA simulator, from which we derive supervision signals that guide the model to learn physically grounded dynamics under extreme conditions. This challenging-trajectory learning strategy enables trajectory correction and promotes physically consistent video generation. Extensive experiments demonstrate that PhyGenesis consistently outperforms state-of-the-art methods, especially on challenging trajectories.

Overview

Acknowledgments

Citation

If you find PhyGenesis is useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@article{zhou2026phygenesis,
  title={Toward Physically Consistent Driving Video World Models under Challenging Trajectories},
  author={Zhou, Jiawei and Zhu, Zhenxin and Du, Lingyi and Lyu, Linye and Zhou, Lijun and Wu, Zhanqian and Luo, Hongcheng and Tian, Zhuotao and Wang, Bing and Chen, Guang and Sun, Haiyang and Ye, Hangjun and Li, Yu},
  journal={arXiv preprint},
  year={2026}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors