Matrix Dataset Document#
The Matrix Dataset#
The Matrix dataset was first introduced by the Matrix team in the paper “The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control.” This dataset is specifically designed for training world models and comprises millions of video sequences accompanied by corresponding control signals.
Visual Content:
- Forza Horizon 5: 1000k video-control signal pairs.
Data format: 60 FPS, 2560×1600P, 4-6s with control signals
Scene: Driving across woods, grass, sea, field, river, others (ratio: 12%:15%:18%:16%:15%:9%:15%)
- Cyberpunk 2077: 300k video-control signal pairs.
Data format: 60 FPS, 2560×1600P, 4-6s with control signals
Scene: Dense urban environments with skyscrapers, including day-night cycles and indoor-outdoor scenes (ratio: 1:3).

Citations and publications#
@misc{feng2024matrixinfinitehorizonworldgeneration,
title={The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control},
author={Ruili Feng and Han Zhang and Zhantao Yang and Jie Xiao and Zhilei Shu and Zhiheng Liu and Andy Zheng and Yukun Huang and Yu Liu and Hongyang Zhang},
year={2024},
eprint={2412.03568},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2412.03568},
}