๐ Matrix Dataset Documentation
Last Updated: 2025-03-25
๐ Project Page ๐ Research Paper ๐ Dataset Docs ๐ฆGitHub
๐ Matrix Dataset
The Matrix dataset was first introduced by the Matrix team in the paper "The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control." This dataset is specifically designed for training world models and comprises millions of video sequences accompanied by corresponding control signals.
๐๏ธ Forza Horizon 5

๐ผ๏ธ Visual Content
- Multi-scene driving sequences
- Desert/Ocean/Grassland biomes
- Dynamic weather system
๐ฆ Dataset Specs
- 1.2M video-control pairs
- Standardized 6-second clips
- Resolution: 2560ร1600 @ 60FPS
๐ Cyberpunk 2077

๐ผ๏ธ Visual Content
- Dense urban landscapes
- Day-night cycle (1:3 ratio)
- Indoor-outdoor transitions
๐ฆ Dataset Specs
- 1.0M video-control pairs
- Standardized 6-second clips
- Resolution: 2560ร1600 @ 60FPS
โ๏ธ Data collection&processing
๐ฅ Research Team
Core Members
- โข Ruili Feng - Tongyi Lab
- โข Han Zhang - Tongyi Lab
- โข Zhantao Yang - Tongyi Lab
- โข Jie Xiao - Tongyi Lab
- โข Zhilei Shu - Tongyi Lab
- โข Zhiheng Liu - Tongyi Lab
- โข Andy Zheng - Waterloo Univ
- โข ShangWen Zhu - SJTU Univ
- โข Yukun Huang - HKU
- โข Yu Liu - Tongyi Lab
- โข Hongyang Zhang - Vector
๐ Publications
@misc{feng2024matrixinfinitehorizonworldgeneration,
title={The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control},
author={Ruili Feng and Han Zhang and Zhantao Yang and Jie Xiao and Zhilei Shu and Zhiheng Liu and Andy Zheng and Yukun Huang and Yu Liu and Hongyang Zhang},
year={2024},
eprint={2412.03568},
url={https://arxiv.org/abs/2412.03568}
}
๐ Contact: https://matrixteam-ai.github.io/