04:00
2026-05-25
arxiv.org
robotics
GEM-4D: Geometry-Enhanced Video World Models for Robot Manipulation
Researchers have developed GEM-4D, a geometry-grounded video world model that generates physically consistent video predictions for robot manipulation by injecting dense 4D correspondence supervision โฆ