{"slug": "world-action-models-a-survey", "title": "World Action Models: A Survey", "summary": "A new survey on World Action Models (WAMs) clarifies the boundaries between broad world models, video generation models, and action-grounded video world models, organizing existing works by what they generate and their design components. The survey identifies a trend toward methods that generate less of the future while preserving what control requires, trading representational richness against compute, memory, latency, and action-label cost.", "body_md": "# Computer Science > Robotics\n\n[Submitted on 18 Jun 2026]\n\n# Title:World Action Models: A Survey\n\n[View PDF](/pdf/2606.20781)\n\n[HTML (experimental)](https://arxiv.org/html/2606.20781v1)\n\nAbstract:World Action Models (WAMs) are embodied predictive-action models that make a forecast of the future available to action. Recent WAMs repurpose large video generation models, and a parallel line relies on language or vision-language backbones without a video-generation core. This rapid expansion has blurred the boundary among broad world models, video generation models, action-grounded video world models, Vision-Language-Action policies, and WAMs. This survey gives the field a common account. It first clarifies these boundaries, then organizes existing works through two complementary views. The first view asks what each method is required to generate, spanning rendered futures, latent futures, and video-generation-free action reasoning. The second view decomposes each method by predictive substrate, backbone, action coupling, and deployment regime. This anatomy supports a unified discussion of interactability, causality, persistence, physical plausibility, and generalization, followed by data, evaluation, and open challenges. Across these axes, a consistent design pattern emerges: WAMs are not simply video generators with action heads, but predictive-action methods whose design choices trade representational richness against compute, memory, latency, and action-label cost. The field is moving toward methods that generate less of the future while preserving what control requires. The survey homepage is available at[this https URL].\n\n### References & Citations\n\nLoading...\n\n# Bibliographic and Citation Tools\n\nBibliographic Explorer\n\n*(*[What is the Explorer?](https://info.arxiv.org/labs/showcase.html#arxiv-bibliographic-explorer))\nConnected Papers\n\n*(*[What is Connected Papers?](https://www.connectedpapers.com/about))\nLitmaps\n\n*(*[What is Litmaps?](https://www.litmaps.co/))\nscite Smart Citations\n\n*(*[What are Smart Citations?](https://www.scite.ai/))# Code, Data and Media Associated with this Article\n\nalphaXiv\n\n*(*[What is alphaXiv?](https://alphaxiv.org/))\nCatalyzeX Code Finder for Papers\n\n*(*[What is CatalyzeX?](https://www.catalyzex.com))\nDagsHub\n\n*(*[What is DagsHub?](https://dagshub.com/))\nGotit.pub\n\n*(*[What is GotitPub?](http://gotit.pub/faq))\nHugging Face\n\n*(*[What is Huggingface?](https://huggingface.co/huggingface))\nScienceCast\n\n*(*[What is ScienceCast?](https://sciencecast.org/welcome))# Demos\n\n# Recommenders and Search Tools\n\nInfluence Flower\n\n*(*[What are Influence Flowers?](https://influencemap.cmlab.dev/))\nCORE Recommender\n\n*(*[What is CORE?](https://core.ac.uk/services/recommender))# arXivLabs: experimental projects with community collaborators\n\narXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.\n\nBoth individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.\n\nHave an idea for a project that will add value for arXiv's community? [ Learn more about arXivLabs](https://info.arxiv.org/labs/index.html).", "url": "https://wpnews.pro/news/world-action-models-a-survey", "canonical_source": "https://arxiv.org/abs/2606.20781", "published_at": "2026-06-24 23:41:19+00:00", "updated_at": "2026-06-25 00:13:56.889758+00:00", "lang": "en", "topics": ["artificial-intelligence", "machine-learning", "robotics", "computer-vision", "ai-research"], "entities": ["World Action Models", "WAMs"], "alternates": {"html": "https://wpnews.pro/news/world-action-models-a-survey", "markdown": "https://wpnews.pro/news/world-action-models-a-survey.md", "text": "https://wpnews.pro/news/world-action-models-a-survey.txt", "jsonld": "https://wpnews.pro/news/world-action-models-a-survey.jsonld"}}