GUI Agents Papers
Star · 751

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Mingyu Ouyang, Siyuan Hu, Kevin Qinghong Lin, Hwee Tou Ng, Mike Zheng Shou

🏛 Institutions
NUS, Oxford
📅 Date
April 8, 2026
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

GameWorld is a standardized browser-based benchmark for multimodal game agents with 34 games and 170 tasks. Two game agent interfaces are studied: (i) computer-use agents that directly emit keyboard and mouse controls, and (ii) generalist multimodal agents that act in a semantic action space.

Open paper arXiv Edit on GitHub Report issue
Related papers