GUI Agents Papers
Star · 821

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Mingyu Ouyang , Siyuan Hu , Kevin Qinghong Lin , Hwee Tou Ng , Mike Zheng Shou

🏛 Institutions
NUS , Oxford
📅 Date
April 8, 2026
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

GameWorld is a standardized browser-based benchmark for multimodal game agents with 34 games and 170 tasks. Two game agent interfaces are studied: (i) computer-use agents that directly emit keyboard and mouse controls, and (ii) generalist multimodal agents that act in a semantic action space.

Open paper arXiv Report issue
Related papers (24)