GUI Agents Papers
Star · 751

R-WoM: Retrieval-augmented World Model For Computer-use Agents

Kai Mei, Jiang Guo, Shuaichen Chang, Mingwen Dong, Dongkyu Lee, Xing Niu, Jiarong Jiang

🏛 Institutions
Rutgers University, AWS Agentic AI
📅 Date
October 13, 2025
📑 Publisher
ICLR 2026 (Poster)
💻 Env
General GUI
🔑 Keywords
TLDR

This paper tests whether LLMs can act as world models for computer-use agents and finds that simulation quality degrades sharply on full-procedure planning even when short-range prediction remains reasonable. R-WoM addresses this by grounding simulated rollouts with retrieved up-to-date tutorials, improving performance on OSWorld and WebArena, especially on longer-horizon tasks.

Open paper arXiv Edit on GitHub Report issue
Related papers