R-WoM: Retrieval-augmented World Model For Computer-use Agents

Kai Mei , Jiang Guo , Shuaichen Chang , Mingwen Dong , Dongkyu Lee , Xing Niu , Jiarong Jiang

🏛 Institutions: Rutgers University , AWS Agentic AI
📅 Date: October 13, 2025
📑 Publisher: ICLR 2026 (Poster)
💻 Env: General GUI
🔑 Keywords: world model tutorial retrieval future state prediction OSWorld WebArena R-WoM

TLDR

This paper tests whether LLMs can act as world models for computer-use agents and finds that simulation quality degrades sharply on full-procedure planning even when short-range prediction remains reasonable. R-WoM addresses this by grounding simulated rollouts with retrieved up-to-date tutorials, improving performance on OSWorld and WebArena, especially on longer-horizon tasks.

Open paper arXiv Report issue