GUI Agents Papers
Star · 821

GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning

Haolong Yan , Yeqing Shen , Xin Huang , Jia Wang , Kaijun Tan , Zhixuan Liang , Hongxin Li , Zheng Ge , Osamu Yoshie , Si Li , Xiangyu Zhang , Daxin Jiang

🏛 Institutions
Beijing University of Posts and Telecommunications , StepFun , Waseda University , Institute of Automation , CAS
📅 Date
December 2, 2025
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

GUI Exploration Lab is a simulation engine for studying screen navigation, exposing full screen and navigation-graph structure so agents can be trained and evaluated without proprietary GUI environments. The paper compares supervised fine-tuning, single-turn RL, and multi-turn RL, and finds that multi-turn RL is what most clearly induces exploratory navigation behavior.

Open paper arXiv Report issue
Related papers (24)