GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Rui Yang , Qianhui Wu , Zhaoyang Wang , Hanyang Chen , Ke Yang , Hao Cheng , Huaxiu Yao , Baoling Peng , Huan Zhang , Jianfeng Gao , Tong Zhang

🏛 Institutions: UIUC , Microsoft , UNC
📅 Date: February 25, 2026
📑 Publisher: arXiv
💻 Env: General GUI
🔑 Keywords: post-training reinforcement learning action-aware supervision partial verifiability GUI reasoning dataset GUI-Libra

TLDR

GUI-Libra is a post-training recipe for native GUI agents that combines curated reasoning data, action-aware supervised fine-tuning, and partially verifiable RL. It targets the mismatch between chain-of-thought reasoning and grounding, and improves both step-level accuracy and end-to-end task completion on web and mobile benchmarks.

Open paper arXiv Report issue