GUI Agents Papers
Star · 821

OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards

Zehao Li , Zhenyu Wu , Yibo Zhao , Bowen Yang , Jingjing Xie , Zhaoyang Liu , Zhoumianze Liu , Kaiming Jin , Jianze Liang , Zonglin Li , Feng Wu , Bowen Zhou , Zun Wang , Zichen Ding

🏛 Institutions
USTC , Shanghai AI Laboratory , CUHK MMLab , HKUST , NUS
📅 Date
March 19, 2026
📑 Publisher
arXiv
💻 Env
General GUI
🔑 Keywords
TLDR

OS-Themis is a scalable critic framework for GUI reward modeling that breaks trajectories into verifiable milestones and audits the evidence chain before issuing a verdict. It improves AndroidWorld training and filtering loops and introduces OmniGUIRewardBench as a cross-platform benchmark for GUI outcome rewards.

Open paper arXiv Report issue
Related papers (24)