IntentScore: Intent-Conditioned Action Evaluation for Computer-Use Agents

Rongqian Chen , Yu Li , Zeyu Fang , Sizhe Tang , Weidong Cao , Tian Lan

🏛 Institutions: George Washington University
📅 Date: April 6, 2026
📑 Publisher: arXiv
💻 Env: Desktop
🔑 Keywords: reward model model plan-aware reward contrastive alignment margin ranking OSWorld IntentScore

TLDR

IntentScore is a plan-aware reward model for computer-use agents trained from 398K offline GUI interaction steps across three OSes, using contrastive alignment and margin ranking objectives. It achieves 97.5% pairwise discrimination and, when used as a re-ranker for Agent S3 on OSWorld, improves task success rate by 6.9 points.

Open paper arXiv Report issue