Efficient Agent Training for Computer Use

🏛 Institutions: SJTU , SII , Generative AI Research Lab (GAIR)
📅 Date: May 20, 2025
📑 Publisher: ICLR 2026 (Poster)
💻 Env: Desktop
🔑 Keywords: model dataset benchmark trajectory augmentation WindowsAgentArena-V2 PC Agent-E

TLDR

This paper studies data-efficient training for desktop computer-use agents, starting from only 312 human trajectories and augmenting them with diversified action decisions sampled from Claude 3.7 Sonnet. The resulting PC Agent-E model improves strongly over the base model, surpasses Claude 3.7 Sonnet on WindowsAgentArena-V2, and releases the improved benchmark alongside the training recipe.

Open paper arXiv Report issue