HATS: Hardness-Aware Trajectory Synthesis for GUI Agents

Rui Shao , Ruize Gao , Bin Xie , Yixing Li , Kaiwen Zhou , Shuai Wang , Weili Guan , Gongwei Chen

🏛 Institutions: HIT-Shenzhen , NUS , CNRS@CREATE , Shenzhen Loop Area Institute , Huawei Noah's Ark Lab
📅 Date: March 12, 2026
📑 Publisher: CVPR 2026
💻 Env: Web Mobile
🔑 Keywords: trajectory synthesis semantic ambiguity hardness-aware exploration alignment-guided refinement WebArena AndroidWorld

TLDR

HATS synthesizes GUI-agent training trajectories by modeling action hardness as semantic ambiguity and combining hardness-driven exploration with alignment-guided refinement, improving data quality and downstream performance on both WebArena and AndroidWorld.

Open paper arXiv Report issue

Related papers (24)

WebWorld: A Large-Scale World Model for Web Agent Training

February 16, 2026 · arXiv
SE-GA: Memory-Augmented Self-Evolution for GUI Agents

May 16, 2026 · arXiv
The Tool Illusion: Rethinking Tool Use in Web Agents

April 3, 2026 · arXiv
When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation

April 1, 2026 · arXiv
WebArena-Infinity: Generating Browser Environments with Verifiable Tasks at Scale

March 2026 · Blog Post
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

March 25, 2026 · arXiv
AI Planning Framework for LLM-Based Web Agents

March 13, 2026 · arXiv
OpAgent: Operator Agent for Web Navigation

February 14, 2026 · arXiv
Adaptive Milestone Reward for GUI Agents

February 12, 2026 · arXiv
DynaWeb: Model-Based Reinforcement Learning of Web Agents

January 29, 2026 · arXiv
ColorBrowserAgent: Complex Long-Horizon Browser Agent with Adaptive Knowledge Evolution

January 12, 2026 · arXiv
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

December 14, 2025 · arXiv
WALT: Web Agents that Learn Tools

October 1, 2025 · ICLR 2026 (Poster)
MobileRL: Online Agentic Reinforcement Learning for Mobile GUI Agents

September 10, 2025 · ICLR 2026 (Poster)
Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control

September 1, 2025 · NeurIPS 2025 (Poster)
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

July 2025 · Findings of ACL 2025
Go-Browse: Training Web Agents with Structured Exploration

June 4, 2025 · ICLR 2026 (Poster)
Inducing Programmatic Skills for Agentic Tasks

April 9, 2025 · COLM 2025
AppVLM: A Lightweight Vision Language Model for Online App Control

February 10, 2025 · arXiv
Beyond Browsing: API-Based Web Agents

October 24, 2024 · Findings of ACL 2025
Large Language Models Can Self-Improve At Web Agent Tasks

May 30, 2024 · arXiv
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents

May 23, 2024 · ICLR 2025 (Poster)
SteP: Stacked LLM Policies for Web Actions

October 5, 2023 · COLM 2024
WebArena: A Realistic Web Environment for Building Autonomous Agents

July 25, 2023 · ICLR 2024 (Poster)