GUI Agents Papers
Star · 809

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification

Zehai He , Wenyi Hong , Zhen Yang , Ziyang Pan , Mingdao Liu , Xiaotao Gu , Jie Tang

🏛 Institutions
Tsinghua , Zhipu
📅 Date
March 27, 2026
📑 Publisher
arXiv
💻 Env
Web
🔑 Keywords
TLDR

Vision2Web is a hierarchical benchmark for visual website development that spans static UI-to-code, interactive frontend reproduction, and full-stack website construction. It evaluates coding agents with workflow-based verification using a GUI agent verifier and a VLM judge, and shows that current models still struggle badly on full-stack tasks.

Open paper arXiv Report issue
Related papers (24)