GUI Agents Papers
Star · 751

Improved GUI Grounding via Iterative Narrowing

Anthony Nguyen

🏛 Institutions
Algoma University
📅 Date
November 18, 2024
📑 Publisher
arXiv
💻 Env
Desktop Mobile Web
🔑 Keywords
TLDR

Iterative Narrowing is a visual-prompting framework for GUI grounding that repeatedly zooms into smaller image regions to refine predictions. The paper shows that this simple test-time strategy improves both general and fine-tuned VLMs on one-shot grounding across multiple UI platforms.

Open paper arXiv Edit on GitHub Report issue
Related papers