GUI Agents Papers
Star · 751

ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents

Jakub Hoscilowicz, Bartosz Maj, Bartosz Kozakiewicz, Oleksii Tymoshchuk, Artur Janicki

🏛 Institutions
Samsung R&D Poland, Warsaw University of Technology
📅 Date
October 9, 2024
📑 Publisher
SIGDIAL 2025
💻 Env
Mobile
🔑 Keywords
TLDR

Proposes ClickAgent, a mobile agent framework that separates high-level reasoning from precise UI element localization. By pairing an MLLM planner with a dedicated grounding component, it improves task success on AITW and on real-device Android evaluations.

Open paper Edit on GitHub Report issue
Related papers