GUI Agents Papers
Star · 821

ClickAgent: Enhancing UI Location Capabilities of Autonomous Agents

Jakub Hoscilowicz , Bartosz Maj , Bartosz Kozakiewicz , Oleksii Tymoshchuk , Artur Janicki

🏛 Institutions
Samsung R&D Poland , Warsaw University of Technology
📅 Date
October 9, 2024
📑 Publisher
SIGDIAL 2025
💻 Env
Mobile
🔑 Keywords
TLDR

Proposes ClickAgent, a mobile agent framework that separates high-level reasoning from precise UI element localization. By pairing an MLLM planner with a dedicated grounding component, it improves task success on AITW and on real-device Android evaluations.

Open paper Report issue
Related papers (24)