GUI Agents Papers
Star · 821

Screen2Words: Automatic Mobile UI Summarization with Multimodal Learning

Bryan Wang , Gang Li , Xin Zhou , Zhourong Chen , Tovi Grossman , Yang Li

🏛 Institutions
University of Toronto
📅 Date
August 6, 2021
📑 Publisher
UIST 2021
💻 Env
Mobile
🔑 Keywords
TLDR

Screen2Words introduces a mobile UI summarization task with over 112k human-written summaries covering 22,417 Android screens. It studies how to generate concise screen-level descriptions from the multimodal content of a UI rather than only captioning individual elements.

Open paper Report issue
Related papers (24)