GUI Agents Papers
Star · 821

A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility

Andrea Burns , Deniz Arsan , Sanjna Agrawal , Ranjitha Kumar , Kate Saenko , Bryan A. Plummer

🏛 Institutions
Boston University , UIUC , MIT-IBM Watson AI Lab
📅 Date
February 4, 2022
📑 Publisher
ECCV 2022
💻 Env
Mobile
🔑 Keywords
TLDR

This paper introduces MoTIF, a mobile-app navigation dataset where commands may be infeasible or ambiguous in the current UI state. In addition to action demonstrations, it adds feasibility labels and follow-up questions, making it a benchmark for both navigation and uncertainty resolution.

Open paper Report issue
Related papers (24)