GUI Agents Papers
Star · 751

Programming with Pixels: Can Computer-Use Agents do Software Engineering?

Pranjal Aggarwal, Sean Welleck

🏛 Institutions
CMU
📅 Date
February 24, 2025
📑 Publisher
ICLR 2026 (Poster)
💻 Env
Desktop
🔑 Keywords
TLDR

This paper introduces Programming with Pixels, a visual IDE environment for evaluating whether generalist computer-use agents can handle software engineering tasks rather than only simple desktop or web interactions. It also presents PwP-Bench, a benchmark spanning 15 software-engineering tasks across languages and modalities. The results show that purely visual computer-use agents lag behind specialist coding agents, but narrow text APIs such as file editing and bash dramatically narrow that gap.

Open paper arXiv Edit on GitHub Report issue
Related papers