GUI Agents Papers
Star · 821

Programming with Pixels: Can Computer-Use Agents do Software Engineering?

Pranjal Aggarwal , Sean Welleck

🏛 Institutions
CMU
📅 Date
February 24, 2025
📑 Publisher
ICLR 2026 (Poster)
💻 Env
Desktop
🔑 Keywords
TLDR

This paper introduces Programming with Pixels, a visual IDE environment for evaluating whether generalist computer-use agents can handle software engineering tasks rather than only simple desktop or web interactions. It also presents PwP-Bench, a benchmark spanning 15 software-engineering tasks across languages and modalities. The results show that purely visual computer-use agents lag behind specialist coding agents, but narrow text APIs such as file editing and bash dramatically narrow that gap.

Open paper arXiv Report issue
Related papers (24)