Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 69
OpenCUA: Open Foundations for Computer-Use Agents Collection This is the official versions of OpenCUA models and AgentNet datasets. Website: https://opencua.xlang.ai/ • 7 items • Updated 7 days ago • 14
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization Paper • 2310.16427 • Published Oct 25, 2023 • 2
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis Paper • 2505.13227 • Published May 19 • 46