-
Executable Code Actions Elicit Better LLM Agents
Paper • 2402.01030 • Published • 164 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 27 -
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
Paper • 2405.15793 • Published • 7 -
DevBench: A Comprehensive Benchmark for Software Development
Paper • 2403.08604 • Published • 2