HomeResearch LibraryWebArena: A Realistic Web Environment for Building Auto…
benchmarkChapter 1ICLR 2024 · 2023

WebArena: A Realistic Web Environment for Building Autonomous Agents

Shuyan Zhou (CMU), Frank F. Xu (CMU)

Abstract

We present WebArena, a standalone, self-hostable web environment for building autonomous agents. WebArena includes realistic web applications with functional tools, user interfaces, and data.

Key Contributions

  • Realistic web benchmark for agents
  • Self-hostable evaluation environment
  • Functional web application simulation

Topics

web agentsbenchmarkrealistic environmentautonomous agents