benchmarkChapter 1ICLR 2024 · 2023
WebArena: A Realistic Web Environment for Building Autonomous Agents
Shuyan Zhou (CMU), Frank F. Xu (CMU)
Abstract
We present WebArena, a standalone, self-hostable web environment for building autonomous agents. WebArena includes realistic web applications with functional tools, user interfaces, and data.
Key Contributions
- →Realistic web benchmark for agents
- →Self-hostable evaluation environment
- →Functional web application simulation
Topics
web agentsbenchmarkrealistic environmentautonomous agents
Relevance Scores
Long-Horizon Score88
Enterprise Score82
Completeness82
Paper Info
Year2023
VenueICLR 2024
Typebenchmark
ChapterCh. 1
Authors2
Frameworks
Related Papers
ReAct: Synergizing Reasoning and Acting in Language Mod…
2023 · Ch.1
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Toolformer: Language Models Can Teach Themselves to Use…
2023 · Ch.1
View all Chapter 1 papers →