Chapter 7 · 2025
A Survey on Evaluation of LLM-based Agents
Yipeng Li, Mahmoud Mohammadi, Jane Lo
Abstract
This survey provides a comprehensive overview of the evaluation methodologies for LLM-based agents. It categorizes existing approaches, discusses common challenges, and highlights key metrics used to assess agent performance, reliability, and safety. The paper aims to bring clarity to the fragmented landscape of LLM agent evaluation and identify future research directions.
Topics
LLM-based agentsevaluationsurveybenchmarkingmetrics
Relevance Scores
Long-Horizon Score85
Enterprise Score80
Completeness75
Paper Info
Year2025
Venue
Type
ChapterCh. 7
Authors3
Zone III Analysis
Related Papers
A Survey on Large Language Model based Autonomous Agent…
2023 · Ch.1
The Landscape of Emerging AI Agent Frameworks
2024 · Ch.1
Towards Long-Horizon Planning with LLMs: A Survey
2024 · Ch.2
Attention Is All You Need
2017 · Ch.7
View all Chapter 7 papers →