Chapter 7 · 2024
Towards Automated Evaluation of LLM-based Multi-turn Dialogue Systems
Zekun Li, Wenhu Chen, Shiyang Li
Abstract
We propose an automated evaluation framework for multi-turn dialogue systems that assesses coherence, consistency, and task completion across extended conversations.
Topics
dialogue evaluationmulti-turncoherenceautomated testing
Relevance Scores
Long-Horizon Score82
Enterprise Score79
Completeness80
Paper Info
Year2024
Venue
Type
ChapterCh. 7
Authors3
Zone III Analysis
Related Papers
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Generative Agents: Interactive Simulacra of Human Behav…
2023 · Ch.2
MemGPT: Towards LLMs as Operating Systems
2023 · Ch.2
View all Chapter 7 papers →