Chapter 8 · 2026
Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention
Rakshith Vasudev, Melisa Russak, Dan Bikel
Abstract
This paper investigates the effectiveness of LLM critic models in improving agent reliability. It demonstrates that while LLM critics can have high offline accuracy, their interventions at deployment time can lead to severe performance degradation. The study identifies a disruption-recovery tradeoff and proposes a pre-deployment test to estimate whether intervention is likely to help or harm, preventing regressions before deployment.
Topics
LLM agentsfailure predictionfailure preventionreliabilitycritic modelsdeploymentperformance degradation
Relevance Scores
Long-Horizon Score65
Enterprise Score60
Completeness75
Paper Info
Year2026
Venue
Type
ChapterCh. 8
Authors3
Zone III Analysis
Related Papers
AgentBench: Evaluating LLMs as Agents
2023 · Ch.1
A Survey on Large Language Model based Autonomous Agent…
2023 · Ch.1
Semantic Uncertainty: Linguistic Invariances for Uncert…
2023 · Ch.3
Agent Drift: Semantic Degradation in Long-Running Auton…
2024 · Ch.1
View all Chapter 8 papers →