system architectureChapter 5Apple ML Research · 2025
Reinforced Agent Inference Feedback
Apple ML Research Team (Apple)
Abstract
We present a method for improving tool-calling agents at inference time through a reviewer agent that evaluates tool calls before execution. The reviewer provides feedback that allows the primary agent to correct its tool calls without retraining.
Key Contributions
- →Reviewer agent for tool call validation
- →Inference-time correction without retraining
- →Reduced tool call errors in production
Topics
inference-time feedbacktool useself-correctionreviewer agent
Relevance Scores
Long-Horizon Score91
Enterprise Score89
Completeness87
Paper Info
Year2025
VenueApple ML Research
Typesystem architecture
ChapterCh. 5
Authors1
Zone III Analysis
Related Papers
ReAct: Synergizing Reasoning and Acting in Language Mod…
2023 · Ch.1
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Toolformer: Language Models Can Teach Themselves to Use…
2023 · Ch.1
View all Chapter 5 papers →