system architectureChapter 5ICLR 2024 · 2023
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimisation
Weiran Yao (Salesforce Research), Shelby Heinecke (Salesforce Research)
Abstract
We present Retroformer, a framework for improving language agents through retrospective policy gradient optimization. Retroformer learns from past trajectories to improve future performance without manual reward engineering.
Key Contributions
- →Retrospective policy gradient for agents
- →Trajectory-based learning
- →Automated reward signal
Topics
policy gradientretrospective learningagent improvementRL
Relevance Scores
Long-Horizon Score85
Enterprise Score74
Completeness74
Paper Info
Year2023
VenueICLR 2024
Typesystem architecture
ChapterCh. 5
Authors2
Related Papers
ReAct: Synergizing Reasoning and Acting in Language Mod…
2023 · Ch.1
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Toolformer: Language Models Can Teach Themselves to Use…
2023 · Ch.1
View all Chapter 5 papers →