HomeResearch LibraryRetroformer: Retrospective Large Language Agents with P…
system architectureChapter 5ICLR 2024 · 2023

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimisation

Weiran Yao (Salesforce Research), Shelby Heinecke (Salesforce Research)

Abstract

We present Retroformer, a framework for improving language agents through retrospective policy gradient optimization. Retroformer learns from past trajectories to improve future performance without manual reward engineering.

Key Contributions

  • Retrospective policy gradient for agents
  • Trajectory-based learning
  • Automated reward signal

Topics

policy gradientretrospective learningagent improvementRL