Retroformer: Retrospective Large Language Agents with Policy Gradient Optimisation

Weiran Yao (Salesforce Research), Shelby Heinecke (Salesforce Research)

Abstract

We present Retroformer, a framework for improving language agents through retrospective policy gradient optimization. Retroformer learns from past trajectories to improve future performance without manual reward engineering.

Key Contributions

→Retrospective policy gradient for agents
→Trajectory-based learning
→Automated reward signal

Topics

policy gradientretrospective learningagent improvementRL

Relevance Scores

Long-Horizon Score85

Enterprise Score74

Completeness74

Paper Info

Year2023

VenueICLR 2024

Typesystem architecture

ChapterCh. 5

Authors2

Frameworks

PASF AEGIS