system architectureChapter 2arXiv · 2023
MemGPT: Towards LLMs as Operating Systems
Charles Packer (UC Berkeley), Sarah Wooders (UC Berkeley), Kevin Lin (UC Berkeley)
Abstract
We present MemGPT, a system that intelligently manages different memory tiers in order to effectively provide extended context within the LLM's limited context window. MemGPT knows when to push critical information to a vector database and when to retrieve it.
Key Contributions
- →Hierarchical memory management for LLMs
- →Self-directed memory paging
- →Persistent agent state across sessions
Eigenvector Commentary
MemGPT is the most directly applicable paper for enterprise long-horizon agents. The OS memory management metaphor is exactly right: agents need virtual memory, not just a context window. Every enterprise deployment of a long-running agent should implement some variant of this architecture.
Topics
memory managementcontext windowlong-horizon agentspersistent memory
Relevance Scores
Long-Horizon Score95
Enterprise Score88
Completeness91
Paper Info
Year2023
VenuearXiv
Typesystem architecture
ChapterCh. 2
Authors3
Related Papers
ReAct: Synergizing Reasoning and Acting in Language Mod…
2023 · Ch.1
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Toolformer: Language Models Can Teach Themselves to Use…
2023 · Ch.1
View all Chapter 2 papers →