MemGPT: Towards LLMs as Operating Systems

Charles Packer (UC Berkeley), Sarah Wooders (UC Berkeley), Kevin Lin (UC Berkeley)

Abstract

We present MemGPT, a system that intelligently manages different memory tiers in order to effectively provide extended context within the LLM's limited context window. MemGPT knows when to push critical information to a vector database and when to retrieve it.

Eigenvector Insight — Zone III / PASF-PADE AnalysisNot part of the original paper

Eigenvector Research — Marco van Hurne

How this paper contributes to solving the Zone III problem (PASF-PADE)

MemGPT is the most directly applicable paper for enterprise long-horizon agents. The OS memory management metaphor is exactly right: agents need virtual memory, not just a context window. Every enterprise deployment of a long-running agent should implement some variant of this architecture.

Why AI is not sufficient for Zone III without this

Zone III refers to high-complexity, high-risk, long-running agentic workflows — the class of enterprise AI deployments where a single failure can cascade across hundreds of steps. Standard AI models, trained to predict the next token, are not inherently designed for durable, governed, multi-step execution. This paper addresses one or more of the structural gaps that make Zone III deployments unsafe without explicit architectural intervention.

Key Contributions

→Hierarchical memory management for LLMs
→Self-directed memory paging
→Persistent agent state across sessions

Topics

memory managementcontext windowlong-horizon agentspersistent memory

Relevance Scores

Long-Horizon Score95

Enterprise Score88

Completeness91

Paper Info

Year2023

VenuearXiv

Typesystem architecture

ChapterCh. 2

Authors3

Zone III Analysis

Frameworks

PASF PADE GRAF