system architectureChapter 2arXiv · 2023
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Albert Gu (CMU), Tri Dao (Princeton)
Abstract
We present Mamba, a selective state space model that achieves linear-time sequence modeling. Mamba selectively propagates or forgets information based on content, enabling efficient long-sequence processing.
Key Contributions
- →Selective state space model
- →Linear-time sequence modeling
- →Content-based information selection
Topics
state space modelsefficient attentionlong sequencesarchitecture
Relevance Scores
Long-Horizon Score85
Enterprise Score78
Completeness74
Paper Info
Year2023
VenuearXiv
Typesystem architecture
ChapterCh. 2
Authors2
Zone III Analysis
Frameworks
Related Papers
ReAct: Synergizing Reasoning and Acting in Language Mod…
2023 · Ch.1
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Toolformer: Language Models Can Teach Themselves to Use…
2023 · Ch.1
View all Chapter 2 papers →