system architectureChapter 7NeurIPS 2017 · 2017
Attention Is All You Need
Ashish Vaswani (Google Brain), Noam Shazeer (Google Brain)
Abstract
We propose the Transformer, a model architecture based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. The Transformer achieves state-of-the-art results on machine translation tasks.
Key Contributions
- →Transformer architecture
- →Self-attention mechanism
- →Multi-head attention
Topics
transformerattention mechanismneural architecturefoundational
Relevance Scores
Long-Horizon Score70
Enterprise Score75
Completeness75
Paper Info
Year2017
VenueNeurIPS 2017
Typesystem architecture
ChapterCh. 7
Authors2
Zone III Analysis
Frameworks
Related Papers
ReAct: Synergizing Reasoning and Acting in Language Mod…
2023 · Ch.1
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Toolformer: Language Models Can Teach Themselves to Use…
2023 · Ch.1
View all Chapter 7 papers →