empirical studyChapter 5arXiv · 2024
Scaling LLM Test-Time Compute Optimally
Charlie Snell (UC Berkeley), Jaehoon Lee (Google DeepMind)
Abstract
We study how to optimally scale test-time compute for LLMs. We find that the optimal allocation of test-time compute depends on the difficulty of the problem and the capabilities of the model.
Key Contributions
- →Optimal test-time compute allocation
- →Difficulty-adaptive compute scaling
- →Inference-time improvement methods
Topics
test-time computeinference scalingcompute optimizationreasoning
Relevance Scores
Long-Horizon Score83
Enterprise Score77
Completeness76
Paper Info
Year2024
VenuearXiv
Typeempirical study
ChapterCh. 5
Authors2
Zone III Analysis
Related Papers
ReAct: Synergizing Reasoning and Acting in Language Mod…
2023 · Ch.1
Reflexion: Language Agents with Verbal Reinforcement Le…
2023 · Ch.1
Tree of Thoughts: Deliberate Problem Solving with Large…
2023 · Ch.1
Toolformer: Language Models Can Teach Themselves to Use…
2023 · Ch.1
View all Chapter 5 papers →