HomeResearch LibraryAdaptive Retrieval-Augmented Generation for Conversatio…
system architectureChapter 2arXiv · 2023

Adaptive Retrieval-Augmented Generation for Conversational Systems

Weizhi Wang (UCSB), Li Dong (Microsoft Research)

Abstract

We present FLARE, an active retrieval augmented generation method that adaptively decides when and what to retrieve during generation. FLARE uses upcoming sentence prediction to trigger retrieval.

Eigenvector Insight — Zone III / PASF-PADE AnalysisNot part of the original paper
Eigenvector Research — Marco van Hurne
How this paper contributes to solving the Zone III problem (PASF-PADE)

FLARE's adaptive retrieval approach is essential for Zone III agents that need to maintain knowledge currency over long workflows. Rather than retrieving everything upfront, adaptive retrieval ensures agents access the right knowledge at the right time.

Why AI is not sufficient for Zone III without this

Zone III refers to high-complexity, high-risk, long-running agentic workflows — the class of enterprise AI deployments where a single failure can cascade across hundreds of steps. Standard AI models, trained to predict the next token, are not inherently designed for durable, governed, multi-step execution. This paper addresses one or more of the structural gaps that make Zone III deployments unsafe without explicit architectural intervention.

Key Contributions

  • Active retrieval triggering
  • Upcoming sentence prediction
  • Adaptive knowledge access

Topics

RAGadaptive retrievalconversational AIknowledge grounding