HomeResearch LibraryFrom RAG to Agentic RAG for Faithful Islamic Question A…
Chapter 4 · 2026

From RAG to Agentic RAG for Faithful Islamic Question Answering

Gagan Bhatia, Hamdy Mubarak, Mustafa Jarrar

Abstract

LLMs are increasingly used for Islamic question answering, where ungrounded responses may carry serious religious consequences. Yet standard MCQ/MRC-style evaluations do not capture key real-world failure modes, notably free-form hallucinations and whether models appropriately abstain when evidence is lacking. To shed a light on this aspect we introduce ISLAMICFAITHQA, a 3,810-item bilingual (Arabic/English) generative benchmark with atomic single-gold answers, which enables direct measurement of hallucination and abstention.

Eigenvector Insight — Zone III / PASF-PADE AnalysisNot part of the original paper
Eigenvector Research — Marco van Hurne
How this paper contributes to solving the Zone III problem (PASF-PADE)

This paper contributes useful building blocks for Zone III architecture through its work on Agentic RAG, Islamic Question Answering, LLMs. While not exclusively focused on enterprise deployment, the insights translate directly to the challenges of long-horizon agentic workflows. The key lesson for Zone III practitioners: the problems identified here do not disappear at scale — they compound. Understanding them at the research level is prerequisite to solving them in production.

Why AI is not sufficient for Zone III without this

Zone III refers to high-complexity, high-risk, long-running agentic workflows — the class of enterprise AI deployments where a single failure can cascade across hundreds of steps. Standard AI models, trained to predict the next token, are not inherently designed for durable, governed, multi-step execution. This paper addresses one or more of the structural gaps that make Zone III deployments unsafe without explicit architectural intervention.

Topics

Agentic RAGIslamic Question AnsweringLLMsHallucinationBenchmarking