AINeutralarXiv – CS AI · 10h ago6/10
🧠
The Token Tax of Epistemic Accuracy: Comparing RAG and Long-Context Architectures for Document-Grounded Generative AI Applications
Researchers compare retrieval-augmented generation (RAG) versus long-context prompting for document-grounded AI applications, finding that while long-context achieves higher accuracy (73.1% vs 65.4%), it incurs a 26x higher token cost. The study frames this trade-off as an 'epistemic accuracy' versus computational expense frontier, with significant implications for resource-constrained organizations.