#rag News & Analysis

95 articles tagged with #rag. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

95 articles

AIBullisharXiv – CS AI · Feb 277/108

🧠

RAGdb: A Zero-Dependency, Embeddable Architecture for Multimodal Retrieval-Augmented Generation on the Edge

Researchers introduce RAGdb, a revolutionary architecture that consolidates Retrieval-Augmented Generation into a single SQLite container, eliminating the need for cloud infrastructure and GPUs. The system achieves 100% entity retrieval accuracy while reducing disk footprint by 99.5% compared to traditional Docker-based RAG stacks, enabling truly portable AI applications for edge computing and privacy-sensitive environments.

AINeutralarXiv – CS AI · Jun 236/10

🧠

The Token Tax of Epistemic Accuracy: Comparing RAG and Long-Context Architectures for Document-Grounded Generative AI Applications

Researchers compare retrieval-augmented generation (RAG) versus long-context prompting for document-grounded AI applications, finding that while long-context achieves higher accuracy (73.1% vs 65.4%), it incurs a 26x higher token cost. The study frames this trade-off as an 'epistemic accuracy' versus computational expense frontier, with significant implications for resource-constrained organizations.

AIBullisharXiv – CS AI · Jun 236/10

🧠

Context-Aware Generative AI for Automated Telecom Test Script Generation

Researchers present a context-aware generative AI framework for automated telecom test script generation that continuously adapts to live system changes rather than relying on static test suites. The system uses a knowledge graph, delta-detection engine, and RAG-enhanced AI agent to automatically create, update, or retire test cases as code, configurations, and KPIs evolve, significantly reducing manual testing effort.

AINeutralarXiv – CS AI · Jun 196/10

🧠

AI Economist Agent: An Agentic Framework for Model-Grounded Economic Analysis with RAG, Knowledge Graphs, and Large Language Models

Researchers propose an AI economist agent that combines large language models with knowledge graphs and retrieval-augmented generation (RAG) to produce grounded economic analyses. Rather than relying solely on LLM-generated narratives, the framework grounds economic claims in explicit model-based computations and retrieved evidence, tested on inflation analysis and bank stress-testing scenarios.

AINeutralarXiv – CS AI · Jun 96/10

🧠

A Multi-Agent System for IPMSM Design Optimization via an FEA-AI Hybrid Approach

Researchers propose an automated multi-agent AI system for optimizing Interior Permanent Magnet Synchronous Motor (IPMSM) design that combines retrieval-augmented generation, finite element analysis, and machine learning surrogates. The framework addresses traditional bottlenecks in motor design by automating problem setup, reducing computational costs, and improving prediction reliability through uncertainty-aware switching between AI inference and high-fidelity simulation.

AIBullisharXiv – CS AI · Jun 86/10

🧠

MHA-RAG: Improving Efficiency, Accuracy, and Consistency by Encoding Exemplars as Soft Prompts

Researchers introduce MHA-RAG, a framework that encodes domain-specific exemplars as soft prompts instead of text, achieving 20-point performance improvements over standard RAG while reducing inference costs by 10X. The approach demonstrates order-invariant performance across multiple question-answering benchmarks, addressing key challenges in adapting foundation models to new domains with limited data.

AINeutralarXiv – CS AI · Jun 86/10

🧠

Evidence Graph Consistency in Retrieval-Augmented Generation: A Model-Dependent Analysis of Hallucination Detection

Researchers propose Evidence Graph Consistency (EGC), a framework to detect hallucinations in Retrieval-Augmented Generation systems by analyzing structural relationships among evidence pieces. Testing across six LLMs reveals a critical finding: the method works as expected for Llama-2 but shows reversed diagnostic signals for GPT-4, GPT-3.5, and Mistral-7B, suggesting hallucination patterns differ fundamentally across model families.

🧠 GPT-4🧠 Llama

AINeutralarXiv – CS AI · Jun 56/10

🧠

Towards Generalization of Block Attention via Automatic Segmentation and Block Distillation

Researchers introduce SemanticSeg, a large semantic segmentation dataset, and block distillation framework to improve block attention mechanisms for long-context language models. The approach uses a frozen full-attention teacher to train block-attention students more efficiently, addressing key challenges in KV cache reuse for applications like RAG.

AIBullisharXiv – CS AI · Jun 46/10

🧠

Overview of the EReL@MIR 2025 Multimodal Document Retrieval Challenge (Track 1)

The EReL@MIR 2025 Multimodal Document Retrieval Challenge invited teams to build retrieval systems handling both closed-set document page retrieval and open-domain Wikipedia passage retrieval from text and image queries. The competition attracted 22 teams with 586 submissions, with winning systems favoring decoder-based Multimodal-LLM embedders over traditional CLIP-style encoders.

AINeutralarXiv – CS AI · Jun 46/10

🧠

MemoryDocDataSet: A Benchmark for Joint Conversational Memory and Long Document Reasoning

Researchers introduce MemoryDocDataSet, a new benchmark for evaluating AI systems that must simultaneously handle multi-session conversational memory and long document reasoning. The synthetic dataset reveals a significant performance gap in current architectures, with the best baseline achieving only 35.8% F1 on tasks requiring joint memory-document navigation.

AINeutralarXiv – CS AI · Jun 26/10

🧠

ForeSci: Evaluating LLM Agents for Forward-Looking AI Research Judgment

ForeSci introduces a new benchmark for evaluating whether large language model agents can make forward-looking research decisions using only historical evidence, testing 500 tasks across AI domains. The research reveals that while explicit evidence organization improves traceability, a fundamental evidence-decision decoupling problem persists where agents cite relevant sources but reach incorrect conclusions.

AIBullisharXiv – CS AI · Jun 26/10

🧠

Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback

Researchers introduce Critic-R, a framework that improves agentic search systems by creating a feedback loop between reasoning agents and retrieval models. The approach uses a critic model to evaluate whether retrieved context supports reasoning steps and includes two mechanisms: Critic-R-Zero for query refinement at inference time, and Critic-Embed for training retrievers without manual annotations, demonstrating significant improvements on multi-hop question-answering benchmarks.

AINeutralarXiv – CS AI · May 286/10

🧠

EngiAI: A Multi-Agent Framework and Benchmark Suite for LLM-Driven Engineering Design

Researchers introduce EngiAI, a multi-agent LLM framework with a comprehensive benchmark suite for evaluating AI systems on complex engineering design tasks combining simulation, retrieval, and manufacturing. The framework reveals significant performance gaps between proprietary models (96-97% task completion) and open-source alternatives (55-78%), with conditional reasoning emerging as a critical failure point.

AINeutralarXiv – CS AI · May 276/10

🧠

LitSeg: Narrative-Aware Document Segmentation for Literary RAG

Researchers introduce LitSeg, a narrative-theory-guided framework for intelligently segmenting literary documents in Retrieval-Augmented Generation systems. The method uses multi-stage prompting to identify plot events and narrative structures, with a lightweight variant (LitSeg-Lite) that distills this complexity into a single inference pass, demonstrating improved retrieval accuracy for literary RAG applications.

AINeutralarXiv – CS AI · May 276/10

🧠

AI Agent for Reverse-Engineering Legacy Finite-Difference Code and Translating to Devito

Researchers have developed an AI agent framework that automates the translation of legacy finite-difference code into Devito, a modern computational framework. The system combines retrieval-augmented generation (RAG) with large language models and implements reinforcement learning feedback mechanisms to enable dynamic code transformation with validation across correctness, structure, and API compliance.

AIBullisharXiv – CS AI · May 116/10

🧠

From Surface Learning to Deep Understanding: A Grounded AI Tutoring System for Moodle

Researchers have developed an AI Teaching & Learning Assistant, a Moodle plugin using Retrieval-Augmented Generation (RAG) to provide students with Socratic tutoring while enabling educators to supervise content generation. The system grounds LLM responses in teacher-provided materials to minimize hallucinations and misinformation, achieving high faithfulness scores (0.97) and strong user satisfaction (4.00/5.00 rating).

AIBullisharXiv – CS AI · May 96/10

🧠

Towards Dependable Retrieval-Augmented Generation Using Factual Confidence Prediction

Researchers propose a two-stage approach to improve reliability in retrieval-augmented generation (RAG) systems by using conformal prediction to filter retrieved content and an attention-based classifier to detect factual inconsistencies. The framework achieves up to 6% answer quality improvement and 77% inconsistency detection, advancing toward certified RAG systems for production AI applications.

AINeutralarXiv – CS AI · May 46/10

🧠

Evaluating Legal Reasoning Traces with Legal Issue Tree Rubrics

Researchers introduce LEGIT, a 24K-instance legal reasoning dataset with hierarchical argument trees that serve as evaluation rubrics for LLM-generated legal reasoning. The study reveals that LLM legal reasoning performance depends critically on both issue coverage and correctness, with RAG and reinforcement learning offering complementary improvements.

AINeutralarXiv – CS AI · Apr 146/10

🧠

Do We Still Need GraphRAG? Benchmarking RAG and GraphRAG for Agentic Search Systems

A new benchmark study (RAGSearch) evaluates whether agentic search systems can reduce the need for expensive GraphRAG pipelines by dynamically retrieving information across multiple rounds. Results show agentic search significantly improves standard RAG performance and narrows the gap to GraphRAG, though GraphRAG retains advantages for complex multi-hop reasoning tasks when preprocessing costs are considered.

🏢 Meta

AINeutralarXiv – CS AI · Apr 146/10

🧠

CodaRAG: Connecting the Dots with Associativity Inspired by Complementary Learning

Researchers introduce CodaRAG, a framework that enhances Retrieval-Augmented Generation by treating evidence retrieval as active associative discovery rather than passive lookup. The system achieves 7-10% gains in retrieval recall and 3-11% improvements in generation accuracy by consolidating fragmented knowledge, navigating multi-dimensional pathways, and eliminating noise.

AIBullisharXiv – CS AI · Apr 76/10

🧠

Compliance-by-Construction Argument Graphs: Using Generative AI to Produce Evidence-Linked Formal Arguments for Certification-Grade Accountability

Researchers propose a compliance-by-construction architecture that integrates Generative AI with structured formal argument representations to ensure accountability in high-stakes decision systems. The approach uses typed Argument Graphs, retrieval-augmented generation, validation constraints, and provenance ledgers to prevent AI hallucinations while maintaining traceability for regulatory compliance.

AIBullisharXiv – CS AI · Apr 76/10

🧠

Scaling DPPs for RAG: Density Meets Diversity

Researchers propose ScalDPP, a new retrieval mechanism for RAG systems that uses Determinantal Point Processes to optimize both density and diversity in context selection. The approach addresses limitations in current RAG pipelines that ignore interactions between retrieved information chunks, leading to redundant contexts that reduce effectiveness.

AIBullisharXiv – CS AI · Apr 76/10

🧠

GROUNDEDKG-RAG: Grounded Knowledge Graph Index for Long-document Question Answering

Researchers introduced GroundedKG-RAG, a new retrieval-augmented generation system that creates knowledge graphs directly grounded in source documents to improve long-document question answering. The system reduces resource consumption and hallucinations while maintaining accuracy comparable to state-of-the-art models at lower cost.

AIBullisharXiv – CS AI · Apr 66/10

🧠

Prompt Compression in the Wild: Measuring Latency, Rate Adherence, and Quality for Faster LLM Inference

A large-scale study of prompt compression techniques for LLMs found that LLMLingua can achieve up to 18% speed improvements when properly configured, while maintaining response quality across tasks. However, compression benefits only materialize under specific conditions of prompt length, compression ratio, and hardware capacity.

AIBullisharXiv – CS AI · Mar 276/10

🧠

Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-integrated programming learning system

Researchers developed a framework integrating large language models with knowledge graphs to provide programming feedback and exercise recommendations. The hybrid GenAI-adaptive approach outperformed traditional adaptive learning and GenAI-only modes, producing more correct code submissions and fewer incomplete attempts across 4,956 code submissions.

← PrevPage 2 of 4Next →