#rag-systems News & Analysis

96 articles tagged with #rag-systems. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

96 articles

AINeutralarXiv – CS AI · May 127/10

🧠

MedMeta: A Benchmark for LLMs in Synthesizing Meta-Analysis Conclusion from Medical Studies

Researchers introduce MedMeta, a benchmark evaluating how well large language models can synthesize conclusions from medical meta-analyses using only study abstracts. The study reveals that retrieval-augmented generation (RAG) significantly outperforms parametric-only approaches, but all current models struggle with evidence synthesis and fail to properly reject contradictory findings, achieving only marginally above-average performance even under ideal conditions.

AIBearisharXiv – CS AI · May 117/10

🧠

From Clouds to Hallucinations: Atmospheric Retrieval Hijacking in Remote Sensing Vision-Language RAG

Researchers introduce CloudWeb, an adversarial attack that manipulates remote sensing images with realistic cloud and haze patterns to hijack vision-language retrieval systems in multimodal RAG pipelines. The attack achieves significant success rates—increasing weather-related evidence injection from 0.71% to 43.29% on benchmark tests—demonstrating that input-space threats to retrieval stages remain largely undefended in production systems.

🏢 OpenAI

AIBullisharXiv – CS AI · May 117/10

🧠

LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

LARAG introduces a link-aware retrieval strategy that improves RAG systems by leveraging hyperlink structures already present in technical documentation, rather than treating documents as flat text collections. The approach achieves better answer quality with fewer computational resources, demonstrating that implicit graph-like retrieval through existing metadata can enhance AI system performance.

AIBullisharXiv – CS AI · May 97/10

🧠

Agentic Retrieval-Augmented Generation for Financial Document Question Answering

Researchers introduce FinAgent-RAG, an advanced AI framework designed to answer complex financial questions by combining iterative retrieval, reasoning, and self-verification. The system achieves 76-78% accuracy on financial benchmarks while reducing computational costs by 41%, demonstrating practical viability for institutional financial analysis.

AINeutralarXiv – CS AI · May 17/10

🧠

NanoKnow: How to Know What Your Language Model Knows

Researchers release NanoKnow, a benchmark dataset that reveals how large language models acquire and encode knowledge by leveraging nanochat's fully transparent pre-training data. The study demonstrates that LLM accuracy depends heavily on answer frequency in training data, and that parametric knowledge and external evidence serve complementary roles in model outputs.

AIBullisharXiv – CS AI · May 17/10

🧠

NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains

Researchers introduce NeocorRAG, a new framework that optimizes retrieval quality in Retrieval-Augmented Generation (RAG) systems by using Evidence Chains, achieving state-of-the-art performance while reducing token consumption by 80% compared to comparable methods. The framework addresses a critical gap where improvements in retrieval metrics don't consistently translate to better reasoning accuracy.

AIBullisharXiv – CS AI · May 17/10

🧠

Think it, Run it: Autonomous ML pipeline generation via self-healing multi-agent AI

Researchers have developed a multi-agent AI system that autonomously generates machine learning pipelines from datasets and natural-language instructions, achieving 84.7% success rate across 150 diverse tasks. The architecture integrates self-healing mechanisms and adaptive learning to reduce manual development time and improve robustness.

AIBullisharXiv – CS AI · Apr 157/10

🧠

DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding

Researchers introduce DocSeeker, a multimodal AI system designed to improve long document understanding by implementing structured analysis, localization, and reasoning workflows. The breakthrough addresses critical limitations in existing large language models that struggle with lengthy documents due to high noise levels and weak training signals, achieving superior performance on both short and ultra-long documents.

AIBullisharXiv – CS AI · Apr 157/10

🧠

Reasoning Graphs: Self-Improving, Deterministic RAG through Evidence-Centric Feedback

Researchers introduce reasoning graphs, a persistent knowledge structure that improves language model reasoning accuracy by storing and reusing chains of thought tied to evidence items. The system achieves 47% error reduction on multi-hop questions and maintains deterministic outputs without model retraining, using only context engineering.

AIBearisharXiv – CS AI · Apr 147/10

🧠

ADAM: A Systematic Data Extraction Attack on Agent Memory via Adaptive Querying

Researchers have developed ADAM, a novel privacy attack that exploits vulnerabilities in Large Language Model agents' memory systems through adaptive querying, achieving up to 100% success rates in extracting sensitive information. The attack highlights critical security gaps in modern LLM-based systems that rely on memory modules and retrieval-augmented generation, underscoring the urgent need for privacy-preserving safeguards.

AIBullisharXiv – CS AI · Apr 147/10

🧠

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

Researchers introduce Disco-RAG, a discourse-aware framework that enhances Retrieval-Augmented Generation (RAG) systems by explicitly modeling discourse structures and rhetorical relationships between retrieved passages. The method achieves state-of-the-art results on question answering and summarization tasks without fine-tuning, demonstrating that structural understanding of text significantly improves LLM performance on knowledge-intensive tasks.

AIBullisharXiv – CS AI · Apr 147/10

🧠

Persistent Identity in AI Agents: A Multi-Anchor Architecture for Resilient Memory and Continuity

Researchers introduce soul.py, an open-source architecture addressing catastrophic forgetting in AI agents by distributing identity across multiple memory systems rather than centralizing it. The framework implements persistent identity through separable components and a hybrid RAG+RLM retrieval system, drawing inspiration from how human memory survives neurological damage.

AIBullisharXiv – CS AI · Apr 137/10

🧠

CSAttention: Centroid-Scoring Attention for Accelerating LLM Inference

Researchers introduce CSAttention, a training-free sparse attention method that accelerates LLM inference by 4.6x for long-context applications. The technique optimizes the offline-prefill/online-decode workflow by precomputing query-centric lookup tables, enabling faster token generation without sacrificing accuracy even at 95% sparsity levels.

AIBearisharXiv – CS AI · Mar 277/10

🧠

PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems

Researchers have developed PIDP-Attack, a new cybersecurity threat that combines prompt injection with database poisoning to manipulate AI responses in Retrieval-Augmented Generation (RAG) systems. The attack method demonstrated 4-16% higher success rates than existing techniques across multiple benchmark datasets and eight different large language models.

AINeutralarXiv – CS AI · Mar 117/10

🧠

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

Researchers have developed an open-source benchmark dataset to evaluate AI systems' compliance with the EU AI Act, specifically focusing on NLP and RAG systems. The dataset enables automated assessment of risk classification, article retrieval, and question-answering tasks, achieving 0.87 and 0.85 F1-scores for prohibited and high-risk scenarios.

AIBullisharXiv – CS AI · Jun 256/10

🧠

CausalRAG2: Hierarchical Causal Knowledge Graph Design for RAG

Researchers introduce CausalRAG2, a framework that improves retrieval-augmented generation (RAG) systems by incorporating causal reasoning into knowledge graph design, addressing limitations in current entity-centric approaches. The framework uses hierarchical modules with causal gating to reduce spurious correlations and enable scalable reasoning, accompanied by a new HolisQA benchmark for comprehensive evaluation.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Memory Makes the Difference: Evaluating How Different Memory Roles Shape Conversational Agents

Researchers present a taxonomy of memory roles in RAG-based conversational AI systems, demonstrating that different memory types—such as clarifying versus irrelevant memories—substantially shape response quality, factual accuracy, and personalization. Using a user-centric evaluation framework, the study reveals that memory function matters more than just storage mechanisms, with implications for developing more effective conversational agents.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Is GraphRAG Needed? From Basic RAG to Graph-/Agentic Solutions with Context Optimization

Researchers present a comprehensive framework comparing RAG (Retrieval-Augmented Generation) variants—including GraphRAG, Modular RAG, and Agentic RAG—across 9 standardized scenarios. They introduce a novel context optimization method that reduces token usage by 19-53% while identifying a retrieval-generation gap suggesting advanced retrieval methods may not proportionally improve output quality.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Graph-Enhanced Large Language Models for Spatial Search

Researchers propose enhancing Large Language Models with graph-based spatial reasoning capabilities to address current limitations in understanding physical world questions. The work aims to enable search engines and LLMs to better answer complex spatial queries relevant to urban planning, engineering, and travel domains by integrating graph data structures.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Tell Me: An LLM-powered Mental Well-being Assistant with RAG, Synthetic Dialogue Generation, and Agentic Planning

Researchers have developed Tell Me, an LLM-powered mental health support system that combines retrieval-augmented generation for personalized dialogue, synthetic therapist-client conversation generation for research purposes, and an agentic AI crew for creating adaptive self-care plans. The system demonstrates how large language models can expand access to mental well-being resources while maintaining clear boundaries that it complements rather than replaces professional therapy.

AINeutralarXiv – CS AI · Jun 236/10

🧠

From RAG to Agentic RAG for Faithful Islamic Question Answering

Researchers introduced IslamicFaithQA, a 3,810-item bilingual benchmark and agentic RAG framework designed to improve the accuracy and reliability of Islamic question-answering systems. The work addresses critical gaps in LLM evaluation by measuring hallucination rates and abstention capabilities, achieving state-of-the-art performance through iterative evidence-seeking mechanisms grounded in Qur'anic text.

🏢 Hugging Face

AIBullisharXiv – CS AI · Jun 196/10

🧠

Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why

Researchers deployed ACIE, an on-premise agentic RAG system at University Medicine Essen, to extract clinical information from fragmented patient records spanning hundreds of documents. Clinicians validated 7,326 extractions with 96.5% acceptance rates, demonstrating that agentic architectures with explicit reasoning can overcome standard RAG failures in handling temporal dependencies and missing metadata in healthcare contexts.

AINeutralarXiv – CS AI · Jun 116/10

🧠

The Structural Attention Tax: How Retrieval Format Hijacks In-Context Learning Independent of Content

Researchers identify a 'structural attention tax' where knowledge graph formats capture 2-3x more model attention than semantically equivalent natural language, degrading in-context learning performance by up to 42% regardless of content relevance. The study formalizes attention decomposition into semantic and structural components, revealing that retrieval format can independently distort LLM outputs independent of knowledge quality.

AINeutralarXiv – CS AI · Jun 96/10

🧠

DIVERGE: Diversity-Enhanced RAG for Open-Ended Information Seeking

Researchers introduce DIVERGE, a new retrieval-augmented generation (RAG) framework that addresses a critical limitation in current AI systems: their inability to generate diverse, multiple perspectives for open-ended questions. The system achieves approximately 2x greater diversity in outputs without sacrificing quality by using iterative reflection and diversity-aware retrieval strategies.

AINeutralarXiv – CS AI · Jun 85/10

🧠

Automated Root-Cause Subclassification and No-Code Fix Generation for Invalid Bug Reports

Researchers introduce a standardized taxonomy for classifying invalid bug reports and develop AI methods to automatically identify root causes and generate no-code fixes. Testing retrieval augmented generation, vanilla LLMs, and agentic web search, they achieve 66% weighted F1-score for subclassification and 68.9% success rate for fix generation, demonstrating significant potential for automating customer support workflows.

← PrevPage 2 of 4Next →