y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#retrieval-augmented-generation News & Analysis

98 articles tagged with #retrieval-augmented-generation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

98 articles
AIBullisharXiv – CS AI · Mar 47/103
🧠

Type-Aware Retrieval-Augmented Generation with Dependency Closure for Solver-Executable Industrial Optimization Modeling

Researchers developed a type-aware retrieval-augmented generation (RAG) method that translates natural language requirements into solver-executable optimization code for industrial applications. The method uses a typed knowledge base and dependency closure to ensure code executability, successfully validated on battery production optimization and job scheduling tasks where conventional RAG approaches failed.

AINeutralarXiv – CS AI · 2d ago6/10
🧠

S3Mem: Structured Spatiotemporal Scene-Event Memory for Long-Horizon Interactive Question Answering

Researchers introduce S3MEM, a structured memory framework that improves how AI agents retrieve and answer questions about long trajectory histories. The system outperforms standard retrieval-augmented generation by organizing trajectories into scene-event units and using anchor-sensitive retrieval, achieving better accuracy with fewer tokens across multiple interactive environments.

AINeutralarXiv – CS AI · 2d ago6/10
🧠

HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering

Researchers introduce HiKEY, a hierarchical multimodal retrieval framework designed to improve document-based question answering systems by leveraging document structure as a core retrieval signal. The system addresses critical limitations in existing approaches by implementing a coarse-to-fine retrieval strategy and demonstrating significant performance improvements on ODQA benchmarks.

AINeutralarXiv – CS AI · 2d ago6/10
🧠

SERC: LDPC-Inspired Semantic Error Correction for Retrieval-Augmented Generation

Researchers propose SERC, an LDPC-inspired framework that treats LLM hallucination correction as a semantic error-correction problem using sparse verification strategies. The training-free, model-agnostic approach demonstrates superior performance on factual accuracy benchmarks while reducing computational overhead compared to dense verification methods.

🧠 Llama
AINeutralarXiv – CS AI · 2d ago6/10
🧠

RAISE: RAG Design as an Architecture Search Problem

Researchers introduce RAISE, a comprehensive framework for optimizing retrieval-augmented generation (RAG) systems by treating architecture design as a hyperparameter search problem. The study evaluates 13 optimization algorithms across seven datasets, revealing that RAG performance is highly task-dependent and no single optimization strategy universally outperforms others, highlighting the need for systematic rather than heuristic-based configuration approaches.

🏢 Meta
AINeutralarXiv – CS AI · 2d ago6/10
🧠

Micro-Macro Retrieval: Reducing Long-Form Hallucination in Large Language Models

Researchers propose Micro-Macro Retrieval (M2R), a framework that reduces hallucination in large language models during long-form text generation by keeping key information closer to model outputs. The method combines coarse-grained external retrieval with fine-grained extraction from an internal knowledge repository, addressing a critical bottleneck where proximity of evidence to final answers directly correlates with factual accuracy.

AINeutralarXiv – CS AI · 2d ago6/10
🧠

SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents

SkillBrew introduces a multi-objective curation framework for managing skill banks in LLM agents, addressing the problem of bloated repositories filled with redundant and outdated skills. The approach treats skill bank management as a constrained optimization problem balancing utility, diversity, and query coverage, evaluated successfully on public benchmarks.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

A Fixed-Budget, Cluster-Aware Standard for LLM-as-a-Judge Evaluation: A Multi-Hop RAG Stress Test

Researchers propose a standardized measurement protocol for evaluating retrieval-augmented generation (RAG) systems using LLM judges, addressing inconsistencies in how semantic search quality is assessed. The standard fixes key variables like evidence budget and prompt while requiring cluster-aware statistical testing, revealing that previous comparisons may have overstated progress and that traditional BM25 retrieval outperforms pure semantic methods under controlled conditions.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

C-MIG: Multi-view Information Gain-based Retrieval-Augmented Generation for Clinical Diagnosis Reasoning

Researchers introduce C-MIG, a retrieval-augmented generation framework that improves clinical diagnosis reasoning by using multi-view information gain instead of binary reward signals. The method outperforms existing RAG-RL approaches on medical benchmarks by better capturing semantically relevant information and addressing credit assignment challenges in healthcare AI systems.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

DiagramRAG: A Lightweight Framework to Retrieve Scientific Diagram for Figure Generation

DiagramRAG is a new retrieval-augmented framework that converts rough sketches into publication-quality scientific diagrams by retrieving semantically and topologically compatible reference diagrams. The system achieves strong performance metrics (F1-scores of 0.848 and 0.802 on benchmark datasets) while maintaining efficient inference at 35.48 seconds per sample.

🏢 Hugging Face
AIBullisharXiv – CS AI · 3d ago6/10
🧠

CircuitLM: A Multi-Agent LLM-Aided Design Framework for Generating Circuit Schematics from Natural Language Prompts

CircuitLM is a multi-agent AI framework that converts natural language descriptions into machine-readable circuit schematics, addressing persistent hallucination and constraint-violation issues in LLM-based electronic design automation. The system uses a five-stage pipeline combining retrieval-augmented generation with dual-layer verification—electrical rule checking and LLM-as-judge evaluation—to produce structurally viable, prototype-ready circuits.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

From Norms to Indicators (N2I-RAG): An Agentic Retrieval-Augmented Generation Framework for Legal Indicator Computation

Researchers introduce N2I-RAG, an AI framework that automates computation of legal indicators from normative texts using retrieval-augmented generation with built-in validation mechanisms. The system addresses hallucination risks in traditional language models by emphasizing traceability and evidence grounding, demonstrating strong performance on French marine environmental law.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Cordon-MAS: Defending RAG against Knowledge Poisoning via Information-Flow Control

Researchers introduce Cordon-MAS, a new defense framework against poisoning attacks on retrieval-augmented generation (RAG) systems. The framework reduces attack success rates by 92.4% by enforcing information-flow control that prevents synthesis agents from directly accessing untrusted evidence, addressing a critical vulnerability in AI systems used for high-stakes applications.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Does RAG Know When Retrieval Is Wrong? Diagnosing Context Compliance under Knowledge Conflict

Researchers introduce Context-Driven Decomposition (CDD), a diagnostic tool that reveals how retrieval-augmented generation (RAG) systems blindly follow retrieved context even when it contradicts their underlying knowledge. Testing across multiple AI models shows CDD can improve accuracy to 64% on adversarial scenarios, though improvements don't consistently transfer across different model families, suggesting RAG systems resolve conflicts through fundamentally different mechanisms.

🧠 Claude🧠 Gemini
AINeutralarXiv – CS AI · May 126/10
🧠

The First Drop of Ink: Nonlinear Impact of Misleading Information in Long-Context Reasoning

Researchers reveal that large language models suffer from a nonlinear performance degradation when exposed to misleading information in long-context scenarios, with the majority of decline occurring when hard distractors comprise just a small fraction of the total context. This finding, termed 'The First Drop of Ink' effect, demonstrates that attention mechanisms disproportionately focus on misleading content, suggesting that upstream retrieval quality is more critical than previously understood for RAG and agentic systems.

AINeutralarXiv – CS AI · May 126/10
🧠

CDS4RAG: Cyclic Dual-Sequential Hyperparameter Optimization for RAG

Researchers introduce CDS4RAG, a novel optimization framework that improves Retrieval-Augmented Generation systems by cyclically optimizing retriever and generator hyperparameters separately rather than treating them as a monolithic unit. The method achieves up to 1.54x improvements in generation quality while demonstrating faster convergence across multiple benchmarks and language models.

AINeutralarXiv – CS AI · May 126/10
🧠

Generating Leakage-Free Benchmarks for Robust RAG Evaluation

Researchers introduce SeedRG, a benchmark generation pipeline that addresses knowledge leakage in retrieval-augmented generation (RAG) evaluation by creating novel, structurally similar test instances that cannot be answered from language models' existing parametric memory. The approach tackles the critical problem of benchmark aging, where reused datasets become less effective for evaluation as their content gets absorbed into model training.

AIBullisharXiv – CS AI · May 116/10
🧠

RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation

Researchers propose RRCM, a novel framework that enhances Large Language Model-based recommendation systems by dynamically retrieving relevant collaborative and metadata information. The system learns optimal context construction through ranking-driven optimization, addressing key challenges in balancing context quality with efficiency limitations.

AIBullisharXiv – CS AI · May 96/10
🧠

Towards Dependable Retrieval-Augmented Generation Using Factual Confidence Prediction

Researchers propose a two-stage approach to improve reliability in retrieval-augmented generation (RAG) systems by using conformal prediction to filter retrieved content and an attention-based classifier to detect factual inconsistencies. The framework achieves up to 6% answer quality improvement and 77% inconsistency detection, advancing toward certified RAG systems for production AI applications.

AINeutralarXiv – CS AI · May 96/10
🧠

Beyond Chunking: Discourse-Aware Hierarchical Retrieval for Long Document Question Answering

Researchers present a discourse-aware hierarchical framework that uses rhetorical structure theory (RST) to improve long-document question answering systems. Rather than treating documents as flat sequences, the approach leverages natural discourse structures to enhance retrieval accuracy across multiple languages and document types.

AINeutralarXiv – CS AI · May 96/10
🧠

MEMSAD: Gradient-Coupled Anomaly Detection for Memory Poisoning in Retrieval-Augmented Agents

Researchers present MEMSAD, a defense mechanism against memory poisoning attacks on retrieval-augmented LLM agents, using gradient-coupled anomaly detection to identify adversarial perturbations while maintaining retrieval performance. The work formalizes security vulnerabilities in persistent external memory systems and demonstrates that while composite defenses achieve perfect detection rates, synonym-based attacks remain undetectable by embedding-based approaches.

AIBullisharXiv – CS AI · May 76/10
🧠

CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation

Researchers introduce CAR (Confidence-Aware Reranking), a training-free framework that improves document ranking in Retrieval-Augmented Generation systems by measuring how much each document increases the language model's confidence rather than just relevance. Testing across multiple datasets shows consistent improvements in ranking quality and downstream generation performance.

AINeutralarXiv – CS AI · May 76/10
🧠

Architectural Constraints Alignment in AI-assisted, Platform-based Service Development

Researchers propose a retrieval-augmented scaffolding approach that enhances AI-assisted code generation by embedding architectural constraints and infrastructure requirements during service development. The method combines platform templates with agentic clarification loops to improve production deployability and architectural consistency compared to standard AI code generation tools.

AINeutralarXiv – CS AI · Apr 206/10
🧠

Rethinking the Necessity of Adaptive Retrieval-Augmented Generation through the Lens of Adaptive Listwise Ranking

Researchers propose AdaRankLLM, an adaptive retrieval-augmented generation framework that dynamically filters irrelevant passages to reduce computational overhead while maintaining output quality. The study challenges whether adaptive retrieval remains necessary as language models grow more robust, finding that its value differs significantly between weaker and stronger models.

← PrevPage 2 of 4Next →