y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#rag News & Analysis

69 articles tagged with #rag. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

69 articles
AIBearisharXiv โ€“ CS AI ยท Mar 266/10
๐Ÿง 

Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias

Research reveals that Retrieval-Augmented Generation (RAG) systems exhibit fairness issues, with queries from certain demographic groups systematically receiving higher accuracy than others. The study identifies three key factors affecting fairness: group exposure in retrieved documents, utility of group-specific documents, and attribution bias in how generators use different group documents.

๐Ÿข Meta
AINeutralarXiv โ€“ CS AI ยท Mar 266/10
๐Ÿง 

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

A research study on retrieval-augmented generation (RAG) systems for AI policy analysis found that improving retrieval quality doesn't necessarily lead to better question-answering performance. The research used 947 AI policy documents and discovered that stronger retrieval can paradoxically cause more confident hallucinations when relevant information is missing.

AINeutralarXiv โ€“ CS AI ยท Mar 176/10
๐Ÿง 

Citation-Enforced RAG for Fiscal Document Intelligence: Cited, Explainable Knowledge Retrieval in Tax Compliance

Researchers have developed a new AI framework that uses citation-enforced retrieval-augmented generation (RAG) specifically for analyzing tax and fiscal documents. The system prioritizes transparency and explainability for tax authorities, showing improved citation accuracy and reduced AI hallucinations when tested on real IRS documents.

AINeutralarXiv โ€“ CS AI ยท Mar 176/10
๐Ÿง 

Not All Queries Need Rewriting: When Prompt-Only LLM Refinement Helps and Hurts Dense Retrieval

Research reveals that LLM query rewriting in RAG systems shows highly domain-dependent performance, degrading retrieval effectiveness by 9% in financial domains while improving it by 5.1% in scientific contexts. The study identifies that effectiveness depends on whether rewriting improves or worsens lexical alignment between queries and domain-specific terminology.

AIBullisharXiv โ€“ CS AI ยท Mar 176/10
๐Ÿง 

GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning

GlobalRAG is a new reinforcement learning framework that significantly improves multi-hop question answering by decomposing questions into subgoals and coordinating retrieval with reasoning. The system achieves 14.2% average improvements in performance metrics while using only 42% of the training data required by baseline models.

AIBullisharXiv โ€“ CS AI ยท Mar 166/10
๐Ÿง 

Test-Time Strategies for More Efficient and Accurate Agentic RAG

Researchers improved agentic Retrieval-Augmented Generation (RAG) systems by introducing contextualization and de-duplication modules to address inefficiencies in complex question-answering. The enhanced Search-R1 pipeline achieved 5.6% better accuracy and 10.5% fewer retrieval turns using GPT-4.1-mini.

๐Ÿง  GPT-4
AIBullisharXiv โ€“ CS AI ยท Mar 116/10
๐Ÿง 

TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation

Researchers propose TaSR-RAG, a new framework that improves Retrieval-Augmented Generation systems by using taxonomy-guided structured reasoning for better evidence selection. The system decomposes complex questions into triple sub-queries and performs step-wise evidence matching, achieving up to 14% performance improvements over existing RAG baselines on multi-hop question answering benchmarks.

AINeutralarXiv โ€“ CS AI ยท Mar 96/10
๐Ÿง 

Towards Neural Graph Data Management

Researchers introduce NGDBench, a comprehensive benchmark for evaluating neural networks' ability to work with graph databases across five domains including finance and medicine. The benchmark supports full Cypher query language capabilities and reveals significant limitations in current AI models when handling structured graph data, noise, and complex analytical tasks.

AIBullisharXiv โ€“ CS AI ยท Mar 66/10
๐Ÿง 

CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG Models

Researchers propose CTRL-RAG, a new reinforcement learning framework that improves large language models' ability to generate accurate, context-faithful responses in Retrieval-Augmented Generation systems. The method uses a Contrastive Likelihood Reward mechanism that optimizes the difference between responses with and without supporting evidence, addressing issues of hallucination and model collapse in existing RAG systems.

AIBullisharXiv โ€“ CS AI ยท Mar 37/107
๐Ÿง 

Semantic XPath: Structured Agentic Memory Access for Conversational AI

Researchers have developed Semantic XPath, a tree-structured memory system for conversational AI that improves performance by 176.7% over traditional methods while using only 9.1% of the tokens. The system addresses scalability issues in long-term AI conversations by efficiently accessing and updating structured memory instead of appending growing conversation history.

AIBullisharXiv โ€“ CS AI ยท Mar 36/106
๐Ÿง 

S5-HES Agent: Society 5.0-driven Agentic Framework to Democratize Smart Home Environment Simulation

Researchers have developed S5-HES Agent, an AI-driven framework that democratizes smart home research by enabling natural language configuration of simulations without programming expertise. The system uses large language models and retrieval-augmented generation to make smart home environment testing accessible to broader research communities beyond traditional technical experts.

$NEAR
AIBullisharXiv โ€“ CS AI ยท Mar 36/109
๐Ÿง 

GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

Researchers introduce GAM-RAG, a training-free framework that improves Retrieval-Augmented Generation by building adaptive memory from past queries instead of relying on static indices. The system uses uncertainty-aware updates inspired by cognitive neuroscience to balance stability and adaptability, achieving 3.95% better performance while reducing inference costs by 61%.

AIBullisharXiv โ€“ CS AI ยท Mar 36/107
๐Ÿง 

NovaLAD: A Fast, CPU-Optimized Document Extraction Pipeline for Generative AI and Data Intelligence

NovaLAD is a new CPU-optimized document extraction pipeline that uses dual YOLO models for converting unstructured documents into structured formats for AI applications. The system achieves 96.49% TEDS and 98.51% NID on benchmarks, outperforming existing commercial and open-source parsers while running efficiently on CPU without requiring GPU resources.

AIBullisharXiv โ€“ CS AI ยท Mar 36/106
๐Ÿง 

TARSE: Test-Time Adaptation via Retrieval of Skills and Experience for Reasoning Agents

Researchers developed TARSE, a new AI system for clinical decision-making that retrieves relevant medical skills and experiences from curated libraries to improve reasoning accuracy. The system performs test-time adaptation to align language models with clinically valid logic, showing improvements over existing medical AI baselines in question-answering benchmarks.

AIBullisharXiv โ€“ CS AI ยท Mar 36/106
๐Ÿง 

Retrieval, Refinement, and Ranking for Text-to-Video Generation via Prompt Optimization and Test-Time Scaling

Researchers introduce 3R, a new RAG-based framework that optimizes prompts for text-to-video generation models without requiring model retraining. The system uses three key strategies to improve video quality: RAG-based modifier extraction, diffusion-based preference optimization, and temporal frame interpolation for better consistency.

AINeutralarXiv โ€“ CS AI ยท Mar 36/103
๐Ÿง 

Scaling Retrieval Augmented Generation with RAG Fusion: Lessons from an Industry Deployment

Research on production RAG systems reveals that retrieval fusion techniques like multi-query retrieval and reciprocal rank fusion increase raw document recall but fail to improve end-to-end performance due to re-ranking limits and context constraints. The study found fusion variants actually decreased accuracy from 0.51 to 0.48 while adding latency overhead without corresponding benefits.

AIBearisharXiv โ€“ CS AI ยท Mar 36/104
๐Ÿง 

Wikipedia in the Era of LLMs: Evolution and Risks

A new research study analyzes how Large Language Models are impacting Wikipedia content and structure, finding approximately 1% influence in certain categories. The research warns of potential risks to AI benchmarks and natural language processing tasks if Wikipedia becomes contaminated by LLM-generated content.

AINeutralarXiv โ€“ CS AI ยท Mar 36/104
๐Ÿง 

Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment

Researchers have developed AQUA, the first watermarking framework designed to protect image copyright in Multimodal Retrieval-Augmented Generation (RAG) systems. The framework addresses a critical gap in protecting visual content within RAG-as-a-Service platforms by embedding semantic signals into synthetic images that survive the retrieval-to-generation process.

AINeutralarXiv โ€“ CS AI ยท Mar 27/1012
๐Ÿง 

An Agentic LLM Framework for Adverse Media Screening in AML Compliance

Researchers have developed an agentic LLM framework using Retrieval-Augmented Generation to automate adverse media screening for anti-money laundering compliance in financial institutions. The system addresses high false-positive rates in traditional keyword-based approaches by implementing multi-step web searches and computing Adverse Media Index scores to distinguish between high-risk and low-risk individuals.

AIBullisharXiv โ€“ CS AI ยท Mar 26/1013
๐Ÿง 

Domain-Partitioned Hybrid RAG for Legal Reasoning: Toward Modular and Explainable Legal AI for India

Researchers developed a domain-partitioned hybrid RAG system with knowledge graphs specifically for Indian legal research, combining three specialized pipelines for Supreme Court cases, statutory texts, and penal codes. The system achieved a 70% pass rate on legal questions, nearly doubling the performance of traditional RAG-only approaches at 37.5%.

AIBullisharXiv โ€“ CS AI ยท Mar 26/1017
๐Ÿง 

Higress-RAG: A Holistic Optimization Framework for Enterprise Retrieval-Augmented Generation via Dual Hybrid Retrieval, Adaptive Routing, and CRAG

Researchers have developed Higress-RAG, a new enterprise-grade framework that addresses key challenges in Retrieval-Augmented Generation systems including low retrieval precision, hallucination, and high latency. The system introduces innovations like 50ms semantic caching, hybrid retrieval methods, and corrective evaluation to optimize the entire RAG pipeline for production use.

$LINK
AIBullisharXiv โ€“ CS AI ยท Feb 276/108
๐Ÿง 

Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation

Researchers developed GYWI, a scientific idea generation system that combines author knowledge graphs with retrieval-augmented generation to help Large Language Models generate more controllable and traceable scientific ideas. The system significantly outperforms mainstream LLMs including GPT-4o, DeepSeek-V3, Qwen3-8B, and Gemini 2.5 in metrics like novelty, reliability, and relevance.

AIBullisharXiv โ€“ CS AI ยท Feb 276/102
๐Ÿง 

Retrieval-Augmented Generation Assistant for Anatomical Pathology Laboratories

Researchers developed a Retrieval-Augmented Generation (RAG) assistant for anatomical pathology laboratories to replace outdated static documentation with dynamic, searchable protocol guidance. The system achieved strong performance using biomedical-specific embeddings and could transform healthcare laboratory workflows by providing technicians with accurate, context-grounded answers to protocol queries.