#information-retrieval News & Analysis

100 articles tagged with #information-retrieval. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

100 articles

AINeutralarXiv – CS AI · Jun 237/10

🧠

When Confidence Takes the Wrong Path: Diagnosing Retrieval-State Lock-In in RAG

Researchers identify 'retrieval-state lock-in,' a failure mode in retrieval-augmented generation (RAG) systems where multiple sampled answers agree despite being wrong because they condition on the same defective retrieval state. The study proposes decomposing confidence scores into three components—answer surface, evidence, and retrieval state—achieving 91.9% precision by requiring all three to agree, though this certifies only 7.7% of answers as low-risk.

AIBearisharXiv – CS AI · Jun 107/10

🧠

Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines

Researchers demonstrate that LLM-based search engines are vulnerable to ranking manipulation attacks, where adversaries craft content to game results. Using game theory, the study reveals that reducing attack success rates can paradoxically incentivize attacks, and defensive caps may fail—highlighting the need for adaptive security strategies beyond traditional defenses.

AIBullisharXiv – CS AI · Jun 57/10

🧠

HypRAG: Hyperbolic Dense Retrieval for Retrieval Augmented Generation

Researchers introduce HypRAG, a novel dense retrieval system for retrieval-augmented generation that operates in hyperbolic space rather than traditional Euclidean space. The approach achieves up to 29% performance gains over Euclidean baselines by better preserving the hierarchical structure of natural language, reducing hallucination risks in AI systems.

AIBullisharXiv – CS AI · Jun 57/10

🧠

Reducing Hallucinations in Complex Question Answering using Simple Graph-based Retrieval-Augmented Generation (long version)

Researchers present a graph-based retrieval-augmented generation (RAG) system that reduces AI hallucinations by integrating lightweight graph structures with vector search tools. Testing on Wikipedia QA benchmarks shows the approach halves hallucinated answers while improving factual precision and recall with minimal token overhead.

AIBearisharXiv – CS AI · Jun 57/10

🧠

Dense Contexts Are Hard Contexts: Lexical Density Limits Effective Context in LLMs

Researchers discovered that lexical density—the rate at which new information appears in text—significantly limits LLM effective context windows, causing near-perfect models to drop below 60% accuracy on information-dense contexts. This finding reveals that input length and needle position, traditionally blamed for context degradation, overlook a critical third factor that directly impacts real-world LLM performance on compact, information-rich data.

AINeutralarXiv – CS AI · Jun 27/10

🧠

Characterizing Web Search in The Age of Generative AI

Researchers systematically compared generative search systems (Google, OpenAI, Perplexity) with traditional Google search, revealing fundamental differences in retrieval strategies, source diversity, and output stability. Generative search synthesizes web information into coherent responses but exhibits significant variation in reliance on internal knowledge, consistency across executions, and evaluation metrics, necessitating new assessment frameworks.

🏢 OpenAI🏢 Perplexity

AIBullisharXiv – CS AI · Jun 17/10

🧠

DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval

DynaTree is a two-stage framework for efficient news retrieval that combines offline agentic reasoning with lightweight online subtree selection, achieving significant improvements in real-world deployment. The system demonstrated a 59-73% survival rate versus 32-53% for fixed approaches in production A/B testing, highlighting the practical value of persistent semantic expansion for time-sensitive information retrieval.

AIBullisharXiv – CS AI · May 297/10

🧠

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

OmniRetrieval is a new framework that enables unified retrieval across heterogeneous knowledge sources—including unstructured text, relational databases, knowledge graphs, and property graphs—by translating natural language queries into source-native queries rather than forcing all data into a homogenized format. The system demonstrates superior performance compared to single-source retrievers across 13 datasets and 309 knowledge bases, positioning it as a general-purpose interface that preserves the structural advantages of each knowledge source.

AIBullisharXiv – CS AI · May 297/10

🧠

No More K-means:Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval

Researchers introduce Single-stage Sparse Retrieval (SSR), a new approach that replaces clustering-based compression with sparse autoencoders for multi-vector retrieval systems. The method achieves 15x faster indexing, 50% lower retrieval latency, and improved accuracy compared to ColBERTv2, addressing critical efficiency bottlenecks in large-scale information retrieval.

AIBearisharXiv – CS AI · May 287/10

🧠

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

Researchers reveal that LLM-based search agents often rely on intrinsic knowledge rather than genuinely searching the web, with up to 44.5% of answers generated without tool use. The new LiveBrowseComp benchmark, designed to test agents on recent facts within 90 days, shows all evaluated agents drop below 2% accuracy and exposes fundamental limitations in current search-augmented AI evaluation.

🏢 Hugging Face

AIBullisharXiv – CS AI · May 277/10

🧠

ICICLE: Expanding Retrieval with In-Context Documents

Researchers introduce ICICLE, a generative retrieval framework that addresses the inefficiency of traditional corpus expansion by treating new documents as in-context evidence rather than requiring model retraining. The approach uses a copy-based routing mechanism to distinguish between parametric memory and context-provided document associations, achieving better scalability without catastrophic forgetting.

AIBullishGoogle AI Blog · May 197/10

🧠

A new era for AI Search

A major technology company announced a significant advancement in search technology by integrating artificial intelligence capabilities with traditional search engine functionality. This development represents a strategic shift toward hybrid search solutions that combine AI's generative and analytical strengths with search engines' indexing and retrieval capabilities.

AIBullisharXiv – CS AI · May 117/10

🧠

WiCER: Wiki-memory Compile, Evaluate, Refine Iterative Knowledge Compilation for LLM Wiki Systems

Researchers introduce WiCER, an iterative algorithm that solves the "compilation gap" in LLM Wiki systems—the problem of distilling raw documents into persistent knowledge artifacts without losing critical facts. The method recovers 80% of lost quality and reduces catastrophic failures by 55%, outperforming naive compilation approaches while maintaining sub-second latency advantages over traditional RAG systems.

AINeutralarXiv – CS AI · May 97/10

🧠

SkillRet: A Large-Scale Benchmark for Skill Retrieval in LLM Agents

Researchers introduce SkillRet, a large-scale benchmark dataset containing 17,810 public agent skills designed to evaluate how language model agents retrieve appropriate tools from massive skill libraries. The benchmark demonstrates that current retrieval methods struggle significantly with realistic large-scale deployments, though task-specific fine-tuning improves performance by up to 16.9 points on key metrics.

AINeutralarXiv – CS AI · May 47/10

🧠

LLM-Oriented Information Retrieval: A Denoising-First Perspective

Researchers propose that information retrieval for LLMs requires a fundamental shift toward denoising—prioritizing signal quality over quantity—because unlike humans, language models are vulnerable to hallucinations when processing noisy or irrelevant data within limited context windows. The paper introduces a four-stage framework addressing IR challenges from inaccessibility to unverifiability, with practical applications across RAG systems, coding agents, and multimodal understanding.

AIBearisharXiv – CS AI · May 17/10

🧠

How Generative AI Disrupts Search: An Empirical Study of Google Search, Gemini, and AI Overviews

A comprehensive empirical study reveals that generative AI is fundamentally reshaping web search by retrieving different sources and presenting information differently than traditional search engines. The research finds that AI Overviews appear in over half of queries, tend to prioritize Google-owned content over institutional sources, and show lower consistency and robustness compared to standard search results.

🧠 Gemini

AIBullisharXiv – CS AI · Apr 147/10

🧠

Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning

Researchers introduce GRIP, a unified framework that integrates retrieval decisions directly into language model generation through control tokens, eliminating the need for external retrieval controllers. The system enables models to autonomously decide when to retrieve information, reformulate queries, and terminate retrieval within a single autoregressive process, achieving competitive performance with GPT-4o while using substantially fewer parameters.

🧠 GPT-4

AIBullisharXiv – CS AI · Apr 137/10

🧠

EigentSearch-Q+: Enhancing Deep Research Agents with Structured Reasoning Tools

Researchers introduce Q+, a structured reasoning toolkit that enhances AI research agents by making web search more deliberate and organized. Integrated into Eigent's browser agent, Q+ demonstrates consistent benchmark improvements of 0.6 to 3.8 percentage points across multiple deep-research tasks, suggesting meaningful progress in autonomous AI agent reliability.

🏢 Anthropic🧠 GPT-4🧠 GPT-5

AIBullisharXiv – CS AI · Mar 167/10

🧠

Towards AI Search Paradigm

Researchers introduce the AI Search Paradigm, a comprehensive framework for next-generation search systems using four LLM-powered agents (Master, Planner, Executor, Writer) that collaborate to handle everything from simple queries to complex reasoning tasks. The system employs modular architecture with dynamic workflows for task planning, tool integration, and content synthesis to create more adaptive and scalable AI search capabilities.

AIBullisharXiv – CS AI · Mar 47/104

🧠

Retrieval-Augmented Robots via Retrieve-Reason-Act

Researchers introduce Retrieval-Augmented Robotics (RAR), a new paradigm enabling robots to actively retrieve and use external visual documentation to execute complex tasks. The system uses a Retrieve-Reason-Act loop where robots search unstructured visual manuals, align 2D diagrams with 3D objects, and synthesize executable plans for assembly tasks.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Exploring Information Seeking Agent Consolidation

Researchers present the first systematic study consolidating specialized information-seeking agents into a single foundation model, comparing data-level mixing with parameter-level merging across 26 methods and 10 benchmarks. Parameter-level merging achieves comparable performance to data mixing at significantly lower training cost while better preserving out-of-domain capabilities, offering practical efficiency gains for cross-domain AI deployment.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Fuzzy Quantification over OWL Ontologies and Knowledge Graphs

Researchers have developed a framework for evaluating fuzzy quantification queries over OWL ontologies and knowledge graphs, enabling retrieval of individuals matching Type I or Type II fuzzy quantified expressions. The system is agnostic to quantifier types and data sources, with Q2S2 released as an open implementation for future research.

AIBullisharXiv – CS AI · Jun 256/10

🧠

Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents

Researchers introduce Membox, a hierarchical memory architecture for LLM agents that organizes dialogue history by topic continuity rather than semantic proximity. The system uses Topic Loom to group related turns and Trace Weaver to link events across sessions, achieving 13-19 percentage point F1 improvements over existing memory systems like Mem0 and A-MEM.

🧠 GPT-4

AINeutralarXiv – CS AI · Jun 236/10

🧠

Revisiting Text Ranking in Deep Research

Researchers conducted a systematic analysis of text ranking methods in deep research tasks, examining how LLM-based agents retrieve and process web information. The study reveals that agent-generated queries follow web-search syntax favoring lexical and sparse retrievers, passage-level units outperform documents under context constraints, and a new query-translation method significantly improves retrieval effectiveness.

AINeutralarXiv – CS AI · Jun 196/10

🧠

ScaffoldAgent: Utility-Guided Dynamic Outline Optimization for Open-Ended Deep Research

ScaffoldAgent introduces a dynamic outline optimization framework for open-ended deep research that evolves report structures through expansion, contraction, and revision operations. The system uses utility-guided feedback mechanisms to evaluate outline modifications based on retrieval gains and coherence, demonstrating improved performance on deep research benchmarks compared to existing approaches.

Page 1 of 4Next →