AIBullisharXiv – CS AI · 2d ago7/10
🧠OmniRetrieval is a new framework that enables unified retrieval across heterogeneous knowledge sources—including unstructured text, relational databases, knowledge graphs, and property graphs—by translating natural language queries into source-native queries rather than forcing all data into a homogenized format. The system demonstrates superior performance compared to single-source retrievers across 13 datasets and 309 knowledge bases, positioning it as a general-purpose interface that preserves the structural advantages of each knowledge source.
AIBullisharXiv – CS AI · 3d ago7/10
🧠Researchers introduce RAG-Coding, an AI system using multiple LLM agents enhanced with retrieval-augmented generation to automate ICD-10-CM medical coding. The method outperforms baseline LLM approaches by 8-13% in accuracy and maintains clinical compliance by grounding decisions in official coding guidelines, while a newly released updated dataset enables evaluation against 2025 standards.
AINeutralarXiv – CS AI · May 47/10
🧠Researchers propose that information retrieval for LLMs requires a fundamental shift toward denoising—prioritizing signal quality over quantity—because unlike humans, language models are vulnerable to hallucinations when processing noisy or irrelevant data within limited context windows. The paper introduces a four-stage framework addressing IR challenges from inaccessibility to unverifiability, with practical applications across RAG systems, coding agents, and multimodal understanding.
AIBullisharXiv – CS AI · Apr 147/10
🧠Researchers introduce GRIP, a unified framework that integrates retrieval decisions directly into language model generation through control tokens, eliminating the need for external retrieval controllers. The system enables models to autonomously decide when to retrieve information, reformulate queries, and terminate retrieval within a single autoregressive process, achieving competitive performance with GPT-4o while using substantially fewer parameters.
🧠 GPT-4
AINeutralarXiv – CS AI · Apr 107/10
🧠Researchers introduce ATANT, an open evaluation framework designed to measure whether AI systems can maintain coherent context and continuity across time without confusing information across different narratives. The framework achieves up to 100% accuracy in isolated scenarios but drops to 96% when managing 250 simultaneous narratives, revealing practical limitations in current AI memory architectures.
AIBullisharXiv – CS AI · Apr 77/10
🧠Researchers propose PassiveQA, a new AI framework that teaches language models to recognize when they don't have enough information to answer questions, choosing to ask for clarification or abstain rather than hallucinate responses. The three-action system (Answer, Ask, Abstain) uses supervised fine-tuning to align model behavior with information sufficiency, showing significant improvements in reducing hallucinations.
AIBullisharXiv – CS AI · Apr 77/10
🧠Researchers propose a new approach to Generative Engine Optimization (GEO) that moves beyond current RAG-based systems to deterministic multi-agent platforms. The study introduces mathematical models for confidence decay in LLMs and demonstrates near-zero hallucination rates through specialized agent routing in industrial applications.
AINeutralarXiv – CS AI · Apr 67/10
🧠Researchers published a comprehensive technical survey on Large Language Model augmentation strategies, examining methods from in-context learning to advanced Retrieval-Augmented Generation techniques. The study provides a unified framework for understanding how structured context at inference time can overcome LLMs' limitations of static knowledge and finite context windows.
AIBullisharXiv – CS AI · Mar 277/10
🧠Researchers introduce WriteBack-RAG, a framework that treats knowledge bases in retrieval-augmented generation systems as trainable components rather than static databases. The method distills relevant information from documents into compact knowledge units, improving RAG performance across multiple benchmarks by an average of +2.14%.
AIBearisharXiv – CS AI · Mar 277/10
🧠Researchers have identified a new attack vector called Epistemic Bias Injection (EBI) that manipulates AI language models by injecting factually correct but biased content into retrieval-augmented generation databases. The attack steers model outputs toward specific viewpoints while evading traditional detection methods, though a new defense mechanism called BiasDef shows promise in mitigating these threats.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers introduce APEX-Searcher, a new framework that enhances large language models' search capabilities through a two-stage approach combining reinforcement learning for strategic planning and supervised fine-tuning for execution. The system addresses limitations in multi-hop question answering by decoupling retrieval processes into planning and execution phases, showing significant improvements across multiple benchmarks.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers propose p²RAG, a new privacy-preserving Retrieval-Augmented Generation system that supports arbitrary top-k retrieval while being 3-300x faster than existing solutions. The system uses an interactive bisection method instead of sorting and employs secret sharing across two servers to protect user prompts and database content.
$RAG
AINeutralarXiv – CS AI · Mar 177/10
🧠This research paper examines how agentic AI systems that can act autonomously challenge existing legal and financial regulatory frameworks. The authors argue that AI governance must shift from model-level alignment to institutional governance structures that create compliant behavior through mechanism design and runtime constraints.
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers introduce MMGraphRAG, a new AI framework that addresses hallucination issues in large language models by integrating visual scene graphs with text knowledge graphs through cross-modal fusion. The system uses SpecLink for entity linking and demonstrates superior performance in multimodal information processing across multiple benchmarks.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers propose RAG-X, a diagnostic framework for evaluating retrieval-augmented generation systems in medical AI applications. The study reveals an 'Accuracy Fallacy' showing a 14% gap between perceived system success and actual evidence-based grounding in medical question-answering systems.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers introduce OSCAR, a new query-dependent online soft compression method for Retrieval-Augmented Generation (RAG) systems that reduces computational overhead while maintaining performance. The method achieves 2-5x speed improvements in inference with minimal accuracy loss across LLMs from 1B to 24B parameters.
🏢 Hugging Face
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers developed MA-RAG, a Multi-Round Agentic RAG framework that improves medical AI reasoning by iteratively refining responses through conflict detection and external evidence retrieval. The system achieved a substantial +6.8 point accuracy improvement over baseline models across 7 medical Q&A benchmarks by addressing hallucinations and outdated knowledge in healthcare AI applications.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers demonstrate that coreference resolution significantly improves Retrieval-Augmented Generation (RAG) systems by reducing ambiguity in document retrieval and enhancing question-answering performance. The study finds that smaller language models benefit more from disambiguation processes, with mean pooling strategies showing superior context capturing after coreference resolution.
AIBullisharXiv – CS AI · Mar 47/102
🧠Researchers have enhanced the Saarthi AI framework for formal verification, achieving 70% better accuracy in generating SystemVerilog assertions and 50% fewer iterations to reach coverage closure. The framework uses multi-agent collaboration and improved RAG techniques to move toward domain-specific AI intelligence for verification tasks.
AIBullisharXiv – CS AI · Feb 277/108
🧠Researchers introduce RAGdb, a revolutionary architecture that consolidates Retrieval-Augmented Generation into a single SQLite container, eliminating the need for cloud infrastructure and GPUs. The system achieves 100% entity retrieval accuracy while reducing disk footprint by 99.5% compared to traditional Docker-based RAG stacks, enabling truly portable AI applications for edge computing and privacy-sensitive environments.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce EngiAI, a multi-agent LLM framework with a comprehensive benchmark suite for evaluating AI systems on complex engineering design tasks combining simulation, retrieval, and manufacturing. The framework reveals significant performance gaps between proprietary models (96-97% task completion) and open-source alternatives (55-78%), with conditional reasoning emerging as a critical failure point.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers introduce LitSeg, a narrative-theory-guided framework for intelligently segmenting literary documents in Retrieval-Augmented Generation systems. The method uses multi-stage prompting to identify plot events and narrative structures, with a lightweight variant (LitSeg-Lite) that distills this complexity into a single inference pass, demonstrating improved retrieval accuracy for literary RAG applications.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers have developed an AI agent framework that automates the translation of legacy finite-difference code into Devito, a modern computational framework. The system combines retrieval-augmented generation (RAG) with large language models and implements reinforcement learning feedback mechanisms to enable dynamic code transformation with validation across correctness, structure, and API compliance.
AIBullisharXiv – CS AI · May 116/10
🧠Researchers have developed an AI Teaching & Learning Assistant, a Moodle plugin using Retrieval-Augmented Generation (RAG) to provide students with Socratic tutoring while enabling educators to supervise content generation. The system grounds LLM responses in teacher-provided materials to minimize hallucinations and misinformation, achieving high faithfulness scores (0.97) and strong user satisfaction (4.00/5.00 rating).
AIBullisharXiv – CS AI · May 96/10
🧠Researchers propose a two-stage approach to improve reliability in retrieval-augmented generation (RAG) systems by using conformal prediction to filter retrieved content and an attention-based classifier to detect factual inconsistencies. The framework achieves up to 6% answer quality improvement and 77% inconsistency detection, advancing toward certified RAG systems for production AI applications.