AIBullisharXiv – CS AI · Apr 136/10
🧠Researchers introduce RecaLLM, a post-trained language model that addresses the 'lost-in-thought' phenomenon where retrieval performance degrades during extended reasoning chains. The model interleaves explicit in-context retrieval with reasoning steps and achieves strong performance on long-context benchmarks using training data significantly shorter than existing approaches.
AIBullisharXiv – CS AI · Apr 136/10
🧠Researchers introduce VISOR, a new agentic visual retrieval-augmented generation system that improves how AI models reason over multi-page visual documents. By addressing key technical challenges in evidence gathering and context management, VISOR achieves state-of-the-art results on complex visual reasoning tasks.
AINeutralarXiv – CS AI · Apr 106/10
🧠Researchers have developed Luwen, an open-source Chinese legal language model built on Baichuan that uses continual pre-training, supervised fine-tuning, and retrieval-augmented generation to excel at legal tasks. The model outperforms baselines on five legal benchmarks including judgment prediction, judicial examination, and legal reasoning, demonstrating effective domain adaptation for specialized legal applications.
AINeutralarXiv – CS AI · Apr 106/10
🧠Researchers propose G-Defense, a graph-enhanced framework that uses large language models and retrieval-augmented generation to detect fake news while providing explainable, fine-grained reasoning. The system decomposes news claims into sub-claims, retrieves competing evidence, and generates transparent explanations without requiring verified fact-checking databases.
AINeutralarXiv – CS AI · Apr 106/10
🧠Researchers introduce chain-of-illocution (CoI) prompting to improve source faithfulness in retrieval-augmented language models, achieving up to 63% gains in source adherence for programming education tasks. The study reveals that standard RAG systems exhibit low fidelity to source materials, with non-RAG models performing worse, while a user study confirms improved faithfulness does not compromise user satisfaction.
AIBullisharXiv – CS AI · Apr 76/10
🧠Researchers propose ScalDPP, a new retrieval mechanism for RAG systems that uses Determinantal Point Processes to optimize both density and diversity in context selection. The approach addresses limitations in current RAG pipelines that ignore interactions between retrieved information chunks, leading to redundant contexts that reduce effectiveness.
AIBullisharXiv – CS AI · Mar 276/10
🧠Researchers have developed UniAI-GraphRAG, an enhanced framework that improves upon existing GraphRAG systems for complex reasoning and multi-hop queries. The framework introduces three key innovations including ontology-guided extraction, multi-dimensional clustering, and dual-channel fusion, showing superior performance over mainstream solutions like LightRAG on benchmark tests.
AIBullisharXiv – CS AI · Mar 276/10
🧠CodeRefine is a new AI framework that automatically converts research paper methodologies into functional code using Large Language Models. The system creates knowledge graphs from papers and uses retrieval-augmented generation to produce more accurate code implementations than traditional zero-shot prompting methods.
AIBullisharXiv – CS AI · Mar 266/10
🧠Researchers propose MixDemo, a new GraphRAG framework that uses a Mixture-of-Experts mechanism to select high-quality demonstrations for improving large language model performance in domain-specific question answering. The framework includes a query-specific graph encoder to reduce noise in retrieved subgraphs and significantly outperforms existing methods across multiple textual graph benchmarks.
AIBearisharXiv – CS AI · Mar 266/10
🧠Research reveals that Retrieval-Augmented Generation (RAG) systems exhibit fairness issues, with queries from certain demographic groups systematically receiving higher accuracy than others. The study identifies three key factors affecting fairness: group exposure in retrieved documents, utility of group-specific documents, and attribution bias in how generators use different group documents.
🏢 Meta
AIBullisharXiv – CS AI · Mar 266/10
🧠Researchers propose a new four-phase architecture to reduce AI hallucinations using domain-specific retrieval and verification systems. The framework achieved win rates up to 83.7% across multiple benchmarks, demonstrating significant improvements in factual accuracy for large language models.
AIBullisharXiv – CS AI · Mar 176/10
🧠Researchers propose a new framework for large language models that separates planning from factual retrieval to improve reliability in fact-seeking question answering. The modular approach uses a lightweight student planner trained via teacher-student learning to generate structured reasoning steps, showing improved accuracy and speed on challenging benchmarks.
AIBullisharXiv – CS AI · Mar 176/10
🧠Researchers have developed QA-Dragon, a new Query-Aware Dynamic RAG System that significantly improves knowledge-intensive Visual Question Answering by combining text and image retrieval strategies. The system achieved substantial performance improvements of 5-6% across different tasks in the Meta CRAG-MM Challenge at KDD Cup 2025.
AIBullisharXiv – CS AI · Mar 166/10
🧠Researchers improved agentic Retrieval-Augmented Generation (RAG) systems by introducing contextualization and de-duplication modules to address inefficiencies in complex question-answering. The enhanced Search-R1 pipeline achieved 5.6% better accuracy and 10.5% fewer retrieval turns using GPT-4.1-mini.
🧠 GPT-4
AIBullisharXiv – CS AI · Mar 116/10
🧠Researchers propose TaSR-RAG, a new framework that improves Retrieval-Augmented Generation systems by using taxonomy-guided structured reasoning for better evidence selection. The system decomposes complex questions into triple sub-queries and performs step-wise evidence matching, achieving up to 14% performance improvements over existing RAG baselines on multi-hop question answering benchmarks.
AIBullisharXiv – CS AI · Mar 96/10
🧠Researchers introduce CBR-to-SQL, a new framework using Case-Based Reasoning to improve natural language-to-SQL translation for healthcare databases. The system addresses limitations of standard RAG approaches by using two-stage retrieval and abstract case templates, achieving state-of-the-art results on medical datasets.
AIBullisharXiv – CS AI · Mar 96/10
🧠Researchers introduce EpisTwin, a neuro-symbolic AI framework that creates Personal Knowledge Graphs from fragmented user data across applications. The system combines Graph Retrieval-Augmented Generation with visual refinement to enable complex reasoning over personal semantic data, addressing current limitations in personal AI systems.
AINeutralarXiv – CS AI · Mar 55/10
🧠Researchers propose RAGNav, a new AI framework that combines semantic reasoning with physical spatial modeling to solve multi-goal visual-language navigation tasks. The system uses a Dual-Basis Memory system integrating topological maps and semantic forests to eliminate spatial hallucinations and improve navigation planning efficiency.
AIBullisharXiv – CS AI · Mar 36/108
🧠Researchers have developed MED-COPILOT, an AI-powered clinical decision-support system that combines GraphRAG retrieval with similar patient case analysis to assist healthcare professionals. The system uses structured knowledge graphs from WHO and NICE guidelines along with a 36,000-case patient database to outperform standard AI models in clinical reasoning accuracy.
AINeutralarXiv – CS AI · Mar 36/107
🧠Researchers introduce MC-Search, the first benchmark for evaluating agentic multimodal retrieval-augmented generation (MM-RAG) systems with long, structured reasoning chains. The benchmark reveals systematic issues in current multimodal large language models and introduces Search-Align, a training framework that improves planning and retrieval accuracy.
AIBullisharXiv – CS AI · Mar 36/109
🧠Researchers introduce GAM-RAG, a training-free framework that improves Retrieval-Augmented Generation by building adaptive memory from past queries instead of relying on static indices. The system uses uncertainty-aware updates inspired by cognitive neuroscience to balance stability and adaptability, achieving 3.95% better performance while reducing inference costs by 61%.
AIBearisharXiv – CS AI · Mar 37/109
🧠Researchers have discovered MM-MEPA, a new attack method that can poison multimodal AI systems by manipulating only metadata while leaving visual content unchanged. The attack achieves up to 91% success rate in disrupting AI retrieval systems and proves resistant to current defense strategies.
AIBullisharXiv – CS AI · Mar 37/1010
🧠Researchers developed a new inference-time safety mechanism for code-generating AI models that uses retrieval-augmented generation to identify and fix security vulnerabilities in real-time. The approach leverages Stack Overflow discussions to guide AI code revision without requiring model retraining, improving security while maintaining interpretability.
AINeutralarXiv – CS AI · Mar 36/103
🧠Research on production RAG systems reveals that retrieval fusion techniques like multi-query retrieval and reciprocal rank fusion increase raw document recall but fail to improve end-to-end performance due to re-ranking limits and context constraints. The study found fusion variants actually decreased accuracy from 0.51 to 0.48 while adding latency overhead without corresponding benefits.
AIBullisharXiv – CS AI · Mar 36/105
🧠Researchers have developed REMem, a new framework that enables AI language agents to form and reason with episodic memory similar to humans. The system uses a two-phase approach with offline memory graph indexing and online agentic retrieval, showing significant improvements over existing memory systems like Mem0 and HippoRAG 2.