#knowledge-graphs News & Analysis

191 articles tagged with #knowledge-graphs. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

191 articles

AIBullisharXiv – CS AI · Jun 257/10

🧠

TheoremGraph: Bridging Formal and Informal Mathematics

Researchers introduce TheoremGraph, a unified dependency graph linking 11.7M informal mathematical statements from arXiv with 388,105 formal Lean 4 declarations through semantic embeddings. The infrastructure bridges the historically fragmented landscape of mathematical knowledge representation, enabling improved discovery and reasoning across both informal academic papers and formally verified mathematics.

🏢 Hugging Face

AIBullisharXiv – CS AI · Jun 237/10

🧠

VISTA Architect: A graph database-oriented health AI system demonstrated in multidisciplinary tumor boards

Stanford Medicine researchers unveiled VISTA Architect, a graph database-powered AI system that integrates large language models with electronic health records to achieve 96.4% accuracy in clinical data extraction for tumor board preparation. The architecture precomputes patient histories into organized knowledge graphs, reducing processing time and latency compared to traditional RAG approaches while maintaining full data provenance.

AIBullisharXiv – CS AI · Jun 237/10

🧠

TTFT-Aware Graph Chain-of-Thought:Distance-Indexed Neural A* for Low-Hallucination Multi-Hop Medical Reasoning

Researchers present GraphRAG, a production-grade system for medical LLMs that reduces hallucinations by constraining answers to verifiable paths within a 700K-node medical knowledge graph. Using Pruned Landmark Labeling and AStarNet heuristics, the system improves clinical reasoning accuracy while reducing latency and hallucination rates in fertility assistant applications.

AIBullisharXiv – CS AI · Jun 237/10

🧠

ARIA: A Causal-Aware Framework for Rescuing LLM Reasoning in Trustworthy Materials Discovery

Researchers introduce ARIA, a causal-aware framework that improves how Large Language Models reason about materials discovery by addressing 'contextual tunneling'—a bias where models over-rely on narrow retrieved evidence. ARIA uses a three-tier approach combining direct causal reasoning, physics-informed analogies, and parametric fallbacks, validated on a knowledge graph of 2,839 materials relations, enabling more trustworthy and auditable AI-assisted scientific discovery.

AIBullisharXiv – CS AI · Jun 197/10

🧠

Detecting Hallucinations for Large Language Model-based Knowledge Graph Reasoning

Researchers introduce LUCID, a novel hallucination detection method for large language models used in knowledge graph reasoning tasks. By combining LLM attention scores, knowledge graph semantics, and structural information through graph neural networks, LUCID achieves state-of-the-art performance across nine datasets, addressing a critical reliability gap in AI-driven knowledge systems.

AIBullisharXiv – CS AI · Jun 197/10

🧠

Beyond Accuracy: Measuring Logical Compliance of Predictive Models

Researchers introduce the Rule Violation Score (RVS), a new evaluation metric that measures whether predictive models respect logical and domain-specific constraints independently of accuracy. Unlike traditional metrics focused on prediction performance, RVS distinguishes between hard rules (strict constraints) and soft rules (statistical regularities), enabling assessment of logical consistency in high-stakes applications like finance and healthcare.

AIBullisharXiv – CS AI · Jun 117/10

🧠

LLMs+Graphs: Toward Graph-Native, Synergistic AI Systems

A research paper proposes synergistic AI systems that combine Large Language Models with graph computation and knowledge graphs to overcome LLMs' limitations in structured reasoning and multi-hop inference. The work outlines three complementary approaches: augmenting LLMs with graph computation, bidirectional integration between LLMs and knowledge graphs, and strengthening AI agents with graph algorithms for complex decision-making.

AIBullisharXiv – CS AI · Jun 107/10

🧠

Less Context, More Accuracy: A Bi-Temporal Memory Engine for LLM Agents Where a Lean Retrieved Context Beats the Full History

Researchers introduce Engram, an open-source memory engine for LLM agents that achieves 83.6% accuracy on long-context tasks using only 9.6k tokens versus 79k for full-history baselines, demonstrating that selective retrieval outperforms exhaustive context replay while reducing computational costs by 8x.

AINeutralarXiv – CS AI · Jun 97/10

🧠

ANNEAL: Adapting LLM Agents via Governed Symbolic Patch Learning

Researchers introduce ANNEAL, a neuro-symbolic AI system that fixes recurring failures in LLM-based agents by directly repairing symbolic knowledge structures rather than adjusting prompts or weights. The system uses constrained generation and multi-dimensional validation to make persistent, auditable repairs, achieving zero failure rates on recurring faults where baseline approaches like ReAct and Reflexion retain 72-100% failure rates.

AIBullisharXiv – CS AI · Jun 97/10

🧠

A Multi-modal Agentic Co-pilot for Evidence Grounded Computational Pathology

PathPocket is a multimodal AI co-pilot system designed to assist pathologists by grounding diagnostic recommendations in verifiable medical evidence. Built on a comprehensive pathology knowledge base of 110,472 documents and 4.55 million entities, the system demonstrates significant improvements in diagnostic accuracy and pathologist confidence across 200,000+ real-world cases.

AIBullisharXiv – CS AI · Jun 57/10

🧠

Harnessing Structural Context for Entity Alignment Foundation Models

Researchers introduce ContextEA, an advanced foundation model for entity alignment across knowledge graphs that significantly improves upon existing approaches by better leveraging structural context. The model demonstrates superior transfer capabilities to unseen knowledge graph pairs, outperforming finetuned baselines without requiring task-specific adaptation.

AIBullisharXiv – CS AI · Jun 57/10

🧠

Reducing Hallucinations in Complex Question Answering using Simple Graph-based Retrieval-Augmented Generation (long version)

Researchers present a graph-based retrieval-augmented generation (RAG) system that reduces AI hallucinations by integrating lightweight graph structures with vector search tools. Testing on Wikipedia QA benchmarks shows the approach halves hallucinated answers while improving factual precision and recall with minimal token overhead.

AIBullisharXiv – CS AI · Jun 37/10

🧠

SkillDAG: Self-Evolving Typed Skill Graphs for LLM Skill Selection at Scale

SkillDAG introduces a typed directed graph system that models inter-skill relationships for LLM agents, enabling dynamic skill selection and structural learning during execution. The approach significantly outperforms existing baselines on ALFWorld and SkillsBench benchmarks, achieving 67.1% success and 27.3% reward by treating skill selection as a structural problem rather than a similarity-matching one.

🧠 GPT-5

AIBullisharXiv – CS AI · Jun 27/10

🧠

MemGraphRAG: Memory-based Multi-Agent System for Graph Retrieval-Augmented Generation

Researchers introduce MemGraphRAG, a memory-based multi-agent system that improves graph-based retrieval-augmented generation by maintaining global context across document corpora. The framework addresses limitations in existing GraphRAG methods by resolving logical conflicts and maintaining structural consistency, demonstrating superior performance on multiple benchmarks.

AIBullisharXiv – CS AI · Jun 27/10

🧠

SHERLOCK: Towards Dynamic Knowledge Adaptation in LLM-enhanced E-commerce Risk Management

Sherlock is an AI framework that combines Large Language Models with structured domain knowledge to automate e-commerce fraud investigation and risk management. Deployed at JD.com, it achieved an 82% expert acceptance rate and 386.7% throughput increase while continuously adapting to evolving fraud tactics through a self-improving data flywheel.

AIBullisharXiv – CS AI · Jun 27/10

🧠

Grokers: Bottom-Up Inductive Comprehension and Write-Time Intelligence over Typed Knowledge Graphs

Grokers introduces an architecture that shifts AI comprehension costs from query time to write time by using autonomous agents to pre-analyze and enrich typed knowledge graphs, eliminating repeated language model calls through inductive dependency traversal. The system proves three formal theorems about cache efficiency, interaction resolution, and correct traversal ordering while providing a deterministic alternative to embedding-based search.

AIBullisharXiv – CS AI · Jun 17/10

🧠

MedCoG: Maximizing LLM Inference Density in Medical Reasoning via Meta-Cognitive Regulation

Researchers propose MedCoG, a meta-cognitive agent that improves Large Language Model efficiency in medical reasoning by dynamically regulating knowledge utilization based on self-assessed task complexity and familiarity. The approach achieves 6.2x inference density improvement while reducing computational costs and improving accuracy on medical benchmarks.

AIBullisharXiv – CS AI · Jun 17/10

🧠

Graph Machine Learning in the Era of Large Language Models (LLMs)

A comprehensive survey examines the convergence of Graph Machine Learning and Large Language Models, exploring how LLMs can enhance graph neural networks while graphs provide factual knowledge to improve LLM reasoning and reduce hallucinations. This bidirectional relationship addresses key challenges in both domains, including data labeling, heterophily, and out-of-distribution generalization.

AIBullisharXiv – CS AI · May 297/10

🧠

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

OmniRetrieval is a new framework that enables unified retrieval across heterogeneous knowledge sources—including unstructured text, relational databases, knowledge graphs, and property graphs—by translating natural language queries into source-native queries rather than forcing all data into a homogenized format. The system demonstrates superior performance compared to single-source retrievers across 13 datasets and 309 knowledge bases, positioning it as a general-purpose interface that preserves the structural advantages of each knowledge source.

AIBullisharXiv – CS AI · May 297/10

🧠

Compass: Navigating Global Marine Lead Data Integration through Expert-Guided LLM Agent

Researchers introduced Compass, an LLM agent framework that extracts marine lead data from 230,000+ academic papers without fine-tuning, successfully creating the largest integrated marine lead database with 3,751 previously uncatalogued records and 92% accuracy. The expert-guided approach demonstrates how domain-specific knowledge can overcome LLM hallucinations in high-stakes scientific applications.

AIBullisharXiv – CS AI · May 287/10

🧠

FD-RAG: Federated Dual-System Retrieval-Augmented Generation

FD-RAG introduces a federated framework for retrieval-augmented generation that enables decentralized LLM deployment across edge devices without centralizing sensitive data. The system achieves 7.8% accuracy improvements and 8.4x latency reductions by splitting lightweight memory access from expensive LLM reasoning, while aggregating anonymized knowledge across fragmented device networks.

AIBullisharXiv – CS AI · May 287/10

🧠

Knowledge Graph-Driven Expert-Level Reasoning for Neuroscience

Researchers demonstrate that knowledge graphs extracted from a single neuroscience textbook can be converted into high-quality training data to fine-tune language models, enabling expert-level reasoning that outperforms larger LLMs while using far fewer parameters. This approach challenges the prevailing assumption that domain expertise requires massive, diverse datasets, showing instead that structured, curated knowledge can produce superior specialized AI systems.

AIBullisharXiv – CS AI · May 287/10

🧠

FundaPod: A Multi-Persona Agent Pod Platform with Knowledge Graph Memory for AI-Assisted Fundamental Investment Research

FundaPod introduces a multi-persona AI agent platform designed to assist institutional investors in fundamental research by enabling independent agents with different investment perspectives to conduct analysis and surface disagreements for human portfolio manager review. The system uses knowledge graphs and grounded evidence models to create transparent, verifiable investment memos that prioritize human-centric decision-making over automated trading signals.

AIBullisharXiv – CS AI · May 277/10

🧠

GraphDancer: Training LLMs to Explore and Reason over Graphs via Two-Stage Curriculum Post-Training

GraphDancer is a new post-training framework that enables large language models to reason over heterogeneous graph-structured data by combining natural-language reasoning with graph function execution. The two-stage curriculum approach uses structural complexity ordering to teach models to explore and reason over graphs, achieving strong cross-domain generalization with only a 3B parameter backbone.

AIBullisharXiv – CS AI · May 127/10

🧠

EpiGraph: A Knowledge Graph and Benchmark for Evidence-Intensive Reasoning in Epilepsy

Researchers have developed EpiGraph, a comprehensive knowledge graph containing 24,324 entities and 32,009 evidence-grounded triplets from 48,166 peer-reviewed papers to improve AI-driven epilepsy diagnosis and treatment. The accompanying EpiBench benchmark demonstrates that integrating structured clinical knowledge into large language models significantly enhances clinical reasoning, with improvements up to 41% in pharmacogenomic applications.

Page 1 of 8Next →