#structured-data News & Analysis

20 articles tagged with #structured-data. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

20 articles

AINeutralarXiv – CS AI · Jun 197/10

🧠

LLM Doesn't Know What It Doesn't Know: Detecting Epistemic Blind Spots via Cross-Model Attribution Divergence on Clinical Tabular Data

Researchers demonstrate that Large Language Models lack genuine self-awareness regarding their knowledge limitations when applied to clinical tabular data, using cross-model attribution divergence to detect epistemic blind spots. LLM confidence scores remain constant regardless of actual accuracy, while a novel cross-model calibrator achieves reliable uncertainty quantification without model access or retraining.

AIBearisharXiv – CS AI · Jun 117/10

🧠

Categorical Prior Lock-in: Why In-Context Learning Fails for Structured Data

Researchers identify a fundamental limitation in large language models' ability to adapt to structured data through in-context learning, discovering that LLMs fail to update their categorical token distributions learned during pre-training even with additional examples. While parameter-efficient fine-tuning overcomes this constraint, it introduces memorization risks and potential instability in structured output generation.

AIBullisharXiv – CS AI · Jun 57/10

🧠

Synthetic Contrastive Reasoning for Multi-Table Q&A

Researchers have developed a synthetic dataset and training method that significantly improves multi-table question-answering systems. By generating contrastive reasoning traces and fine-tuning open-weight language models with Contrastive Preference Optimization, the approach achieves 9.7-21 percentage point improvements over standard supervised fine-tuning methods.

🧠 Llama

AIBullisharXiv – CS AI · May 287/10

🧠

Knowledge Graph-Driven Expert-Level Reasoning for Neuroscience

Researchers demonstrate that knowledge graphs extracted from a single neuroscience textbook can be converted into high-quality training data to fine-tune language models, enabling expert-level reasoning that outperforms larger LLMs while using far fewer parameters. This approach challenges the prevailing assumption that domain expertise requires massive, diverse datasets, showing instead that structured, curated knowledge can produce superior specialized AI systems.

AIBullisharXiv – CS AI · May 277/10

🧠

GraphDancer: Training LLMs to Explore and Reason over Graphs via Two-Stage Curriculum Post-Training

GraphDancer is a new post-training framework that enables large language models to reason over heterogeneous graph-structured data by combining natural-language reasoning with graph function execution. The two-stage curriculum approach uses structural complexity ordering to teach models to explore and reason over graphs, achieving strong cross-domain generalization with only a 3B parameter backbone.

AIBullishcrypto.news · May 47/10

🧠

SAP to acquire Prior Labs in push to scale tabular AI capabilities

SAP is acquiring AI startup Prior Labs for over €1 billion to expand its tabular AI capabilities for structured business data processing. The acquisition strengthens SAP's position in enterprise AI by adding specialized models designed to work with the types of data most common in business applications.

AINeutralarXiv – CS AI · Jun 46/10

🧠

QO-Bench: Diagnosing Query-Operator-Preserving Retrieval over Typed Event Tuples

Researchers introduce QO-Bench, a diagnostic benchmark for evaluating retrieval-augmented generation (RAG) systems on structured database-style queries over text. The benchmark reveals that current RAG systems excel at finding relevant passages but fail to preserve typed values needed for query operators like joins and counting, identifying operator execution rather than retrieval as the core bottleneck.

AINeutralarXiv – CS AI · May 296/10

🧠

Notation Matters: A Benchmark Study of Token-Optimized Formats in Agentic AI Systems

Researchers benchmark token-optimized data formats (TRON and TOON) against JSON in agentic AI systems, finding TRON reduces token consumption by up to 27% with acceptable accuracy trade-offs. The study reveals that while these alternatives show promise in isolated tasks, their real-world performance in multi-turn agent loops exposes limitations, particularly with TOON's parsing cascades and parallel tool-call handling.

AINeutralarXiv – CS AI · May 296/10

🧠

Do Proactive Agents Really Need an LLM to Decide When to Wake and What to Anchor?

Researchers propose replacing LLM-based triggers in proactive agent systems with a lightweight temporal graph learning (TGL) model that processes structured event streams directly. The approach achieves 16.7% mean F1 improvement while running 4-7x faster on GPUs and 12-83x faster on consumer hardware, with a 220 MiB footprint suitable for on-device deployment.

AIBullisharXiv – CS AI · May 276/10

🧠

Knowledge Graphs as the Missing Data Layer for LLM-Based Industrial Asset Operations

Researchers demonstrate that knowledge graphs significantly outperform traditional document stores for LLM-based industrial asset operations, achieving 100% accuracy on 467 maintenance scenarios compared to 65% with flat data structures. The study reveals that data architecture, not LLM orchestration design, is the primary performance bottleneck in structured operational domains.

🏢 Hugging Face🧠 GPT-4

AINeutralarXiv – CS AI · May 276/10

🧠

Practical Anonymous Two-Party Gradient Boosting Decision Tree

Researchers introduce an anonymous gradient-boosted decision tree (GBDT) protocol enabling secure training on vertically partitioned data between two parties while hiding record identifiers. The approach uses dual circuit-PSI and oblivious pseudorandom functions to eliminate ID exposure risks inherent in standard private set intersection methods, while achieving computational efficiency comparable to non-private approaches.

AINeutralarXiv – CS AI · May 126/10

🧠

Key Coverage Matters: Semi-Structured Extraction of OCR Clinical Reports

Researchers developed a semi-structured extraction method for digitizing fragmented clinical reports using OCR and question-answering models, introducing 'key coverage' as a metric to measure data completeness. The approach achieves F1 scores above 0.83 on real-world hospital data from 20+ institutions using a lightweight BERT model, demonstrating that canonical key inventory completeness drives extraction performance.

AINeutralarXiv – CS AI · May 16/10

🧠

When 2D Tasks Meet 1D Serialization: On Serialization Friction in Structured Tasks

Researchers demonstrate that Large Language Models perform significantly better on 2D structured tasks when given visual representations rather than serialized text inputs. The study reveals that converting 2D data into 1D token sequences creates representational friction that degrades model performance, with gaps widening as task complexity increases.

AINeutralarXiv – CS AI · Apr 206/10

🧠

Integrating Graphs, Large Language Models, and Agents: Reasoning and Retrieval

A comprehensive survey examines how Large Language Models can be effectively integrated with graph-based data structures to improve reasoning, retrieval, and decision-making across domains. The research categorizes integration approaches by purpose, graph type, and strategy, providing practitioners with guidance on selecting appropriate techniques for specific applications in healthcare, finance, robotics, and other fields.

AINeutralarXiv – CS AI · Apr 136/10

🧠

ASTRA: Adaptive Semantic Tree Reasoning Architecture for Complex Table Question Answering

Researchers introduce ASTRA, a new architecture designed to improve how large language models process and reason about complex tables through adaptive semantic tree structures. The method combines tree-based navigation with symbolic code execution to achieve state-of-the-art performance on table question-answering benchmarks, addressing fundamental limitations in how tables are currently serialized for LLMs.

AINeutralarXiv – CS AI · Mar 96/10

🧠

Towards Neural Graph Data Management

Researchers introduce NGDBench, a comprehensive benchmark for evaluating neural networks' ability to work with graph databases across five domains including finance and medicine. The benchmark supports full Cypher query language capabilities and reveals significant limitations in current AI models when handling structured graph data, noise, and complex analytical tasks.

AINeutralarXiv – CS AI · Mar 55/10

🧠

Knowledge Graph and Hypergraph Transformers with Repository-Attention and Journey-Based Role Transport

Researchers present a new transformer architecture that jointly trains on natural language and structured data by maintaining separate knowledge and language representations. The model uses a key-value repository system with journey-based role transport to enable cross-attention between linguistic context and structured knowledge graphs.

AINeutralarXiv – CS AI · Mar 54/10

🧠

Token-Oriented Object Notation vs JSON: A Benchmark of Plain and Constrained Decoding Generation

A benchmark study compares Token-Oriented Object Notation (TOON) with JSON for structured data serialization in LLMs, finding that while TOON reduces token usage, plain JSON shows better accuracy overall. The research reveals that TOON's efficiency benefits may only emerge at scale where syntax savings offset the initial prompt overhead.

AINeutralarXiv – CS AI · Mar 34/106

🧠

LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

Researchers developed LexChronos, an AI framework that extracts structured event timelines from Indian Supreme Court judgments using a dual-agent architecture. The system achieved 0.8751 F1 score on synthetic data and showed 75% preference over unstructured approaches in legal text summarization tasks.

AINeutralarXiv – CS AI · Mar 24/106

🧠

Heterogeneous Multi-Agent Reinforcement Learning with Attention for Cooperative and Scalable Feature Transformation

Researchers propose a new multi-agent reinforcement learning framework that uses three cooperative agents with attention mechanisms to automate feature transformation for machine learning models. The approach addresses key limitations in existing automated feature engineering methods, including dynamic feature expansion instability and insufficient agent cooperation.