y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#ai-transparency News & Analysis

60 articles tagged with #ai-transparency. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

60 articles
AINeutralarXiv – CS AI · 4d ago6/10
🧠

Integrated and Cross-Architecture Interpretation of LLM Reasoning

Researchers present the Integrated cross-Architecture Reasoning (IAR) framework, a novel methodology for interpreting how large language models perform reasoning tasks by combining multiple analytical probes—bandwidth-calibrated Mutual Information Peak, Deep-Thinking Ratio analysis, and Jaccard stability metrics—across model layers and architectures. Testing on Qwen and Llama models across mathematics, code, logic, and common sense domains demonstrates that this multi-metric approach provides more reliable insights into LLM reasoning patterns than single-probe methods.

🧠 Llama
AIBullisharXiv – CS AI · 4d ago6/10
🧠

Tell Me a Story! Narrative-Driven XAI with Large Language Models

Researchers introduce XAIstories, a framework that uses Large Language Models to convert complex AI explanations (SHAP values and counterfactual explanations) into human-readable narratives. User studies show over 90% of general audiences find these AI-generated stories convincing, with data scientists viewing them as valuable for explaining AI decisions to non-technical stakeholders.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Explaining is Harder Than Predicting Alone: Evaluating Concept-based Explanations of MLLMs as ICL Visual Classifiers

Researchers evaluated how multimodal large language models (MLLMs) explain their image classification decisions in few-shot learning scenarios. The study found that forcing models to generate formal, concept-based explanations actually reduces their predictive accuracy from 93.8% to 90.1%, suggesting that explicit reasoning doesn't universally improve performance despite being widely assumed to do so.

AINeutralThe Verge – AI · 5d ago6/10
🧠

YouTube is putting AI labels where you’ll actually see them

YouTube is making AI-generated content labels more prominent and visible to users by relocating them directly below video players instead of hiding them in expanded descriptions. The platform is also implementing automatic detection and labeling of AI-generated content across Shorts and long-form videos, marking a significant shift in content transparency following Google's broader AI verification initiatives announced at I/O.

YouTube is putting AI labels where you’ll actually see them
AINeutralarXiv – CS AI · 5d ago6/10
🧠

How Reliable are LLMs for Reasoning on the Re-ranking task?

Researchers investigate whether Large Language Models reliably perform re-ranking tasks by analyzing how different training methods affect semantic understanding and reasoning transparency. The study reveals that some training approaches produce better explainability than others, suggesting LLMs may optimize for evaluation metrics rather than genuine semantic comprehension, raising concerns about their actual reliability in ranking applications.

GeneralNeutralOpenAI News · 5d ago6/10
📰

Election information and safeguards in 2026

A technology platform is implementing measures to combat election misinformation ahead of 2026 global elections, focusing on information access, cybersecurity support, and AI transparency. The initiative addresses growing concerns about digital threats to electoral integrity and AI-generated disinformation during critical political events.

AIBearishSimon Willison Blog · May 226/10
🧠

FTC to Require Cox Media Group, Two Other Firms to Pay Nearly $1 Million to Settle Charges They Deceived Customers About “Active Listening” AI-Powered Marketing Service

The FTC has ordered Cox Media Group and two other companies to pay nearly $1 million to settle charges that they deceived customers about an 'active listening' AI-powered marketing service. The settlement addresses false claims regarding how the technology actually functioned and what customer data it collected.

AINeutralarXiv – CS AI · May 126/10
🧠

From Historical Tabular Image to Knowledge Graphs: A Provenance-Aware Modular Pipeline

Researchers present a modular, provenance-aware pipeline that converts handwritten archival tables into Knowledge Graphs while maintaining transparency through intermediate inspection points. The approach combines table structure recognition, handwriting recognition, and semantic interpretation while tracking data lineage to ensure all extracted information remains traceable to its source, addressing the opacity problem in end-to-end AI systems.

AINeutralarXiv – CS AI · May 96/10
🧠

DataDignity: Training Data Attribution for Large Language Models

Researchers introduce DataDignity, a new framework for attributing large language model outputs to specific training documents. The study presents FakeWiki, a benchmark of 3,537 fabricated Wikipedia articles designed to test provenance tracking, and proposes ScoringModel, a supervised contrastive ranker that improves document attribution accuracy from 35% to 52.2% recall compared to existing baselines.

AINeutralarXiv – CS AI · May 96/10
🧠

Visual Fingerprints for LLM Generation Comparison

Researchers have developed a visual fingerprinting method to compare Large Language Model outputs across different generation conditions by analyzing linguistic choices in content, expression, and structure. This approach enables pattern recognition in LLM behavior that is difficult to detect through individual responses or standard metrics, advancing model evaluation and prompt optimization techniques.

AINeutralarXiv – CS AI · May 96/10
🧠

Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

Researchers propose concept-based abductive and contrastive explanations that identify minimal sets of high-level concepts causally relevant to vision model predictions. The approach combines human-interpretable concept-based explanations with formal causal reasoning, enabling better understanding of both individual predictions and common model behaviors across image collections.

AINeutralThe Register – AI · May 46/10
🧠

Shadow IT has given way to shadow AI. Enter AI-BOMs

The article discusses the emergence of AI-BOMs (AI Bills of Materials) as organizations struggle to manage uncontrolled AI deployments across their enterprises, similar to how shadow IT once operated outside official channels. This represents a critical shift in how companies must track, govern, and secure AI systems to mitigate compliance, security, and operational risks.

AINeutralarXiv – CS AI · Apr 206/10
🧠

AtManRL: Towards Faithful Reasoning via Differentiable Attention Saliency

Researchers introduce AtManRL, a method that combines differentiable attention manipulation with reinforcement learning to improve the faithfulness of chain-of-thought reasoning in large language models. By training attention masks to identify which tokens genuinely influence model predictions, the approach demonstrates that LLM reasoning traces can be made more interpretable and transparent.

🧠 Llama
AINeutralarXiv – CS AI · Apr 206/10
🧠

Towards Rigorous Explainability by Feature Attribution

A new research paper challenges the rigor of popular explainability methods in machine learning, particularly Shapley values and SHAP, arguing that non-symbolic approaches lack the mathematical foundation needed for high-stakes applications. The work advocates for symbolic methods as a more reliable alternative for determining feature importance in AI models.

AINeutralarXiv – CS AI · Apr 206/10
🧠

LLMbench: A Comparative Close Reading Workbench for Large Language Models

LLMbench is a new browser-based tool that enables detailed comparative analysis of large language model outputs through side-by-side visualization and token-level probability inspection. Unlike existing quantitative comparison tools, it applies digital humanities methodology to make the probabilistic structure of LLM-generated text legible through multiple analytical overlays and visualization modes.

AINeutralarXiv – CS AI · Apr 156/10
🧠

TRUST Agents: A Collaborative Multi-Agent Framework for Fake News Detection, Explainable Verification, and Logic-Aware Claim Reasoning

TRUST Agents is a multi-agent AI framework designed to improve fake news detection and fact verification by combining claim extraction, evidence retrieval, verification, and explainable reasoning. Unlike binary classification approaches, the system generates transparent, human-inspectable reports with logic-aware reasoning for complex claims, though it shows that retrieval quality and uncertainty calibration remain significant challenges in automated fact verification.

AINeutralarXiv – CS AI · Apr 156/10
🧠

Reasoning about Intent for Ambiguous Requests

Researchers propose a method for large language models to handle ambiguous user requests by generating structured responses that enumerate multiple valid interpretations with corresponding answers, trained via reinforcement learning with dual reward objectives for coverage and precision.

AINeutralarXiv – CS AI · Apr 146/10
🧠

Explainable Planning for Hybrid Systems

A new thesis examines explainable AI planning (XAIP) for hybrid systems, addressing the critical challenge of making autonomous planning decisions interpretable in safety-critical applications. As AI automation expands into domains like autonomous vehicles, energy grids, and healthcare, the ability to explain system reasoning becomes essential for trust and regulatory compliance.

AIBullisharXiv – CS AI · Apr 146/10
🧠

AdaQE-CG: Adaptive Query Expansion for Web-Scale Generative AI Model and Data Card Generation

Researchers introduce AdaQE-CG, a framework that automatically generates model and data cards for AI systems with improved accuracy and completeness. The approach combines dynamic query expansion to extract information from papers with cross-card knowledge transfer to fill gaps, accompanied by MetaGAI-Bench, a new benchmark for evaluating documentation quality.

🏢 Meta🏢 Hugging Face
AINeutralarXiv – CS AI · Apr 146/10
🧠

Measuring the Authority Stack of AI Systems: Empirical Analysis of 366,120 Forced-Choice Responses Across 8 AI Models

Researchers conducted the first large-scale empirical analysis of AI decision-making across 366,120 responses from 8 major models, revealing measurable but inconsistent value hierarchies, evidence preferences, and source trust patterns. The study found significant framing sensitivity and domain-specific value shifts, with critical implications for deploying AI systems in professional contexts.

AIBearisharXiv – CS AI · Apr 136/10
🧠

How Similar Are Grokipedia and Wikipedia? A Multi-Dimensional Textual and Structural Comparison

Researchers conducted a large-scale computational analysis comparing 17,790 articles from Grokipedia, Elon Musk's AI-generated encyclopedia, against Wikipedia. The study found that Grokipedia articles are longer but contain fewer citations, with some entries showing systematic rightward political bias in media sources, particularly in history, religion, and arts sections.

🏢 xAI🧠 Grok
AIBearishCrypto Briefing · Apr 107/10
🧠

Mark Suman: AI systems can understand human thought patterns better than we do, the rapid pace of AI development outstrips ethical considerations, and the opacity of AI companies raises serious privacy concerns | The Peter McCormack Show

Mark Suman discusses concerns that AI systems may understand human thought patterns better than humans themselves understand them, while the rapid pace of AI development outpaces ethical frameworks and regulatory considerations. The opacity of AI companies raises significant privacy concerns that demand urgent attention from policymakers and industry stakeholders.

Mark Suman: AI systems can understand human thought patterns better than we do, the rapid pace of AI development outstrips ethical considerations, and the opacity of AI companies raises serious privacy concerns | The Peter McCormack Show
AINeutralarXiv – CS AI · Apr 106/10
🧠

Front-End Ethics for Sensor-Fused Health Conversational Agents: An Ethical Design Space for Biometrics

Researchers propose an ethical framework for sensor-fused health AI agents that combine biometric data with large language models. The paper identifies critical risks at the user-facing layer where sensor data is translated into health guidance, arguing that the perceived objectivity of biometrics can mask AI errors and turn them into harmful medical directives.

← PrevPage 2 of 3Next →