#ai-transparency News & Analysis

93 articles tagged with #ai-transparency. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

93 articles

AINeutralarXiv – CS AI · Jun 26/10

🧠

Bridging the Last Mile of Time Series Forecasting with LLM Agents

Researchers present an LLM-agent framework that enhances time series forecasting by incorporating business context and expert judgment into statistical predictions. The system bridges the gap between raw forecasts and decision-ready outputs through structured reasoning, contextual evidence retrieval, and auditable revision mechanisms.

AINeutralarXiv – CS AI · Jun 26/10

🧠

GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing

Researchers introduce GenPT (Generative Projective Testing), a novel psychometric methodology that uses AI-generated stimuli to assess the psychological states of language models more reliably than traditional self-report questionnaires. The approach mitigates contamination from training data and social-desirability bias, showing significantly greater sensitivity to contextual changes in depression assessment compared to conventional methods.

AINeutralarXiv – CS AI · Jun 16/10

🧠

Rationalize: Shared Semantic Reasoning for Human-AI Alignment

Researchers introduce Rationalize, a framework enabling shared semantic reasoning between humans and AI models through complementary role pairs (Explorer-Guide, Investigator-Informant, Teacher-Student, Judge-Advocate). The framework aims to align AI systems not just at the output level but by making purposes, questions, assumptions, and evidence explicit during human-AI collaboration, addressing bidirectional alignment challenges.

AINeutralarXiv – CS AI · Jun 16/10

🧠

Who Gets Credit or Blame? Attributing Accountability in Modern AI Systems

Researchers propose a framework to attribute AI model behavior to specific development stages (pretraining, fine-tuning, alignment), enabling accountability tracking without model retraining. The method quantifies how each stage contributes to model outputs and can identify spurious correlations, advancing transparency in AI development.

AIBearishWired – AI · May 296/10

🧠

We Asked the ‘Future of Truth’ Author to Explain How He Used AI. It Didn’t Go Well

A book about AI's impact on truth and reality was criticized for using AI-generated quotes without disclosure, raising questions about the author's credibility and the broader issue of AI-generated content misrepresenting itself as authentic. The incident highlights the irony and risks when AI tools are deployed without transparency, particularly in works examining AI's societal implications.

AINeutralarXiv – CS AI · May 296/10

🧠

JMed48k: A Multi-Profession Japanese Medical Licensing Benchmark for Vision-Language Model Evaluation

Researchers introduce JMed48k, a comprehensive Japanese medical licensing benchmark containing 48,862 exam questions and 20,142 images to evaluate vision-language models across 11 healthcare professions. Testing 21 models reveals significant disparities in how effectively different AI systems leverage visual information, with proprietary models gaining substantially from images while medical-specific systems show limited visual utilization.

AINeutralarXiv – CS AI · May 286/10

🧠

Explaining is Harder Than Predicting Alone: Evaluating Concept-based Explanations of MLLMs as ICL Visual Classifiers

Researchers evaluated how multimodal large language models (MLLMs) explain their image classification decisions in few-shot learning scenarios. The study found that forcing models to generate formal, concept-based explanations actually reduces their predictive accuracy from 93.8% to 90.1%, suggesting that explicit reasoning doesn't universally improve performance despite being widely assumed to do so.

AINeutralarXiv – CS AI · May 286/10

🧠

AI, Take the Wheel: What Drives Delegation and Trust in Human-Computer Cooperative Question Answering?

A research study examines how humans decide to trust and rely on AI systems in collaborative question-answering tasks, identifying two distinct reliance patterns: delegation (autonomous AI action) and adoption (evaluating AI suggestions). The findings reveal humans make suboptimal trust decisions, both under-utilizing correct AI suggestions and over-relying on misleading AI outputs, with confirmation bias playing a significant role in trust calibration failures.

AINeutralarXiv – CS AI · May 286/10

🧠

Measuring Progress Toward AGI: A Cognitive Framework

Researchers propose a Cognitive Taxonomy framework to measure progress toward AGI by evaluating systems against 10 key cognitive faculties derived from psychology and neuroscience research. The framework aims to address the lack of standardized metrics for AGI advancement and provide empirical evaluation methods to support responsible AI governance.

AINeutralarXiv – CS AI · May 286/10

🧠

Integrated and Cross-Architecture Interpretation of LLM Reasoning

Researchers present the Integrated cross-Architecture Reasoning (IAR) framework, a novel methodology for interpreting how large language models perform reasoning tasks by combining multiple analytical probes—bandwidth-calibrated Mutual Information Peak, Deep-Thinking Ratio analysis, and Jaccard stability metrics—across model layers and architectures. Testing on Qwen and Llama models across mathematics, code, logic, and common sense domains demonstrates that this multi-metric approach provides more reliable insights into LLM reasoning patterns than single-probe methods.

🧠 Llama

AIBullisharXiv – CS AI · May 286/10

🧠

Tell Me a Story! Narrative-Driven XAI with Large Language Models

Researchers introduce XAIstories, a framework that uses Large Language Models to convert complex AI explanations (SHAP values and counterfactual explanations) into human-readable narratives. User studies show over 90% of general audiences find these AI-generated stories convincing, with data scientists viewing them as valuable for explaining AI decisions to non-technical stakeholders.

AINeutralThe Verge – AI · May 276/10

🧠

YouTube is putting AI labels where you’ll actually see them

YouTube is making AI-generated content labels more prominent and visible to users by relocating them directly below video players instead of hiding them in expanded descriptions. The platform is also implementing automatic detection and labeling of AI-generated content across Shorts and long-form videos, marking a significant shift in content transparency following Google's broader AI verification initiatives announced at I/O.

AINeutralarXiv – CS AI · May 276/10

🧠

How Reliable are LLMs for Reasoning on the Re-ranking task?

Researchers investigate whether Large Language Models reliably perform re-ranking tasks by analyzing how different training methods affect semantic understanding and reasoning transparency. The study reveals that some training approaches produce better explainability than others, suggesting LLMs may optimize for evaluation metrics rather than genuine semantic comprehension, raising concerns about their actual reliability in ranking applications.

GeneralNeutralOpenAI News · May 276/10

📰

Election information and safeguards in 2026

A technology platform is implementing measures to combat election misinformation ahead of 2026 global elections, focusing on information access, cybersecurity support, and AI transparency. The initiative addresses growing concerns about digital threats to electoral integrity and AI-generated disinformation during critical political events.

AIBearishSimon Willison Blog · May 226/10

🧠

FTC to Require Cox Media Group, Two Other Firms to Pay Nearly $1 Million to Settle Charges They Deceived Customers About “Active Listening” AI-Powered Marketing Service

The FTC has ordered Cox Media Group and two other companies to pay nearly $1 million to settle charges that they deceived customers about an 'active listening' AI-powered marketing service. The settlement addresses false claims regarding how the technology actually functioned and what customer data it collected.

AINeutralarXiv – CS AI · May 126/10

🧠

From Historical Tabular Image to Knowledge Graphs: A Provenance-Aware Modular Pipeline

Researchers present a modular, provenance-aware pipeline that converts handwritten archival tables into Knowledge Graphs while maintaining transparency through intermediate inspection points. The approach combines table structure recognition, handwriting recognition, and semantic interpretation while tracking data lineage to ensure all extracted information remains traceable to its source, addressing the opacity problem in end-to-end AI systems.

AINeutralarXiv – CS AI · May 96/10

🧠

DataDignity: Training Data Attribution for Large Language Models

Researchers introduce DataDignity, a new framework for attributing large language model outputs to specific training documents. The study presents FakeWiki, a benchmark of 3,537 fabricated Wikipedia articles designed to test provenance tracking, and proposes ScoringModel, a supervised contrastive ranker that improves document attribution accuracy from 35% to 52.2% recall compared to existing baselines.

AINeutralarXiv – CS AI · May 96/10

🧠

Visual Fingerprints for LLM Generation Comparison

Researchers have developed a visual fingerprinting method to compare Large Language Model outputs across different generation conditions by analyzing linguistic choices in content, expression, and structure. This approach enables pattern recognition in LLM behavior that is difficult to detect through individual responses or standard metrics, advancing model evaluation and prompt optimization techniques.

AINeutralarXiv – CS AI · May 96/10

🧠

Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

Researchers propose concept-based abductive and contrastive explanations that identify minimal sets of high-level concepts causally relevant to vision model predictions. The approach combines human-interpretable concept-based explanations with formal causal reasoning, enabling better understanding of both individual predictions and common model behaviors across image collections.

AINeutralThe Register – AI · May 46/10

🧠

Shadow IT has given way to shadow AI. Enter AI-BOMs

The article discusses the emergence of AI-BOMs (AI Bills of Materials) as organizations struggle to manage uncontrolled AI deployments across their enterprises, similar to how shadow IT once operated outside official channels. This represents a critical shift in how companies must track, govern, and secure AI systems to mitigate compliance, security, and operational risks.

AINeutralarXiv – CS AI · Apr 206/10

🧠

Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User Studies

A research study comparing simulated AI interactions with real human subjects reveals that AI transparency significantly outweighs personality factors in determining interaction quality, with findings diverging notably between pure simulation and actual human experiments across hiring and transactional scenarios.

AINeutralarXiv – CS AI · Apr 206/10

🧠

AtManRL: Towards Faithful Reasoning via Differentiable Attention Saliency

Researchers introduce AtManRL, a method that combines differentiable attention manipulation with reinforcement learning to improve the faithfulness of chain-of-thought reasoning in large language models. By training attention masks to identify which tokens genuinely influence model predictions, the approach demonstrates that LLM reasoning traces can be made more interpretable and transparent.

🧠 Llama

AINeutralarXiv – CS AI · Apr 206/10

🧠

Towards Rigorous Explainability by Feature Attribution

A new research paper challenges the rigor of popular explainability methods in machine learning, particularly Shapley values and SHAP, arguing that non-symbolic approaches lack the mathematical foundation needed for high-stakes applications. The work advocates for symbolic methods as a more reliable alternative for determining feature importance in AI models.

AINeutralarXiv – CS AI · Apr 206/10

🧠

LLMbench: A Comparative Close Reading Workbench for Large Language Models

LLMbench is a new browser-based tool that enables detailed comparative analysis of large language model outputs through side-by-side visualization and token-level probability inspection. Unlike existing quantitative comparison tools, it applies digital humanities methodology to make the probabilistic structure of LLM-generated text legible through multiple analytical overlays and visualization modes.

AINeutralarXiv – CS AI · Apr 156/10

🧠

TRUST Agents: A Collaborative Multi-Agent Framework for Fake News Detection, Explainable Verification, and Logic-Aware Claim Reasoning

TRUST Agents is a multi-agent AI framework designed to improve fake news detection and fact verification by combining claim extraction, evidence retrieval, verification, and explainable reasoning. Unlike binary classification approaches, the system generates transparent, human-inspectable reports with logic-aware reasoning for complex claims, though it shows that retrieval quality and uncertainty calibration remain significant challenges in automated fact verification.

← PrevPage 3 of 4Next →