#explainability News & Analysis

34 articles tagged with #explainability. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

34 articles

AIBullisharXiv – CS AI · 3d ago7/10

🧠

Beyond Binary Moral Judgment: Modeling Ethical Pluralism in AI

Researchers propose a framework for modeling AI moral reasoning as a probabilistic distribution across multiple ethical theories rather than binary judgments. The approach achieves 88.89% accuracy in classifying ethical dilemmas by integrating consequentialism, virtue ethics, and deontology, advancing AI alignment and accountability in decision-making systems.

AIBearisharXiv – CS AI · 3d ago7/10

🧠

From Accuracy to Auditability: A Survey of Determinism in Financial AI Systems

A comprehensive survey reveals that machine learning systems deployed in regulated financial sectors—credit risk, fraud detection, and anti-money laundering—suffer from reproducibility failures caused by hardware-level nondeterminism in neural networks and generative AI. The research quantifies specific vulnerabilities across tabular models, graph networks, and LLM-based workflows, proposing evaluation frameworks to improve auditability in financial AI systems.

AIBullisharXiv – CS AI · May 127/10

🧠

Deep Arguing

Researchers introduce Deep Arguing, a neurosymbolic method that combines deep learning with argumentation reasoning to create interpretable AI classification models. The approach constructs argumentative structures where data points support or attack predictions, enabling end-to-end learning while providing human-understandable explanations for model decisions.

AIBullisharXiv – CS AI · May 127/10

🧠

Hierarchical Attention-based Graph Neural Network with Relevance-driven Pruning

Researchers introduce HA-HeteroGNN, a Graph Neural Network framework that improves both interpretability and efficiency through hierarchical attention mechanisms and relevance-driven pruning. The approach achieves a 27% reduction in graph edges while improving classification accuracy by up to 2.46%, alongside 43.9% training time reductions.

AIBullisharXiv – CS AI · Apr 207/10

🧠

EVIL: Evolving Interpretable Algorithms for Zero-Shot Inference on Event Sequences and Time Series with LLMs

Researchers introduce EVIL, an LLM-guided evolutionary approach that discovers interpretable Python algorithms for zero-shot inference on time series and event sequences without traditional neural network training. The evolved algorithms match or exceed deep learning performance while remaining transparent and significantly faster, demonstrating a novel paradigm for dynamical systems inference.

AIBullisharXiv – CS AI · Apr 207/10

🧠

Prototype-Grounded Concept Models for Verifiable Concept Alignment

Researchers introduce Prototype-Grounded Concept Models (PGCMs), a new approach to interpretable AI that grounds abstract concepts in visual prototypes—concrete image parts that serve as evidence. Unlike previous Concept Bottleneck Models, PGCMs enable direct verification of whether learned concepts match human intentions, substantially improving transparency and allowing targeted corrections without sacrificing predictive performance.

AINeutralarXiv – CS AI · Apr 77/10

🧠

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

Researchers introduce 'error verifiability' as a new metric to measure whether AI-generated justifications help users distinguish correct from incorrect answers. The study found that common AI improvement methods don't enhance verifiability, but two new domain-specific approaches successfully improved users' ability to assess answer correctness.

AINeutralarXiv – CS AI · 2d ago6/10

🧠

Xetrieval: Mechanistically Explaining Dense Retrieval

Researchers introduce Xetrieval, a mechanistic framework that explains how dense retrieval models assign relevance scores by decomposing high-dimensional embeddings into interpretable features. The method uses a lightweight reasoning internalizer to enrich embeddings with reasoning information and provides human-readable feature-level explanations of retrieval decisions, advancing transparency in neural information retrieval systems.

AINeutralarXiv – CS AI · 2d ago6/10

🧠

Structured Prompt Optimization Meets Reinforcement Learning for Global and Local Interpretability over Complex Text

Researchers introduce eXTC, a new framework combining structured prompt optimization with reinforcement learning to create interpretable text classifiers that balance performance with explainability. The system generates human-readable domain rules while maintaining inference speed through knowledge distillation, addressing a longstanding trade-off in AI transparency.

AINeutralarXiv – CS AI · 2d ago6/10

🧠

SafeRx-Agent: A Knowledge-Grounded Multi-Agent Framework for Safe and Explainable Medication Recommendation

Researchers introduce SafeRx-Agent, a multi-agent AI framework designed to improve medication recommendation systems by integrating clinical knowledge, safety verification, and explainability. The system addresses limitations in existing approaches by using fine-grained drug classification (ATC codes) and demonstrating improved accuracy while controlling for drug interactions and contraindications on MIMIC datasets.

AINeutralarXiv – CS AI · 3d ago6/10

🧠

Show, Don't TELL: Explainable AI-Generated Text Detection

Researchers have developed TELL, an AI-generated text detector that prioritizes explainability by showing users the specific linguistic markers indicating AI or human authorship rather than just providing an opaque numerical score. The system achieves competitive detection performance (AUROC 0.927) while generating human-evaluated explanations with a 72.3% mean win-rate across quality metrics, fundamentally reframing detection as a human-centric interpretability problem.

AINeutralarXiv – CS AI · 3d ago6/10

🧠

Do Models Know Why They Changed Their Mind? Interpretability and Faithfulness of Chain-of-Thought Under Knowledge Conflict

Researchers found that large language models' chain-of-thought reasoning remains remarkably consistent even when reaching opposite conclusions about conflicting information, suggesting CoT explanations don't faithfully reflect the underlying decision mechanism. While model confidence shows weak but genuine predictive signal for decisions, internal reasoning tokens proved more decision-sensitive than user-facing explanations, indicating models may not transparently report how they actually choose between document claims and training knowledge.

🧠 GPT-4🧠 Claude🧠 Sonnet

AINeutralarXiv – CS AI · 3d ago6/10

🧠

Performance and Explainability Requirements of Evolutionary Algorithms in Real-World Physics-Informed Optimization

Researchers identify a significant gap between evolutionary computation research and real-world physics-based optimization applications. Domain experts consistently require fast convergence and algorithm explainability, but existing evolutionary algorithm techniques remain underutilized in complex practical scenarios due to trust and performance concerns.

AINeutralarXiv – CS AI · 4d ago6/10

🧠

READER: Reasoning-Enhanced AI-Generated Text Detection

Researchers have developed READER, a compact AI text detector with only 1.5B parameters that outperforms much larger language models and existing detection systems. READER combines classification with explainable reasoning, providing both AI/human verdicts and structured rationales for its decisions, addressing critical limitations in current detection methods that fail under distribution shifts.

🧠 GPT-5🧠 Gemini

AINeutralarXiv – CS AI · May 126/10

🧠

Attribution-based Explanations for Markov Decision Processes

Researchers have developed attribution techniques that explain decision-making in Markov Decision Processes (MDPs), extending explainability methods beyond static inputs to sequential decision-making systems. The approach assigns importance scores to states and execution paths, enabling more interpretable AI agents in dynamic environments.

AINeutralarXiv – CS AI · May 115/10

🧠

Cognitive Agent Compilation for Explicit Problem Solver Modeling

Researchers propose Cognitive Agent Compilation (CAC), a framework that uses large language models to create explicit, inspectable problem-solving agents for educational applications. The approach separates knowledge representation, problem-solving policy, and verification rules to make AI systems more controllable and transparent than standard LLMs, though it reveals trade-offs between interpretability and scalability.

AINeutralarXiv – CS AI · May 116/10

🧠

Why Self-Inconsistency Arises in GNN Explanations and How to Exploit It

Researchers identify why Graph Neural Network explanations produce inconsistent results when re-applied to their own outputs, attributing this to context perturbation during re-explanation. They propose Self-Denoising, a training-free post-processing method that improves explanation quality with minimal computational overhead.

AINeutralarXiv – CS AI · May 116/10

🧠

Graph-Structured Hyperdimensional Computing for Data-Efficient and Explainable Process-Structure-Property Prediction

Researchers developed PSP-HDC, a graph-structured hyperdimensional computing framework for predicting material properties in 3D microstructure fabrication with sparse, heterogeneous data. The approach achieves 91% accuracy while providing inherent explainability—a critical advantage over conventional machine learning models that struggle with limited datasets and poor generalization.

AINeutralarXiv – CS AI · May 116/10

🧠

Frequency-Aware Model Parameter Explorer: A new attribution method for improving explainability

Researchers introduce FAMPE, a novel attribution method that uses frequency-domain analysis to improve explainability in deep neural networks. By separately perturbing high and low-frequency components through FFT-based techniques, the method outperforms existing attribution approaches on ImageNet across multiple architectures without requiring manual baseline selection.

AINeutralarXiv – CS AI · May 96/10

🧠

Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

Researchers propose concept-based abductive and contrastive explanations that identify minimal sets of high-level concepts causally relevant to vision model predictions. The approach combines human-interpretable concept-based explanations with formal causal reasoning, enabling better understanding of both individual predictions and common model behaviors across image collections.

AIBullisharXiv – CS AI · May 76/10

🧠

The Tsetlin Machine Goes Deep: Logical Learning and Reasoning With Graphs

Researchers introduce Graph Tsetlin Machine (GraphTM), an interpretable deep learning approach that processes graph-structured data while maintaining logical explainability. The system demonstrates competitive or superior performance across image classification, action tracking, recommendation systems, and genomic sequence analysis, while training significantly faster than comparable methods like GCNs.

AINeutralarXiv – CS AI · Apr 206/10

🧠

Towards Rigorous Explainability by Feature Attribution

A new research paper challenges the rigor of popular explainability methods in machine learning, particularly Shapley values and SHAP, arguing that non-symbolic approaches lack the mathematical foundation needed for high-stakes applications. The work advocates for symbolic methods as a more reliable alternative for determining feature importance in AI models.

AINeutralarXiv – CS AI · Apr 206/10

🧠

Applied Explainability for Large Language Models: A Comparative Study

Researchers compare three explainability techniques—Integrated Gradients, Attention Rollout, and SHAP—for interpreting LLM decisions on sentiment classification tasks. The study reveals that gradient-based methods offer stability and interpretability, while attention-based approaches are faster but less predictive, highlighting critical trade-offs in choosing explanation methods for transformer models.

AINeutralarXiv – CS AI · Apr 156/10

🧠

TRUST Agents: A Collaborative Multi-Agent Framework for Fake News Detection, Explainable Verification, and Logic-Aware Claim Reasoning

TRUST Agents is a multi-agent AI framework designed to improve fake news detection and fact verification by combining claim extraction, evidence retrieval, verification, and explainable reasoning. Unlike binary classification approaches, the system generates transparent, human-inspectable reports with logic-aware reasoning for complex claims, though it shows that retrieval quality and uncertainty calibration remain significant challenges in automated fact verification.

AINeutralarXiv – CS AI · Apr 156/10

🧠

FaCT: Faithful Concept Traces for Explaining Neural Network Decisions

Researchers introduce FaCT, a new approach for explaining neural network decisions through faithful concept-based explanations that don't rely on restrictive assumptions about how models learn. The method includes a new evaluation metric (C²-Score) and demonstrates improved interpretability while maintaining competitive performance on ImageNet.

Page 1 of 2Next →