AIBullisharXiv – CS AI · 4d ago6/10
🧠Researchers introduced ECSEL, an explainable classification method that learns symbolic equations to create interpretable machine learning models. The approach outperforms competing symbolic regression methods on benchmarks while maintaining computational efficiency and classification accuracy comparable to traditional ML models.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers present Vital Trace, a protocol-constrained multi-agent AI framework designed to improve clinical risk prediction in intensive care units by tracking patient trajectories over extended periods. The system uses compact patient-state memory and structured reasoning agents rather than unbounded text histories, demonstrating better temporal consistency and interpretability on MIMIC-IV and eICU datasets.
AINeutralarXiv – CS AI · 4d ago6/10
🧠AMARIS is a new system that improves how large language models are trained using reinforcement learning by maintaining a persistent memory of past training data and failures. Unlike existing methods that only look at immediate, local information, AMARIS tracks recurring problems and previous rubric adjustments over time, achieving measurable performance improvements across multiple domains.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers demonstrate that language models develop semantic role understanding (who-did-what-to-whom comprehension) primarily during pre-training, though fine-tuning still improves performance. Using linear probes on frozen transformer models, they find semantic role information emerges from language modeling objectives alone, with representation structure becoming more distributed as models scale.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce Probabilistic Logical Knowledge Tracing (PLKT), an interpretable AI framework that uses Beta-distributed probabilistic embeddings to model student knowledge states and predict learning performance. Unlike conventional deep learning approaches that rely on opaque deterministic embeddings, PLKT constructs transparent reasoning paths showing how past interactions influence predictions while maintaining superior accuracy compared to existing methods.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce MarsTSC, a novel framework combining Vision Language Models with agentic reasoning for few-shot multimodal time series classification. The system uses collaborative AI roles—Generator, Reflector, and Modifier—to iteratively refine knowledge and improve classification accuracy across 12 benchmarks while providing interpretable explanations.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce an anchor-projection framework that enables behavioral directions to transfer across different large language model families by mapping their diverse hidden representations into a shared coordinate space. The approach achieves high cross-model alignment (0.83 ten-way detection accuracy) without fine-tuning, demonstrating that interpretability and control mechanisms can be standardized across architecturally different models.
🧠 Llama
AIBullisharXiv – CS AI · May 126/10
🧠Researchers introduce E-TCAV, an optimized version of TCAV that improves the efficiency and stability of neural network interpretability testing by leveraging penultimate layer representations. The method achieves linear speed-ups while maintaining accuracy, advancing practical tools for model debugging and real-time concept-guided training across vision and language tasks.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers present Hierarchical Causal Abduction (HCA), a framework that makes Model Predictive Control decisions interpretable by combining physics-informed reasoning, optimization evidence, and causal discovery. The method achieves 53% higher explanation accuracy than existing approaches across industrial control applications, addressing a critical barrier to deploying AI in safety-critical infrastructure.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers found that machine learning models trained on elite European football leagues lose interpretability and reliability when applied to university-level competition, suggesting that performance insights don't transfer across competition tiers. The study reveals that explanation stability and feature importance hierarchies are domain-dependent, challenging the assumption that ML-derived performance determinants are universally applicable.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers analyzed how Qwen3-VL-8B, a multimodal transformer, encodes visual interestingness—a measure derived from human engagement data—without explicit supervision. Using neuroscience-inspired methods, they found that the model's internal representations align with human-derived interestingness scores, suggesting transformers may capture principles of human attention and perception.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers developed an explainable machine learning framework that uses unsupervised and supervised learning to identify and interpret dietary patterns from UK nutrition survey data. The system discovered four distinct eating patterns and achieved high accuracy in reproducing classifications, with potential applications for dietitian-assisted clinical assessments and personalized nutrition counseling.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers demonstrate that early layers of cohort-trained Implicit Neural Representations (INRs) encode transferable features for signal fitting, identifying optimal freezing points through weight stable rank analysis. Using sparse autoencoders for mechanistic interpretability, they reveal that SIREN and Fourier-feature MLPs learn fundamentally different dictionary representations despite comparable performance, with implications for designing more generalizable neural architectures.
AIBullisharXiv – CS AI · May 126/10
🧠Researchers introduce CAMAL, a method that leverages segmentation masks to improve attention alignment and faithfulness in vision models across deep learning and reinforcement learning paradigms. The approach achieves over 35% improvements in attention faithfulness while maintaining or improving generalization performance without additional inference costs.
AIBullisharXiv – CS AI · May 126/10
🧠Researchers introduce Lattice Deduction Transformers (LDT), a specialized neural architecture that achieves near-perfect accuracy on constraint-solving puzzles like Sudoku and Mazes while remaining logically sound. The approach demonstrates that smaller models with domain-specific architectures can outperform large language models on reasoning tasks.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers propose Bridge Matching, a novel framework that decomposes stochastic generative model dynamics into deterministic transport and diffusion-induced osmotic effects. This decomposition enables more interpretable and controllable generative sampling by separately parameterizing how probability mass moves versus how stochastic fluctuations affect the process.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers propose Shapley regression, a game-theoretic machine learning method for diagnosing APDS, a rare genetic immune disorder. The approach combines interpretability with predictive power by modeling symptom interactions while maintaining transparency, validated on both public datasets and a real-world cohort of 222 patients.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers studying one-layer Transformers discovered that architectural choices in feedforward networks (FFNs)—particularly sparse mixture-of-experts (MoE) routing—fundamentally reshape how attention mechanisms learn to compute, with sparsity rather than learned specialization driving this computational redistribution.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce OracleTSC, an LLM-based traffic signal control system that combines reward hurdle mechanisms and uncertainty regularization to stabilize reinforcement learning training. The approach achieves 75% reduction in travel time while maintaining interpretability through natural language explanations, with strong cross-intersection generalization capabilities.
AINeutralarXiv – CS AI · May 125/10
🧠Researchers establish connections between Consistency-Based Diagnosis (CBD) and Actual Causality frameworks within Explainable AI (XAI), addressing a gap in how diagnosis systems explain their outputs. This theoretical work bridges two previously disconnected areas in AI research, with potential applications for making data management systems more interpretable and trustworthy.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce a spectral diagnostic method to detect hidden coalitions in multi-agent AI systems by analyzing mutual information patterns in internal neural representations rather than observable behavior. The technique successfully identifies hierarchical and dynamic coalition structures in reinforcement learning and language models, providing a scalable tool for monitoring emergent organization in distributed AI systems.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers prove that modern neural networks can be represented using a Generalized Singular Value Decomposition that makes them left-invertible before a final linear layer while preserving norm properties. This mathematical framework enables distance calibration between feature space and input space, with demonstrated applications to adversarial perturbation detection and potential future use in addressing model bias and invertibility.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce PLOT (Progressive Localization via Optimal Transport), a new framework for mechanistic interpretability that efficiently identifies causal variables in neural networks through optimal transport coupling rather than computationally expensive searches. The method significantly speeds up causal abstraction analysis while maintaining competitive accuracy, offering practical advantages for large-scale AI interpretability research.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce a neurosymbolic framework that combines neural networks with symbolic logic for skeleton-based human action recognition, enabling interpretable AI models that explain their decisions through human-readable logical rules rather than operating as black boxes.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce RelAge-GNN, a graph neural network framework that models complex biological relationships among DNA methylation sites to improve aging clock predictions. The method outperforms existing approaches in estimating biological age and shows enhanced sensitivity for detecting age acceleration in disease cohorts, with interpretability analysis revealing which relationships and CpG sites drive predictions.