AINeutralarXiv โ CS AI ยท 4h ago6/10
๐ง
Empirical Characterization of Rationale Stability Under Controlled Perturbations for Explainable Pattern Recognition
Researchers propose a new metric to assess consistency of AI model explanations across similar inputs, implementing it on BERT models for sentiment analysis. The framework uses cosine similarity of SHAP values to detect inconsistent reasoning patterns and biased feature reliance, providing more robust evaluation of model behavior.