y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#rule-extraction News & Analysis

2 articles tagged with #rule-extraction. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles
AINeutralarXiv – CS AI · 18h ago7/10
🧠

Can Global XAI Methods Reveal Injected Behaviours in LLMs? SHAP vs Rule Extraction vs RuleSHAP

Researchers propose RuleSHAP, a novel explainable AI method that combines SHAP analysis with rule induction to detect injected behavioral triggers in large language models. The approach outperforms existing techniques by 82% in identifying belief-driven heuristics that fuel misinformation, offering a practical pathway for auditing LLM safety.

🧠 Llama
AINeutralarXiv – CS AI · 18h ago6/10
🧠

Neuron-Anchored Rule Extraction for Large Language Models via Contrastive Hierarchical Ablation

Researchers introduce MechaRule, a novel method for extracting interpretable symbolic rules from large language models by identifying and ablating sparse neuron activations that drive specific behaviors. The technique achieves 97% recall of high-impact neurons while requiring only 2.14% of the computational cost of exhaustive ablation, demonstrating effectiveness on arithmetic reasoning and jailbreak detection tasks.