AIBullisharXiv โ CS AI ยท 4h ago5
๐ง
Efficient Discovery of Approximate Causal Abstractions via Neural Mechanism Sparsification
Researchers have developed a new method to extract interpretable causal mechanisms from neural networks using structured pruning as a search technique. The approach reframes network pruning as finding approximate causal abstractions, yielding closed-form criteria for simplifying networks while maintaining their causal structure under interventions.