AINeutralarXiv – CS AI · 6h ago6/10
🧠
Who Gets Credit or Blame? Attributing Accountability in Modern AI Systems
Researchers propose a framework to attribute AI model behavior to specific development stages (pretraining, fine-tuning, alignment), enabling accountability tracking without model retraining. The method quantifies how each stage contributes to model outputs and can identify spurious correlations, advancing transparency in AI development.