AINeutralarXiv – CS AI · 10h ago6/10
🧠
FragileFlow: Spectral Control of Correct-but-Fragile Predictions for Foundation Model Robustness
FragileFlow introduces a theoretical framework and practical regularizer to detect and mitigate a hidden failure mode in large language models and vision-language models where predictions remain technically correct but confidence margins narrow dangerously. The research provides the first PAC-Bayes bounds for margin-aware error flow, addressing robustness gaps that standard accuracy metrics overlook.