🧠 AI🔴 BearishImportance 7/10

Can Vision Models Truly Forget? Mirage: Representation-Level Certification of Visual Unlearning

arXiv – CS AI|Zhenyu Yu, Yangchen Zeng, Chunlei Meng, Guangzhen Yao, Shuigeng Zhou|June 2, 2026 at 04:00 AM

🤖AI Summary

Researchers introduce Mirage, a representation-level auditing framework that reveals existing machine unlearning methods in federated learning fail to truly forget sensitive data despite passing output-level tests. The study demonstrates that current approaches retain substantial class structure in internal representations, exposing a critical gap between certification standards and actual data privacy.

Analysis

The Mirage framework addresses a fundamental vulnerability in machine unlearning research by exposing that certification methods relying solely on output-level metrics provide false assurance of data deletion. When visual models undergo unlearning procedures, their external behavior may change while internal representations preserve the original training data's patterns. This distinction matters significantly because federated learning systems handle sensitive information across distributed networks, and inadequate forgetting mechanisms could compromise user privacy despite apparent compliance with unlearning protocols.

Vertical Federated Learning has emerged as a privacy-preserving approach where different parties contribute different features to the same training samples. The sector has grown because organizations seek to collaborate on machine learning without exposing raw data. However, the Mirage study reveals an unlearning trilemma: no existing method achieves high utility, output-level forgetting, and representation-level forgetting simultaneously. This creates a fundamental tradeoff that researchers and practitioners must navigate.

The asymmetry between class-level and sample-level forgetting proves particularly concerning, with class information persisting strongly across network layers even after unlearning. For developers deploying federated learning systems, this research necessitates immediate evaluation of current unlearning implementations against representation-level standards rather than relying on traditional output metrics. Organizations processing sensitive data must demand transparency about how their information is being forgotten at deeper architectural levels.

Key Takeaways

→Existing federated unlearning methods retain up to 15.4 points higher class structure recovery than retrained baselines despite passing output-level tests
→The unlearning trilemma shows no method simultaneously achieves utility, output forgetting, and representation forgetting—requiring fundamental architectural tradeoffs
→Class-level unlearning leaves 97% representational traces while sample-level forgetting approaches random chance, exposing asymmetric privacy guarantees
→Mirage's four diagnostics (LPR, CKA, separability scoring, layer-wise analysis) establish representation-level evaluation as necessary for privacy certification
→Current industry standards for machine unlearning validation are insufficient and require immediate update to address representation-level data persistence

#machine-unlearning #federated-learning #data-privacy #representation-learning #ai-security #model-auditing

Read Original →via arXiv – CS AI

Act on this with AI

Stay ahead of the market.

Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.

Connect Wallet to AI →How it works

AIMay 6

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

AIMay 6

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

AIMay 6

Can Vision Models Truly Forget? Mirage: Representation-Level Certification of Visual Unlearning

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

S&P 500 and NASDAQ hit record highs as AI chip stocks surge