Generalized Category Discovery in Federated Graph Learning
Researchers introduce GCD-FGL, a federated graph learning framework that enables decentralized networks to discover novel categories while preserving knowledge of known ones. The approach addresses critical challenges in distributed graph learning by implementing topology-reliable semantic alignment on client nodes and hierarchical prototype alignment on servers, demonstrating significant performance improvements across multiple datasets.
This research addresses a fundamental limitation in federated graph learning systems: their inability to handle dynamic environments where new categories emerge continuously. Traditional FGL approaches operate under closed-world assumptions, making them brittle when deployed in real-world scenarios requiring ongoing discovery of novel data categories across distributed networks.
The technical contribution tackles two interconnected problems. The Neighborhood Absorption Effect occurs when fragmented graph structures cause bias in how nodes aggregate information from neighbors, leading misclassification of novel nodes as known categories. Simultaneously, Global Semantic Inconsistency emerges when these local biases propagate to central servers, where heterogeneous subgraph distributions amplify inconsistencies across decentralized clients.
GCD-FGL's dual-layer solution proves effective: client-side topology-reliable semantic alignment mitigates local structural biases, while server-side hierarchical prototype alignment reconciles global inconsistencies. The framework achieves an average +4.86 improvement in HRScore across five real-world datasets, indicating meaningful performance gains.
For practitioners building distributed machine learning systems, this work highlights the practical importance of handling open-world scenarios in federated settings. Organizations deploying federated learning across supply chains, IoT networks, or multi-institutional collaborations increasingly encounter novel categories requiring dynamic model adaptation. The research provides a generalizable methodology for improving discovery mechanisms in decentralized environments without compromising privacy guarantees inherent to federated approaches.
- βGCD-FGL framework enables federated graph learning systems to discover novel categories while retaining knowledge of known ones.
- βNeighborhood Absorption Effect causes novel nodes to be misclassified when graph structures fragment across distributed clients.
- βGlobal Semantic Inconsistency amplifies local biases when heterogeneous subgraph data propagates to central servers.
- βClient-side topology alignment and server-side hierarchical prototypes jointly resolve the two fundamental challenges.
- βPerformance gains of +4.86 average HRScore demonstrate significant improvements over state-of-the-art baselines.