CGKDFL: A Federated Learning Approach Based on Client Clustering and Generator‐Based Knowledge Distillation for Heterogeneous Data-Reference-Cited by-同舟云学术

CGKDFL: A Federated Learning Approach Based on Client Clustering and Generator‐Based Knowledge Distillation for Heterogeneous Data

Published:2025-04-08 Issue:9-11 Volume:37 Page:
ISSN:1532-0626
Container-title:Concurrency and Computation: Practice and Experience
language:en
Short-container-title:Concurrency and Computation

Author:

Zhang Sanfeng¹^ORCID,Xu Hongzhen²,Yu Xiaojun²

Affiliation:

1. School of Information Engineering East China University of Technology Nanchang China

2. School of Software East China University of Technology Nanchang China

Abstract

ABSTRACTIn practical, real‐world complex networks, data distribution is frequently decentralized and Non‐Independently Identically Distributed (Non‐IID). This heterogeneous data presents a significant challenge for federated learning. Such problems include the generation of biased global models, the lack of sufficient personalization capability of local models, and the difficulty in absorbing global knowledge. We propose a Federated Learning Approach Based on Client Clustering and Generator‐based Knowledge Distillation(CGKDFL) for heterogeneous data. Firstly, to reduce the global model bias, we propose a clustering federated learning approach that only requires each client to transmit some of the parameters of the selected layer, thus reducing the number of parameters. Subsequently, to circumvent the absence of global knowledge resulting from clustering, a generator designed to improve privacy features and increase diversity is developed on the server side. This generator produces feature representation data that aligns with the specific tasks of the client by utilizing the labeling information provided by the client. This is achieved without the need for any external dataset. The generator then transfers its global knowledge to the local model. The client can then utilize this information for knowledge distillation. Finally, extensive experiments were conducted on three heterogeneous datasets. The results demonstrate that CGKDFL outperforms the baseline method by a minimum of , , and regarding accuracy on the three heterogeneous datasets. Additionally, it outperforms the compared methods regarding convergence speed in all cases.

Funder

National Natural Science Foundation of China

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpe.70048

Reference62 articles.

1. A federated learning method for network intrusion detection

2. Increasing trust in AI through privacy preservation and model explainability: Federated Learning of Fuzzy Regression Trees

3. Advancements in Federated Learning: Models, Methods, and Privacy;Chen H.;ACM Computing Surveys,2024

4. Survey on Federated Learning for Intrusion Detection System: Concept, Architectures, Aggregation Strategies, Challenges, and Future Directions