《计算机应用》唯一官方网站

• •    下一篇

基于原型聚类和费舍尔信息矩阵加权融合的联邦学习算法

王星尧,陈学斌   

  1. 华北理工大学
  • 收稿日期:2025-09-04 修回日期:2025-10-16 发布日期:2025-11-07 出版日期:2025-11-07
  • 通讯作者: 陈学斌
  • 基金资助:
    国家自然科学基金资助项目

Federated learning algorithm based on prototype clustering and fisher information matrix weighted aggregation

  • Received:2025-09-04 Revised:2025-10-16 Online:2025-11-07 Published:2025-11-07

摘要: 联邦学习作为人工智能领域的重要研究方向,在保障数据隐私的前提下,通过分布式协同训练构建全局模型,为隐私敏感场景下的联合建模提供了新范式。然而,现有方法在面对客户端数据异构时,存在性能退化、训练振荡和收敛缓慢等问题。为此,提出了一种基于原型聚类与自适应加权融合的联邦学习算法——FedPFA。首先服务器对客户端上传的原型表示进行聚类,以缓解数据分布差异并增强全局模型的一致性与泛化能力,随后结合客户端费舍尔信息矩阵迹估计值为原型分配差异化权重,从而在融合过程中突出高质量客户端的贡献,抑制噪声或训练不足客户端的干扰。在MNIST、Fashion-MNIST和CIFAR-10数据集上的实验结果表明,与FedGH方法相比准确率提高了约19.64、26.14和16.15个百分点。算法在保证性能提升的同时,显著改善了全局模型的收敛速度与稳定性,并在多种数据集上展现出良好的鲁棒性与实际应用价值。

关键词: 联邦学习, 原型聚类, 费舍尔信息矩阵, 加权融合, 客户端选择

Abstract: Federated learning, as an important research direction in the field of artificial intelligence, was utilized to construct global models through distributed collaborative training under the premise of ensuring data privacy, providing a new paradigm for joint modeling in privacy-sensitive scenarios. However, existing methods were challenged by performance degradation, training oscillation, and slow convergence when faced with heterogeneous client data. To address these issues, a federated learning algorithm based on prototype clustering and adaptive weighted fusion, named FedPFA, was proposed. First, the prototypes uploaded by clients were clustered by the server to mitigate data distribution discrepancies and enhance the consistency and generalization capability of the global model. Subsequently, differentiated weights were assigned to the prototypes by incorporating the trace estimates of the Fisher information matrix from each client, thereby highlighting the contributions of high-quality clients and suppressing interference from noisy or inadequately trained clients during the fusion process. Experimental results on the MNIST, Fashion-MNIST, and CIFAR-10 datasets demonstrated that the accuracy was improved by approximately 19.64, 26.14, and 16.15 percentage points, respectively, compared to the FedGH method. The proposed algorithm not only ensured performance enhancement but also significantly improved the convergence speed and stability of the global model, exhibiting strong robustness and practical application value across various datasets.

Key words: federated learning, prototype clustering, fisher information matrix, weighted fusion, client selection

中图分类号: