Journal of Computer Applications ›› 2026, Vol. 46 ›› Issue (6): 1746-1755.DOI: 10.11772/j.issn.1001-9081.2025050661

• Artificial intelligence • Previous Articles    

Federated learning framework integrating dynamic feature alignment and temperature-aware aggregation

Zhijian DONG, Ruichun GU()   

  1. School of Digital and Intelligent Industry,Inner Mongolia University of Science and Technology,Baotou Inner Mongolia 014010,China
  • Received:2025-06-23 Revised:2025-08-18 Accepted:2025-08-26 Online:2025-09-16 Published:2026-06-10
  • Contact: Ruichun GU
  • About author:DONG Zhijian, born in 2000, M. S. candidate. His research interests include federated learning.
    First author contact:GU Ruichun, born in 1982, Ph. D., associate professor, senior engineer. His research interests include federated learning, blockchain, edge intelligence.
  • Supported by:
    Inner Mongolia Natural Science Foundation(2021LHMS06003);Fundamental Research Funds for the Universities in Inner Mongolia(114)

融合动态特征对齐与温度感知聚合的联邦学习框架

董汦楗, 顾瑞春()   

  1. 内蒙古科技大学 数智产业学院,内蒙古 包头 014010
  • 通讯作者: 顾瑞春
  • 作者简介:董汦楗(2000—),男,河南三门峡人,硕士研究生,主要研究方向:联邦学习
    第一联系人:顾瑞春(1982—),男,内蒙古包头人,副教授,高级工程师,博士,CCF会员,研究方向:联邦学习、区块链、边缘智能。
  • 基金资助:
    内蒙古自然科学基金资助项目(2021LHMS06003);内蒙古高校基本科研业务费专项资金资助项目(114)

Abstract:

To address the degradation of model performance caused by statistical heterogeneity under Non-Independent and Identically Distributed (Non-IID) data in federated learning, a Federated learning framework integrating Dynamic feature alignment and Temperature-aware Aggregation (FedDTA) was proposed. In the framework, client drifts were mitigated through dynamic feature alignment and temperature-aware aggregation collaboratively. It has two core components: a dynamic regularization approach based on Sliced Wasserstein Distance (SWD) was used to achieve local-global feature distribution alignment via low-dimensional Monte Carlo projections, thereby reducing computational complexity and suppressing feature drifts; a hierarchical aggregation strategy incorporating a learnable projection network with annealing temperature scheduling was used to allocate client weights dynamically according to parameter differences. Experimental results indicate that under strong heterogeneity (Dirichlet α=0.1) condition, in accuracy, FedDTA outperforms suboptimal FedKTL(Federated Knowledge-Transfer-Loop) and FedCMD (Federated learning with Contrastive cloud-edge Model Decoupling) by 1.698 and 0.714 percentage points on the CIFAR-10 and CIFAR-100 datasets, respectively, demonstrating superior generalization capability in multi-data scenarios. Ablation experimental results confirm that SWD alignment reduces feature drifts significantly, while temperature scheduling optimization balances the exploration with exploitation. Without exposing raw data, FedDTA provides theoretical and methodological supports for privacy-sensitive scenarios such as medical collaboration and the industrial Internet of Things.

Key words: federated learning, statistical heterogeneity, Non-Independent and Identically Distributed (Non-IID) data, feature alignment, Sliced Wasserstein Distance (SWD), hierarchical aggregation strategy

摘要:

为了解决联邦学习中非独立同分布(Non-IID)数据下统计异质性导致的模型性能退化问题,提出一种融合动态特征对齐与温度感知聚合的联邦学习框架(FedDTA)。该框架通过动态特征对齐和温度感知聚合协同优化客户端漂移,包含2个核心组件:基于切片Wasserstein距离(SWD)的动态正则化方法,利用低维蒙特卡洛投影实现局部-全局特征分布对齐,从而降低计算复杂度并抑制特征偏移;结合可学习投影网络与退火温度调度的分层聚合策略,基于参数差异动态分配客户端权重。实验结果表明,在强异质性(Dirichlet α=0.1)条件下,相较于次优的FedKTL(Federated Knowledge-Transfer-Loop)和FedCMD(Federated learning with Contrastive cloud-edge Model Decoupling),FedDTA在CIFAR-10与CIFAR-100数据集上准确率分别提升了1.698与0.714个百分点。可见,FedDTA在多数据场景下具有更优的泛化能力。消融实验结果验证了SWD对齐显著减少了特征漂移,而温度调度优化平衡了探索与利用。FedDTA框架无需暴露原始数据,能为医疗协作和工业物联网等隐私敏感场景提供了理论与技术支持。

关键词: 联邦学习, 统计异质性, 非独立同分布数据, 特征对齐, 切片Wasserstein距离, 分层聚合策略

CLC Number: