《计算机应用》唯一官方网站

• •    下一篇

基于相关性差异化迁移的渐进式神经网络

蔡昌骁1,王士同2   

  1. 1. 江南大学
    2. 江南大学人工智能与计算机学院
  • 收稿日期:2022-06-13 修回日期:2022-08-26 发布日期:2022-09-22 出版日期:2022-09-22
  • 通讯作者: 蔡昌骁
  • 基金资助:
    江苏省自然科学基金

Progressive neural network based on correlation differentiation transfer

  • Received:2022-06-13 Revised:2022-08-26 Online:2022-09-22 Published:2022-09-22

摘要: 经典的渐进式神经网络(PNN)通过获取先前任务的经验知识来提高神经网络在当前任务中的性能,但忽略了在渐进任务较多时渐进任务间的相关性差异对网络性能的影响。针对这种渐进任务数量较多且任务间相关性存在差异的场景,提出了一种基于相关性差异化迁移的渐进式神经网络(CDT-PNN)。首先使用基于余弦相似度的算法评估两个渐进任务的相关性;然后利用当前任务和先前任务之间的相关性来决定神经网络的知识参数传递,删除与当前渐进任务呈负相关的先前渐进任务的知识参数;最后依据任务间相关性按一定比例随机抽取正相关渐进任务的知识参数进行参数迁移。在添加了不同程度噪声的cifar-100数据集和mnist手写识别数据集上进行实验,实验结果表明,在复杂多任务场景下CDT-PNN相比于传统的PNN性能更好,在cifar-100数据集上的实验任务平均分类精度提高6.6个百分点,在mnist数据集上的实验任务平均分类精度提高1.56个百分点。

关键词: 渐进式神经网络, 深度神经网络, 持续学习, 相关性差异, 复杂多任务

Abstract: Classical Progressive Neural Network(PNN) improves the performance of neural networks on the current task by acquiring empirical knowledge of previous tasks,but ignores the influence of the correlation difference between progressive tasks on the performance of the network when there are many progressive tasks. For such a scenario with a large number of progressive tasks and differences in the correlation between tasks, a Progressive Neural Network based on Correlation Differentiation Transfer(CDT-PNN) algorithm was proposed. Firstly,the correlation of the two progressive tasks was first evaluated using a cosine similarity-based algorithm.Then,the knowledge parameter transfer of the neural network was determined by exploiting the correlation between the current task and the previous task.The previous asymptotics that were negatively correlated with the current progressive task were removed.Finally,the knowledge parameters of the tasks were randomly selected according to the correlation between tasks and the knowledge parameters of the progressive tasks were randomly selected to transfer the parameters. Experiments were conducted on the cifar-100 dataset and mnist handwriting recognition dataset with different levels of noise.The experimental results show that CDT-PNN performs better than PNN in complex multi-task scenarios. The average classification accuracy of the experimental tasks on cifar-100 dataset is increased by 6.6 percentage points, and that on mnist dataset is increased by 1.56 percentage points.

Key words: progressive neural network, deep neural network, continual learning, correlation differentiation, complex multi-task

中图分类号: