基于相关性差异化迁移的渐进式神经网络

doi:10.11772/j.issn.1001-9081.2022060851

《计算机应用》唯一官方网站

• • 下一篇

基于相关性差异化迁移的渐进式神经网络

蔡昌骁¹,王士同²

1. 江南大学
2. 江南大学人工智能与计算机学院

收稿日期:2022-06-13 修回日期:2022-08-26 发布日期:2022-09-22 出版日期:2022-09-22
通讯作者: 蔡昌骁
基金资助:
江苏省自然科学基金

Progressive neural network based on correlation differentiation transfer

Received:2022-06-13 Revised:2022-08-26 Online:2022-09-22 Published:2022-09-22

摘要/Abstract

摘要： 经典的渐进式神经网络(PNN)通过获取先前任务的经验知识来提高神经网络在当前任务中的性能，但忽略了在渐进任务较多时渐进任务间的相关性差异对网络性能的影响。针对这种渐进任务数量较多且任务间相关性存在差异的场景，提出了一种基于相关性差异化迁移的渐进式神经网络（CDT-PNN）。首先使用基于余弦相似度的算法评估两个渐进任务的相关性；然后利用当前任务和先前任务之间的相关性来决定神经网络的知识参数传递，删除与当前渐进任务呈负相关的先前渐进任务的知识参数；最后依据任务间相关性按一定比例随机抽取正相关渐进任务的知识参数进行参数迁移。在添加了不同程度噪声的cifar-100数据集和mnist手写识别数据集上进行实验，实验结果表明，在复杂多任务场景下CDT-PNN相比于传统的PNN性能更好，在cifar-100数据集上的实验任务平均分类精度提高6.6个百分点，在mnist数据集上的实验任务平均分类精度提高1.56个百分点。

关键词: 渐进式神经网络, 深度神经网络, 持续学习, 相关性差异, 复杂多任务

Abstract: Classical Progressive Neural Network(PNN) improves the performance of neural networks on the current task by acquiring empirical knowledge of previous tasks,but ignores the influence of the correlation difference between progressive tasks on the performance of the network when there are many progressive tasks. For such a scenario with a large number of progressive tasks and differences in the correlation between tasks, a Progressive Neural Network based on Correlation Differentiation Transfer(CDT-PNN) algorithm was proposed. Firstly,the correlation of the two progressive tasks was first evaluated using a cosine similarity-based algorithm.Then,the knowledge parameter transfer of the neural network was determined by exploiting the correlation between the current task and the previous task.The previous asymptotics that were negatively correlated with the current progressive task were removed.Finally,the knowledge parameters of the tasks were randomly selected according to the correlation between tasks and the knowledge parameters of the progressive tasks were randomly selected to transfer the parameters. Experiments were conducted on the cifar-100 dataset and mnist handwriting recognition dataset with different levels of noise.The experimental results show that CDT-PNN performs better than PNN in complex multi-task scenarios. The average classification accuracy of the experimental tasks on cifar-100 dataset is increased by 6.6 percentage points, and that on mnist dataset is increased by 1.56 percentage points.

Key words: progressive neural network, deep neural network, continual learning, correlation differentiation, complex multi-task

中图分类号:

TP389.1

蔡昌骁王士同. 基于相关性差异化迁移的渐进式神经网络[J]. 计算机应用, DOI: 10.11772/j.issn.1001-9081.2022060851.

[1]	肖斌, 杨模, 汪敏, 秦光源, 李欢. 独立性视角下的相频融合领域泛化方法[J]. 《计算机应用》唯一官方网站, 2024, 44(4): 1002-1009.
[2]	颜梦玫, 杨冬平. 深度神经网络平均场理论综述[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 331-343.
[3]	赵旭剑, 李杭霖. 基于混合机制的深度神经网络压缩算法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2686-2691.
[4]	申云飞, 申飞, 李芳, 张俊. 基于张量虚拟机的深度神经网络模型加速方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2836-2844.
[5]	李校林, 杨松佳. 基于深度学习的多用户毫米波中继网络混合波束赋形[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2511-2516.
[6]	李淦, 牛洺第, 陈路, 杨静, 闫涛, 陈斌. 融合视觉特征增强机制的机器人弱光环境抓取检测[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2564-2571.
[7]	杨海宇, 郭文普, 康凯. 基于卷积长短时深度神经网络的信号调制方式识别方法[J]. 《计算机应用》唯一官方网站, 2023, 43(4): 1318-1322.
[8]	高媛媛, 余振华, 杜方, 宋丽娟. 基于贝叶斯优化的无标签网络剪枝算法[J]. 《计算机应用》唯一官方网站, 2023, 43(1): 30-36.
[9]	刘小宇, 陈怀新, 刘壁源, 林英, 马腾. 自适应置信度阈值的非限制场景车牌检测算法[J]. 《计算机应用》唯一官方网站, 2023, 43(1): 67-73.
[10]	王晓雨, 王展青, 熊威. 深度非对称离散跨模态哈希方法[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2461-2470.
[11]	杨博, 张恒巍, 李哲铭, 徐开勇. 基于图像翻转变换的对抗样本生成方法[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2319-2325.
[12]	玄英律, 万源, 陈嘉慧. 基于多尺度卷积和注意力机制的LSTM时间序列分类[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2343-2352.
[13]	李坤, 侯庆. 基于注意力机制的轻量型人体姿态估计[J]. 《计算机应用》唯一官方网站, 2022, 42(8): 2407-2414.
[14]	毛文涛, 吴桂芳, 吴超, 窦智. 基于中国写意风格迁移的动漫视频生成模型[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2162-2169.
[15]	陈荣源, 姚剑敏, 严群, 林志贤. 基于深度神经网络的视频播放速度识别[J]. 《计算机应用》唯一官方网站, 2022, 42(7): 2043-2051.

基于相关性差异化迁移的渐进式神经网络

Progressive neural network based on correlation differentiation transfer

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics