《计算机应用》唯一官方网站 ›› 2022, Vol. 42 ›› Issue (5): 1447-1454.DOI: 10.11772/j.issn.1001-9081.2021030516

• 数据科学与技术 • 上一篇    下一篇

基于交叉层级数据共享的多任务模型

陈颖1, 于炯1,2(), 陈嘉颖2, 杜旭升2   

  1. 1.新疆大学 软件学院,乌鲁木齐 830091
    2.新疆大学 信息科学与工程学院,乌鲁木齐 830046
  • 收稿日期:2021-04-06 修回日期:2021-06-22 接受日期:2021-06-22 发布日期:2022-06-11 出版日期:2022-05-10
  • 通讯作者: 于炯
  • 作者简介:陈颖(1999—),女,湖南娄底人,硕士研究生,主要研究方向:数据挖掘、机器学习
    于炯(1964—),男,北京人,教授,博士生导师,博士,主要研究方向:绿色计算、机器学习、数据挖掘 yujiong@xju.edu.cn
    陈嘉颖(1988—),女,新疆沙湾人,博士研究生,主要研究方向:推荐系统、数据挖掘
    杜旭升(1995—),男,甘肃庆阳人,博士研究生,CCF会员,主要研究方向:机器学习、数据挖掘。
  • 基金资助:
    国家自然科学基金资助项目(61862060)

Cross-layer data sharing based multi-task model

Ying CHEN1, Jiong YU1,2(), Jiaying CHEN2, Xusheng DU2   

  1. 1.School of Software,Xinjiang University,Urumqi Xinjiang 830091,China
    2.College of Information Science and Engineering,Xinjiang University,Urumqi Xinjiang 830046,China
  • Received:2021-04-06 Revised:2021-06-22 Accepted:2021-06-22 Online:2022-06-11 Published:2022-05-10
  • Contact: Jiong YU
  • About author:CHEN Ying, born in 1999,M. S. candidate. Her research interestsinclude data mining,machine learning.
    YU Jiong, born in 1964,Ph. D.,professor. His research interestsinclude green computing,machine learning,data mining.
    CHEN Jiayingborn in 1988,Ph. D. candidate. Her researchinterests include recommender system,data mining.,
    DU Xusheng,born in 1995, Ph. D. candidate. His researchinterests include machine learning,data mining.
  • Supported by:
    National Natural Science Foundation of China(61862060)

摘要:

针对多任务学习模型中相关度低的任务之间存在的负迁移现象和信息共享困难问题,提出了一种基于交叉层级数据共享的多任务模型。该模型关注细粒度的知识共享,且能保留浅层共享专家的记忆能力和深层特定任务专家的泛化能力。首先,统一多层级共享专家,以获取复杂相关任务间的公共知识;然后,将共享信息分别迁移到不同层级的特定任务专家之中,从而在上下层之间共享部分公共知识;最后,利用基于数据样本的门控网络自主选择不同任务所需信息,从而减轻样本依赖性对模型的不利影响。相较于多门控混合专家(MMOE)模型,所提模型在UCI census-income数据集上对两个任务的F1值分别提高了7.87个百分点和1.19个百分点;且在MovieLens数据集上的回归任务的均方误差(MSE)值降低到0.004 7,分类任务的AUC值提高到0.642。实验结果表明,所提出的模型适用于改善负迁移现象的影响,且能更高效地学习复杂相关任务之间的公共信息。

关键词: 多任务学习, 信息共享, 负迁移, 神经网络, 迁移学习

Abstract:

To address the issues of negative transfer and difficulty of information sharing between loosely correlated tasks in multi-task learning model, a cross-layer data sharing based multi-task model was proposed. The proposed model pays attention to fine-grained knowledge sharing, and is able to retain the memory ability of shallow layer shared experts and generalization ability of deep layer specific task experts. Firstly, multi-layer shared experts were unified to obtain public knowledge among complicatedly correlated tasks. Then, the shared information was transferred to specific task experts at different layers for sharing partial public knowledge between the upper and lower layers. Finally, the data sample based gated network was used to select the needed information for different tasks autonomously, thereby alleviating the harmful effects of sample dependence to the model. Compared with the Multi-gate Mixture-Of-Experts (MMOE) model, the proposed model improved the F1-score of two tasks by 7.87 percentage points and 1.19 percentage points respectively on UCI census-income dataset. The proposed model also decreased the Mean Square Error (MSE) value of regression task to 0.004 7 and increased the Area Under Curve (AUC) value of classification task to 0.642 on MovieLens dataset. Experimental results demonstrate that the proposed model is suitable to improve the influence of negative transfer and can learn public information among complicated related tasks more efficiently.

Key words: multi-task learning, information sharing, negative transfer, neural network, transfer learning

中图分类号: