《计算机应用》唯一官方网站 ›› 2023, Vol. 43 ›› Issue (10): 3107-3113.DOI: 10.11772/j.issn.1001-9081.2022091454

• 人工智能 • 上一篇    

TenrepNN:集成学习的新范式在企业自律性评价中的实践

赵敬涛1,2, 赵泽方1,2, 岳兆娟1, 李俊1,2()   

  1. 1.中国科学院 计算机网络信息中心,北京 100083
    2.中国科学院大学 计算机科学与技术学院,北京 100049
  • 收稿日期:2022-09-30 修回日期:2022-12-15 接受日期:2023-01-05 发布日期:2023-03-17 出版日期:2023-10-10
  • 通讯作者: 李俊
  • 作者简介:赵敬涛(1998—),男,山东聊城人,硕士研究生,主要研究方向:推荐系统、机器学习
    赵泽方(1996—),男,山西临汾人,博士研究生,主要研究方向:自然语言处理、情感分析
    岳兆娟(1984—),女,河南驻马店人,高级工程师,博士,主要研究方向:计算传播、数据挖掘;
  • 基金资助:
    国家重点研发计划项目(2019YFB1405801)

TenrepNN:practice of new ensemble learning paradigm in enterprise self-discipline evaluation

Jingtao ZHAO1,2, Zefang ZHAO1,2, Zhaojuan YUE1, Jun LI1,2()   

  1. 1.Computer Network Information Center,Chinese Academy of Sciences,Beijing 100083,China
    2.School of Computer Science and Technology,University of Chinese Academy of Sciences,Beijing 100049,China
  • Received:2022-09-30 Revised:2022-12-15 Accepted:2023-01-05 Online:2023-03-17 Published:2023-10-10
  • Contact: Jun LI
  • About author:ZHAO Jingtao, born in 1998, M. S. candidate. His research interests include recommendation system, machine learning.
    ZHAO Zefang, born in 1996, Ph. D. candidate. His research interests include natural language processing, sentiment analysis.
    YUE Zhaojuan, born in 1984, Ph. D., senior engineer. Her research interests include computing propagation, data mining.
  • Supported by:
    National Key Research and Development Program of China(2019YFB1405801)

摘要:

为了应对互联网环境中企业自律性低、违规事件频发、政府监管困难的现状,提出一种针对企业自律性评价的双层集成残差预测神经网络(TenrepNN)模型,并融合Stacking和Bagging集成学习的思想提出一种集成学习的新范式Adjusting。TenrepNN模型具有两层结构:第1层使用3种基学习器初步预测企业评分;第2层采用残差修正的思想,提出残差预测神经网络以预测每个基学习器的输出偏差。最后,将偏差与基学习器评分相加得到最终输出。在企业自律性评价数据集上,相较于传统的神经网络,TenrepNN模型的均方根误差(RMSE)降低了2.7%,企业自律性等级分类准确率达到了94.51%。实验结果表明,TenrepNN模型集成不同的基学习器降低预测方差,并使用残差预测神经网络显式地降低偏差,从而能够准确评价企业自律性以实现差异化的动态监管。

关键词: 企业自律性评价, 集成学习范式, 残差预测神经网络, 显式偏差修正, 互联网企业监管

Abstract:

In order to cope with the current situations of low self-discipline, frequent violation events and difficult government supervision of enterprises in the internet environment, a Two-layer ensemble residual prediction Neural Network (TenrepNN) model was proposed to evaluate the self-discipline of enterprises. And by integrating the ideas of Stacking and Bagging ensemble learning, a new paradigm of integrated learning was designed, namely Adjusting. TenrepNN model has a two-layer structure. In the first layer, three base learners were used to predict the enterprise score preliminarily. In the second layer, the idea of residual correction was adopted, and a residual prediction neural network was proposed to predict the output deviation of each base learner. Finally, the final output was obtained by adding the deviations and the base learner scores together. On the enterprise self-discipline evaluation dataset, compared with the traditional neural network, the proposed model has the Root Mean Square Error (RMSE) reduced by 2.7%, and the classification accuracy in the self-discipline level reached 94.51%. Experimental results show that by integrating different base learners to reduce the variance and using residual prediction neural network to decrease the deviation explicitly, TenrepNN model can accurately evaluate enterprise self-discipline to achieve differentiated dynamic supervision.

Key words: enterprise self-discipline evaluation, ensemble learning paradigm, residual prediction neural network, explicit deviation correction, internet enterprise supervision

中图分类号: