• •    

一种基于用户身份特征的多标签分类算法研究及应用

郑晓雪1,张大方2,刁祖龙1   

  1. 1. 湖南省长沙市岳麓区湖南大学
    2. 湖南大学
  • 收稿日期:2016-11-14 修回日期:2017-01-10 发布日期:2017-01-10
  • 通讯作者: 郑晓雪

Research and application of a multi-lable classification algorithm based on user identity

  • Received:2016-11-14 Revised:2017-01-10 Online:2017-01-10

摘要: 摘 要:家校沟通是智慧校园中重要的组成部分,对智慧校园的建设起着重要的作用。然而目前对于智慧校园中的家校沟通,缺乏一种衡量和参考的方法。针对智慧校园中特有的聊天特点即存在明显的身份特征,提出了一种基于 Adaboost.mh的多标签分类算法Adaboost.ml算法,该算法新增加了启发式规则,提高了判断准确率,同时摒弃了把数据集进行分片的概念,直接利用单条数据作为分析的焦点,减少了由于时间片边缘带来的误差和推断时间,综合决策出聊天用户之间的关联关系。实验结果表明,比传统的算法相比误报率、漏报率均有明显的改善,同时在基于微信数据集上也具有良好的分类效果。

关键词: 社会网络, 智慧校园, 启发式规则, 多标签判断, 集成学习

Abstract: Abstract: Home-school communication plays an important role in the construction of smart campus. At present there is a lack of a way to measure home-school communication on a smart campus. Aiming at the obvious identity characteristics of chatting in intelligent campus, a new Adaboost.ml algorithm is proposed based on Adaboost.mh. The heuristic rule is added to improve the accuracy of judgment, and the concept of time slice is discarded. Single data as the focus of analysis, reducing the inference time and the error caused by the edge of the time slice, integrated decision-making relationship between the chat users. The experimental results show that the false positive rate and false negative rate are obviously improved compared with the traditional algorithm, and also have good classification results based on the micro - data sets.

Key words: social network, smart campus, heuristic rules, multi label judgment, semble?learning