计算机应用 ›› 2011, Vol. 31 ›› Issue (12): 3268-3270.

• 数据库技术 • 上一篇    下一篇

基于特征加权朴素贝叶斯算法的网络用户识别

刘磊1,陈兴蜀1,尹学渊1,段意2,吕昭1   

  1. 1. 四川大学 计算机学院,成都 610065
    2. 四川大学 计算机学院,四川 成都610065
  • 收稿日期:2011-06-24 修回日期:2011-08-08 发布日期:2011-12-12 出版日期:2011-12-01
  • 通讯作者: 刘磊
  • 基金资助:
    国家973计划项目;国家242信息安全专项

Network User Identify On Feature Weighting Naive Bayes Classification Algorithm

LIU Lei,CHEN Xing-shu,YIN Xue-yuan,DUAN Yi,LV Zhao   

  1. College of Computer Science, Sichuan University, Chengdu Sichuan 610065, China
  • Received:2011-06-24 Revised:2011-08-08 Online:2011-12-12 Published:2011-12-01
  • Contact: LIU Lei

摘要: 基于网络用户的访问记录,提出了采用特征加权的朴素贝叶斯分类算法对用户进行识别。首先利用基于WinPcap框架的数据采集系统对用户访问记录进行采集,通过分析记录从5个方面对用户特征进行统计,并经过筛选后对特征进行选取,最后采用特征加权的朴素贝叶斯分类算法对3300个测试样本进行识别,识别率达到了85.73%。实验结果表明该算法能够有效实现对网络用户身份的识别。

关键词: 用户识别, 朴素贝叶斯分类器, 特征加权, 特征选择, 数据采集

Abstract: Based on the access logs of network users, Feature Weighting Naive Bayes Classification(FWNBC) algorithm is used to identify users. Firstly, the data acquisition system based on WinPcap framework was used to collect the access logs of network users, characteristics are counted from five aspects by analyzing these access logs, and then selected after filtering, at last the FWNBC algorithm is used to identify the 3300 samples, and the recognition rate reached 85.73%.The experiment results show that this algorithm is effective to identify the identity of network users.

Key words: user identification, Naive Bayes Classifier, feature weighting, feature selection, data acquisition