计算机应用 ›› 2012, Vol. 32 ›› Issue (08): 2245-2249.DOI: 10.3724/SP.J.1087.2012.02245

• 人工智能 • 上一篇    下一篇

基于野草算法的文本特征选择

刘逵,周竹荣   

  1. 西南大学 计算机与信息科学学院,重庆 400715
  • 收稿日期:2012-02-24 修回日期:2012-03-20 发布日期:2012-08-28 出版日期:2012-08-01
  • 通讯作者: 刘逵
  • 作者简介:刘逵(1988-),男,江西高安人,硕士研究生,主要研究方向:语义网、面向服务计算;
    周竹荣(1970-),男,四川大竹人,副教授,博士,主要研究方向:语义网、面向服务计算。

Text feature selection method based on invasive weed optimization

LIU Kui,ZHOU Zhu-rong   

  1. College of Computer and Information Science, Southwest University, Chongqing 400715, China
  • Received:2012-02-24 Revised:2012-03-20 Online:2012-08-28 Published:2012-08-01
  • Contact: LIU Kui

摘要: 为了更全面地对文本进行特征选择,提高文本特征选择的准确率,提出一种基于野草算法的文本特征选择方法,利用野草算法中子代个体按正态分布的方式分布于父代个体周围,在进化过程中通过动态调整子代个体正态分布的标准差,使算法在早期与中期充分保持种群多样性的优势,对文本进行比较全面的特征选择;在算法后期加强对优秀个体的特征选择,保证算法稳健地收敛到全局最优解,提高文本特征选择的准确率。实验结果表明,这种方法可以给予权重值低的词条进行特征选择的机会,并且保证权重值高的词条特征选择优势,从而提高文本特征选择的全面性和准确性。

关键词: 文本特征, 特征选择, 野草算法

Abstract: In order to select text feature more comprehensively and improve the accuracy of the text feature selection, a new text feature selection method based on Invasive Weed Optimization (IWO) was proposed. The biggest advantage of IWO is that the offspring individuals are being randomly spread around their parents according to Gauss normal distribution, and the standard deviation of the random function is adjusted dynamically during the evolution process; thus, the algorithm explores new areas aggressively to maintain the diversity of the species in the early and middle iterations, and enhances the feature selection of the optimal individuals in final iteration. Such mechanism ensured the steady convergence of the algorithm to global optimal solution, and improved the accuracy of the text feature selection. The results of experiments indicate that this method can provide the entry of low weight value with feature selection opportunity, and ensure the feature selection advantage of the entry with high weight value, thereby enhancing the completeness and accuracy of the text feature selection.

Key words: text feature, feature selection, Invasive Weed Optimization (IWO)

中图分类号: