Journal of Computer Applications ›› 2012, Vol. 32 ›› Issue (06): 1738-1740.DOI: 10.3724/SP.J.1087.2012.01738

• Computer software technology • Previous Articles     Next Articles

Mutual information-based feature selection approach for software defect prediction

WANG Pei,JIN Cong,GE He-he   

  1. Department of Computer Science, Central China Normal University, Wuhan Hubei 430079, China
  • Received:2011-11-09 Revised:2012-02-02 Online:2012-06-04 Published:2012-06-01
  • Contact: WANG Pei

面向软件缺陷预测的互信息属性选择方法

王培,金聪,葛贺贺   

  1. 华中师范大学 计算机科学系,武汉 430079
  • 通讯作者: 王培
  • 作者简介:王培(1988-),男,安徽合肥人,硕士研究生,主要研究方向:信息安全、软件质量预测;〓金聪(1960-),女,上海人,教授,博士,主要研究方向:数字水印、图像处理、信息安全;〓葛贺贺(1986-),男,河南商丘人,硕士研究生,主要研究方向:软件质量工程。
  • 基金资助:
    教育部人文社会科学研究规划基金资助项目

Abstract: Predicting defect-prone software modules accurately and effectively is an important way to control the quality of a software system during software development. Feature selection can highly improve the accuracy and efficiency of the software defect prediction model. A mutual information-based feature selection method for software defect prediction is proposed. The optimal feature subsets generated by the proposed approach were applied to train and test various prediction models. The experiment results show that all the classifiers achieve higher accuracy and performance by using the feature subset provided by proposed approach.

Key words: Software quality, Mutual information, Feature selection, Optimal feature subset, Defect prediction

摘要: 软件开发过程中准确有效地预测具有缺陷倾向的软件模块是提高软件质量的重要方法。属性选择能够显著地提高软件缺陷预测模型的精确度和效率。提出了一种基于互信息的属性选择方法,将选择出的最优属性子集用于软件缺陷预测模型。方法采用了前向搜索策略,并在评价函数中引入非线性平衡系数。实验结果表明,基于互信息的属性选择方法提供的属性子集能提高各类软件缺陷预测模型的预测精度和效率。

关键词: 软件质量, 互信息, 属性选择, 最优属性子集, 缺陷预测

CLC Number: