首页 | 本学科首页   官方微博 | 高级检索  
     

面向软件缺陷个数预测的混合式特征选择方法*
引用本文:马子逸,马传香,余啸,刘瑞奇.面向软件缺陷个数预测的混合式特征选择方法*[J].计算机应用研究,2018,35(2).
作者姓名:马子逸  马传香  余啸  刘瑞奇
作者单位:湖北大学计算机与信息工程学院,湖北大学计算机与信息工程学院,武汉大学软件工程国家重点实验室,武汉大学国际软件学院
基金项目:湖北大学精品课程(No:013665,No:150145)
摘    要:针对软件缺陷数据集中不相关特征和冗余特征会降低软件缺陷个数预测模型的性能的问题,提出了一种面向软件缺陷个数预测的混合式特征选择方法-HFSNFP。首先,利用ReliefF算法计算每个特征与缺陷个数之间的相关性,选出相关性最高的m个特征;然后,基于特征之间的关联性利用谱聚类对这m个特征进行聚类;最后,利用基于包裹式特征选择思想从每个簇中依次挑选最相关的特征形成最终的特征子集。实验结果表明,相比于已有的五种过滤式特征选择方法,HFSNFP方法在提高预测率的同时降低了误报率,且G-measure与RMSE度量值更佳;相比于已有的两种包裹式特征选择方法,HFSNFP方法在保证了缺陷个数预测性能的同时可以显著降低特征选择的时间。

关 键 词:软件缺陷个数预测    特征选择  谱聚类  包裹式特征选择
收稿时间:2017/4/24 0:00:00
修稿时间:2018/1/2 0:00:00

A hybrid feature selection method for the number of software faults prediction
Ma Ziyi,Ma Chuanxiang,Yu Xiao and Liu Ruiqi.A hybrid feature selection method for the number of software faults prediction[J].Application Research of Computers,2018,35(2).
Authors:Ma Ziyi  Ma Chuanxiang  Yu Xiao and Liu Ruiqi
Affiliation:School of Computer Science and Information Engineering, HuBei Universitya,,,
Abstract:Focused on the issue that the irrelevant and redundant features in software defect data degrade the performance of the number of software faults prediction models, this paper proposed a Hybrid Feature Selection method for the Number of Faults Prediction (HFSNFP) . Firstly, HFSNFP computes the relevance between every feature and the number of fault with ReliefF algorithm and selects the top m most relevant features. Then, HFSNFP groups the m features with spectral clustering algorithm according to the correlation between every two features; Finally, HFSNFP selects the most relevant features from each resulted cluster to form the final feature subset using a wrapper search. Compared with five existing filter-based feature selection methods, the experimental results show that HFSNFP increases PD value, reduces PF value and achieves better G-measure and RMSE values. Comparison with two wrapper-based feature selection methods demonstrates that HFSNFP can achieve the high performance of the number of faults prediction and reduce the running time of feature selection.
Keywords:the number of software faults prediction  feature selection  spectral clustering  wrapper-based feature selection
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号