首页 | 本学科首页   官方微博 | 高级检索  
     

回归算法对软件缺陷个数预测模型性能的影响
引用本文:付忠旺,肖蓉,余啸,谷懿.回归算法对软件缺陷个数预测模型性能的影响[J].计算机应用,2018,38(3):824-828.
作者姓名:付忠旺  肖蓉  余啸  谷懿
作者单位:1. 湖北大学 计算机与信息工程学院, 武汉 430062;2. 软件工程国家重点实验室(武汉大学), 武汉 430072;3. 湖北省教育信息化工程技术研究中心, 武汉 430062
摘    要:针对已有研究在评价软件缺陷个数预测模型性能时没有考虑到软件缺陷数据集存在数据不平衡的问题而采用了评估回归模型的不合适的评价指标的问题,提出以平均缺陷百分比作为评价指标,讨论不同回归算法对软件缺陷个数预测模型性能的影响程度。利用PROMISE提供的6个开源数据集,分析了10个回归算法对软件缺陷个数预测模型预测结果的影响以及各种回归算法之间的差异。研究结果表明:使用不同的回归算法建立的软件缺陷个数预测模型具有不同的预测效果,其中梯度Boosting回归算法和贝叶斯岭回归算法预测效果更好。

关 键 词:软件缺陷个数预测  数据不平衡  回归算法  
收稿时间:2017-08-07
修稿时间:2017-09-22

Impact of regression algorithms on performance of defect number prediction model
FU Zhongwang,XIAO Rong,YU Xiao,GU Yi.Impact of regression algorithms on performance of defect number prediction model[J].journal of Computer Applications,2018,38(3):824-828.
Authors:FU Zhongwang  XIAO Rong  YU Xiao  GU Yi
Affiliation:1. School of Computer Science and Information Engineering, Hubei University, Wuhan Hubei 430062, China;2. State Key Laboratory of Software Engineering(Wuhan University), Wuhan Hubei 430072, China;3. Educational Informationalization Engineering Research Center of HuBei Province, Wuhan Hubei 430062, China
Abstract:Focusing on the issue that the existing studies do not consider the imbalanced data distribution problem in defect datasets and employ improper performance measures to evaluate the performance of regression models for predicting the number of defects, the impact of different regression algorithms on models for predicting the number of defects were explored by using Fault-Percentile-Average (FPA) as the performance measure. Experiments were conducted on six datasets from PROMISE repository to analyze the impact on the models and the difference of ten regression algorithms for predicting the number of defects. The results show that the forecast results of models for predicting the number of defects built by different regression algorithms are various, and gradient boosting regression algorithm and Bayesian ridge regression algorithm can achieve better performance as a whole.
Keywords:defect number prediction                                                                                                                        imbalanced data distribution                                                                                                                        regression algorithm
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号