首页 | 本学科首页   官方微博 | 高级检索  
     

基于代价极速学习机的软件缺陷报告分类方法
引用本文:张天伦,陈荣,杨溪,祝宏玉.基于代价极速学习机的软件缺陷报告分类方法[J].软件学报,2019,30(5):1386-1406.
作者姓名:张天伦  陈荣  杨溪  祝宏玉
作者单位:大连海事大学 信息科学技术学院, 辽宁 大连 116026,大连海事大学 信息科学技术学院, 辽宁 大连 116026,大连海事大学 信息科学技术学院, 辽宁 大连 116026,深圳大学 计算机与软件学院, 广东 深圳 518060
基金项目:国家自然科学基金(61672122,61602077,61732011)
摘    要:在所有的软件系统开发过程中,Bug的存在是不可避免的问题.对于软件系统的开发者来说,修复Bug最有利的工具就是Bug报告.但是人工识别Bug报告会给开发人员带来新的负担,因此,自动对Bug报告进行分类是一项很有必要的工作.基于此,提出用基于极速学习机的方法来对Bug报告进行分类.具体而言,主要解决Bug报告自动分类的3个问题:第1个是Bug报告数据集里不同类别的样本数量不平衡问题;第2个是Bug报告数据集里被标注的样本不充足问题;第3个是Bug报告数据集总体样本量不充足问题.为了解决这3个问题,分别引入了基于代价的有监督分类方法、基于模糊度的半监督学习方法以及样本迁移方法.通过在多个Bug报告数据集上进行实验,验证了这些方法的可行性和有效性.

关 键 词:软件Bug报告  有监督分类方法  半监督学习方法  样本迁移方法  极速学习机
收稿时间:2018/8/31 0:00:00
修稿时间:2018/10/31 0:00:00

Approach of Bug Reports Classification Based on Cost Extreme Learning Machine
ZHANG Tian-Lun,CHEN Rong,YANG Xi and ZHU Hong-Yu.Approach of Bug Reports Classification Based on Cost Extreme Learning Machine[J].Journal of Software,2019,30(5):1386-1406.
Authors:ZHANG Tian-Lun  CHEN Rong  YANG Xi and ZHU Hong-Yu
Affiliation:Information Science and Technology College, Dalian Maritime University, Dalian 116026, China,Information Science and Technology College, Dalian Maritime University, Dalian 116026, China,Information Science and Technology College, Dalian Maritime University, Dalian 116026, China and College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
Abstract:Bug is an unavoidable problem in the development of all software systems. For developers of software system, bug report is a powerful tool for fixing bugs. However, manual recognition on bug reports tends to be time-consuming and not economical. It thus becomes significant to advance the automated classification approach to provide clear guidelines on how to assign a reasonable severity to a reported bug. In this study, several algrithoms are proposed based on extreme learning machine to automatically classify bug reports. Concretely, this study focuses on three problems in the field of bug report classification. The first one is the imbalanced class distribution in bug report dataset; the second is the insufficient labeled sample in bug report dataset; the last is the limited training data available. In order to solve these issues, three methods are proposed based on cost-sensitive supervised classification, semi-supervised learning, and sample transferring, respectively. Extensive experiments on real bug report datasets are conducted, and the results demonstrate the practicability and effectiveness of the proposed methods.
Keywords:software bug report  supervised classification method  semi-supervised learning approach  sample transferring approach  extreme learning machine
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号