Automated classification of software issue reports using machine learning techniques: an empirical study期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Automated classification of software issue reports using machine learning techniques: an empirical study

Authors:	Nitish Pandey Debarshi Kumar Sanyal Abir Hudait Amitava Sen

Affiliation:	1.School of Computer Engineering,KIIT University,Bhubaneswar,India;2.Indian Institute of Technology Kharagpur,Kharagpur,India;3.Department of Computer Science and Engineering,JIS University,Kolkata,India

Abstract:	Software developers, testers and customers routinely submit issue reports to software issue trackers to record the problems they face in using a software. The issues are then directed to appropriate experts for analysis and fixing. However, submitters often misclassify an improvement request as a bug and vice versa. This costs valuable developer time. Hence automated classification of the submitted reports would be of great practical utility. In this paper, we analyze how machine learning techniques may be used to perform this task. We apply different classification algorithms, namely naive Bayes, linear discriminant analysis, k-nearest neighbors, support vector machine (SVM) with various kernels, decision tree and random forest separately to classify the reports from three open-source projects. We evaluate their performance in terms of F-measure, average accuracy and weighted average F-measure. Our experiments show that random forests perform best, while SVM with certain kernels also achieve high performance.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏