首页 | 本学科首页   官方微博 | 高级检索  
     


Automated classification of software issue reports using machine learning techniques: an empirical study
Authors:Nitish Pandey  Debarshi Kumar Sanyal  Abir Hudait  Amitava Sen
Affiliation:1.School of Computer Engineering,KIIT University,Bhubaneswar,India;2.Indian Institute of Technology Kharagpur,Kharagpur,India;3.Department of Computer Science and Engineering,JIS University,Kolkata,India
Abstract:Software developers, testers and customers routinely submit issue reports to software issue trackers to record the problems they face in using a software. The issues are then directed to appropriate experts for analysis and fixing. However, submitters often misclassify an improvement request as a bug and vice versa. This costs valuable developer time. Hence automated classification of the submitted reports would be of great practical utility. In this paper, we analyze how machine learning techniques may be used to perform this task. We apply different classification algorithms, namely naive Bayes, linear discriminant analysis, k-nearest neighbors, support vector machine (SVM) with various kernels, decision tree and random forest separately to classify the reports from three open-source projects. We evaluate their performance in terms of F-measure, average accuracy and weighted average F-measure. Our experiments show that random forests perform best, while SVM with certain kernels also achieve high performance.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号