Automated bug assignment: Ensemble-based machine learning in large scale industrial contexts期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Automated bug assignment: Ensemble-based machine learning in large scale industrial contexts

Authors:	Leif?Jonsson author-information" > author-information__contact u-icon-before" > mailto:leif.jonsson@ericsson.com" title=" leif.jonsson@ericsson.com" itemprop=" email" data-track=" click" data-track-action=" Email author" data-track-label=" " >Email author author-information__orcid u-icon-before icon--orcid u-icon-no-repeat" > http://orcid.org/---" itemprop=" url" title=" View OrcID profile" target=" _blank" rel=" noopener" data-track=" click" data-track-action=" OrcID" data-track-label=" " >View author&# s OrcID profile,Markus?Borg,David?Broman,Kristian?Sandahl,Sigrid?Eldh,Per?Runeson

Affiliation:	1.Ericsson AB,Stockholm,Sweden;2.Department of Computer and Information Science,Link?ping University,Link?ping,Sweden;3.Department of Computer Science,Lund University,Lund,Sweden;4.KTH Royal Institute of Technology,Kista,Sweden;5.UC Berkeley,Berkeley,USA

Abstract:	Bug report assignment is an important part of software maintenance. In particular, incorrect assignments of bug reports to development teams can be very expensive in large software development projects. Several studies propose automating bug assignment techniques using machine learning in open source software contexts, but no study exists for large-scale proprietary projects in industry. The goal of this study is to evaluate automated bug assignment techniques that are based on machine learning classification. In particular, we study the state-of-the-art ensemble learner Stacked Generalization (SG) that combines several classifiers. We collect more than 50,000 bug reports from five development projects from two companies in different domains. We implement automated bug assignment and evaluate the performance in a set of controlled experiments. We show that SG scales to large scale industrial application and that it outperforms the use of individual classifiers for bug assignment, reaching prediction accuracies from 50 % to 89 % when large training sets are used. In addition, we show how old training data can decrease the prediction accuracy of bug assignment. We advice industry to use SG for bug assignment in proprietary contexts, using at least 2,000 bug reports for training. Finally, we highlight the importance of not solely relying on results from cross-validation when evaluating automated bug assignment.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏