首页 | 本学科首页   官方微博 | 高级检索  
     


A component recommender for bug reports using Discriminative Probability Latent Semantic Analysis
Affiliation:1. State Key laboratory of Coal Mine Disaster Dynamics and Control, Chongqing 400044, PR China;2. School of Software Engineering, Chongqing University, Chongqing 401331, PR China;3. Key Laboratory of Dependable Service Computing in Cyber Physical, Society Ministry of Education, Chongqing 400044, PR China;1. Jordan University of Science and Technology, Irbid, Jordan, 22110;2. Jadara University, Irbid, Jordan;3. The University of Jordan, Amman, Jordan;4. Yarmouk University, Irbid, Jordan;5. The Hashemite University,Zarqa, Jordan;1. Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur, Malaysia;2. Department of Computer Science, Central Washington University, Ellensburg, WA 98926, USA
Abstract:ContextThe component field in a bug report provides important location information required by developers during bug fixes. Research has shown that incorrect component assignment for a bug report often causes problems and delays in bug fixes. A topic model technique, Latent Dirichlet Allocation (LDA), has been developed to create a component recommender for bug reports.ObjectiveWe seek to investigate a better way to use topic modeling in creating a component recommender.MethodThis paper presents a component recommender by using the proposed Discriminative Probability Latent Semantic Analysis (DPLSA) model and Jensen–Shannon divergence (DPLSA-JS). The proposed DPLSA model provides a novel method to initialize the word distributions for different topics. It uses the past assigned bug reports from the same component in the model training step. This results in a correlation between the learned topics and the components.ResultsWe evaluate the proposed approach over five open source projects, Mylyn, Gcc, Platform, Bugzilla and Firefox. The results show that the proposed approach on average outperforms the LDA-KL method by 30.08%, 19.60% and 14.13% for recall @1, recall @3 and recall @5, outperforms the LDA-SVM method by 31.56%, 17.80% and 8.78% for recall @1, recall @3 and recall @5, respectively.ConclusionOur method discovers that using comments in the DPLSA-JS recommender does not always make a contribution to the performance. The vocabulary size does matter in DPLSA-JS. Different projects need to adaptively set the vocabulary size according to an experimental method. In addition, the correspondence between the learned topics and components in DPLSA increases the discriminative power of the topics which is useful for the recommendation task.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号