首页 | 本学科首页   官方微博 | 高级检索  
     

基于信息检索的软件缺陷定位技术研究进展
引用本文:张芸,刘佳琨,夏鑫,吴明晖,颜晖. 基于信息检索的软件缺陷定位技术研究进展[J]. 软件学报, 2020, 31(8): 2432-2452
作者姓名:张芸  刘佳琨  夏鑫  吴明晖  颜晖
作者单位:浙江大学城市学院计算机与计算科学学院,浙江杭州310015;浙江大学计算机科学与技术学院,浙江杭州310007;3Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
基金项目:2019年度高层次留学回国人员在杭创新项目(多元融合的缺陷定位技术研究)
摘    要:缺陷定位是软件工程研究最活跃的领域之一.大部分软件缺陷都会被提交到类似于Bugzilla和Jira的缺陷追踪系统中.由于提交的缺陷报告数量过多,开发人员不能及时处理,因而迫切需要一个自动化工具来帮助开发人员识别缺陷相关源代码文件.研究人员已提出了大量缺陷定位技术.基于信息检索的软件缺陷定位技术(information retrieval-based bug localization,简称IRBL)利用了缺陷报告的文本特性,并且由于计算成本低、对不同的程序语言更具普适性而成为缺陷定位领域的研究热点,取得了一系列研究成果.然而,IRBL技术也在数据预处理、相似度计算和工程应用等方面存在诸多挑战.鉴于此,对现有的IRBL技术进行梳理总结,主要内容包括:(1)梳理了IRBL中数据预处理的过程和信息检索通用方法;(2)对IRBL技术中利用的数据特征进行了详细的分类和总结;(3)总结了技术评估中使用的性能评估指标;(4)归纳出了IRBL技术的关键问题;(5)展望了IRBL技术的未来发展.

关 键 词:缺陷报告定位  信息检索  软件工程
收稿时间:2020-02-16
修稿时间:2020-04-07

Research Progress on Software Bug Localization Technology Based on Information Retrieval
ZHANG Yun,LIU Jia-Kun,XIA Xin,WU Ming-Hui,YAN Hui. Research Progress on Software Bug Localization Technology Based on Information Retrieval[J]. Journal of Software, 2020, 31(8): 2432-2452
Authors:ZHANG Yun  LIU Jia-Kun  XIA Xin  WU Ming-Hui  YAN Hui
Affiliation:School of Computer & Computing Science, Zhejiang University City College, Hangzhou 310015, China;College of Computer Science and Technology, Zhejiang University, Hangzhou 310007, China;Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia
Abstract:Bug localization is one of the most active domains in software engineering. Most of the bugs are submitted to bug tracker systems, e.g., Bugzilla and Jira. Because of the large number of the submitted bug reports, it is difficult for developers to resolve these defects in time. Therefore, an automatic tool to help developers to identify bug related files is needed. Many bug localization technologies have been proposed by researchers. Taking advantages of the text nature of bug report, information retrieval technologies are adopted to solve bug localization problems. Due to the low computing cost and the applicability to various programming languages, information Retrieval-based Bug Localization (IRBL) technologies become hot spots in bug localization and acquire a series of achievements. However, challenges still exist in data preprocessing, similarity calculation and engineering application. Therefore, we summarize current IRBL technologies. The contributions of this paper are:(1) We summarize the data preprocess methods and general information retrieval algorithms. (2) We conclude and classify the feature categories. (3) We conclude the performance measures. (4) We highlight the current problems in IRBL technologies. (5)We outlook the trends of IRBL technologies.
Keywords:Bug report localization  Information retrieval  Software engineering
本文献已被 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号