首页 | 本学科首页   官方微博 | 高级检索  
     


Augmenting and structuring user queries to support efficient free-form code search
Authors:Raphael Sirres  Tegawendé F Bissyandé  Dongsun Kim  David Lo  Jacques Klein  Kisub Kim  Yves Le Traon
Affiliation:1.National Library of Luxembourg,Roosevelt,Luxembourg;2.Interdisciplinary Centre for Security, Reliability and Trust,University of Luxembourg,Kennedy,Luxembourg;3.School of Information Systems,Singapore Management University,Singapore,Singapore
Abstract:Source code terms such as method names and variable types are often different from conceptual words mentioned in a search query. This vocabulary mismatch problem can make code search inefficient. In this paper, we present COde voCABUlary (CoCaBu), an approach to resolving the vocabulary mismatch problem when dealing with free-form code search queries. Our approach leverages common developer questions and the associated expert answers to augment user queries with the relevant, but missing, structural code entities in order to improve the performance of matching relevant code examples within large code repositories. To instantiate this approach, we build GitSearch, a code search engine, on top of GitHub and Stack Overflow Q&A data. We evaluate GitSearch in several dimensions to demonstrate that (1) its code search results are correct with respect to user-accepted answers; (2) the results are qualitatively better than those of existing Internet-scale code search engines; (3) our engine is competitive against web search engines, such as Google, in helping users solve programming tasks; and (4) GitSearch provides code examples that are acceptable or interesting to the community as answers for Stack Overflow questions.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号