首页 | 本学科首页   官方微博 | 高级检索  
     


An information retrieval process to aid in the analysis of code clones
Authors:Robert Tairas  Jeff Gray
Affiliation:(1) Department of Computer and Information Sciences, University of Alabama at Birmingham, 1300 University Boulevard, Birmingham, AL 35294, USA
Abstract:The advent of new static analysis tools has automated the searching for code clones, which are duplicated or similar code fragments in a program. However, clone detection tools can report many clones if the source code that is being searched is large. Programmers may have difficulty comprehending the extensive results from the detection tool, which may inhibit the ability to maintain the identified clones. Latent Semantic Indexing (LSI) is an information retrieval technique that attempts to find relationships in a corpus based on the analysis of the documents in the corpus and the terms in the documents. In this paper, LSI is used to cluster clone classes that have been identified initially by a clone detection tool. The goal of this paper is to detect trends and associations among the clustered clone classes and determine if they provide further comprehension to assist in the maintenance of clones. Experimental evaluation of the approach is reported from a sequence of tools that are chained together to perform an analysis of clones detected in the Microsoft Windows NT kernel source code.
Contact Information Jeff GrayEmail:

Robert Tairas   is a Ph.D. student in the Department of Computer and Information Sciences at the University of Alabama at Birmingham (UAB) and a member of the Software Composition and Modeling (SoftCom) laboratory. His research interests include code clone analysis and model-driven engineering. He received an MS in Computer Science from UAB in 2005. MediaObjects/10664_2008_9089_Figa_HTML.jpg Jeff Gray   is an Associate Professor in the Department of Computer and Information Sciences at UAB where he co-directs the Software Composition and Modeling (SoftCom) laboratory. He received the Ph.D. in Computer Science from Vanderbilt University, and a MS and BS in Computer Science from West Virginia University. Jeff’s research interests include model-driven engineering, aspect-oriented software development, and generative programming. He is a 2007 NSF CAREER award winner and current Chair of the Alabama IEEE Computer Society. MediaObjects/10664_2008_9089_Figb_HTML.jpg
Keywords:Information retrieval  Program comprehension  Latent semantic indexing  Code clone analysis
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号