An information retrieval process to aid in the analysis of code clones |
| |
Authors: | Robert Tairas Jeff Gray |
| |
Affiliation: | (1) Department of Computer and Information Sciences, University of Alabama at Birmingham, 1300 University Boulevard, Birmingham, AL 35294, USA |
| |
Abstract: | The advent of new static analysis tools has automated the searching for code clones, which are duplicated or similar code fragments in a program. However, clone detection tools can report many clones if the source code that is being searched is large. Programmers may have difficulty comprehending the extensive results from the detection tool, which may inhibit the ability to maintain the identified clones. Latent Semantic Indexing (LSI) is an information retrieval technique that attempts to find relationships in a corpus based on the analysis of the documents in the corpus and the terms in the documents. In this paper, LSI is used to cluster clone classes that have been identified initially by a clone detection tool. The goal of this paper is to detect trends and associations among the clustered clone classes and determine if they provide further comprehension to assist in the maintenance of clones. Experimental evaluation of the approach is reported from a sequence of tools that are chained together to perform an analysis of clones detected in the Microsoft Windows NT kernel source code. Robert Tairas is a Ph.D. student in the Department of Computer and Information Sciences at the University of Alabama at Birmingham (UAB) and a member of the Software Composition and Modeling (SoftCom) laboratory. His research interests include code clone analysis and model-driven engineering. He received an MS in Computer Science from UAB in 2005. Jeff Gray is an Associate Professor in the Department of Computer and Information Sciences at UAB where he co-directs the Software Composition and Modeling (SoftCom) laboratory. He received the Ph.D. in Computer Science from Vanderbilt University, and a MS and BS in Computer Science from West Virginia University. Jeff’s research interests include model-driven engineering, aspect-oriented software development, and generative programming. He is a 2007 NSF CAREER award winner and current Chair of the Alabama IEEE Computer Society. |
| |
Keywords: | Information retrieval Program comprehension Latent semantic indexing Code clone analysis |
本文献已被 SpringerLink 等数据库收录! |
|