首页 | 本学科首页   官方微博 | 高级检索  
     


Source code fragment summarization with small-scale crowdsourcing based features
Authors:Najam Nazar  He Jiang  Guojun Gao  Tao Zhang  Xiaochen Li  Zhilei Ren
Affiliation:1. Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, School of Software, Dalian University of Technology, Dalian 116621, China2. State Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072, China3. Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China
Abstract:Recent studies have applied different approaches for summarizing software artifacts, and yet very few efforts have been made in summarizing the source code fragments available on web. This paper investigates the feasibility of generating code fragment summaries by using supervised learning algorithms.We hire a crowd of ten individuals from the same work place to extract source code features on a corpus of 127 code fragments retrieved from Eclipse and Net- Beans Official frequently asked questions (FAQs). Human annotators suggest summary lines. Our machine learning algorithms produce better results with the precision of 82% and performstatistically better than existing code fragment classifiers. Evaluation of algorithms on several statistical measures endorses our result. This result is promising when employing mechanisms such as data-driven crowd enlistment improve the efficacy of existing code fragment classifiers.
Keywords:summarizing code fragments  supervised learning  crowdsourcing  
本文献已被 SpringerLink 等数据库收录!
点击此处可从《Frontiers of Computer Science》浏览原始摘要信息
点击此处可从《Frontiers of Computer Science》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号