Source code fragment summarization with small-scale crowdsourcing based features |
| |
Authors: | Najam Nazar He Jiang Guojun Gao Tao Zhang Xiaochen Li Zhilei Ren |
| |
Affiliation: | 1. Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, School of Software, Dalian University of Technology, Dalian 116621, China2. State Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072, China3. Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China |
| |
Abstract: | Recent studies have applied different approaches for summarizing software artifacts, and yet very few efforts have been made in summarizing the source code fragments available on web. This paper investigates the feasibility of generating code fragment summaries by using supervised learning algorithms.We hire a crowd of ten individuals from the same work place to extract source code features on a corpus of 127 code fragments retrieved from Eclipse and Net- Beans Official frequently asked questions (FAQs). Human annotators suggest summary lines. Our machine learning algorithms produce better results with the precision of 82% and performstatistically better than existing code fragment classifiers. Evaluation of algorithms on several statistical measures endorses our result. This result is promising when employing mechanisms such as data-driven crowd enlistment improve the efficacy of existing code fragment classifiers. |
| |
Keywords: | summarizing code fragments supervised learning crowdsourcing |
本文献已被 SpringerLink 等数据库收录! |
| 点击此处可从《Frontiers of Computer Science》浏览原始摘要信息 |
|
点击此处可从《Frontiers of Computer Science》下载全文 |