Effective approaches to combining lexical and syntactical information for code summarization期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Effective approaches to combining lexical and syntactical information for code summarization

Authors:	Ziyi Zhou Huiqun Yu Guisheng Fan

Affiliation:	Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, China

Abstract:	Natural language summaries of source codes are important during software development and maintenance. Recently, deep learning based models have achieved good performance on the task of automatic code summarization, which encode token sequence or abstract syntax tree (AST) of code with neural networks. However, there has been little work on the efficient combination of lexical and syntactical information of code for better summarization quality. In this paper, we propose two general and effective approaches to leveraging both types of information: a convolutional neural network that aims to better extract vector representation of AST node for downstream models; and a Switch Network that learns an adaptive weight vector to combine different code representations for summary generation. We integrate these approaches into a comprehensive code summarization model, which includes a sequential encoder for token sequence of code and a tree based encoder for its AST. We evaluate our model on a large Java dataset. The experimental results show that our model outperforms several state-of-the-art models on various metrics, and the proposed approaches contribute a lot to the improvements.

Keywords:	code summarization deep learning program comprehension

设为首页 | 免责声明 | 关于勤云 | 加入收藏