首页 | 本学科首页   官方微博 | 高级检索  
     

结合句法特征和卷积神经网络的多意图识别模型
引用本文:杨春妮,冯朝胜. 结合句法特征和卷积神经网络的多意图识别模型[J]. 计算机应用, 2018, 38(7): 1839-1845. DOI: 10.11772/j.issn.1001-9081.2017122996
作者姓名:杨春妮  冯朝胜
作者单位:1. 四川师范大学 计算机科学学院, 成都 610101;2. 电子科技大学 信息与软件工程学院, 成都 610054
基金项目:国家自然科学基金资助项目(61373163);国家科技支撑计划项目(2014BAH11F02,2014BAH11F01)。
摘    要:短文本的多意图识别是口语理解(SLU)中的难题,因短文本的特征稀疏、字数少但包含信息量大,在分类问题中难以提取其有效特征。为解决该问题,将句法特征和卷积神经网络(CNN)进行结合,提出一种多意图识别模型。首先,将句子进行依存句法分析以确定是否包含多意图;然后,利用词频-逆文档频率(TF-IDF)和训练好的词向量计算距离矩阵,以确定意图的个数;其次,把该距离矩阵作为CNN模型的输入,进行意图分类;最后,判断每个意图的情感极性,计算用户的真实意图。采用现有的智能客服系统的真实数据进行实验,实验结果表明,结合句法特征的CNN模型在10个意图上的单分类精准率达到93.5%,比未结合句法特征的CNN模型高1.4个百分点;而在多意图识别上,精准率比其他模型提高约30个百分点。

关 键 词:口语理解  多意图识别  句法特征  卷积神经网络  自然语言  
收稿时间:2017-12-21
修稿时间:2018-02-08

Multi-intention recognition model with combination of syntactic feature and convolution neural network
YANG Chunni,FENG Chaosheng. Multi-intention recognition model with combination of syntactic feature and convolution neural network[J]. Journal of Computer Applications, 2018, 38(7): 1839-1845. DOI: 10.11772/j.issn.1001-9081.2017122996
Authors:YANG Chunni  FENG Chaosheng
Affiliation:1. School of Computer Science, Sichuan Normal University, Chengdu Sichuan 610101, China;2. School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu Sichuan 610054, China
Abstract:Multi-Intention (MI) recognition of short texts is a problem in Spoken Language Understanding (SLU). The effective features of short texts are difficult to extract in classification problems because of sparse features of short texts and few words containing many information. To solve the problem, by combining syntactic features and Convolution Neural Network (CNN), a multi-intention recognition model was proposed. Firstly, the sentence was syntactically analyzed to determine whether it contains multi-intention. Secondly, the number of intentions and matrix of distance were calculated by using Term Frequency-Inverse Document Frequency (TF-IDF) and word embedding. Then the matrix of distance was acted as the input of CNN model to classify intentions. Finally, the emotional polarity of each intention was judged to return to the user's true intentions. The experiment was carried out by using the real data of the existing intelligent customer service system. The experimental results show that, the single classification precision of the combination model of syntactic features and CNN is 93.5% in 10 intentions, which is 1.4 percentage points higher than the original CNN model without syntactic features. And in mutil-intention recognition, the classification precision is 30 percentage points higher than others.
Keywords:Spoken Language Understanding (SLU)  Multi-Intention (MI) recognition  syntactic feature  Convolution Neural Network (CNN)  natural language  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号