首页 | 官方网站   微博 | 高级检索  
     

结合多头注意力机制的旅游问句分类研究
引用本文:吴迪,姜丽婷,王路路,吐尔根·依布拉音,艾山·吾买尔,早克热·卡德尔.结合多头注意力机制的旅游问句分类研究[J].计算机工程与应用,2022,58(3):165-171.
作者姓名:吴迪  姜丽婷  王路路  吐尔根·依布拉音  艾山·吾买尔  早克热·卡德尔
作者单位:1.新疆大学 软件学院,乌鲁木齐 830046 2.新疆大学 信息科学与工程学院,乌鲁木齐 830046
基金项目:国家重点研发计划子课题(2017YFB1002103);国家自然科学基金(61762084)。
摘    要:旅游问句具有长度较短,不严格按照语法规则的特点,导致该文本数据信息容量过少、口语化严重。充分理解问句表达的语义是提高旅游问句分类器性能面临的重要挑战,基于此,提出一个融合Bi-GRU、CNN与Multi-Head-Attention的旅游问句分类模型。该模型将预先训练的词向量和经Bi-GRU处理得到的语义信息进行融合,进行问句依赖关系学习,通过CNN和Multi-Head-Attention进行特征提取,以加强局部特征的学习,通过Softmax完成分类。实验结果表明,该模型在文本信息少、表述不规范的旅游问句分类任务中F1值达到了92.11%,优于现有的主流分类模型。

关 键 词:自然语言处理  旅游问句分类  双向门控循环单元(Bi-GRU)  卷积神经网络(CNN)  多头注意力机制  

Research on Classification of Tourist Questions Combined with Multi-head Attention Mechanism
WU Di,JIANG Liting,WANG Lulu,Tuergen Yibulayin,Aishan Wumaier,Zaokere Kadder.Research on Classification of Tourist Questions Combined with Multi-head Attention Mechanism[J].Computer Engineering and Applications,2022,58(3):165-171.
Authors:WU Di  JIANG Liting  WANG Lulu  Tuergen Yibulayin  Aishan Wumaier  Zaokere Kadder
Affiliation:1.College of Software, Xinjiang University, Urumqi 830046, China 2.College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
Abstract:Tourism questions have the characteristics of short length and not strictly following grammatical rules, which leads to too little information capacity of the text data and serious colloquial. It is an important challenge to fully understand the semantics of question expression to improve the performance of tourist question classifiers. To this end, a tourist question classification model combining Bi-GRU, CNN, and Multi-Head-Attention is proposed. The model fuses pre-trained word vectors and semantic information processed by Bi-GRU to learn question dependency, then extracts features through CNN and Multi-Head-Attention to strengthen the learning of local features, and finally completes classification through Softmax. The F1 score of this model achieves 92.11% in the classification task of tourism questions with less text information and irregular expression, which is superior to the existing mainstream classification model.
Keywords:natural language processing  tourism question classification  bidirectional gated recurrent unit(Bi-GRU)  convolutional neural network(CNN)  Multi-Head-Attention
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号