首页 | 本学科首页   官方微博 | 高级检索  
     

自注意力机制的短文本分类算法
引用本文:姚苗,杨文忠,袁婷婷,马国祥.自注意力机制的短文本分类算法[J].计算机工程与设计,2020,41(6):1592-1598.
作者姓名:姚苗  杨文忠  袁婷婷  马国祥
作者单位:新疆大学软件学院,新疆乌鲁木齐830046;新疆大学软件学院,新疆乌鲁木齐830046;新疆大学信息科学与工程学院,新疆乌鲁木齐830046;新疆大学信息科学与工程学院,新疆乌鲁木齐830046
基金项目:国家自然科学基金;新疆维吾尔自治区自然科学基金
摘    要:分析目前的短文本分类算法没有综合考虑文本中隐含的依赖关系和局部关键信息这一问题,提出基于自注意力机制(self-attention mechanism)的堆叠双向长短时记忆网络(stack bidirectional long short term memory)模型(简称Att-BLSTMs)。利用stack Bi-LSTMs捕获上下文隐藏依赖关系,优化短文本特征稀疏的问题;利用自注意力机制加大对短文本中局部关键信息的注意力,优化文本表示。在公开AG-news网页新闻的语料和DBpedia分类数据集中,进行丰富的对比实验。实验结果表明,该模型将文本中隐含依赖关系与局部关键信息综合考虑后,有效提高了短文本分类的准确性。

关 键 词:短文本分类  深度学习  自注意力机制  堆叠双向长短时记忆网络模型  微平均  宏平均

Short text classification algorithm of self-attention mechanism
YAO Miao,YANG Wen-zhong,YUAN Ting-ting,MA Guo-xiang.Short text classification algorithm of self-attention mechanism[J].Computer Engineering and Design,2020,41(6):1592-1598.
Authors:YAO Miao  YANG Wen-zhong  YUAN Ting-ting  MA Guo-xiang
Affiliation:(College of Software Engineering,Xinjiang University,Urumqi 830046,China;College of Information Science and Engineering,Xinjiang University,Urumqi 830046,China)
Abstract:The implicit dependencies and local key information in the current short text classification algorithm are not comprehensively considered.A stack bidirectional long short term memory model based on the self-attention mechanism was proposed.Stack Bi-LSTMs was used to mine the contextual semantic dependencies information to optimize feature representation.The attention mechanism was used to focus on key information of text to optimize the text representation.The public corpus of the AG-news web news and DBpedia were used to conduct a rich comparative experiment.It is pointed out that the accuracy of the short text classification is improved a lot by considering the implicit dependencies and local key information.
Keywords:short text classification  deep learning  self-attention mechanism  stack bidirectional long short term memory mo-del  micro-average  macro-average
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号