首页 | 本学科首页   官方微博 | 高级检索  
     

改进卷积神经网络的文本主题识别算法研究
引用本文:邱宁佳,杨长庚,王鹏,任涛.改进卷积神经网络的文本主题识别算法研究[J].计算机工程与应用,2022,58(2):161-168.
作者姓名:邱宁佳  杨长庚  王鹏  任涛
作者单位:长春理工大学 计算机科学技术学院,长春 130022
基金项目:吉林省教育厅“十三五”科学技术项目(JJKH20190600KJ)。
摘    要:针对于传统方法中存在的文本特征表示能力差、模型主题识别准确率低等问题,提出一种融合SENet与卷积神经网络的文本主题识别方法.将每个词对应的Word2vec词向量与LDA主题向量进行融合,并依据词语对主题的贡献度完成文档加权向量化处理;构建SECNN主题识别模型,使用SENet对卷积层输出的特征图进行权值的重标定,依靠...

关 键 词:主题识别  SENet  卷积神经网络  Word2vec  隐含狄利克雷分布(LDA)

Research on Text Topic Recognition Algorithm Based on Improved Convolutional Neural Network
QIU Ningjia,YANG Changgeng,WANG Peng,REN Tao.Research on Text Topic Recognition Algorithm Based on Improved Convolutional Neural Network[J].Computer Engineering and Applications,2022,58(2):161-168.
Authors:QIU Ningjia  YANG Changgeng  WANG Peng  REN Tao
Affiliation:College of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China
Abstract:Aiming at the problems of poor text feature representation ability and low model topic recognition accuracy in traditional methods, a text topic recognition method combining SENet and convolutional neural network is proposed. First, the Word2vec word vector corresponding to each word is fused with the LDA topic vector, and the document weighted vectorization process is completed according to the contribution of the word to the topic. Then the SECNN topic recognition model is constructed, and the SENet is used to perform the feature map output of the convolutional layer. The recalibration of weights relies on the performance of improving important features and suppressing useless features to efficiently perform topic identification. Finally, the FDA is used to evaluate the category representation ability of the sample, and the FDA-SGD algorithm is proposed to optimize the model parameters and complete the text topic recognition task. The news text data set is used to verify the effectiveness of the improved algorithm. The comparison with the traditional model shows that the improved algorithm can effectively improve the convergence speed of the model and has better topic recognition capabilities.
Keywords:topic recognition  SENet  convolutional neural network  Word2vec  latent Dirichlet allocation(LDA)
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号