首页 | 本学科首页   官方微博 | 高级检索  
     

字符级卷积神经网络短文本分类算法
引用本文:刘敬学,孟凡荣,周勇,刘兵.字符级卷积神经网络短文本分类算法[J].计算机工程与应用,2019,55(5):135-142.
作者姓名:刘敬学  孟凡荣  周勇  刘兵
作者单位:中国矿业大学 计算机科学与技术学院,江苏 徐州,221116;中国矿业大学 计算机科学与技术学院,江苏 徐州 221116;中国科学院 电子研究所,北京 100080
摘    要:由于短文本具有长度短、特征稀疏以及上下文依赖性强等特点,传统方法对其直接进行分类精度有限。针对该问题,提出了一种基于字符级嵌入的卷积神经网络(CNN)和长短时记忆网络(LSTM)相结合的神经网络模型进行短文本的分类。该模型同时包括了高速公路网络(Highway networks)框架,用于缓解深度神经网络训练时的困难,提高分类的准确性。通过对几种数据集的测试,结果表明提出的模型在短文本分类任务中优于传统模型和其他基于CNN的分类模型。

关 键 词:字符级  神经网络  文本分类  高速公路网络

Character-Level Convolutional Neural Networks for Short Text Classification
LIU Jingxue,MENG Fanrong,ZHOU Yong,LIU Bing.Character-Level Convolutional Neural Networks for Short Text Classification[J].Computer Engineering and Applications,2019,55(5):135-142.
Authors:LIU Jingxue  MENG Fanrong  ZHOU Yong  LIU Bing
Affiliation:1.College of Computer Science and Technology, China University of Mining and Technology, Xuzhou, Jiangsu 221116, China 2.Insititute of Electrics, Chinese Academy of Sciences, Beijing 100080, China
Abstract:Since short text is characterized of the short length, sparse features and strong context dependency, the traditional models have a limited precision. Motivated by this, this article offers an empirical exploration on a character-level model which implements a combination of Convolutional Neural Network(CNN) and Long Short-Term Memory neural networks(LSTM) for short text classification. Including the highway networks framework so that it can address the difficult of training and improve the accuracy of classification. The evaluations on several datasets show that the proposed model outperforms the traditional and CNN-based models on short text classification mission.
Keywords:character-level  neural network  text classification  highway networks  
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号