首页 | 本学科首页   官方微博 | 高级检索  
     

融合CNN和EWC算法的不平衡文本情绪分类方法
引用本文:程艳,朱海,项国雄,唐天伟,钟林辉,王国玮.融合CNN和EWC算法的不平衡文本情绪分类方法[J].中文信息学报,2020,34(4):92-100.
作者姓名:程艳  朱海  项国雄  唐天伟  钟林辉  王国玮
作者单位:1.江西师范大学 计算机信息工程学院,江西 南昌 330022;
2.江西师范大学 新闻与传播学院,江西 南昌 330022;
3.江西师范大学 管理决策评价研究中心,江西 南昌 330022
基金项目:国家自然科学基金(61967011);江西省科技重点项目(20161BBE50086);江西省教育厅科技重点项目(GJJ150299);江西省教育厅人文社科重点(重大)项目(文)(JD19056);江西省教育厅科学技术项目(GJJ170207)
摘    要:文本情绪分类是自然语言处理领域的一个基本任务。然而,基于不平衡数据的学习使得传统文本情绪分类方法的分类性能降低。针对这个问题,该文提出了一种融合CNN和EWC算法的不平衡文本情绪分类方法。首先,该方法使用随机欠采样方法得到多组平衡数据;其次,按顺序单独使用每一组平衡数据输入CNN训练,同时在训练过程中引入EWC算法用以克服CNN中的灾难性遗忘;最后,把使用最后一组平衡数据输入CNN训练得到的模型作为最终分类模型。实验结果表明,该方法在分类性能上明显优于基于欠采样和多分类算法的集成学习框架,且该方法比基于多通道LSTM神经网络的不平衡情绪分类方法在Accuracy和G-mean上分别提高了1.9%和2.1%。

关 键 词:情绪分类  不平衡分类  CNN  EWC算法  

Emotion Classification Based on CNN and EWC Algorithm for Unbalanced Texts
CHENG Yan,ZHU Hai,XIANG Guoxiong,TANG Tianwei,ZHONG Linhui,WANG Guowei.Emotion Classification Based on CNN and EWC Algorithm for Unbalanced Texts[J].Journal of Chinese Information Processing,2020,34(4):92-100.
Authors:CHENG Yan  ZHU Hai  XIANG Guoxiong  TANG Tianwei  ZHONG Linhui  WANG Guowei
Affiliation:1.School of Computer Information Engineering, Jiangxi Normal University, Nanchang, Jiangxi 330022, China;
2.School of Journalism and Communication, Jiangxi Normal University, Nanchang, Jiangxi 330022, China;
3.Management Decision Evaluation Research Center, Jiangxi Normal University, Nanchang, Jiangxi 330022, China
Abstract:Text emotion classification is a well-addressed task in the field of natural language processing. To deal with the unbalanced data which hurt the classification performance, this paper proposes an emotion classification method combining CNN and EWC algorithms. First, the method uses the random under-sampling method to obtain multiple sets of balanced data for training. Then it feeds each balanced dataset to CNN training in sequence, introducing EWC algorithm in the training process to overcome the catastrophic forgetting issue in CNN. Finally, the CNN model trained by the last data set is treated as the final classification model. The experimental results show that the proposed method is superior to the ensemble learning framework based on under-sampling and multi-classification algorithms, and outperforms the multi-channel LSTM neural network with 1.9% and 2.1% improvements in accuracy and G-mean, respectively.
Keywords:emotion classification  imbalanced classification  CNN  EWC algorithm  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号