首页 | 本学科首页   官方微博 | 高级检索  
     

基于迁移学习的唐诗宋词情感分析
引用本文:吴斌,吉佳,孟琳,石川,赵惠东,李仪清. 基于迁移学习的唐诗宋词情感分析[J]. 电子学报, 2016, 44(11): 2780-2787. DOI: 10.3969/j.issn.0372-2112.2016.11.030
作者姓名:吴斌  吉佳  孟琳  石川  赵惠东  李仪清
作者单位:北京邮电大学智能通信软件与多媒体北京市重点实验室, 北京 100876
基金项目:国家973重点基础研究发展计划(No.2013CB329606);国家自然科学基金(71231002,61375058)
摘    要:随着计算社会学的兴起,利用数据挖掘分析社会情感是近期的研究重点.当前的研究主要针对现代文本,对于古代诗歌这类短文本的情感分析相对较少.本文提出了一个基于短文本特征扩展的迁移学习模型CATL-PCO,通过分析诗歌情感对当时社会及文化进行进一步了解.该模型首先基于频繁词对对古文特征向量进行扩展,再通过迁移学习方式,建立三个分类器并投票得出最后的情感分析结果.CATL-PCO模型首先能够解决古文短文本特征稀疏的问题,在此基础上进一步解决由于现代译文信息匮乏所导致的古代诗歌情感分析困难问题,从而准确的分析古诗词情感倾向,从计算社会学的角度,增进对中国历史的认识.实验表明,当训练集为中国唐诗时,本文提出方法能够准确的对唐代诗歌进行情感分类,并能应用于唐代和宋代各个时期情感分析及代表流派分析.

关 键 词:情感分析  社会计算学  唐诗宋词  迁移学习  
收稿时间:2015-02-13

Transfer Learning Based Senti ment Analysis for Poetry of the Tang Dynasty and Song Dynasty
WU Bin,JI Jia,MENG Lin,SHI Chuan,ZHAO Hui-dong,LI Yi-qing. Transfer Learning Based Senti ment Analysis for Poetry of the Tang Dynasty and Song Dynasty[J]. Acta Electronica Sinica, 2016, 44(11): 2780-2787. DOI: 10.3969/j.issn.0372-2112.2016.11.030
Authors:WU Bin  JI Jia  MENG Lin  SHI Chuan  ZHAO Hui-dong  LI Yi-qing
Affiliation:Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China
Abstract:With the rise of computational social science,analyzing social sentiment with data mining methods has at-tracted widespread attention and has become a hot spot in recent years.Existing researches of sentiment analysis mainly focus on modern text,but hardly involve the ancient short text literature.This paper proposes a short text feature extension based transfer learning model CATL-PCO(Correlation Analysis Transfer Learning-Probability Co-occurrence).Through sentiments analysis in ancient literature,this paper can discovery social and cultural development in the ancient era.CATL-PCO expands the ancient literature feature vector based on the frequent word pairs,and utilizes transfer learning method to train three senti-ment classifiers.CATL-PCO solves the problem of sparsity of short text feature vector,and the scarcity of modern transla-tion,which improves the cognition of Chinese History.Experiments demonstrate the effectiveness of the proposed method on the dataset of Chinese poems in Tang Dynasty.Moreover,different periods of Tang and Song Dynasty,and different genres are analyzed in this paper in details.
Keywords:sentiment analysis  computational social science  poetries of the Tang dynasty and Song dynasty  transfer learning
本文献已被 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号