首页 | 本学科首页   官方微博 | 高级检索  
     

基于连词的维吾尔语情感词库扩展研究
引用本文:刘若兰,年梅,玛尔哈巴·艾赛提. 基于连词的维吾尔语情感词库扩展研究[J]. 中文信息学报, 2018, 32(3): 49-54
作者姓名:刘若兰  年梅  玛尔哈巴·艾赛提
作者单位:1.新疆师范大学 计算机科学技术学院,新疆 乌鲁木齐 830054;
2.新疆师范大学 文学院,新疆 乌鲁木齐 830054
基金项目:国家自然科学基金(61163064);教育部人文社会科学工程科技人才培养专项(15JDGC022);新疆师范大学数据安全重点实验室资助;新疆师范大学计算机应用技术重点学科资助
摘    要:极性情感词是准确分析维吾尔文倾向性的基础资源。该文在前期构建的维吾尔语褒贬情感词典基础上进行网络情感词的自动扩展研究。首先分析维吾尔语情感表达的语言特征,总结了连词、程度副词与情感词的搭配规律,并基于此规律设计从情感语料库中获取候选情感词的算法,形成候选情感词库;最后再利用维吾尔语连词的特性,结合已创建的情感词典和维吾尔语反义词词典,以互联网作为超大规模语料库,设计基于搜索引擎的情感词极性判别算法,根据算法得分判别候选情感词的极性,再将其扩展到已构建的褒贬情感词库。实验结果表明,与扩展前的情感词库相比,使用互联网文本语料扩展后的情感词库后进行维吾尔语句子倾向性测评的准确率和召回率均有明显提高。

关 键 词:维吾尔语  情感词扩展  连词  程度副词  极性判断  

The Uyghur Emotional Lexicon Extension Based on Conjunctions
LIU Ruolan,NIAN Mei,Maierhaba Aisaiti. The Uyghur Emotional Lexicon Extension Based on Conjunctions[J]. Journal of Chinese Information Processing, 2018, 32(3): 49-54
Authors:LIU Ruolan  NIAN Mei  Maierhaba Aisaiti
Affiliation:1.Department of Computer Science and Technology, Xinjiang Normal University,Urumqi, Xinjiang 830054, China;
2.Department of Literature, Xinjiang Normal University, Urumqi, Xinjiang 830054,China
Abstract:Emotion words are the fundamental resource for accurately analysis the opinions of the Uighur language. We investigates the automatic expansion of the web emotional words on the basis of an existing Uighur sentiment lexicon. First, we summarize the collocation rules of the conjunctions, degree adverbs and sentiment words by analyzing the linguistic features of Uighur emotional expression. Based on the rules, we design an algorithm to obtain the candidate emotional words from emotional corpus, forming the candidate sentiment lexicon. Finally, we use the Internet as a super-large corpus to design the emotional discriminant algorithm based on search engine by reusing the characteristics of Uighur conjunctions and combining with the established emotional lexicon and Uighur antonyms dictionary. The polarity of candidate emotional words is decided according to the score calculated by the algorithm, and then add them to the emotional lexicon. Compared with the emotional lexicon that was not expanded, the experimental results showed that the accuracy and recall rate of Uyghur sentence‘s tendency are significantly improved by our extended dictionary.
Keywords:Uyghur    expansion of the emotional words    conjunctions    degree adverbs    polarity judgment  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号