首页 | 本学科首页   官方微博 | 高级检索  
     

基于词项共现关系图模型的中文观点句识别研究
引用本文:王明文,付翠琴,徐 凡,洪 欢.基于词项共现关系图模型的中文观点句识别研究[J].中文信息学报,2015,29(6):185-192.
作者姓名:王明文  付翠琴  徐 凡  洪 欢
作者单位:江西师范大学,计算机信息工程学院,江西 南昌 330022
基金项目:国家自然科学基金(61272212,61163006,61203313,61365002,61402208)
摘    要:不同于传统的词项间强独立性假设的词袋模型驱动的观点句识别方法,该文提出了一种新型的基于词项共现关系的图模型方法。该方法通过构建词项共现关系图模型,利用词项与词项之间的共现性和句法关系来描述词项在观点句和非观点句集合中的分布差异,同时采用基于入度的词项权重计算方法来计算词项特征值。上述研究在基准语料上进行实验,实验表明采用基于词项关系图模型方法后,中文观点句识别准确率相比目前基于词袋的方法得到显著提升。


关 键 词:词项共现  图模型  观点句识别  特征值  有监督学习  
  

A New Chinese Subjective Sentences Recognition Method Based on Word Co-occurrence Relationship Graphic Model
WANG Mingwen,FU Cuiqin,XU Fan,HONG Huan.A New Chinese Subjective Sentences Recognition Method Based on Word Co-occurrence Relationship Graphic Model[J].Journal of Chinese Information Processing,2015,29(6):185-192.
Authors:WANG Mingwen  FU Cuiqin  XU Fan  HONG Huan
Affiliation:School of Computer Information Engineering, Jiangxi Normal University, Nanchang, Jiangxi 330022, China)
Abstract:Different from the traditional term independence assumption-based bag-of-words model, we present a new word co-occurrence relationship-based graphic model. Our model describes the distribution difference among the terms within both subjective and non-subjective sentences sets via the term co-occurrence and syntactic information, also integrates an indegree-based term weighting calculation method. Evaluation on the benchmark dataset shows the importance of the term co-occurrence graphic model. It also shows that our model significantly outperforms the bag-of-words model currently in the subjective sentence identification field.
Key words word co-occurrence; graphic model; subjective sentence identification; feature value; supervised learning


Keywords:word co-occurrence  graphic model  subjective sentence identification  feature value  supervised learning  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号