首页 | 本学科首页   官方微博 | 高级检索  
     

用于细颗粒度挖掘的产品评论语料库构建技术
引用本文:刘远超,宋明凯,刘铭,张想.用于细颗粒度挖掘的产品评论语料库构建技术[J].哈尔滨工业大学学报,2012,44(3):64-68.
作者姓名:刘远超  宋明凯  刘铭  张想
作者单位:哈尔滨工业大学计算机科学与技术学院;哈尔滨工业大学计算机科学与技术学院;哈尔滨工业大学计算机科学与技术学院;哈尔滨工业大学计算机科学与技术学院
基金项目:教育部人文社会科学研究青年基金资助项目(10YJCZH099);中央高校基本科研业务费专项资金资助项目(HIT.NSRIF.2009065);语言语音教育部-微软重点实验室开放基金资助项目(HIT.KLOF.2009022)
摘    要:为了辅助进行产品评论中特征-观点对识别的挖掘工作,对细颗粒度产品评论语料库的构建技术进行了研究.介绍了用于产品评论细颗粒度挖掘的语料库构建方法,以及目前初步进行的语料标注工作.标注数据可以数据库形式存储,从而实现了无结构化到结构化的转变,为自动查询等处理提供了极大方便.实验结果表明:虽然文中的标注方法以手机产品为例,但具有良好的移植性,可以应用到其他产品评论的细颗粒度语料库构建.相应的语料库构建对于高性能机器学习方法的应用、特征-观点对识别算法的性能提高以及自动评价等具有重要意义.

关 键 词:产品意见挖掘  细颗粒度语料库构建  语料标注

Research of product review corpus constructing technology for fine-granularity mining
LIU Yuan-chao,SONG Ming-kai,LIU Ming and ZHANG Xiang.Research of product review corpus constructing technology for fine-granularity mining[J].Journal of Harbin Institute of Technology,2012,44(3):64-68.
Authors:LIU Yuan-chao  SONG Ming-kai  LIU Ming and ZHANG Xiang
Affiliation:School of Computer Science And Technology,Harbin Institute of Technology,150001 Harbin,China;School of Computer Science And Technology,Harbin Institute of Technology,150001 Harbin,China;School of Computer Science And Technology,Harbin Institute of Technology,150001 Harbin,China;School of Computer Science And Technology,Harbin Institute of Technology,150001 Harbin,China
Abstract:Quantitative analysis and mining of product reviews posted by users are helpful for both manufacturers and consumers.During the work of fine-granularity product review mining,extracting feature-opinion pair is one of the core works.The corresponding corpus construction is of great significance for the application of high performance machine learning methods,improving the performance of feature-opinion extraction algorithm and automatic evaluation.This article introduces corpus constructing technology for fine-granularity product review mining and the initial corpus labeling work,thus realizing non-structured to structural changes.The corpus can be stored in database and thus provide great convenience for automatic query processing.Although current labeling work was performed in mobile phone products,it can be applied also to other product types for fine granularity corpus construction.So our work has good transplantation ability.
Keywords:product review mining  fine-granularity corpus construction  corpus annotation
本文献已被 CNKI 等数据库收录!
点击此处可从《哈尔滨工业大学学报》浏览原始摘要信息
点击此处可从《哈尔滨工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号