首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于弱监督学习的论坛帖子对话行为分类方法
引用本文:孙承杰,林磊,刘秉权.一种基于弱监督学习的论坛帖子对话行为分类方法[J].中文信息学报,2014,28(6):156-161.
作者姓名:孙承杰  林磊  刘秉权
作者单位:哈尔滨工业大学 计算机科学与技术学院,黑龙江 哈尔滨 150001
基金项目:国家自然科学基金(61100094, 61300114)
摘    要:论坛帖子对话行为分类可以明确每个帖子在当前线索中的角色,有助于重构论坛线索中的对话关系,提高论坛信息检索的效果。该文提出了一种基于弱监督学习的论坛帖子对话行为分类方法,把帖子的对话行为分类作为线索的序列标注问题来解决。该方法的特点是只要指定合理的特征约束,就可以训练对话行为分类模型。方法在CNET和edX数据集上的分类精确率分别达到75.6%和60.7%,优于有监督的条件随机域方法。

关 键 词:弱监督学习  特征约束  对话行为分类  论坛线索结构分析  

A Weakly Supervised Learning Method for Forum Posts Dialogue Act Classification
SUN Chengjie,LIN Lei,LIU Bingquan.A Weakly Supervised Learning Method for Forum Posts Dialogue Act Classification[J].Journal of Chinese Information Processing,2014,28(6):156-161.
Authors:SUN Chengjie  LIN Lei  LIU Bingquan
Affiliation:School of lompater Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China
Abstract:Dialogue act classification for online forum post can indicate the role of a post in a thread, which is helpful for reconstructing the conversation relation in a thread and improving the performance of forum retrieval. This paper proposes a weakly supervised learning method for online forum post dialogue act classification, which trests the posts dialogue act classification as sequential labeling problem for threads. The proposed approach can lean the model for dialogue act classification with feature constrains and unlabeled data. It achieved an accuracy of 75.6% and 60.7% in CNET data set and edX data set respectively, which are better that the performances of supervised CRF model.
Keywords:weakly supervised learning  feature constrains  dialogue act classification  forum thread structure analysis  
本文献已被 CNKI 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号