首页 | 本学科首页   官方微博 | 高级检索  
     

基于差异性和重要性的问句特征组合
引用本文:杨思春,高超,戴新宇,尹存燕,陈家骏.基于差异性和重要性的问句特征组合[J].电子学报,2014,42(5):918-924.
作者姓名:杨思春  高超  戴新宇  尹存燕  陈家骏
作者单位:1. 南京大学计算机软件新技术国家重点实验室, 江苏南京 210023;2. 安徽工业大学计算机学院, 安徽马鞍山 243002;3. 安徽工程大学机电学院, 安徽芜湖 241000
基金项目:国家自然科学基金(No.61170181);江苏省自然科学基金(No.BK2011192);安徽省高校省级自然科学研究重点项目(No.KJ2011A048)
摘    要:在问答系统问句分类研究中,对问句特征进行组合有助于构造高效的问句分类器.针对当前问句分类中的特征组合问题,提出一种基于差异性和重要性的特征组合 (Diversity and Importance based Feature Combination,DIFC)方法.通过计算待组合特征与当前特征组合的错分差异度和正分差异度,以及待组合特征本身的重要度,从候选特征集中动态获取优化的特征组合.在哈工大中文问句集上对词袋绑定特征进行组合的实验结果表明,与其他特征组合方法相比,DIFC方法灵活高效,准确率更高.

关 键 词:问句分类  特征组合  差异性  重要性  
收稿时间:2013-04-17

Combining Features of Question Based on Diversity and Importance
YANG Si-chun,GAO Chao,DAI Xin-yu,YIN Cun-yan,CHEN Jia-jun.Combining Features of Question Based on Diversity and Importance[J].Acta Electronica Sinica,2014,42(5):918-924.
Authors:YANG Si-chun  GAO Chao  DAI Xin-yu  YIN Cun-yan  CHEN Jia-jun
Affiliation:1. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu 210023, China;2. School of Computer Science, Anhui University of Technology, Maanshan, Anhui 243002, China;3. College of Mechanical & Electrical Engineering, Anhui Polytechnic University, Wuhu, Anhui 241000, China
Abstract:In research on question classification in question answering system,combining features can greatly help construct efficient question classifier.In order to deal with the problem of low performance of existing methods,a new method of diversity and importance based feature combination(DIFC) is proposed.By calculating the diversity between candidate feature and current combination for error and correct classification respectively,and the importance of candidate feature,features can be dynamically selected from candidate feature set.The experimental results of bag-of-words binding features on the HIT Chinese question set show that,compared with other methods,the new method is flexible and efficient,and gets more optimal feature combination.
Keywords:question answering system  question classification  feature combination  diversity  importance  
本文献已被 CNKI 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号