首页 | 本学科首页   官方微博 | 高级检索  
     

口语对话中冗余词汇识别方法研究
引用本文:翟飞飞,宗成庆.口语对话中冗余词汇识别方法研究[J].中文信息学报,2011,25(3):104-112.
作者姓名:翟飞飞  宗成庆
作者单位:中国科学院 自动化研究所 模式识别国家重点实验室,北京 100190
基金项目:国家自然科学基金,中国科学院对外合作交流项目,中国新加坡数字媒体研究院资助
摘    要:冗余现象是口语对话中普遍存在的特殊语言现象之一,它的存在常常会影响口语句子的理解和翻译。该文基于真实口语对话语料对冗余现象进行了分析,并在词汇层面对冗余现象进行了分类,然后对口语中的冗余词汇进行了统计识别方法研究。通过对冗余词汇处理前后的口语句子翻译实验,结果表明,预先对冗余现象进行处理,能够改善口语翻译的译文质量。

关 键 词:冗余现象    最大熵分类器    支持向量机    条件随机场  

Research on Recognition of Fillers in Spoken Dialogs
ZHAI Feifei,ZONG Chengqing.Research on Recognition of Fillers in Spoken Dialogs[J].Journal of Chinese Information Processing,2011,25(3):104-112.
Authors:ZHAI Feifei  ZONG Chengqing
Affiliation:National Lab of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Abstract:Fillers and redundancy are the most common phenomena in spoken dialogs. It always influences the results of spoken language understanding and translation system. Based on the analysis and statistical classification of fillers in lexical level of spoken dialog corpus, we propose statistical methods to recognize the fillers. Experiments on translation of the spoken sentences before and after processing of the fillers have been conducted. The experimental results have shown that the performance of spoken language translation system is significantly improved if the fillers are processed before translating.
Key wordsfillers; maximum entropy classifier; SVM; CRFs
Keywords:fillers  maximum entropy classifier  SVM  CRFs  
本文献已被 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号