首页 | 本学科首页   官方微博 | 高级检索  
     

基于音素混淆模型的集外词查询项扩展方法
引用本文:陆梨花,张连海.基于音素混淆模型的集外词查询项扩展方法[J].信息工程大学学报,2014,15(4):459-465.
作者姓名:陆梨花  张连海
作者单位:信息工程大学,河南郑州450001
基金项目:国家自然科学基金资助项目(61175017)
摘    要:为了提高语音查询项检索系统中集外词检索的性能,在加权有限状态转换器(weightedfinite-state transducer,WFST)框架下提出了一种基于音素混淆模型的集外词查询项扩展技术,将查询项扩展成多发音序列来解决集外词问题.首先由G2P(grapheme-to-phoneme)模型生成查询项的发音序列,然后利用音素混淆模型将发音序列扩展成N-best发音,以补偿识别错误造成Lattice建立的索引与查询项发音序列之间音素表示差异带来的影响,从而有效降低漏警率.实验结果表明,加入音素混淆模型之后,系统集外词检索性能有明显提升.

关 键 词:集外词查询项扩展  音素混淆模型  加权有限状态转换器  语音查询项检索

Query Expansion Method for Out-of-Vocabulary Based on Phonetic Confusion Model
LU Li-hua,ZHANG Lian-hai.Query Expansion Method for Out-of-Vocabulary Based on Phonetic Confusion Model[J].Journal of Information Engineering University,2014,15(4):459-465.
Authors:LU Li-hua  ZHANG Lian-hai
Affiliation:(Information Engineering University, Zhengzhou 450001, China)
Abstract:To improve the performance of spoken term detection systems, a query expansion method for out-of-vocabulary (OOV) based on phonetic confusion model is presented in the weighted finite- state transducer framework (WFST). The problem of OOV is solved by expanding the queries to multiple pronunciation sequences. First, a pronunciation sequence is generated by grapheme-to-pho- neme model; then, the pronunciation sequence is expanded to N-best sequences by phonetic confusion model to compensate for potential differences caused by recognition errors in deriving index and query representations, thus reducing the missing alarm rate effectively. The experimental results show that the OOV retrieval performance of the system is improved significantly by the expansion based on phonetic confusion model.
Keywords:query expansion for out-of-vocabulary  grapheme-to-phoneme  phonetic confusion mod-el  weighted finite-state transducer  spoken term detection
本文献已被 维普 等数据库收录!
点击此处可从《信息工程大学学报》浏览原始摘要信息
点击此处可从《信息工程大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号