首页 | 本学科首页   官方微博 | 高级检索  
     


Efficient data selection for speech recognition based on prior confidence estimation using speech and monophone models
Affiliation:1. Universidade Federal do Ceará, Av. José de Freitas Queiroz, 5003 – Cedro CEP 63900-000, Quixadá, CE, Brazil;2. Universidade Estadual do Ceará, Fortaleza, CE, Brazil;3. IBM Research Brazil, Rio de Janeiro, RJ, Brazil;4. Universidade do Vale do Rio dos Sinos, São Leopoldo, RS, Brazil;5. Universidade Federal do Ceará, Quixadá, CE, Brazil
Abstract:This paper proposes an efficient speech data selection technique that can identify those data that will be well recognized. Conventional confidence measure techniques can also identify well-recognized speech data. However, those techniques require a lot of computation time for speech recognition processing to estimate confidence scores. Speech data with low confidence should not go through the time-consuming recognition process since they will yield erroneous spoken documents that will eventually be rejected. The proposed technique can select the speech data that will be acceptable for speech recognition applications. It rapidly selects speech data with high prior confidence based on acoustic likelihood values and using only speech and monophone models. Experiments show that the proposed confidence estimation technique is over 50 times faster than the conventional posterior confidence measure while providing equivalent data selection performance for speech recognition and spoken document retrieval.
Keywords:Speech recognition  Spoken document retrieval  Data selection  Context independent model  Gaussian mixture model
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号