首页 | 本学科首页   官方微博 | 高级检索  
     


Phone-based filter parameter optimization of filter and sum robust speech recognition using likelihood maximization
Authors:Bahram Kouhi-Jelehkaran  Hamidreza Bakhshi  Farbod Razzazi
Affiliation:1. Department of Electrical Engineering, Islamic Azad University, Science and Research Branch, Tehran, Iran;2. Department of Electrical Engineering, Shahed University, Tehran, Iran
Abstract:Because of noise and reverberation, accuracy of speech recognition systems decreases when the distance between talker and microphone increases. By the using of microphone arrays and appropriate filtering of received signals, the accuracy of recognizer can be increased. Many different methods for using microphone arrays have been proposed that can be classified into two main approaches: systems that perform in two independent stages of array processing and then recognition and systems that use array processing to generate a sequence of features which maximize the likelihood of generating the correct hypothesis in recognition phase. Following second approach, in this paper a new method for microphone array processing is proposed in which the parameters of array processing are adjusted in calibration phase based on phones used in language and maximum likelihood method. Optimized filter parameters are stored and used during recognition phase. A new modified Viterbi algorithm using optimal phone-based filter parameters is used for recognition phase. The proposed algorithm is analytically formulated and Persian language is used to find any improvement in speech recognition accuracy compared with results of delay and sum and utterance-based filter and sum algorithms. The results show 12.2% improvement in accuracy compared to utterance-based algorithm.
Keywords:Maximum likelihood filtering optimization  Microphone array processing  Robust speech recognition
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号