Speech recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Speech recognition

Authors:	Alotaibi YA Shahsavari MM

Affiliation:	Dept. of Electr. & Comput. Eng., Florida Inst. of Technol., Melbourne, FL;

Abstract:	The authors have designed, successfully trained and tested an Arabic speech recognition system. This system was implemented using C++ programming language on Windows 95. It can be partitioned into five main modules. These modules are the front-end, feature extraction, training, pattern recognition and decision making and display. The front-end module functions as signal preparation and calibration. This includes: setting the signal sampling rate, removing the DC component from the signal, setting the scaling factor of the signal and detecting the endpoints of the utterance. The endpoint task removes the non-speech signal portions created by the speaker's pauses. This reduces the system computation time needed and the memory requirements. The feature extraction module is mainly a digital signal processor. The training module is the one that finds the best templates for every word or sound (phonemes) in the system's database. In short, this module needs to be executed only one time before users can utilize the system. The next module is the pattern recognition module. Its function is to compare the given utterance (test utterance) to all the stored templates (the reference module). The decision and display module functions as an interface between the user and the hidden system modules. In other words, after getting the recognition module results, this module displays the best candidate(s) and/or their likelihood percentage. The error rates are computed and displayed in this module

Keywords:

设为首页 | 免责声明 | 关于勤云 | 加入收藏