Evaluation of discrete transforms for use in digital speech recognition |
| |
Authors: | H.A. Barger K.R. Rao |
| |
Affiliation: | Collins Radio Group of Rockwell International Dallas, Texas, U.S.A.;Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX 76019, U.S.A. |
| |
Abstract: | Traditionally FFT (fast Fourier transform) has been utilized in recognition algorithms involving speech. Other discrete transforms such as Walsh-Hadamard transform (WHT) and rapid transform (RT) can play equally important roles in the recognition process as they have advantages in implementation and hardware realization. The capability of these transforms in recognizing phonemes based on training matrices and various matching criteria is investigated. The speech data base consists of ten sentences spoken by ten different speakers (all male). For recognition purposes the speech is sectioned into 10 ms intervals and is sampled at 20 KHz. Training matrices for all the three transforms are developed. Test matrices in the transform domain are compared with the prototypes based on these criteria which led to the decision process. WHT and RT appear to offer promise and potential compared to FFT as the former are easier to implement and as the yield recognition results comparable to those of the FFT. Other distance measures and recognition schemes are proposed for improving the classification accuracy. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|