Autoregressive modeling of speech trajectory transformed to the reconstructed phase space for ASR purposes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Autoregressive modeling of speech trajectory transformed to the reconstructed phase space for ASR purposes

Authors:	Yasser Shekofteh Farshad Almasganj

Affiliation:	Biomedical Engineering Department, Amirkabir University of Technology, Hafez Avenue, P.O. Box 15875-4413, Tehran, Iran

Abstract:	Investigating new effective feature extraction methods applied to the speech signal is an important approach to improve the performance of automatic speech recognition (ASR) systems. Owing to the fact that the reconstructed phase space (RPS) is a proper field for true detection of signal dynamics, in this paper we propose a new method for feature extraction from the trajectory of the speech signal in the RPS. This method is based upon modeling the speech trajectory using the multivariate autoregressive (MVAR) method. Moreover, in the following, we benefit from linear discriminant analysis (LDA) for dimension reduction. The LDA technique is utilized to simultaneously decorrelate and reduce the dimension of the final feature set. Experimental results show that the MVAR of order 6 is appropriate for modeling the trajectory of speech signals in the RPS. In this study recognition experiments are conducted with an HMM-based continuous speech recognition system and a naive Bayes isolated phoneme classifier on the Persian FARSDAT and American English TIMIT corpora to compare the proposed features to some older RPS-based and traditional spectral-based MFCC features.

Keywords:	Feature extraction Speech recognition system Signal embedding Reconstructed phase space Multivariate autoregressive model
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏