Advanced classification approach for neuronal phoneme recognition system based on efficient constructive training algorithm |
| |
Authors: | Mondher Frikha Sabeur Masmoudi Ahmed Ben Hamida Mohamed Chtourou |
| |
Affiliation: | 1. Advanced Technologies for Medical and Signals, Research Unity, ATMS, ENIS, Sfax University, Sfax, Tunisia 2. Control & Energies Managment (CEM) Lab, Research Laboratory, ENIS, Sfax University, Sfax, Tunisia
|
| |
Abstract: | This paper introduces a neural network optimization procedure allowing the generation of multilayer perceptron (MLP) network topologies with few connections, low complexity and high classification performance for phoneme’s recognition. An efficient constructive algorithm with incremental training using a new proposed Frame by Frame Neural Networks (FFNN) classification approach for automatic phoneme recognition is thus proposed. It is based on a novel recruiting hidden neuron’s procedure for a single hidden-layer. After an initializing phase started with initial small number of hidden neurons, this algorithm allows the Neural Networks (NNs) to adjust automatically its parameters during the training phase. The modular FFNN classification method is then constructed and tested to recognize 5 broad phonetic classes extracted from the TIMIT database. In order to take into account the speech variability related to the coarticulation effect, a Context Window of Three Successive Frame’s (CWTSF) analysis is applied. Although, an important reduction of the computational training time is observed, this technique penalized the overall Phone Recognition Rate (PRR) and increased the complexity of the recognition system. To alleviate these limitations, two feature dimensionality reduction techniques respectively based on Principal Component Analysis (PCA) and Self Organizing Maps (SOM) are investigated. It is observed an important improvement in the performance of the recognition system when the PCA technique is applied. Optimal neuronal phone recognition architecture is finally derived according to the following criteria: best PRR, minimum computational training time and complexity of the BPNN architecture. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|