首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Speech and speaker recognition is an important topic to be performed by a computer system. In this paper, an expert speaker recognition system based on optimum wavelet packet entropy is proposed for speaker recognition by using real speech/voice signal. This study contains both the combination of the new feature extraction and classification approach by using optimum wavelet packet entropy parameter values. These optimum wavelet packet entropy values are obtained from measured real English language speech/voice signal waveforms using speech experimental set. A genetic-wavelet packet-neural network (GWPNN) model is developed in this study. GWPNN includes three layers which are genetic algorithm, wavelet packet and multi-layer perception. The genetic algorithm layer of GWPNN is used for selecting the feature extraction method and obtaining the optimum wavelet entropy parameter values. In this study, one of the four different feature extraction methods is selected by using genetic algorithm. Alternative feature extraction methods are wavelet packet decomposition, wavelet packet decomposition – short-time Fourier transform, wavelet packet decomposition – Born–Jordan time–frequency representation, wavelet packet decomposition – Choi–Williams time–frequency representation. The wavelet packet layer is used for optimum feature extraction in the time–frequency domain and is composed of wavelet packet decomposition and wavelet packet entropies. The multi-layer perceptron of GWPNN, which is a feed-forward neural network, is used for evaluating the fitness function of the genetic algorithm and for classification speakers. The performance of the developed system has been evaluated by using noisy English speech/voice signals. The test results showed that this system was effective in detecting real speech signals. The correct classification rate was about 85% for speaker classification.  相似文献   

2.
In this paper, we present automatic classification models for ultrasonic flaw signals acquired from carbon-fiber-reinforced polymer specimens. Different state-of-the-art strategies based on wavelet transform are utilized for feature extraction. Furthermore, a wavelet packet transform-based local energy feature extraction method is proposed to solve the deficiencies of the existing methods. Artificial neural networks and support vector machines are trained to validate the effectiveness of different feature extraction methods for flaw signal classification. Experimental results show that the proposed method can extract reliable features to effectively classify the different ultrasonic flaw signals with high accuracy.  相似文献   

3.
This paper presents the study of speaker identification for security systems based on the energy of speaker utterances. The proposed system consisted of a combination of signal pre-process, feature extraction using wavelet packet transform (WPT) and speaker identification using artificial neural network. In the signal pre-process, the amplitude of utterances, for a same sentence, were normalized for preventing an error estimation caused by speakers’ change in volume. In the feature extraction, three conventional methods were considered in the experiments and compared with the irregular decomposition method in the proposed system. In order to verify the effect of the proposed system for identification, a general regressive neural network (GRNN) was used and compared in the experimental investigation. The experimental results demonstrated the effectiveness of the proposed speaker identification system and were compared with the discrete wavelet transform (DWT), conventional WPT and WPT in Mel scale.  相似文献   

4.
The problem of cloud data classification from satellite imagery using neural networks is considered. Several image transformations such as singular value decomposition (SVD) and wavelet packet (WP) were used to extract the salient spectral and textural features attributed to satellite cloud data in both visible and infrared (IR) channels. In addition, the well-known gray-level cooccurrence matrix (GLCM) method and spectral features were examined for the sake of comparison. Two different neural-network paradigms namely probability neural network (PNN) and unsupervised Kohonen self-organized feature map (SOM) were examined and their performance were also benchmarked on the geostationary operational environmental satellite (GOES) 8 data. Additionally, a postprocessing scheme was developed which utilizes the contextual information in the satellite images to improve the final classification accuracy. Overall, the performance of the PNN when used in conjunction with these feature extraction and postprocessing schemes showed the potential of this neural-network-based cloud classification system.  相似文献   

5.
This paper presents an effective method based on support vector machines (SVM) for identification of power system disturbances. Because of its advantages in signal processing applications, the wavelet transform (WT) is used to extract the distinctive features of the voltage signals. After the wavelet decomposition, the characteristic features of each disturbance waveforms are obtained. The wavelet energy criterion is also applied to wavelet detail coefficients to reduce the sizes of data set. After feature extraction stage SVM is used to classify the power system disturbance waveforms and the performance of SVM is compared with the artificial neural networks (ANN).  相似文献   

6.
Despite some successes, the process of Automatic Facial Recognition (AFR) remains a significant challenge when unconstrained imaging conditions are involved. The authors believe that this occurs because an effective feature extraction method of facial images has not been found so far. In this paper a new approach to extract powerful local discriminatory features is described. First, the wavelet transform is used for extraction of multi-resolution coarse features, and then the emphasis is placed on the extraction of Multiscale fine Local Discriminatory Features (MLDFs). Instead of using traditional wavelet features, the authors examine the multiscale local statistical characteristics to derive stronger discriminatory features based on some important wavelet subbands. To efficiently utilise potentials of the extracted multi-MLDFs, an integrated recognition system is developed where the multi-classifiers first conduct the corresponding coarse classification, then a decision making scheme is used to associate different priorities with each of the classifiers to make the final recognition. Experiments have shown that this scheme provides superior performance to popular methods, such as Principal Components Analysis (PCA or Eigenface), wavelet features, neural networks, etc.  相似文献   

7.
Acoustical parameters extracted from the recorded voice samples are actively pursued for accurate detection of vocal fold pathology. Most of the system for detection of vocal fold pathology uses high quality voice samples. This paper proposes a hybrid expert system approach to detect vocal fold pathology using the compressed/low quality voice samples which includes feature extraction using wavelet packet transform, clustering based feature weighting and classification. In order to improve the robustness and discrimination ability of the wavelet packet transform based features (raw features), we propose clustering based feature weighting methods including k-means clustering (KMC), fuzzy c-means (FCM) clustering and subtractive clustering (SBC). We have investigated the effectiveness of raw and weighted features (obtained after applying feature weighting methods) using four different classifiers: Least Square Support Vector Machine (LS-SVM) with radial basis kernel, k-means nearest neighbor (kNN) classifier, probabilistic neural network (PNN) and classification and regression tree (CART). The proposed hybrid expert system approach gives a promising classification accuracy of 100% using the feature weighting methods and also it has potential application in remote detection of vocal fold pathology.  相似文献   

8.
In this work, an average framing linear prediction coding (AFLPC) technique for text-independent speaker identification systems is presented. Conventionally, linear prediction coding (LPC) has been applied in speech recognition applications. However, in this study the combination of modified LPC with wavelet transform (WT), termed AFLPC, is proposed for speaker identification. The investigation procedure is based on feature extraction and voice classification. In the phase of feature extraction, the distinguished speaker’s vocal tract characteristics were extracted using the AFLPC technique. The size of a speaker’s feature vector can be optimized in term of an acceptable recognition rate by means of genetic algorithm (GA). Hence, an LPC order of 30 is found to be the best according to the system performance. In the phase of classification, probabilistic neural network (PNN) is applied because of its rapid response and ease in implementation. In the practical investigation, performances of different wavelet transforms in conjunction with AFLPC were compared with one another. In addition, the capability analysis on the proposed system was examined by comparing it with other systems proposed in literature. Consequently, the PNN classifier achieves a better recognition rate (97.36%) with the wavelet packet (WP) and AFLPC termed WPLPCF feature extraction method. It is also suggested to analyze the proposed system in additive white Gaussian noise (AWGN) and real noise environments; 58.56% for 0 dB and 70.52% for 5 dB. The recognition rates for the whole database of the Gaussian mixture model (GMM) reached the lowest value in case of small number of training samples.  相似文献   

9.
表面肌电(Surface electromyography,sEMG)信号直接、客观地反映了神经和肌肉的活动功能状态,已获得广泛应用。本文设计了一种sEMG信号采集电路并以单通道形式采集上肢5种动作的sEMG信号,经小波包变换提取6种特征(其中一种引自基于小波变换的特征提取方法)并分别结合PCA和KPCA进行处理;再分别用BP神经网络和SVM进行动作识别。此外,对比了小波变换的特征提取;讨论了KPCA与PCA在特征变换上的差异。所提取的基于小波包变换的6种特征有5种的识别率均超过95.7%,其中引入的高低频系数组合特征在BP神经网络下平均识别率超过99%。基于小波变换提取的5种特征经KPCA变换后也达到较高的识别率。实验结果表明,本文的sEMG信号采集方法及其特征提取方法均达到较好效果。  相似文献   

10.
冯静  舒宁 《中国图象图形学报》2009,14(10):2042-2046
提出了一种新颖的用于高光谱遥感图像特征提取的子波变换算法.与二进小波变换按恒Q准则划分频域不同的是,该算法通过改变相邻子波的带宽比,可以实现更为灵活的频域划分.采用子波能量的离散余弦变换作为特征矢量,然后进行无监督C均值聚类实验和有监督RBF(径向基函数)神经网络分类实验.实验结果表明,子波变换能量的离散余弦变换特征可以有效地描述光谱曲线特征,且正确分类率高于传统的小波变换.  相似文献   

11.
This paper presents the experimental pilot study to investigate the effects of pulsed electromagnetic field (PEMF) at extremely low frequency (ELF) in response to photoplethysmographic (PPG), electrocardiographic (ECG), electroencephalographic (EEG) activity. The assessment of wavelet transform (WT) as a feature extraction method was used in representing the electrophysiological signals. Considering that classification is often more accurate when the pattern is simplified through representation by important features, the feature extraction and selection play an important role in classifying systems such as neural networks. The PPG, ECG, EEG signals were decomposed into time-frequency representations using discrete wavelet transform (DWT) and the statistical features were calculated to depict their distribution. Our pilot study investigation for any possible electrophysiological activity alterations due to ELF PEMF exposure, was evaluated by the efficiency of DWT as a feature extraction method in representing the signals. As a result, this feature extraction has been justified as a feasible method.  相似文献   

12.
In last year’s, the expert target recognition has been become very important topic in radar literature. In this study, a target recognition system is introduced for expert target recognition (ATR) using radar target echo signals of High Range Resolution (HRR) radars. This study includes a combination of an adaptive feature extraction and classification using optimum wavelet entropy parameter values. The features used in this study are extracted from radar target echo signals. Herein, a genetic wavelet extreme learning machine classifier model (GAWELM) is developed for expert target recognition. The GAWELM composes of three stages. These stages of GAWELM are genetic algorithm, wavelet analysis and extreme learning machine (ELM) classifier. In previous studies of radar target recognition have shown that the learning speed of feedforward networks is in general much slower than required and it has been a major disadvantage. There are two important causes. These are: (1) the slow gradient-based learning algorithms are commonly used to train neural networks, and (2) all the parameters of the networks are fixed iteratively by using such learning algorithms. In this paper, a new learning algorithm named extreme learning machine (ELM) for single-hidden layer feedforward networks (SLFNs) Ahern et al., 1989, Al-Otum and Al-Sowayan, 2011, Avci et al., 2005a, Avci et al., 2005b, Biswal et al., 2009, Frigui et al., in press, Cao et al., 2010, Guo et al., 2011, Famili et al., 1997, Han and Huang, 2006, Huang et al., 2011, Huang et al., 2006, Huang and Siew, 2005, Huang et al., 2009, Jiang et al., 2011, Kubrusly and Levan, 2009, Le et al., 2011, Lhermitte et al., in press, Martínez-Martínez et al., 2011, Matlab, 2011, Nelson et al., 2002, Nejad and Zakeri, 2011, Tabib et al., 2009, Tang et al., 2011, which randomly choose hidden nodes and analytically determines the output weights of SLFNs, to eliminate the these disadvantages of feedforward networks for expert target recognition area. Then, the genetic algorithm (GA) stage is used for obtaining the feature extraction method and finding the optimum wavelet entropy parameter values. Herein, the optimal one of four variant feature extraction methods is obtained by using a genetic algorithm (GA). The four feature extraction methods proposed GAWELM model are discrete wavelet transform (DWT), discrete wavelet transform–short-time Fourier transform (DWT–STFT), discrete wavelet transform–Born–Jordan time–frequency transform (DWT–BJTFT), and discrete wavelet transform–Choi–Williams time–frequency transform (DWT–CWTFT). The discrete wavelet transform stage is performed for optimum feature extraction in the time–frequency domain. The discrete wavelet transform stage includes discrete wavelet transform and calculating of discrete wavelet entropies. The extreme learning machine (ELM) classifier is performed for evaluating the fitness function of the genetic algorithm and classification of radar targets. The performance of the developed GAWELM expert radar target recognition system is examined by using noisy real radar target echo signals. The applications results of the developed GAWELM expert radar target recognition system show that this GAWELM system is effective in rating real radar target echo signals. The correct classification rate of this GAWELM system is about 90% for radar target types used in this study.  相似文献   

13.
Textural features of high-resolution remote sensing imagery are a powerful data source for improving classification accuracy because using only spectral information is not sufficient for the classification of objects with within-field spectral variability. This study presents the methods of using an object-oriented texture analysis algorithm for improving high-resolution remote sensing imagery classification, including wavelet packet transform texture analysis, the grey-level co-occurrence matrix (GLCM) and local spatial statistics. Wavelet packet transform texture analysis, with the method of optimization and selection of wavelet texture for feature extraction, is a good candidate for object-oriented classification. Feature optimization is used to reduce the data dimensions in combinations of textural sub-bands and spectral bands. The result of the classification accuracy assessment indicates the improvement of texture analysis for object-oriented classification in this study. Compared with the traditional method that uses only spectral bands, the combination of GLCM homogeneity and spectral bands increases the overall accuracy from 0.7431 to 0.9192. Furthermore, wavelet packet transform texture analysis is the optimal method, increasing the overall accuracy to 0.9216 using a smaller data dimension. Local spatial statistical measures also increase the classification total accuracy, but only from 0.7431 to 0.8088. This study demonstrates that wavelet packet and statistical textures can be used to improve object-oriented classification; specifically, the texture analysis based on the multiscale wavelet packet transform is optimal for increasing the classification accuracy using a smaller data dimension.  相似文献   

14.
ECG signals are an important source of information in the diagnosis of atrial conduction pathology. Nevertheless, diagnosis by visual inspection is a difficult task. This work introduces a novel wavelet feature extraction method for atrial fibrillation derived from the average framing percentage energy (AFE) of terminal wavelet packet transform (WPT) sub signals. Probabilistic neural network (PNN) is used for classification. The presented method is shown to be a potentially effective discriminator in an automated diagnostic process. The ECG signals taken from the MIT-BIH database are used to classify different arrhythmias together with normal ECG. Several published methods were investigated for comparison. The best recognition rate selection was obtained for AFE. The classification performance achieved accuracy 97.92%. It was also suggested to analyze the presented system in an additive white Gaussian noise (AWGN) environment; 55.14% for 0 dB and 92.53% for 5 dB. It was concluded that the proposed approach of automating classification is worth pursuing with larger samples to validate and extend the present study.  相似文献   

15.
In this paper, an intelligent diagnosis for fault gear identification and classification based on vibration signal using discrete wavelet transform and adaptive neuro-fuzzy inference system (ANFIS) is presented. The discrete wavelet transform (DWT) technique plays one of the important roles for signal feature extraction in the proposed system. The abnormal transient signals will show in different decomposition levels and can be used to recognize the various faults by the DWT figure. However, many fault conditions are hard to inspect accurately by the naked eye. In the present study, the feature extraction method based on discrete wavelet transform with energy spectrum is proposed. The different order wavelets are considered to identify fault features accurately. The database is established by feature vectors of energy spectrum which are used as input pattern in the training and identification process. Furthermore, the ANFIS is proposed to identify and classify the fault gear positions and the gear fault conditions in the fault diagnosis system. The proposed ANFIS includes both the fuzzy logic qualitative approximation and the adaptive neural network capability. The experimental results verified that the proposed ANFIS has more possibilities in fault gear identification. The ANFIS achieved an accuracy identification rate which was more satisfactory than traditional vision inspection in the proposed system.  相似文献   

16.
针对目前运动想象脑电信号特征提取单一,分类识别准确率低等现象,结合卷积神经网络分类器,提出了一种多维度特征加权融合的特征融合算法来提高运动想象脑电识别率。对预处理后的脑电信号进行小波包变换,提取其共空间特征、能量特征、边际谱熵特征以及非线性动力学特征,然后加权融合,使用卷积神经网络分类器分类。为验证算法的合理性,使用BCI-IV Dataset 2a数据集对提出的特征融合算法进行验证分析,结果表明,所提出的加权特征融合算法结合CNN分类器可以有效提高运动想象识别准确率。实验中,9位志愿者平均分类准确率达到75.88%,平均Kappa系数为0.70。  相似文献   

17.
This paper proposes a new method for speaker feature extraction based on Formants, Wavelet Entropy and Neural Networks denoted as FWENN. In the first stage, five formants and seven Shannon entropy wavelet packet are extracted from the speakers’ signals as the speaker feature vector. In the second stage, these 12 feature extraction coefficients are used as inputs to feed-forward neural networks. Probabilistic neural network is also proposed for comparison. In contrast to conventional speaker recognition methods that extract features from sentences (or words), the proposed method extracts the features from vowels. Advantages of using vowels include the ability to recognize speakers when only partially-recorded words are available. This may be useful for deaf-mute persons or when the recordings are damaged. Experimental results show that the proposed method succeeds in the speaker verification and identification tasks with high classification rate. This is accomplished with minimum amount of information, using only 12 coefficient features (i.e. vector length) and only one vowel signal, which is the major contribution of this work. The results are further compared to well-known classical algorithms for speaker recognition and are found to be superior.  相似文献   

18.
为了提高利用高压水射流靶物反射声信号识别靶物材质的效率,针对地雷探测过程常见的地雷、石块、砖块和木块4种靶物,采用不同的特征提取方法来识别靶物材质。在分析Mel频率倒谱系数及小波包变换倒谱系数基本原理的基础上,结合靶物反射声信号的特点,提出了一种基于Mel频率倒谱和小波包变换倒谱特征融合的特征提取方法:利用小波包变换将原始靶物反射声信号划分为若干子频段,选取其中一个子频段作为低频和高频的划分层;低频部分提取Mel频率倒谱系数作为特征值,高频部分则提取小波包变换倒谱系数作为特征值,将2组特征值线性合并为一组新的特征向量,用于靶物材质的识别。采用最小二乘支持向量机建立多分类模型,验证基于单一特征和基于特征融合的特征提取方法的识别率。实验结果表明,在取得低频与高频的最佳划分层时,基于特征融合的特征提取方法的平均识别率达到82.812 5%,较单一的利用Mel频率倒谱系数或小波包变换倒谱系数作为特征向量时的平均识别率分别提高了10.312 5%和7.812 5%。  相似文献   

19.
We present a robust algorithm for sequential imbalance detection (detecting a change of properties) for random processes with a wavelet packet transform. Based on this detector and artificial neural networks, we develop a classification system for different types of imbalance. We compare the resulting system with Shewhart control charts. The resulting system can be successfully used in selective control and under other conditions of imbalance detection and classification related to insufficient information about the signal before and after the change.  相似文献   

20.
《Applied Soft Computing》2008,8(1):225-231
Recently, significant of the robust texture image classification has increased. The texture image classification is used for many areas such as medicine image processing, radar image processing, etc. In this study, a new method for invariant pixel regions texture image classification is presented. Wavelet packet entropy adaptive network based fuzzy inference system (WPEANFIS) was developed for classification of the twenty 512 × 512 texture images obtained from Brodatz image album. There, sixty 32 × 32 image regions were randomly selected (overlapping or non-overlapping) from each of these 20 images. Thirty of these image regions and other 30 of these image regions are used for training and testing processing of the WPEANFIS, respectively. In this application study, Daubechies, biorthogonal, coiflets, and symlets wavelet families were used for wavelet packet transform part of the WPEANFIS algorithm, respectively. In this way, effects to correct texture classification performance of these wavelet families were compared. Efficiency of WPEANFIS developed method was tested and a mean %93.12 recognition success was obtained.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号