共查询到13条相似文献,搜索用时 0 毫秒
1.
2.
In this paper, a frame linear predictive coding spectrum (FLPCS) technique for speaker identification is presented. Traditionally, linear predictive coding (LPC) was applied in many speech recognition applications, nevertheless, the modification of LPC termed FLPCS is proposed in this study for speaker identification. The analysis procedure consists of feature extraction and voice classification. In the stage of feature extraction, the representative characteristics were extracted using the FLPCS technique. Through the approach, the size of the feature vector of a speaker can be reduced within an acceptable recognition rate. In the stage of classification, general regression neural network (GRNN) and Gaussian mixture model (GMM) were applied because of their rapid response and simplicity in implementation. In the experimental investigation, performances of different order FLPCS coefficients which were induced from the LPC spectrum were compared with one another. Further, the capability analysis on GRNN and GMM was also described. The experimental results showed GMM can achieve a better recognition rate with feature extraction using the FLPCS method. It is also suggested the GMM can complete training and identification in a very short time. 相似文献
3.
支持向量机作为一种新的统计学习方法,在说话人识别中得到了广泛应用.本文针对支持向量机在说话人辨识中的大样本训练耗时问题,提出对语音参数进行模糊核聚类的约简方法,选择聚类边界的语音参数作为支持向量,可以在不影响识别率的情况下,减少支持向量机的训练量.并通过实验验证了该方法的有效性. 相似文献
4.
Classification of process trends based on fuzzified symbolic representation and hidden Markov models 总被引:1,自引:0,他引:1
This paper presents a strategy to represent and classify process data for detection of abnormal operating conditions. In representing the data, a wavelet-based smoothing algorithm is used to filter the high frequency noise. A shape analysis technique called triangular episodes then converts the smoothed data into a semi-qualitative form. Two membership functions are implemented to transform the quantitative information in the triangular episodes to a purely symbolic representation. The symbolic data is classified with a set of sequence matching hidden Markov models (HMMs), and the classification is improved by utilizing a time correlated HMM after the sequence matching HMM. The method is tested on simulations with a non-isothermal CSTR and compared with methods that use a back-propagation neural network with and without an ARX model. 相似文献
5.
Vitoantonio Bevilacqua Lucia Cariello Gaetano Carro Domenico Daleno Giuseppe Mastronardi 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2008,12(7):615-621
Face recognition from an image or video sequences is emerging as an active research area with numerous commercial and law
enforcement applications. In this paper different Pseudo 2-dimension Hidden Markov Models (HMMs) are introduced for a face
recognition showing performances reasonably fast for binary images. The proposed P2-D HMMs are made up of five levels of states,
one for each significant facial region in which the input frontal images are sequenced: forehead, eyes, nose, mouth and chin.
Each of P2-D HMMs has been trained by coefficients of an artificial neural network used to compress a bitmap image in order
to represent it with a number of coefficients that is smaller than the total number of pixels. All the P2-D HMMs, applied
to the input set consisting of the Olivetti Research Laboratory face database combined to others photos, have achieved good
rates of recognition and, in particular, the structure 3-6-6-6-3 has achieved a rate of recognition equal to 100%. 相似文献
6.
A. Xafopoulos Author Vitae Author Vitae G. Almpanidis Author VitaeAuthor Vitae 《Pattern recognition》2004,37(3):583-594
This paper deals with language identification in the domain of web documents. The proposed system is built on hidden Markov models (HMMs) that enable the modeling of character sequences. Furthermore, the use of HMMs provides the means for language tracking, that is, language identification across the segments of a multilingual document. 相似文献
7.
Ki Yong Lee 《Pattern recognition letters》2004,25(16):3846-1817
To reduce the high dimensionality required for training of feature vectors in speaker identification, we propose an efficient GMM based on local PCA with fuzzy clustering. The proposed method firstly partitions the data space into several disjoint clusters by fuzzy clustering, and then performs PCA using the fuzzy covariance matrix on each cluster. Finally, the GMM for speaker is obtained from the transformed feature vectors with reduced dimension in each cluster. Compared to the conventional GMM with diagonal covariance matrix, the proposed method shows faster result with less storage maintaining same performance. 相似文献
8.
9.
C. de TrazegniesAuthor Vitae C. UrdialesAuthor VitaeA. BanderaAuthor Vitae F. SandovalAuthor Vitae 《Pattern recognition》2003,36(11):2571-2584
In this paper, a 3D object recognition algorithm is proposed. Objects are recognized by studying planar images corresponding to a sequence of views. Planar shape contours are represented by their adaptively calculated curvature functions, which are decomposed in the Fourier domain as a linear combination of a set of representative shapes. Finally, sequences of views are identified by means of Hidden Markov Models. The proposed system has been tested for artificial and real objects. Distorted and noisy versions of the objects were correctly clustered together. 相似文献
10.
11.
Junxi Sun Author Vitae Author Vitae Yazhu Chen Author Vitae Author Vitae 《Pattern recognition》2004,37(7):1315-1324
The wavelet analysis is an efficient tool for the detection of image edges. Based on the wavelet analysis, we present an unsupervised learning algorithm to detect image edges in this paper. A wavelet domain vector hidden Markov tree (WD-VHMT) is employed in our algorithm to model the statistical properties of multiscale and multidirectional (subband) wavelet coefficients of an image. With this model, each wavelet coefficient is viewed as an observation of its hidden state and the hidden state indicates if the wavelet coefficient belongs to an edge. The WD-VHMT model can be learned by an expectation-maximization algorithm. After the model is learned, we employ an extended Viterbi algorithm to uncover the hidden state sequences according to the maximum a posterior estimation. The experiment results of the edge detection for several images are provided to evaluate our algorithm. 相似文献
12.
13.
分析了几类主要的P2P业务识别方法,重点分析了基于流的内在特征的各种识别方法,并对其优缺点作出评价,指出了P2P识别技术进一步的发展方向. 相似文献