首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 10 毫秒
1.
This letter describes a method to increase hyperspectral image classification accuracy (CA) and segmentation accuracy (SA) using spectral warping, which is a nonlinear transformation that warps the frequency content of a signal. In the proposed approach, the frequency content corresponding to spectral data for the hyperspectral image was nonlinearly transformed along the spectral axis using warping. Classification and segmentation algorithms were estimated for the transformed spectral values to show the impact of warping. Experimental results are provided for different values of the warping parameter and it is shown that applying spectral warping increases CA and SA for appropriate warping parameters.  相似文献   

2.
We present an approach for the joint segmentation and classification of a time series. The segmentation is on the basis of a menu of possible statistical models: each of these must be describable in terms of a sufficient statistic, but there is no need for these sufficient statistics to be the same, and these can be as complex (for example, cepstral features or autoregressive coefficients) as fits. All that is needed is the probability density function (PDF) of each sufficient statistic under its own assumed model--presumably this comes from training data, and it is particularly appealing that there is no need at all for a joint statistical characterization of all the statistics. There is similarly no need for an a-priori specification of the number of sections, as the approach uses an appropriate penalization of an over-zealous segmentation. The scheme has two stages. In stage one, rough segmentations are implemented sequentially using a piecewise generalized likelihood ratio (GLR); in the second stage, the results from the first stage (both forward and backward) are refined. The computational burden is remarkably small, approximately linear with the length of the time series, and the method is nicely accurate in terms both of discovered number of segments and of segmentation accuracy. A hybrid of the approach with one based on Gibbs sampling is also presented; this combination is somewhat slower but considerably more accurate.  相似文献   

3.
Machine hearing is an emerging research field that is analogous to machine vision in that it aims to equip computers with the ability to hear and recognise a variety of sounds. It is a key enabler of natural human–computer speech interfacing, as well as in areas such as automated security surveillance, environmental monitoring, smart homes/buildings/cities. Recent advances in machine learning allow current systems to accurately recognise a diverse range of sounds under controlled conditions. However doing so in real-world noisy conditions remains a challenging task. Several front–end feature extraction methods have been used for machine hearing, employing speech recognition features like MFCC and PLP, as well as image-like features such as AIM and SIF. The best choice of feature is found to be dependent upon the noise environment and machine learning techniques used. Machine learning methods such as deep neural networks have been shown capable of inferring discriminative classification rules from less structured front–end features in related domains. In the machine hearing field, spectrogram image features have recently shown good performance for noise-corrupted classification using deep neural networks. However there are many methods of extracting features from spectrograms. This paper explores a novel data-driven feature extraction method that uses variance-based criteria to define spectral pooling of features from spectrograms. The proposed method, based on maximising the pooled spectral variance of foreground and background sound models, is shown to achieve very good performance for robust classification.  相似文献   

4.
Neural Computing and Applications - Lung cancer is a deadly disease if not diagnosed in its early stages. However, early detection of lung cancer is a challenging task due to the shape and size of...  相似文献   

5.
Feature extraction methods for sound events have been traditionally based on parametric representations specifically developed for speech signals, such as the well-known Mel Frequency Cepstrum Coefficients (MFCC). However, the discrimination capabilities of these features for Acoustic Event Classification (AEC) tasks could be enhanced by taking into account the spectro-temporal structure of acoustic event signals. In this paper, a new front-end for AEC which incorporates this specific information is proposed. It consists of two different stages: short-time feature extraction and temporal feature integration. The first module aims at providing a better spectral representation of the different acoustic events on a frame-by-frame basis, by means of the automatic selection of the optimal set of frequency bands from which cepstral-like features are extracted. The second stage is designed for capturing the most relevant temporal information in the short-time features, through the application of Non-Negative Matrix Factorization (NMF) on their periodograms computed over long audio segments. The whole front-end has been evaluated in clean and noisy conditions. Experiments show that the removal of certain frequency bands (which are mainly located in the medium region of the spectrum for clean conditions and in low frequencies for noisy environments) in the short-time feature computation process in conjunction with the NMF technique for temporal feature integration improves significantly the performance of a Support Vector Machine (SVM) based AEC system with respect to the use of conventional MFCCs.  相似文献   

6.
New hyperspectral sensors can collect a large number of spectral bands, which provide a capability to distinguish various objects and materials on the earth. However, the accurate classification of these images is still a big challenge. Previous studies demonstrate the effectiveness of combination of spectral data and spatial information for better classification of hyperspectral images. In this article, this approach is followed to propose a novel three-step spectral–spatial method for classification of hyperspectral images. In the first step, Gabor filters are applied for texture feature extraction. In the second step, spectral and texture features are separately classified by a probabilistic Support Vector Machine (SVM) pixel-wise classifier to estimate per-pixel probability. Therefore, two probabilities are obtained for each pixel of the image. In the third step, the total probability is calculated by a linear combination of the previous probabilities on which a control parameter determines the efficacy of each one. As a result, one pixel is assigned to one class which has the highest total probability. This method is performed in multivariate analysis framework (MAF) on which one pixel is represented by a d-dimensional vector, d is the number of spectral or texture features, and in functional data analysis (FDA) on which one pixel is considered as a continuous function. The proposed method is evaluated with different training samples on two hyperspectral data. The combination parameter is experimentally obtained for each hyperspectral data set as well as for each training samples. This parameter adjusts the efficacy of the spectral versus texture information in various areas such as forest, agricultural or urban area to get the best classification accuracy. Experimental results show high performance of the proposed method for hyperspectral image classification. In addition, these results confirm that the proposed method achieves better results in FDA than in MAF. Comparison with some state-of-the-art spectral–spatial classification methods demonstrates that the proposed method can significantly improve classification accuracies.  相似文献   

7.
Vehicle segmentation and classification using deformable templates   总被引:21,自引:0,他引:21  
This paper proposes a segmentation algorithm using deformable template models to segment a vehicle of interest both from the stationary complex background and other moving vehicles in an image sequence. We define a polygonal template to characterize a general model of a vehicle and derive a prior probability density function to constrain the template to be deformed within a set of allowed shapes. We propose a likelihood probability density function which combines motion information and edge directionality to ensure that the deformable template is contained within the moving areas in the image and its boundary coincides with strong edges with the same orientation in the image. The segmentation problem is reduced to a minimization problem and solved by the Metropolis algorithm. The system was successfully tested on 405 image sequences containing multiple moving vehicles on a highway  相似文献   

8.
Face recognition is a challenging task in computer vision and pattern recognition. It is well-known that obtaining a low-dimensional feature representation with enhanced discriminatory power is of paramount importance to face recognition. Moreover, recent research has shown that the face images reside on a possibly nonlinear manifold. Thus, how to effectively exploit the hidden structure is a key problem that significantly affects the recognition results. In this paper, we propose a new unsupervised nonlinear feature extraction method called spectral feature analysis (SFA). The main advantages of SFA over traditional feature extraction methods are: (1) SFA does not suffer from the small-sample-size problem; (2) SFA can extract discriminatory information from the data, and we show that linear discriminant analysis can be subsumed under the SFA framework; (3) SFA can effectively discover the nonlinear structure hidden in the data. These appealing properties make SFA very suitable for face recognition tasks. Experimental results on three benchmark face databases illustrate the superiority of SFA over traditional methods.  相似文献   

9.
In this paper we propose a method for improving the performance of the segmentation of speech waveforms to phonetic units. The proposed method is based on the well known Viterbi time-alignment algorithm and utilizes the phonetic boundary predictions from multiple speech parameterization techniques. Specifically, we utilize the most appropriate, with respect to boundary type, phone transition position prediction as initial point to start Viterbi time-alignment for the prediction of the successor phonetic boundary. The proposed method was evaluated on the TIMIT database, with the exploitation of several, well known in the area of speech processing, Fourier-based and wavelet-based speech parameterization algorithms. The experimental results for the tolerance of 20 milliseconds indicated an improvement of the absolute segmentation accuracy of approximately 0.70%, when compared to the baseline speech segmentation scheme.  相似文献   

10.
目的 高光谱影像(hyperspectral image,HSI)中“同物异谱,异物同谱”的现象普遍存在,使分类结果存在严重的椒盐噪声问题。HSI中的空间地物结构复杂多样,单一尺度的空间特征提取方法无法有效地表达地物类间差异和区分地物边界。有效解决光谱混淆和空间尺度问题是提高分类精度的关键。方法 结合多尺度超像素和奇异谱分析,提出一种新的高光谱影像分类方法,从而充分挖掘地物的局部空间特征和光谱特征,解决空间尺度和光谱混淆的问题,提高分类精度。利用多尺度超像素对影像进行分割,获取不同尺度的分割影像,同时在分割区域内进行均值滤波,减少类内的光谱差异,增强类间的光谱差异;对每个区域计算平均光谱向量,并利用奇异谱分析方法获取光谱的主要鉴别特征,同时消除噪声的影响;利用支持向量机对不同尺度超像素分割影像进行分类,并进行决策融合,得到最终的分类结果。结果 实验选取了两个标准高光谱数据集和一个真实数据集,结果表明,利用本文算法提取的光谱—空间特征进行分类,比直接在原始数据上进行分类分别提高约26.8%、9.2%和13%的精度;与先进的深度学习SSRN (spectral-spatial residual network)算法相比,本文算法在精度上分别提升约5.2%、0.7%和4%,并且运行时间仅为前者的18.3%、45.4%和62.1%,处理效率更高。此外,在训练样本有限的情况下,两个标准数据集的样本分别为1%和0.2%时,本文算法均能取得87%以上的分类精度。结论 针对高光谱影像分类中的难题,提出一种新的融合光谱和多尺度空间特征的HSI分类方法。实验结果表明,本文方法优于对比方法,可以产生更精细的分类结果。  相似文献   

11.
We present a novel method for segmenting images with texture and nontexture regions. Local spectral histograms are feature vectors consisting of histograms of chosen filter responses, which capture both texture and nontexture information. Based on the observation that the local spectral histogram of a pixel location can be approximated through a linear combination of the representative features weighted by the area coverage of each feature, we formulate the segmentation problem as a multivariate linear regression, where the solution is obtained by least squares estimation. Moreover, we propose an algorithm to automatically identify representative features corresponding to different homogeneous regions, and show that the number of representative features can be determined by examining the effective rank of a feature matrix. We present segmentation results on different types of images, and our comparison with other methods shows that the proposed method gives more accurate results.  相似文献   

12.
Abstract

The problem of classifying clouds seen on meteorological satellite images into different types is one which requires the use of textural as well as spectral information. Since multi-spectral features are of prime importance, textural features must be considered as augmenting, rather than replacing, spectral measures. Several textural features are studied to determine their discriminating power across a number of cloud classes including those which have previously been found difficult to separate. Although several features in the frequency domain are tested they are found to be less useful than those in the spatial domain with only one exception. The specific features recommended for use in classification depend on the type of classification to be undertaken. Specifically, different features should be used for a multi-dimensional feature space analysis than for a binary-tree rule-based classification.  相似文献   

13.
图像分割作为图像识别的一个重要处理步骤,但存在效果不理想或者计算复杂度过高的问题。提出一种新的灰度图像二值化的方法。该方法将Ncut作为谱聚类的量度,在计算该值时使用基于图像灰度级的权重矩阵,而非普通基于图像像素的权重矩阵。这样,计算复杂度和空间复杂度都明显降低。通过对实际场景中文本图像的实验,数据表明此方法在时间和系统开销方面比传统基于阈值的分割方法具有更优的性能。  相似文献   

14.
This paper is devoted to the sequential detection of abrupt changes in spectral characteristics of digital signals, as this problem occurs for the segmentation of real signals such as speech, EEG, ECG, or geophysical signals. The limitations of a classical test are emphasized and some new algorithms are presented. They are based upon the use of two autoregressive models and some distance measures between them, such as the log-likelihood ratio and Kullback's divergence between conditional probability laws. All these algorithms are compared both via a simulation study and from a theoretical point of view.  相似文献   

15.
Semantic image segmentation aims to partition an image into non-overlapping regions and assign a pre-defined object class label to each region. In this paper, a semantic method combining low-level features and high-level contextual cues is proposed to segment natural scene images. The proposed method first takes the gist representation of an image as its global feature. The image is then over-segmented into many super-pixels and histogram representations of these super-pixels are used as local features. In addition, co-occurrence and spatial layout relations among object classes are exploited as contextual cues. Finally the features and cues are integrated into the inference framework based on conditional random field by defining specific potential terms and introducing weighting functions. The proposed method has been compared with state-of-the-art methods on the MSRC database, and the experimental results show its effectiveness.  相似文献   

16.
A new algorithm using invariant spectral features for segmenting colour images is presented in this paper. Input data are three primary images obtained from a colour sensor. The input colour image is transformed to IHS (Intensity, Hue, Saturation) colour space. This colour space transform compensates for illumination variations and delivers image pixel values with low variance for individual colour regions, hence contributing to simplified segmentation. The hue and saturation images are then separately filtered and combined. The resulting image is segmented by means of a threshold process. An opening operation on the segmented image completes the algorithm. Experimental results obtained for several images are presented. Issues related to illumination and sensors are also addressed.  相似文献   

17.
Content-based audio classification and segmentation is a basis for further audio/video analysis. In this paper, we present our work on audio segmentation and classification which employs support vector machines (SVMs). Five audio classes are considered in this paper: silence, music, background sound, pure speech, and non- pure speech which includes speech over music and speech over noise. A sound stream is segmented by classifying each sub-segment into one of these five classes. We have evaluated the performance of SVM on different audio type-pairs classification with testing unit of different- length and compared the performance of SVM, K-Nearest Neighbor (KNN), and Gaussian Mixture Model (GMM). We also evaluated the effectiveness of some new proposed features. Experiments on a database composed of about 4- hour audio data show that the proposed classifier is very efficient on audio classification and segmentation. It also shows the accuracy of the SVM-based method is much better than the method based on KNN and GMM.  相似文献   

18.
Skin segmentation using color pixel classification: analysis and comparison   总被引:8,自引:0,他引:8  
This work presents a study of three important issues of the color pixel classification approach to skin segmentation: color representation, color quantization, and classification algorithm. Our analysis of several representative color spaces using the Bayesian classifier with the histogram technique shows that skin segmentation based on color pixel classification is largely unaffected by the choice of the color space. However, segmentation performance degrades when only chrominance channels are used in classification. Furthermore, we find that color quantization can be as low as 64 bins per channel, although higher histogram sizes give better segmentation performance. The Bayesian classifier with the histogram technique and the multilayer perceptron classifier are found to perform better compared to other tested classifiers, including three piecewise linear classifiers, three unimodal Gaussian classifiers, and a Gaussian mixture classifier.  相似文献   

19.
A classification procedure for National Oceanic and Atmospheric Administration Advanced Very High Resolution Radiometer (NOAA AVHRR) data based on image segmentation following supervised classification by regions is presented. The procedure was appliedto channel 1 (0.58-0.68 mu m) and channel 2 (0.72-1.10 mu m) AVHRR mosaics composed of images acquired between 13 and 26 June 1993 for the state of Mato Grosso, Brazil. Eight vegetation classes were identified using this procedure. The result was compared with an existing vegetation map of Mato Grosso state for reference. The quantitative evaluation yielded a kappa coefficient of 0.4. The result indicate that image segmentation and supervised classification by regions is a procedure that is useful for mapping vegetation classes on a regional scale.  相似文献   

20.
Image classification usually requires complicated segmentation to separate foreground objects from the background scene. However, the statistical content of a background scene can actually provide very useful information for classification. In this paper, we propose a new hybrid pyramid kernel which incorporates local features extracted from both dense regular grids and interest points for image classification, without requiring segmentation. Features extracted from dense regular grids can better capture information about the background scene, while interest points detected at corners and edges can better capture information about the salient objects. In our algorithm, these two local features are combined in both the spatial and the feature-space domains, and are organized into pyramid representations. In order to obtain better classification accuracy, we fine-tune the parameters involved in the similarity measure, and we determine discriminative regions by means of relevance feedback. From the experimental results, we observe that our algorithm can achieve a 6.37 % increase in performance as compared to other pyramid-representation-based methods. To evaluate the applicability of the proposed hybrid kernel to large-scale databases, we have performed a cross-dataset experiment and investigated the effect of foreground/background features on each of the kernels. In particular, the proposed hybrid kernel has been proven to satisfy Mercer’s condition and is efficient in measuring the similarity between image features. For instance, the computational complexity of the proposed hybrid kernel is proportional to the number of features.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号