共查询到20条相似文献,搜索用时 31 毫秒
1.
B. Demir 《International journal of remote sensing》2013,34(12):3657-3663
This letter describes a method to increase hyperspectral image classification accuracy (CA) and segmentation accuracy (SA) using spectral warping, which is a nonlinear transformation that warps the frequency content of a signal. In the proposed approach, the frequency content corresponding to spectral data for the hyperspectral image was nonlinearly transformed along the spectral axis using warping. Classification and segmentation algorithms were estimated for the transformed spectral values to show the impact of warping. Experimental results are provided for different values of the warping parameter and it is shown that applying spectral warping increases CA and SA for appropriate warping parameters. 相似文献
2.
Wang Z.J. Willett P. 《IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics》2004,34(2):1056-1067
We present an approach for the joint segmentation and classification of a time series. The segmentation is on the basis of a menu of possible statistical models: each of these must be describable in terms of a sufficient statistic, but there is no need for these sufficient statistics to be the same, and these can be as complex (for example, cepstral features or autoregressive coefficients) as fits. All that is needed is the probability density function (PDF) of each sufficient statistic under its own assumed model--presumably this comes from training data, and it is particularly appealing that there is no need at all for a joint statistical characterization of all the statistics. There is similarly no need for an a-priori specification of the number of sections, as the approach uses an appropriate penalization of an over-zealous segmentation. The scheme has two stages. In stage one, rough segmentations are implemented sequentially using a piecewise generalized likelihood ratio (GLR); in the second stage, the results from the first stage (both forward and backward) are refined. The computational burden is remarkably small, approximately linear with the length of the time series, and the method is nicely accurate in terms both of discovered number of segments and of segmentation accuracy. A hybrid of the approach with one based on Gibbs sampling is also presented; this combination is somewhat slower but considerably more accurate. 相似文献
3.
Meraj Talha Rauf Hafiz Tayyab Zahoor Saliha Hassan Arslan Lali M. IkramUllah Ali Liaqat Bukhari Syed Ahmad Chan Shoaib Umar 《Neural computing & applications》2021,33(17):10737-10750
Neural Computing and Applications - Lung cancer is a deadly disease if not diagnosed in its early stages. However, early detection of lung cancer is a challenging task due to the shape and size of... 相似文献
4.
Feature extraction methods for sound events have been traditionally based on parametric representations specifically developed for speech signals, such as the well-known Mel Frequency Cepstrum Coefficients (MFCC). However, the discrimination capabilities of these features for Acoustic Event Classification (AEC) tasks could be enhanced by taking into account the spectro-temporal structure of acoustic event signals. In this paper, a new front-end for AEC which incorporates this specific information is proposed. It consists of two different stages: short-time feature extraction and temporal feature integration. The first module aims at providing a better spectral representation of the different acoustic events on a frame-by-frame basis, by means of the automatic selection of the optimal set of frequency bands from which cepstral-like features are extracted. The second stage is designed for capturing the most relevant temporal information in the short-time features, through the application of Non-Negative Matrix Factorization (NMF) on their periodograms computed over long audio segments. The whole front-end has been evaluated in clean and noisy conditions. Experiments show that the removal of certain frequency bands (which are mainly located in the medium region of the spectrum for clean conditions and in low frequencies for noisy environments) in the short-time feature computation process in conjunction with the NMF technique for temporal feature integration improves significantly the performance of a Support Vector Machine (SVM) based AEC system with respect to the use of conventional MFCCs. 相似文献
5.
A probabilistic SVM approach for hyperspectral image classification using spectral and texture features 总被引:1,自引:0,他引:1
Reza Seifi Majdar 《International journal of remote sensing》2017,38(15):4265-4284
New hyperspectral sensors can collect a large number of spectral bands, which provide a capability to distinguish various objects and materials on the earth. However, the accurate classification of these images is still a big challenge. Previous studies demonstrate the effectiveness of combination of spectral data and spatial information for better classification of hyperspectral images. In this article, this approach is followed to propose a novel three-step spectral–spatial method for classification of hyperspectral images. In the first step, Gabor filters are applied for texture feature extraction. In the second step, spectral and texture features are separately classified by a probabilistic Support Vector Machine (SVM) pixel-wise classifier to estimate per-pixel probability. Therefore, two probabilities are obtained for each pixel of the image. In the third step, the total probability is calculated by a linear combination of the previous probabilities on which a control parameter determines the efficacy of each one. As a result, one pixel is assigned to one class which has the highest total probability. This method is performed in multivariate analysis framework (MAF) on which one pixel is represented by a d-dimensional vector, d is the number of spectral or texture features, and in functional data analysis (FDA) on which one pixel is considered as a continuous function. The proposed method is evaluated with different training samples on two hyperspectral data. The combination parameter is experimentally obtained for each hyperspectral data set as well as for each training samples. This parameter adjusts the efficacy of the spectral versus texture information in various areas such as forest, agricultural or urban area to get the best classification accuracy. Experimental results show high performance of the proposed method for hyperspectral image classification. In addition, these results confirm that the proposed method achieves better results in FDA than in MAF. Comparison with some state-of-the-art spectral–spatial classification methods demonstrates that the proposed method can significantly improve classification accuracies. 相似文献
6.
Vehicle segmentation and classification using deformable templates 总被引:21,自引:0,他引:21
Dubuisson Jolly M.-P. Lakshmanan S. Jain A.K. 《IEEE transactions on pattern analysis and machine intelligence》1996,18(3):293-308
This paper proposes a segmentation algorithm using deformable template models to segment a vehicle of interest both from the stationary complex background and other moving vehicles in an image sequence. We define a polygonal template to characterize a general model of a vehicle and derive a prior probability density function to constrain the template to be deformed within a set of allowed shapes. We propose a likelihood probability density function which combines motion information and edge directionality to ensure that the deformable template is contained within the moving areas in the image and its boundary coincides with strong edges with the same orientation in the image. The segmentation problem is reduced to a minimization problem and solved by the Metropolis algorithm. The system was successfully tested on 405 image sequences containing multiple moving vehicles on a highway 相似文献
7.
Fei Wang Author Vitae Jingdong Wang Author Vitae Author Vitae James Kwok Author Vitae 《Pattern recognition》2007,40(10):2786-2797
Face recognition is a challenging task in computer vision and pattern recognition. It is well-known that obtaining a low-dimensional feature representation with enhanced discriminatory power is of paramount importance to face recognition. Moreover, recent research has shown that the face images reside on a possibly nonlinear manifold. Thus, how to effectively exploit the hidden structure is a key problem that significantly affects the recognition results. In this paper, we propose a new unsupervised nonlinear feature extraction method called spectral feature analysis (SFA). The main advantages of SFA over traditional feature extraction methods are: (1) SFA does not suffer from the small-sample-size problem; (2) SFA can extract discriminatory information from the data, and we show that linear discriminant analysis can be subsumed under the SFA framework; (3) SFA can effectively discover the nonlinear structure hidden in the data. These appealing properties make SFA very suitable for face recognition tasks. Experimental results on three benchmark face databases illustrate the superiority of SFA over traditional methods. 相似文献
8.
Iosif Mporas Todor Ganchev Nikos Fakotakis 《International Journal of Speech Technology》2008,11(2):73-85
In this paper we propose a method for improving the performance of the segmentation of speech waveforms to phonetic units.
The proposed method is based on the well known Viterbi time-alignment algorithm and utilizes the phonetic boundary predictions
from multiple speech parameterization techniques. Specifically, we utilize the most appropriate, with respect to boundary
type, phone transition position prediction as initial point to start Viterbi time-alignment for the prediction of the successor
phonetic boundary. The proposed method was evaluated on the TIMIT database, with the exploitation of several, well known in
the area of speech processing, Fourier-based and wavelet-based speech parameterization algorithms. The experimental results
for the tolerance of 20 milliseconds indicated an improvement of the absolute segmentation accuracy of approximately 0.70%,
when compared to the baseline speech segmentation scheme. 相似文献
9.
图像分割作为图像识别的一个重要处理步骤,但存在效果不理想或者计算复杂度过高的问题。提出一种新的灰度图像二值化的方法。该方法将Ncut作为谱聚类的量度,在计算该值时使用基于图像灰度级的权重矩阵,而非普通基于图像像素的权重矩阵。这样,计算复杂度和空间复杂度都明显降低。通过对实际场景中文本图像的实验,数据表明此方法在时间和系统开销方面比传统基于阈值的分割方法具有更优的性能。 相似文献
10.
Z. Q. GU C. N. DUNCAN P. M. GRANT C. F. N. COWAN E. RENSHAW M. A. MUGGLESTONE 《International journal of remote sensing》2013,34(5):953-968
Abstract The problem of classifying clouds seen on meteorological satellite images into different types is one which requires the use of textural as well as spectral information. Since multi-spectral features are of prime importance, textural features must be considered as augmenting, rather than replacing, spectral measures. Several textural features are studied to determine their discriminating power across a number of cloud classes including those which have previously been found difficult to separate. Although several features in the frequency domain are tested they are found to be less useful than those in the spatial domain with only one exception. The specific features recommended for use in classification depend on the type of classification to be undertaken. Specifically, different features should be used for a multi-dimensional feature space analysis than for a binary-tree rule-based classification. 相似文献
11.
This paper is devoted to the sequential detection of abrupt changes in spectral characteristics of digital signals, as this problem occurs for the segmentation of real signals such as speech, EEG, ECG, or geophysical signals. The limitations of a classical test are emphasized and some new algorithms are presented. They are based upon the use of two autoregressive models and some distance measures between them, such as the log-likelihood ratio and Kullback's divergence between conditional probability laws. All these algorithms are compared both via a simulation study and from a theoretical point of view. 相似文献
12.
Semantic image segmentation aims to partition an image into non-overlapping regions and assign a pre-defined object class label to each region. In this paper, a semantic method combining low-level features and high-level contextual cues is proposed to segment natural scene images. The proposed method first takes the gist representation of an image as its global feature. The image is then over-segmented into many super-pixels and histogram representations of these super-pixels are used as local features. In addition, co-occurrence and spatial layout relations among object classes are exploited as contextual cues. Finally the features and cues are integrated into the inference framework based on conditional random field by defining specific potential terms and introducing weighting functions. The proposed method has been compared with state-of-the-art methods on the MSRC database, and the experimental results show its effectiveness. 相似文献
13.
Phung SL Bouzerdoum A Chai D 《IEEE transactions on pattern analysis and machine intelligence》2005,27(1):148-154
This work presents a study of three important issues of the color pixel classification approach to skin segmentation: color representation, color quantization, and classification algorithm. Our analysis of several representative color spaces using the Bayesian classifier with the histogram technique shows that skin segmentation based on color pixel classification is largely unaffected by the choice of the color space. However, segmentation performance degrades when only chrominance channels are used in classification. Furthermore, we find that color quantization can be as low as 64 bins per channel, although higher histogram sizes give better segmentation performance. The Bayesian classifier with the histogram technique and the multilayer perceptron classifier are found to perform better compared to other tested classifiers, including three piecewise linear classifiers, three unimodal Gaussian classifiers, and a Gaussian mixture classifier. 相似文献
14.
Content-based audio classification and segmentation by using support vector machines 总被引:9,自引:0,他引:9
Content-based audio classification and segmentation is a basis for further audio/video analysis. In this paper, we present
our work on audio segmentation and classification which employs support vector machines (SVMs). Five audio classes are considered
in this paper: silence, music, background sound, pure speech, and non- pure speech which includes speech over music and speech
over noise. A sound stream is segmented by classifying each sub-segment into one of these five classes. We have evaluated
the performance of SVM on different audio type-pairs classification with testing unit of different- length and compared the
performance of SVM, K-Nearest Neighbor (KNN), and Gaussian Mixture Model (GMM). We also evaluated the effectiveness of some
new proposed features. Experiments on a database composed of about 4- hour audio data show that the proposed classifier is
very efficient on audio classification and segmentation. It also shows the accuracy of the SVM-based method is much better
than the method based on KNN and GMM. 相似文献
15.
A new algorithm using invariant spectral features for segmenting colour images is presented in this paper. Input data are three primary images obtained from a colour sensor. The input colour image is transformed to IHS (Intensity, Hue, Saturation) colour space. This colour space transform compensates for illumination variations and delivers image pixel values with low variance for individual colour regions, hence contributing to simplified segmentation. The hue and saturation images are then separately filtered and combined. The resulting image is segmented by means of a threshold process. An opening operation on the segmented image completes the algorithm. Experimental results obtained for several images are presented. Issues related to illumination and sensors are also addressed. 相似文献
16.
Image classification usually requires complicated segmentation to separate foreground objects from the background scene. However, the statistical content of a background scene can actually provide very useful information for classification. In this paper, we propose a new hybrid pyramid kernel which incorporates local features extracted from both dense regular grids and interest points for image classification, without requiring segmentation. Features extracted from dense regular grids can better capture information about the background scene, while interest points detected at corners and edges can better capture information about the salient objects. In our algorithm, these two local features are combined in both the spatial and the feature-space domains, and are organized into pyramid representations. In order to obtain better classification accuracy, we fine-tune the parameters involved in the similarity measure, and we determine discriminative regions by means of relevance feedback. From the experimental results, we observe that our algorithm can achieve a 6.37 % increase in performance as compared to other pyramid-representation-based methods. To evaluate the applicability of the proposed hybrid kernel to large-scale databases, we have performed a cross-dataset experiment and investigated the effect of foreground/background features on each of the kernels. In particular, the proposed hybrid kernel has been proven to satisfy Mercer’s condition and is efficient in measuring the similarity between image features. For instance, the computational complexity of the proposed hybrid kernel is proportional to the number of features. 相似文献
17.
火焰图像分割质量对基于数字成像的燃烧监测十分重要。受炉膛背景及燃烧工况的影响,难以同时满足火焰图像分割速度和准确度(即火焰图像分割结果与真实火焰接近程度)的需求。提出一种基于多尺度颜色特征和小波纹理特征(MCWT)的无监督火焰图像分割方法,用于提高火焰图像分割的质量和速度。结合火焰图像颜色特征及小波纹理特征构建特征矩阵,对特征矩阵进行压缩并初步检测压缩尺度火焰区域。根据压缩尺度火焰边缘确定原始尺度火焰边缘区域并构建火焰边缘区域特征矩阵,进一步分割得到准确火焰图像分割结果。采用该方法对某工业煤燃烧实验炉内不同燃烧工况下的火焰图像进行分割,并与传统分割方法对比。实验结果表明与其他传统分割方法相比,提出方法能够更准确且快速地实现不同燃烧工况下火焰图像的分割,并且其对于含有高斯噪声和椒盐噪声的火焰图像都具有更好的分割效果。 相似文献
18.
Document Segmentation is a process that aims to filter documents while identifying certain regions of interest. Generally, the regions of interest include texts, graphics (image occupied regions) and the background. This paper presents a novel top-bottom approach to perform document segmentation using texture features that are extracted from the specified/selected documents. A mask of suitable size is used to summarize textural features, and statistical parameters are captured as blocks in document images. Four textural features that are extracted from masks using the gray level co-occurrence matrix (glcm) include entropy, contrast, energy and homogeneity. Furthermore, two statistical parameters extracted from corresponding masks are the modal and median pixel values. The extracted attributes allow the classification of each mask or block as text, graphics, and background. A feedforward network is trained on the 6 extracted attributes, using documents obtained from a public database ; an error rate of 15.77 % is achieved. Furthermore, it is shown that this novel approach produces promising performance in segmenting documents and is expected to be significantly efficient for content-based information retrieval systems. Detection of duplicate documents within large databases is another potential area of application. 相似文献
19.
改进的遗传模糊聚类算法对医学图像的分割 总被引:1,自引:0,他引:1
利用遗传算法全局随机搜索的特点,可以解决模糊C均值聚类(FCM)算法在医学图像分割中容易陷入局部最优解的问题,但确定遗传算法的初始搜索范围时,需要借助于人的经验。为此,用收敛速度快的硬聚类算法得到的聚类中心作为参考,上下浮动划出一个较小的数据范围,作为遗传算法的初始搜索空间。该方法在避免FCM算法陷入局部最优化的同时,也加速了遗传算法的收敛过程。实验表明,该方法相对于标准的遗传模糊算法,效果要好得多。 相似文献