首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
It is well known that microarray printing, hybridization, and washing oftentimes create erroneous measurements, and these errors detrimentally impact machine microarray spot quality classification. Thus, it is crucial to identify and remove these errors if automation is to replace the still common practice of visually assessing spot quality, an extremely expensive and time-consuming procedure. A major problem in microarray spot quality classification methods proposed in the literature is the correlation among the features extracted from the spots. In this paper, we propose using a random subspace ensemble of neural networks and a feature selection algorithm to improve the performance of our microarray spot quality classification method. Our best method obtains an error under the receiver operating characteristic curve (EAUR) of 0.3 outperforming the stand-alone support vector machine EAUR of 1.7. The consistency of our proposed approach makes it a viable alternative to the labour-intensive manual method of spot quality assessment.  相似文献   

2.

A novel partially supervised classification technique is proposed, which allows the efficient mapping of a specific land-cover class (or a few land-cover classes) of interest, by using only training samples belonging to the class or classes selected. It is based on a combined use of a Radial Basis Function network, which models the image data distribution, and a Markov Random Field approach, which exploits the spatial-contextual information. The result is high classification accuracy comparable to that provided by fully supervised classifiers.  相似文献   

3.
Web page classification has become a challenging task due to the exponential growth of the World Wide Web. Uniform Resource Locator (URL)‐based web page classification systems play an important role, but high accuracy may not be achievable as URL contains minimal information. Nevertheless, URL‐based classifiers along with rejection framework can be used as a first‐level filter in a multistage classifier, and a costlier feature extraction from contents may be done in later stages. However, noisy and irrelevant features present in URL demand feature selection methods for URL classification. Therefore, we propose a supervised feature selection method by which relevant URL features are identified using statistical methods. We propose a new feature weighting method for a Naive Bayes classifier by embedding the term goodness obtained from the feature selection method. We also propose a rejection framework to the Naive Bayes classifier by using posterior probability for determining the confidence score. The proposed method is evaluated on the Open Directory Project and WebKB data sets. Experimental results show that our method can be an effective first‐level filter. McNemar tests confirm that our approach significantly improves the performance.  相似文献   

4.
5.
一种基于微阵列数据的集成分类方法*   总被引:1,自引:0,他引:1  
针对现有的微阵列数据集成分类方法分类精度不高这一问题,提出了一种Bagging-PCA-SVM方法。该方法首先采用Bootstrap技术对训练样本集重复取样,构成大量训练样本子集,然后在每个子集上进行特征选择和主成分分析以消除噪声基因与冗余基因;最后利用支持向量机作为分类器,采用多数投票的方法预测样本的类属。通过三个数据集进行了测试,测试结果表明了该方法的有效性和可行性。  相似文献   

6.
7.
Surface defect detection is very important to guarantee the quality of ceramic tiles production. At present, this process is usually performed manually in the ceramic tile industry, which is low efficiency and time-consuming. For small surface defects detection of high-resolution ceramic tiles image, an intelligent detection method for surface defects of ceramic tiles based on an improved you only look once version 5 (YOLOv5) algorithm is presented. Firstly, the high-resolution ceramic tile images are cropped into slices, and the Bottleneck module in the YOLOv5s network is optimized by introducing depthwise convolution and replaced in the whole network. Then, feature extraction is performed using the improved Shufflenetv2 backbone, and an attention mechanism is added to the backbone network to improve the feature extraction ability. The path aggregation network (PAN) and Feature Pyramid Networks (FPN) neck are used to enhance the feature extraction, and finally, the YOLO head is used to identify and locate the ceramic tile defects. The multiple sliding windows detection method is proposed to detect the original ceramic tile image which is faster than the single sliding window detection method. The experimental results show that compared with the original YOLOv5s detection algorithm, the parameters of the model are reduced by 20.46 %, the floating point operations are reduced by 26.22 %, and the mean average precision (mAP) of the proposed method is 96.73 % in the ceramic tile image slice test set which has 1.93 % improvement in mAP than the original YOLOv5s. Compare with other object detection methods, the method proposed in this paper also has certain advantages. In the high-resolution ceramic tile images test set, the mAP of the proposed algorithm is 86.44 % by using the multiple sliding window detection method. The ceramic defect detection experiment has verified the feasibility of the method proposed in this paper.  相似文献   

8.
Feature selection, both for supervised as well as for unsupervised classification is a relevant problem pursued by researchers for decades. There are multiple benchmark algorithms based on filter, wrapper and hybrid methods. These algorithms adopt different techniques which vary from traditional search-based techniques to more advanced nature inspired algorithm based techniques. In this paper, a hybrid feature selection algorithm using graph-based technique has been proposed. The proposed algorithm has used the concept of Feature Association Map (FAM) as an underlying foundation. It has used graph-theoretic principles of minimal vertex cover and maximal independent set to derive feature subset. This algorithm applies to both supervised and unsupervised classification. The performance of the proposed algorithm has been compared with several benchmark supervised and unsupervised feature selection algorithms and found to be better than them. Also, the proposed algorithm is less computationally expensive and hence has taken less execution time for the publicly available datasets used in the experiments, which include high-dimensional datasets.  相似文献   

9.
As the popularity of the portable document format (PDF) file format increases, research that facilitates PDF text analysis or extraction is necessary. Heading detection is a crucial component of PDF-based text classification processes. This research involves training a supervised learning model to detect headings by systematically testing and selecting classifier features using recursive feature elimination. Results indicate that decision tree is the best classifier with an accuracy of 95.83%, sensitivity of 0.981, and a specificity of 0.946. This research into heading detection contributes to the field of PDF-based text extraction and can be applied to the automation of large scale PDF text analysis in a variety of professional and policy-based contexts.  相似文献   

10.
The discovery of the genes involved in genetic diseases is a very important step towards the understanding of the nature of these diseases. In-lab identification is a difficult, time-consuming task, where computational methods can be very useful. In silico identification algorithms can be used as a guide in future studies. Previous works in this topic have not taken into account that no reliable sets of negative examples are available, as it is not possible to ensure that a given gene is not related to any genetic disease. In this paper, this feature of the nature of the problem is considered, and identification is approached as a partially supervised classification problem. In addition, we have performed a more specific method to identify disease genes by classifying, for the first time, genes causing dominant and recessive diseases independently. We base this separation on previous results that show that these two types of genes present differences in their sequence properties. In this paper, we have applied a new model averaging algorithm to the identification of human genes associated with both dominant and recessive Mendelian diseases.  相似文献   

11.
The notion of “fuzzy separability” is introduced for fuzzy sets of patterns. A supervised learning algorithm is proposed for estimation of membership functions that yield hierarchical partitioning of the feature space for fuzzy separable pattern classes under confusion. Finally we present a methodology for the design of a classifier composed of hierarchical binary decision trees.  相似文献   

12.
Artificial Intelligence Review - Longitudinal datasets of human ageing studies usually have a high volume of missing data, and one way to handle missing values in a dataset is to replace them with...  相似文献   

13.
Hui  Yang  Mei  Xuesong  Jiang  Gedong  Zhao  Fei  Ma  Ziwei  Tao  Tao 《Journal of Intelligent Manufacturing》2022,33(3):753-769
Journal of Intelligent Manufacturing - During the batch assembly analysis of linear axis of machine tool, assembly quality evaluation is crucial to reduce assembly quality fluctuations and improve...  相似文献   

14.
Determining a proper distance metric is often a crucial step for machine learning. In this paper, a boosting algorithm is proposed to learn a Mahalanobis distance metric. Similar to most boosting algorithms, the proposed algorithm improves a loss function iteratively. In particular, the loss function is defined in terms of hypothesis margins, and a metric matrix base-learner specific to the boosting framework is also proposed. Experimental results show that the proposed approach can yield effective Mahalanobis distance metrics for a variety of data sets, and demonstrate the feasibility of the proposed approach.  相似文献   

15.
In this paper we introduce a novel supervised manifold learning technique called Supervised Laplacian Eigenmaps (S-LE), which makes use of class label information to guide the procedure of non-linear dimensionality reduction by adopting the large margin concept. The graph Laplacian is split into two components: within-class graph and between-class graph to better characterize the discriminant property of the data. Our approach has two important characteristics: (i) it adaptively estimates the local neighborhood surrounding each sample based on data density and similarity and (ii) the objective function simultaneously maximizes the local margin between heterogeneous samples and pushes the homogeneous samples closer to each other.Our approach has been tested on several challenging face databases and it has been conveniently compared with other linear and non-linear techniques, demonstrating its superiority. Although we have concentrated in this paper on the face recognition problem, the proposed approach could also be applied to other category of objects characterized by large variations in their appearance (such as hand or body pose, for instance).  相似文献   

16.
Machine vision based inspection systems are in great focus nowadays for quality control applications. The proposed work presents a novel approach for classification of wood knot defects for an automated inspection. The proposed technique utilizes gray level co-occurrence matrix and laws texture energy measures as texture feature extractors and feed-forward back-propagation neural network as classifier. The proposed work involves the comparison of gray level co-occurrence matrix based features with laws texture energy measures based features. Firstly it takes contrast, correlation, energy and homogeneity as input parameters to a feed-forward back propagation neural network to predict wood defects and then it take energy calculated from laws texture energy measures based energy maps as input feature to a feed-forward back propagation neural network. Mean Square Error (MSE) for training data is found to be 0.0718 and 90.5% overall average classification accuracy is achieved when laws texture energy measures based features are used as input to the neural network as compared to gray level co-occurrence matrix based input features where MSE for training data is found to be 0.10728 and 84.3% overall average classification accuracy is achieved. The proposed technique shows promising results to classify wood defects using a feed forward back-propagation neural network.  相似文献   

17.
Liu  Mingyi  Tu  Zhiying  Xu  Xiaofei  Wang  Zhongjie  Wang  Yan 《Software and Systems Modeling》2023,22(3):919-939
Software and Systems Modeling - Services are flourishing drastically both on the Internet and in the real world. In addition, services have become much more interconnected to facilitate...  相似文献   

18.
A multi-scale supervised neural architecture, called Multi-Scale SOON, is proposed for natural texture classification. This architecture recognizes the input textured image through a hierarchical categorization structure in multiple scales. This process consists of three sequential phases: a multi-scale feature extraction, a scale prototype pattern generation, and a multi-scale prototype fusion pattern classification. First phase extracts scale textural features using the Gabor filtering. Then, a hierarchical categorization shapes the classification. First categorization level generates the scale prototypes and an upper level categorizes the prototypes fusion. Three increasing complexity tests over the well-known Brodatz database are performed in order to quantify the Multi-Scale SOON behavior. The comparison to other standout methods proves Multi-Scale SOON behavior to be satisfactory. The tests, including the entire texture album, show the stability and robustness of the Multi-Scale SOON response.  相似文献   

19.
A novel ensemble of classifiers for microarray data classification   总被引:1,自引:0,他引:1  
Yuehui  Yaou   《Applied Soft Computing》2008,8(4):1664-1669
Micorarray data are often extremely asymmetric in dimensionality, such as thousands or even tens of thousands of genes and a few hundreds of samples. Such extreme asymmetry between the dimensionality of genes and samples presents several challenges to conventional clustering and classification methods. In this paper, a novel ensemble method is proposed. Firstly, in order to extract useful features and reduce dimensionality, different feature selection methods such as correlation analysis, Fisher-ratio is used to form different feature subsets. Then a pool of candidate base classifiers is generated to learn the subsets which are re-sampling from the different feature subsets with PSO (Particle Swarm Optimization) algorithm. At last, appropriate classifiers are selected to construct the classification committee using EDAs (Estimation of Distribution Algorithms). Experiments show that the proposed method produces the best recognition rates on four benchmark databases.  相似文献   

20.
Currently, quality estimation (QE) is mostly addressed using supervised learning approaches. In this paper we show that unsupervised and weakly supervised approaches (using a small training set) perform almost as well as supervised ones, for a significantly lower cost. More generally, we study the various possible definitions, parameters, evaluation methods and approaches for QE, in order to show that there are multiple possible configurations for this task.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号