首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Classification is one of the most popular data mining techniques applied to many scientific and industrial problems. The efficiency of a classification model is evaluated by two parameters, namely the accuracy and the interpretability of the model. While most of the existing methods claim their accurate superiority over others, their models are usually complex and hardly understandable for the users. In this paper, we propose a novel classification model that is based on easily interpretable fuzzy association rules and fulfils both efficiency criteria. Since the accuracy of a classification model can be largely affected by the partitioning of numerical attributes, this paper discusses several fuzzy and crisp partitioning techniques. The proposed classification method is compared to 15 previously published association rule-based classifiers by testing them on five benchmark data sets. The results show that the fuzzy association rule-based classifier presented in this paper, offers a compact, understandable and accurate classification model.  相似文献   

2.
This paper proposes a classification method that is based on easily interpretable fuzzy rules and fully capitalizes on the two key technologies, namely pruning the outliers in the training data by SVMs (support vector machines), i.e., eliminating the influence of outliers on the learning process; finding a fuzzy set with sound linguistic interpretation to describe each class based on AFS (axiomatic fuzzy set) theory. Compared with other fuzzy rule-based methods, the proposed models are usually more compact and easily understandable for the users since each class is described by much fewer rules. The proposed method also comes with two other advantages, namely, each rule obtained from the proposed algorithm is simply a conjunction of some linguistic terms, there are no parameters that are required to be tuned. The proposed classification method is compared with the previously published fuzzy rule-based classifiers by testing them on 16 UCI data sets. The results show that the fuzzy rule-based classifier presented in this paper, offers a compact, understandable and accurate classification scheme. A balance is achieved between the interpretability and the accuracy.  相似文献   

3.
This paper presents an on-line multi-stage sorting algorithm capable of adapting to different populations. The sorting algorithm selects on-line the most appropriate classifier and feature subsets for the incoming population. The sorting algorithm includes two levels, a low level for population detection and a high level for classifier selection which incorporates feature selection. Population detection is achieved by an on-line unsupervised clustering algorithm that analyzes product variability. The classifier selection uses n fuzzy kNN classifiers, each trained with different feature combinations that function as input to a fuzzy rule-based decision system. Re-training of the n fuzzy kNN classifiers occurs when the rule based system cannot assign an existing classifier with high confidence level. Classification results for synthetic and real world databases are presented.  相似文献   

4.
How good are fuzzy If-Then classifiers?   总被引:9,自引:0,他引:9  
This paper gives some known theoretical results about fuzzy rule-based classifiers and offers a few new ones. The ability of Takagi-Sugeno-Kang (TSK) fuzzy classifiers to match exactly and to approximate classification boundaries is discussed. The lemma by Klawonn and Klement about the exact match of a classification boundary in R (2) is extended from monotonous to arbitrary functions. Equivalence between fuzzy rule-based and nonfuzzy classifiers (1-nn and Parzen) is outlined. We specify the conditions under which a class of fuzzy TSK classifiers turn into lookup tables. It is shown that if the rule base consists of all possible rules (all combinations of linguistic labels on the input features), the fuzzy TSK model is a lookup classifier with hyperbox cells, regardless of the type (shape) of the membership functions used. The question "why fuzzy?" is addressed in the light of these results.  相似文献   

5.
We propose a two-layer decision fusion technique, called Fuzzy Stacked Generalization (FSG) which establishes a hierarchical distance learning architecture. At the base-layer of an FSG, fuzzy k-NN classifiers receive different feature sets each of which is extracted from the same dataset to gain multiple views of the dataset. At the meta-layer, first, a fusion space is constructed by aggregating decision spaces of all the base-layer classifiers. Then, a fuzzy k-NN classifier is trained in the fusion space by minimizing the difference between the large sample and N-sample classification error. In order to measure the degree of collaboration among the base-layer classifiers and the diversity of the feature spaces, a new measure called, shareability, is introduced. Shearability is defined as the number of samples that are correctly classified by at least one of the base-layer classifiers in FSG. In the experiments, we observe that FSG performs better than the popular distance learning and ensemble learning algorithms when the shareability measure is large enough such that most of the samples are correctly classified by at least one of the base-layer classifiers. The relationship between the proposed and state-of-the-art diversity measures is experimentally analyzed. The tests performed on a variety of artificial and real-world benchmark datasets show that the classification performance of FSG increases compared to that of state-of-the art ensemble learning and distance learning methods as the number of classes increases.  相似文献   

6.
在分析区间值模糊集理论和现有模糊分类器的基础上,提出两种分类器的算法,它们分别是建立在区间值推理和AFS结构上的.经Iris数据实验,证明第一种算法分类结果很好,后一算法不仅结果与前面的一样好,而且它还能处理一些描述模糊概念的数据库.这两种算法在分类实践中具有良好的应用前景.  相似文献   

7.
Classification, a data mining technique, has widespread applications including medical diagnosis, targeted marketing, and others. Knowledge discovery from databases in the form of association rules is one of the important data mining tasks. An integrated approach, classification based on association rules, has drawn the attention of the data mining community over the last decade. While attention has been mainly focused on increasing classifier accuracies, not much efforts have been devoted towards building interpretable and less complex models. This paper discusses the development of a compact associative classification model using a hill-climbing approach and fuzzy sets. The proposed methodology builds the rule-base by selecting rules which contribute towards increasing training accuracy, thus balancing classification accuracy with the number of classification association rules. The results indicated that the proposed associative classification model can achieve competitive accuracies on benchmark datasets with continuous attributes and lend better interpretability, when compared with other rule-based systems.  相似文献   

8.
We propose two models for improving the performance of rule-based classification under unbalanced and highly imprecise domains. Both models are probabilistic frameworks aimed to boost the performance of basic rule-based classifiers. The first model implements a global-to-local scheme, where the response of a global rule-based classifier is refined by performing a probabilistic analysis of the coverage of its rules. In particular, the coverage of the individual rules is used to learn local probabilistic models, which ultimately refine the predictions from the corresponding rules of the global classifier. The second model implements a dual local-to-global strategy, in which single classification rules are combined within an exponential probabilistic model in order to boost the overall performance as a side effect of mutual influence. Several variants of the basic ideas are studied, and their performances are thoroughly evaluated and compared with state-of-the-art algorithms on standard benchmark datasets.  相似文献   

9.
The notion of a rough set was originally proposed by Pawlak [Z. Pawlak, Rough sets, International Journal of Computer and Information Sciences 11 (5) (1982) 341-356]. Later on, Dubois and Prade [D. Dubois, H. Prade, Rough fuzzy sets and fuzzy rough sets, International Journal of General System 17 (2-3) (1990) 191-209] introduced rough fuzzy sets and fuzzy rough sets as a generalization of rough sets. This paper deals with an interval-valued fuzzy information system by means of integrating the classical Pawlak rough set theory with the interval-valued fuzzy set theory and discusses the basic rough set theory for the interval-valued fuzzy information systems. In this paper we firstly define the rough approximation of an interval-valued fuzzy set on the universe U in the classical Pawlak approximation space and the generalized approximation space respectively, i.e., the space on which the interval-valued rough fuzzy set model is built. Secondly several interesting properties of the approximation operators are examined, and the interrelationships of the interval-valued rough fuzzy set models in the classical Pawlak approximation space and the generalized approximation space are investigated. Thirdly we discuss the attribute reduction of the interval-valued fuzzy information systems. Finally, the methods of the knowledge discovery for the interval-valued fuzzy information systems are presented with an example.  相似文献   

10.

针对决策信息为区间直觉模糊数且属性权重完全未知的多属性决策问题, 提出基于改进的区间直觉模糊熵和新得分函数的决策方法. 首先, 利用改进的区间直觉模糊熵确定属性权重; 然后, 利用区间直觉模糊加权算术平均算子集成信息, 得到各备选方案的综合属性值, 进而指出现有得分函数存在排序失效或排序不符合实际的不足, 同时给出一个新的得分函数, 并以此对方案进行排序; 最后, 通过实例表明了所提出方法的有效性.

  相似文献   

11.
In this paper, we define various induced intuitionistic fuzzy aggregation operators, including induced intuitionistic fuzzy ordered weighted averaging (OWA) operator, induced intuitionistic fuzzy hybrid averaging (I-IFHA) operator, induced interval-valued intuitionistic fuzzy OWA operator, and induced interval-valued intuitionistic fuzzy hybrid averaging (I-IIFHA) operator. We also establish various properties of these operators. And then, an approach based on I-IFHA operator and intuitionistic fuzzy weighted averaging (WA) operator is developed to solve multi-attribute group decision-making (MAGDM) problems. In such problems, attribute weights and the decision makers' (DMs') weights are real numbers and attribute values provided by the DMs are intuitionistic fuzzy numbers (IFNs), and an approach based on I-IIFHA operator and interval-valued intuitionistic fuzzy WA operator is developed to solve MAGDM problems where the attribute values provided by the DMs are interval-valued IFNs. Furthermore, induced intuitionistic fuzzy hybrid geometric operator and induced interval-valued intuitionistic fuzzy hybrid geometric operator are proposed. Finally, a numerical example is presented to illustrate the developed approaches.  相似文献   

12.
Fuzzy classification has become of great interest because of its ability to utilize simple linguistically interpretable rules and has overcome the limitations of symbolic or crisp rule based classifiers. This paper introduces an extension to fuzzy classifier: a neutrosophic classifier, which would utilize neutrosophic logic for its working. Neutrosophic logic is a generalized logic that is capable of effectively handling indeterminacy, stochasticity acquisition errors that fuzzy logic cannot handle. The proposed neutrosophic classifier employs neutrosophic logic for its working and is an extension of commonly used fuzzy classifier. It is compared with the commonly used fuzzy classifiers on the following parameters: nature of membership functions, number of rules and indeterminacy in the results generated. It is proved in the paper that extended fuzzy classifier: neutrosophic classifier; optimizes the said parameters in comparison to the fuzzy counterpart. Finally the paper is concluded with justifying that neutrosophic logic though in its nascent stage still holds the potential to be experimented for further exploration in different domains.  相似文献   

13.
BackgroundDetection and monitoring of respiratory related illness is an important aspect in pulmonary medicine. Acoustic signals extracted from the human body are considered in detection of respiratory pathology accurately.ObjectivesThe aim of this study is to develop a prototype telemedicine tool to detect respiratory pathology using computerized respiratory sound analysis.MethodsAround 120 subjects (40 normal, 40 continuous lung sounds (20 wheeze and 20 rhonchi)) and 40 discontinuous lung sounds (20 fine crackles and 20 coarse crackles) were included in this study. The respiratory sounds were segmented into respiratory cycles using fuzzy inference system and then S-transform was applied to these respiratory cycles. From the S-transform matrix, statistical features were extracted. The extracted features were statistically significant with p < 0.05. To classify the respiratory pathology KNN, SVM and ELM classifiers were implemented using the statistical features obtained from of the data.ResultsThe validation showed that the classification rate for training for ELM classifier with RBF kernel was high compared to the SVM and KNN classifiers. The time taken for training the classifier was also less in ELM compared to SVM and KNN classifiers. The overall mean classification rate for ELM classifier was 98.52%.ConclusionThe telemedicine software tool was developed using the ELM classifier. The telemedicine tool has performed extraordinary well in detecting the respiratory pathology and it is well validated.  相似文献   

14.
In this paper, we present the induced generalized intuitionistic fuzzy ordered weighted averaging (I-GIFOWA) operator. It is a new aggregation operator that generalized the IFOWA operator, including all the characteristics of both the generalized IFOWA and the induced IFOWA operators. It provides a very general formulation that includes as special cases a wide range of aggregation operators for intuitionistic fuzzy information, including all the particular cases of the I-IFOWA operator, GIFOWA operator and the induced intuitionistic fuzzy ordered geometric (I-IFOWG) operator. We also present the induced generalized interval-valued intuitionistic fuzzy ordered weighted averaging (I-GIIFOWA) operator to accommodate the environment in which the given arguments are interval-valued intuitionistic fuzzy sets. Further, we develop procedures to apply them to solve group multiple attribute decision making problems with intuitionistic fuzzy or interval-valued intuitionistic fuzzy information. Finally, we present their application to show the effectiveness of the developed methods.  相似文献   

15.
The aim of this paper is to investigate decision making problems with interval-valued intuitionistic fuzzy preference information, in which the preferences provided by the decision maker over alternatives are incomplete or uncertain. We define some new preference relations, including additive consistent incomplete interval-valued intuitionistic fuzzy preference relation, multiplicative consistent incomplete interval-valued intuitionistic fuzzy preference relation and acceptable incomplete interval-valued intuitionistic fuzzy preference relation. Based on the arithmetic average and the geometric mean, respectively, we give two procedures for extending the acceptable incomplete interval-valued intuitionistic fuzzy preference relations to the complete interval-valued intuitionistic fuzzy preference relations. Then, by using the interval-valued intuitionistic fuzzy averaging operator or the interval-valued intuitionistic fuzzy geometric operator, an approach is given to decision making based on the incomplete interval-valued intuitionistic fuzzy preference relation, and the developed approach is applied to a practical problem. It is worth pointing out that if the interval-valued intuitionistic fuzzy preference relation is reduced to the real-valued intuitionistic fuzzy preference relation, then all the above results are also reduced to the counterparts, which can be applied to solve the decision making problems with incomplete intuitionistic fuzzy preference information.  相似文献   

16.
Linguistic fuzzy modelling, developed by linguistic fuzzy rule-based systems, allows us to deal with the modelling of systems by building a linguistic model which could become interpretable by human beings. Linguistic fuzzy modelling comes with two contradictory requirements: interpretability and accuracy. In recent years the interest of researchers in obtaining more interpretable linguistic fuzzy models has grown.Whereas the measures of accuracy are straightforward and well-known, interpretability measures are difficult to define since interpretability depends on several factors; mainly the model structure, the number of rules, the number of features, the number of linguistic terms, the shape of the fuzzy sets, etc. Moreover, due to the subjectivity of the concept the choice of appropriate interpretability measures is still an open problem.In this paper, we present an overview of the proposed interpretability measures and techniques for obtaining more interpretable linguistic fuzzy rule-based systems. To this end, we will propose a taxonomy based on a double axis: “Complexity versus semantic interpretability” considering the two main kinds of measures; and “rule base versus fuzzy partitions” considering the different components of the knowledge base to which both kinds of measures can be applied. The main aim is to provide a well established framework in order to facilitate a better understanding of the topic and well founded future works.  相似文献   

17.
In this paper, according to the Maclaurin symmetric mean (MSM) operator, the dual MSM (DMSM) operator and the q-rung interval-valued orthopair fuzzy set (q-RIVOFS), we develop some novel MSM operators under the q-rung interval-valued orthopair fuzzy environment, such as, the q-rung interval-valued orthopair fuzzy MSM operator, the q-rung interval-valued orthopair fuzzy weighted MSM (q-RIVOFWMSM) operator, the q-rung interval-valued orthopair fuzzy DMSM operator, and the q-rung interval-valued orthopair fuzzy weighted DMSM operator. In addition, some precious properties and numerical examples of these new operators are given in detail. These new operators have the advantages of considering the interrelationship of arguments and can deal with multiple attribute group decision-making problems with q-rung interval-valued orthopair fuzzy information. Finally, a reality example for green suppliers selection in green supply chain management is provided to demonstrate the proposed approach and to verify its rationality and scientific.  相似文献   

18.
The simultaneous use of multiple classifiers has been shown to provide performance improvement in classification problems. The selection of an optimal set of classifiers is an important part of multiple classifier systems and the independence of classifier outputs is generally considered to be an advantage for obtaining better multiple classifier systems. In this paper, the need for the classifier independence is interrogated from classification performance point of view. The performance achieved with the use of classifiers having independent joint distributions is compared to some other classifiers which are defined to have best and worst joint distributions. These distributions are obtained by formulating the combination operation as an optimization problem. The analysis revealed several important observations about classifier selection which are then used to analyze the problem of selecting an additional classifier to be used with the available multiple classifier system.  相似文献   

19.
In this paper, we introduce a new adaptive rule-based classifier for multi-class classification of biological data, where several problems of classifying biological data are addressed: overfitting, noisy instances and class-imbalance data. It is well known that rules are interesting way for representing data in a human interpretable way. The proposed rule-based classifier combines the random subspace and boosting approaches with ensemble of decision trees to construct a set of classification rules without involving global optimisation. The classifier considers random subspace approach to avoid overfitting, boosting approach for classifying noisy instances and ensemble of decision trees to deal with class-imbalance problem. The classifier uses two popular classification techniques: decision tree and k-nearest-neighbor algorithms. Decision trees are used for evolving classification rules from the training data, while k-nearest-neighbor is used for analysing the misclassified instances and removing vagueness between the contradictory rules. It considers a series of k iterations to develop a set of classification rules from the training data and pays more attention to the misclassified instances in the next iteration by giving it a boosting flavour. This paper particularly focuses to come up with an optimal ensemble classifier that will help for improving the prediction accuracy of DNA variant identification and classification task. The performance of proposed classifier is tested with compared to well-approved existing machine learning and data mining algorithms on genomic data (148 Exome data sets) of Brugada syndrome and 10 real benchmark life sciences data sets from the UCI (University of California, Irvine) machine learning repository. The experimental results indicate that the proposed classifier has exemplary classification accuracy on different types of biological data. Overall, the proposed classifier offers good prediction accuracy to new DNA variants classification where noisy and misclassified variants are optimised to increase test performance.  相似文献   

20.
Non-parametric classification procedures based on a certainty measure and nearest neighbour rule for motor unit potential classification (MUP) during electromyographic (EMG) signal decomposition were explored. A diversity-based classifier fusion approach is developed and evaluated to achieve improved classification performance. The developed system allows the construction of a set of non-parametric base classifiers and then automatically chooses, from the pool of base classifiers, subsets of classifiers to form candidate classifier ensembles. The system selects the classifier ensemble members by exploiting a diversity measure for selecting classifier teams. The kappa statistic is used as the diversity measure to estimate the level of agreement between base classifier outputs, i.e., to measure the degree of decision similarity between base classifiers. The pool of base classifiers consists of two kinds of classifiers: adaptive certainty-based classifiers (ACCs) and adaptive fuzzy k-NN classifiers (AFNNCs) and both utilize different types of features. Once the patterns are assigned to their classes, by the classifier fusion system, firing pattern consistency statistics for each class are calculated to detect classification errors in an adaptive fashion. Performance of the developed system was evaluated using real and simulated EMG signals and was compared with the performance of the constituent base classifiers and the performance of the fixed ensemble containing the full set of base classifiers. Across the EMG signal data sets used, the diversity-based classifier fusion approach had better average classification performance overall, especially in terms of reducing classification errors.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号