首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The variable precision rough sets model (VPRS) along with many derivatives of rough set theory (RST) necessitates a number of stages towards the final classification of objects. These include, (i) the identification of subsets of condition attributes (β-reducts in VPRS) which have the same quality of classification as the whole set, (ii) the construction of sets of decision rules associated with the reducts and (iii) the classification of the individual objects by the decision rules. The expert system exposited here offers a decision maker (DM) the opportunity to fully view each of these stages, subsequently empowering an analyst to make choices during the analysis. Its particular innovation is the ability to visually present available β-reducts, from which the DM can make their selection, a consequence of their own reasons or expectations of the analysis undertaken. The practical analysis considered here is applied on a real world application, the credit ratings of large banks and investment companies in Europe and North America. The snapshots of the expert system presented illustrate the variation in results from the ‘asymmetric’ consequences of the choice of β-reducts considered.  相似文献   

2.
Qinghua Hu  Jinfu Liu  Daren Yu 《Knowledge》2008,21(4):294-304
Feature subset selection presents a common challenge for the applications where data with tens or hundreds of features are available. Existing feature selection algorithms are mainly designed for dealing with numerical or categorical attributes. However, data usually comes with a mixed format in real-world applications. In this paper, we generalize Pawlak’s rough set model into δ neighborhood rough set model and k-nearest-neighbor rough set model, where the objects with numerical attributes are granulated with δ neighborhood relations or k-nearest-neighbor relations, while objects with categorical features are granulated with equivalence relations. Then the induced information granules are used to approximate the decision with lower and upper approximations. We compute the lower approximations of decision to measure the significance of attributes. Based on the proposed models, we give the definition of significance of mixed features and construct a greedy attribute reduction algorithm. We compare the proposed algorithm with others in terms of the number of selected features and classification performance. Experiments show the proposed technique is effective.  相似文献   

3.
Nowadays the rough set method is receiving increasing attention in remote sensing classification although one of the major drawbacks of the method is that it is too sensitive to the spectral confusion between-class and spectral variation within-class. In this paper, a novel remote sensing classification approach based on variable precision rough sets (VPRS) is proposed by relaxing subset operators through the inclusion error β. The remote sensing classification algorithm based on VPRS includes three steps: (1) spectral and textural information (or other input data) discretization, (2) feature selection, and (3) classification rule extraction. The new method proposed here is tested with Landsat-5 TM data. The experiment shows that admitting various inclusion errors β, can improve classification performance including feature selection and generalization ability. The inclusion of β also prevents the overfitting to the training data. With the inclusion of β, higher classification accuracy is obtained. When β=0 (i.e., the original rough set based classifier), overfitting to the training data occurs, with the overall accuracy=0.6778 and unrecognizable percentage=12%. When β=0.07, the highest classification performance is reached with overall accuracy and unrecognizable percentage up to 0.8873% and 2.6%, respectively.  相似文献   

4.
This paper introduces a new technique in the investigation of limited-dependent variable models. This paper illustrates that variable precision rough set theory (VPRS), allied with the use of a modern method of classification, or discretisation of data, can out-perform the more standard approaches that are employed in economics, such as a probit model. These approaches and certain inductive decision tree methods are compared (through a Monte Carlo simulation approach) in the analysis of the decisions reached by the UK Monopolies and Mergers Committee. We show that, particularly in small samples, the VPRS model can improve on more traditional models, both in-sample, and particularly in out-of-sample prediction. A similar improvement in out-of-sample prediction over the decision tree methods is also shown.Scope and purposeThe Monopolies and Mergers Commission (MMC) in the UK evaluates whether a given firm, or set of firms is behaving in a manner that is considered to be against the public interest, that is anti-competitive. The interpretation and prediction of the decisions made by the MMC is of importance to firm's possible future investment plans. Through the construction of decision rules using the variable precision rough sets (VPRS) model this interoperation and prediction is able to be undertaken. The importance of the concomitant variables in the decisions made is shown through a ‘leave n out’ Monte Carlo simulation approach. At the technical level this study illustrates one of the first applications of VPRS in an economic environment.  相似文献   

5.
Unsupervised Rough Set Classification Using GAs   总被引:10,自引:1,他引:9  
  相似文献   

6.
基于优势关系的变精度粗糙集模型将传统粗糙集中的等价关系扩展为优势关系,并结合变精度的思想来定义相关概念,从而可以处理具有偏好关系的信息并具有一定的容错能力。然而,传统优势关系的定义仍然过于严格,只有当一个对象x的每个属性值都优于另一个对象y时,该对象x才优于y。当属性个数较多时,这种优势关系的定义会导致对象的优势集偏小,影响到规则的提取和决策结果。为了解决这一问题,通过引入参数的方法扩展了传统优势关系的定义,并在此基础上进一步给出了扩展后的优势集和近似集的概念,建立了扩展优势关系下的变精度粗糙集模型,采用覆盖率和测试精度作为模型的评估指标。最后给出算例,并在UCI数据集上进行大量的实验将所提模型与传统优势关系下的变精度粗糙集模型进行比较。  相似文献   

7.
The variable precision rough set (VPRS) model extends the basic rough set (RS) theory with finite uni- verses and finite evaluative measures. VPRS is concerned with the equivalence and the contained relationship between two sets. In incompatible information systems, the inclusion degree and β upper (lower) approximation of the inconsistent equivalence class to the decision equivalence classes may be affected by the variable precision. The analysis of an example of incompatible decision table shows that there is a critical point in β available-values region. In the new β range limited at the critical point, the incompatible decision table can be converted to the coordination decision table reliably. The method and its algorithm implement are introduced for the critical value search. The examples of the inconsistent equivalence class transformation are exhibited. The results illustrate that this algorithm is rational and precise.  相似文献   

8.
9.
将粗糙集理论中属性重要度和依赖度的概念与分级聚类离散化算法相结合,提出了一种纳税人连续型属性动态的离散化算法。首先将纳税数据对象的每个连续型属性划分为2类,然后利用粗糙集理论计算每个条件属性对于决策属性的重要度,再通过重要度由大至小排序进行增类运算,最后将保持与原有数据对象集依赖度一致的分类结果输出。该算法能够动态地对数据对象进行类别划分,实现纳税人连续型属性的离散化。通过采用专家分析和关联分析的实验结果,验证了该算法具有较高的纳税人连续型属性离散化精度和性能。  相似文献   

10.
Rough sets theory has proved to be a useful mathematical tool for classification and prediction. However, as many real‐world problems deal with ordering objects instead of classifying objects, one of the extensions of the classical rough sets approach is the dominance‐based rough sets approach, which is mainly based on substitution of the indiscernibility relation by a dominance relation. In this article, we present a dominance‐based rough sets approach to reasoning in incomplete ordered information systems. The approach shows how to find decision rules directly from an incomplete ordered decision table. We propose a reduction of knowledge that eliminates only that information that is not essential from the point of view of the ordering of objects or decision rules. © 2005 Wiley Periodicals, Inc. Int J Int Syst 20: 13–27, 2005.  相似文献   

11.
Probabilistic Decision Tables in the Variable Precision Rough Set Model   总被引:5,自引:0,他引:5  
The Variable Precision Rough Set Model (VPRS) is an extension of the original rough set model. This extension is directed towards deriving decision table-based predictive models from data with parametrically adjustable degrees of accuracy. The imprecise nature of such models leads to quite significant modification of the classical notion of decision table. This is accomplished by introducing the idea of approximation region-based, or probabilistic decision table which is a tabular specification of three, in general uncertain, disjunctive decision rules corresponding to rough approximation regions: positive, boundary and negative regions. The focus of the paper is on the extraction of such decision tables from data, their relationship to conjunctive rules and probabilistic assessment of decision confidence with such rules.  相似文献   

12.
陈健  赵跃龙 《微机发展》2008,18(3):203-206
粗糙集理论是一种处理模糊和不确定知识的一种新型数学工具,在很多领域取得了成功的应用。但是经典粗糙集理论处理的分类必须是完全正确的,在实际应用中,缺乏对噪声数据的适应能力,为了克服这个缺点,提出一种变精度的粗糙集模型,以适应实际应用的需要。对变精度粗糙集理论的数据预处理、属性约简、值约简和规则提取等问题进行了分析和研究,提出属性约简算法和基于求核值属性的归纳值约简算法,并将其运用于医疗系统的手术诊断数据表的数据挖掘分析过程中,所得到的实验结果与专家诊断结果基本吻合,取得了较好的实际应用效果。  相似文献   

13.
A rough set theory is a new mathematical tool to deal with uncertainty and vagueness of decision system and it has been applied successfully in all the fields. It is used to identify the reduct set of the set of all attributes of the decision system. The reduct set is used as preprocessing technique for classification of the decision system in order to bring out the potential patterns or association rules or knowledge through data mining techniques. Several researchers have contributed variety of algorithms for computing the reduct sets by considering different cases like inconsistency, missing attribute values and multiple decision attributes of the decision system. This paper focuses on the review of the techniques for dimensionality reduction under rough set theory environment. Further, the rough sets hybridization with fuzzy sets, neural network and metaheuristic algorithms have also been reviewed. The performance analysis of the algorithms has been discussed in connection with the classification.  相似文献   

14.
Credit scoring is the term used to describe methods utilized for classifying applicants for credit into classes of risk. This paper evaluates two induction approaches, rough sets and decision trees, as techniques for classifying credit (business) applicants. Inductive learning methods, like rough sets and decision trees, have a better knowledge representational structure than neural networks or statistical procedures because they can be used to derive production rules. If decision trees have already been used for credit granting, the rough sets approach is rarely utilized in this domain. In this paper, we use production rules obtained on a sample of 1102 business loans in order to compare the classification abilities of the two techniques. We show that decision trees obtain better results with 87.5% of good classifications with a pruned tree, against 76.7% for rough sets. However, decision trees make more type–II errors than rough sets, but fewer type–I errors.  相似文献   

15.
This study proposes a technique based upon Fuzzy C-Means (FCM) classification theory and related fuzzy theories for choosing an appropriate value of the Variable Precision Rough Set (VPRS) threshold parameter (β) when applied to the classification of continuous information systems. The VPRS model is then combined with a moving Average Autoregressive Exogenous (ARX) prediction model and Grey Systems theory to create an automatic stock market forecasting and portfolio selection mechanism. In the proposed mechanism, financial data are collected automatically every quarter and are input to an ARX prediction model to forecast the future trends of the collected data over the next quarter or half-year period. The forecast data are then reduced using a GM(1, N) model, classified using a FCM clustering algorithm, and then supplied to a VPRS classification module which selects appropriate investment stocks in accordance with a pre-determined set of decision-making rules. Finally, a grey relational analysis technique is employed to weight the selected stocks in such a way as to maximize the rate of return of the stock portfolio. The validity of the proposed approach is demonstrated using electronic stock data extracted from the financial database maintained by the Taiwan Economic Journal (TEJ). The portfolio results obtained using the proposed hybrid model are compared with those obtained using a Rough Set (RS) selection model. The effects of the number of attributes of the RS lower approximation set and VPRS β-lower approximation set on the classification are systematically examined and compared. Overall, the results show that the proposed stock forecasting and stock selection mechanism not only yields a greater number of selected stocks in the β-lower approximation set than in the RS approximation set, but also yields a greater rate of return.  相似文献   

16.
Rough Sets Theory is often applied to the task of classification and prediction, in which objects are assigned to some pre-defined decision classes. When the classes are preference-ordered, the process of classification is referred to as sorting. To deal with the specificity of sorting problems an extension of the Classic Rough Sets Approach, called the Dominance-based Rough Sets Approach, was introduced. The final result of the analysis is a set of decision rules induced from what is called rough approximations of decision classes. The main role of the induced decision rules is to discover regularities in the analyzed data set, but the same rules, when combined with a particular classification method, may also be used to classify/sort new objects (i.e. to assign the objects to appropriate classes). There exist many different rule induction strategies, including induction of an exhaustive set of rules. This strategy produces the most comprehensive knowledge base on the analyzed data set, but it requires a considerable amount of computing time, as the complexity of the process is exponential. In this paper we present a shortcut that allows classifying new objects without generating the rules. The presented approach bears some resemblance to the idea of lazy learning.  相似文献   

17.
The fuzzy rough set (FRS) model has been introduced to handle databases with real values. However, FRS was sensitive to misclassification and perturbation (here misclassification means error or missing values in classification, and perturbation means small changes of numerical data). The variable precision rough sets (VPRSs) model was introduced to handle databases with misclassification. However, it could not effectively handle the real-valued datasets. Now, it is valuable from theoretical and practical aspects to combine FRS and VPRS so that a powerful tool, which not only can handle numerical data but also is less sensitive to misclassification and perturbation, can be developed. In this paper, we set up a model named fuzzy VPRSs (FVPRSs) by combining FRS and VPRS with the goal of making FRS a special case. First, we study the knowledge representation ways of FRS and VPRS, and then, propose the set approximation operators of FVPRS. Second, we employ the discernibility matrix approach to investigate the structure of attribute reductions in FVPRS and develop an algorithm to find all reductions. Third, in order to overcome the NP-complete problem of finding all reductions, we develop some fast heuristic algorithms to obtain one near-optimal attribute reduction. Finally, we compare FVPRS with RS, FRS, and several flexible RS-based approaches with respect to misclassification and perturbation. The experimental comparisons show the feasibility and effectiveness of FVPRS.  相似文献   

18.
Rough sets for adapting wavelet neural networks as a new classifier system   总被引:2,自引:2,他引:0  
Classification is an important theme in data mining. Rough sets and neural networks are two techniques applied to data mining problems. Wavelet neural networks have recently attracted great interest because of their advantages over conventional neural networks as they are universal approximations and achieve faster convergence. This paper presents a hybrid system to extract efficiently classification rules from decision table. The neurons of such hybrid network instantiate approximate reasoning knowledge gleaned from input data. The new model uses rough set theory to help in decreasing the computational effort needed for building the network structure by using what is called reduct algorithm and a rules set (knowledge) is generated from the decision table. By applying the wavelets, frequencies analysis, rough sets and dynamic scaling in connection with neural network, novel and reliable classifier architecture is obtained and its effectiveness is verified by the experiments comparing with traditional rough set and neural networks approaches.  相似文献   

19.
Abstract: Machine learning can extract desired knowledge from training examples and ease the development bottleneck in building expert systems. Most learning approaches derive rules from complete and incomplete data sets. If attribute values are known as possibility distributions on the domain of the attributes, the system is called an incomplete fuzzy information system. Learning from incomplete fuzzy data sets is usually more difficult than learning from complete data sets and incomplete data sets. In this paper, we deal with the problem of producing a set of certain and possible rules from incomplete fuzzy data sets based on rough sets. The notions of lower and upper generalized fuzzy rough approximations are introduced. By using the fuzzy rough upper approximation operator, we transform each fuzzy subset of the domain of every attribute in an incomplete fuzzy information system into a fuzzy subset of the universe, from which fuzzy similarity neighbourhoods of objects in the system are derived. The fuzzy lower and upper approximations for any subset of the universe are then calculated and the knowledge hidden in the information system is unravelled and expressed in the form of decision rules.  相似文献   

20.
以构建电子商务系统中的本体为出发点,分析现有的本体构建技术中存在的缺陷。针对这些不足,综合考虑变精度粗糙集模型和形式概念分析的相关理论,提出基于粗概念格模型来构建本体。将变精度粗糙集的β选取算法和可辨识矩阵属性约简算法进行了改进,使β 上、下分布的约简方法适用于形式背景的约简,从而提出基于变精度粗糙集的概念格约减算法;然后计算语义概念相似度,并以联合国标准产品与服务分类代码的本体元模型为核心本体,结合领域专家知识,建立电子商务领域本体模型。实验表明了粗概念格构建本体的高效性。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号