共查询到20条相似文献,搜索用时 0 毫秒
1.
文本分类是根据未知文本的内容将其划分到一个或多个预先定义的类别的过程,是许多基于内容的信息管理任务的重要组成部分.文本分类问题的难点是特征空间的高维性,通常采用特征选择作为降维的重要方法.将属性约简和文本分类的特点相结合,提出了一种基于粗糙集的特征选择算法即改进的快速约简算法.实验表明该算法是有效的,不仅可以降低特征空间的维度,而且能够维持高精度. 相似文献
2.
The dominance-based rough set approach is proposed as a methodology for plunge grinding process diagnosis. The process is analyzed and next its diagnosis is considered as a multi-criteria decision making problem based on the modelling of relationships between different process states and their symptoms using a set of rules induced from measured process data. The development of the diagnostic system is characterized by three phases. Firstly, the process experimental data is prepared in the form of a decision table. Using selected methods of signal processing, each process running is described by 17 process state features (condition attributes) and 5 criteria evaluating process state and results (decision attributes). The semantic correlation between all the attributes is modelled. Next, the phase of condition attributes selection and knowledge extraction are strictly integrated with the phase of the model evaluation using an iterative approach. After each loop of the iterative feature selection procedure the induction of rules is conducted using the VC-DomLEM algorithm. The classification capability of the induced rules is carried out using the leave-one-out method and a set of measures. The classification accuracy of individual models is in the range of 80.77–98.72 %. The induced set of rules constitutes a classifier for an assessment of new process run cases. 相似文献
3.
Feature subset selection is viewed as an important preprocessing step for pattern recognition, machine learning and data mining. Most of researches are focused on dealing with homogeneous feature selection, namely, numerical or categorical features. In this paper, we introduce a neighborhood rough set model to deal with the problem of heterogeneous feature subset selection. As the classical rough set model can just be used to evaluate categorical features, we generalize this model with neighborhood relations and introduce a neighborhood rough set model. The proposed model will degrade to the classical one if we specify the size of neighborhood zero. The neighborhood model is used to reduce numerical and categorical features by assigning different thresholds for different kinds of attributes. In this model the sizes of the neighborhood lower and upper approximations of decisions reflect the discriminating capability of feature subsets. The size of lower approximation is computed as the dependency between decision and condition attributes. We use the neighborhood dependency to evaluate the significance of a subset of heterogeneous features and construct forward feature subset selection algorithms. The proposed algorithms are compared with some classical techniques. Experimental results show that the neighborhood model based method is more flexible to deal with heterogeneous data. 相似文献
4.
Feature selection is viewed as an important preprocessing step for pattern recognition, machine learning and data mining. Traditional hill-climbing search approaches to feature selection have difficulties to find optimal reducts. And the current stochastic search strategies, such as GA, ACO and PSO, provide a more robust solution but at the expense of increased computational effort. It is necessary to investigate fast and effective search algorithms. Rough set theory provides a mathematical tool to discover data dependencies and reduce the number of features contained in a dataset by purely structural methods. In this paper, we define a structure called power set tree (PS-tree), which is an order tree representing the power set, and each possible reduct is mapped to a node of the tree. Then, we present a rough set approach to feature selection based on PS-tree. Two kinds of pruning rules for PS-tree are given. And two novel feature selection algorithms based on PS-tree are also given. Experiment results demonstrate that our algorithms are effective and efficient. 相似文献
5.
Feature selection plays a vital role in many areas of pattern recognition and data mining. The effective computation of feature selection is important for improving the classification performance. In rough set theory, many feature selection algorithms have been proposed to process static incomplete data. However, feature values in an incomplete data set may vary dynamically in real-world applications. For such dynamic incomplete data, a classic (non-incremental) approach of feature selection is usually computationally time-consuming. To overcome this disadvantage, we propose an incremental approach for feature selection, which can accelerate the feature selection process in dynamic incomplete data. We firstly employ an incremental manner to compute the new positive region when feature values with respect to an object set vary dynamically. Based on the calculated positive region, two efficient incremental feature selection algorithms are developed respectively for single object and multiple objects with varying feature values. Then we conduct a series of experiments with 12 UCI real data sets to evaluate the efficiency and effectiveness of our proposed algorithms. The experimental results show that the proposed algorithms compare favorably with that of applying the existing non-incremental methods. 相似文献
6.
On the use of evolutionary feature selection for improving fuzzy rough set based prototype selection
J. Derrac N. Verbiest S. García C. Cornelis F. Herrera 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2013,17(2):223-238
The k-nearest neighbors classifier is a widely used classification method that has proven to be very effective in supervised learning tasks. In this paper, a fuzzy rough set method for prototype selection, focused on optimizing the behavior of this classifier, is presented. The hybridization with an evolutionary feature selection method is considered to further improve its performance, obtaining a competent data reduction algorithm for the 1-nearest neighbors classifier. This hybridization is performed in the training phase, by using the solution of each preprocessing technique as the starting condition of the other one, within a cycle. The results of the experimental study, which have been contrasted through nonparametric statistical tests, show that the new hybrid approach obtains very promising results with respect to classification accuracy and reduction of the size of the training set. 相似文献
7.
专家系统中基于粗集的知识获取、更新与推理 总被引:9,自引:3,他引:9
知识获取、知识更新和不确定性推理是设计专家系统的重要方面。根据粗集理论,提出了一种专家系统的结构模型,该系统在规则获取的基础上,利用系统运行的实例增量式地更新知识库中的规则及其参数,以改善系统的性能,利用知识库中的规则及数量参数进行不确定性推理,得出结论的可信度。 相似文献
8.
In recent years, Reverse Logistics (RL) has been touted as one of the strategies of improving organization performance and generating a competitive advantage. In RL, the generic routing problem has become a focus since it provides a great flexibility in modeling, e.g., selection of suppliers by using a node as a supplier candidate in a network. To date, complicated networks make decision makers hard to search a desired routine. In addition, the traditional network defines and resolves such a problem only at one soot. The solution cannot be acquired from multiple perspectives like minimal cost, minimal delivery time, maximal reliability, and optimal “3Rs”—reduce, reuse, and recycle. In this study, rough set theory is applied to reduce complexity of the RL data sets and induct decision rules. Through incorporating the decision rules, the generic label correcting algorithm is used to solve generic routing problems by integrating various operators and comparators in the GLC algorithm. Consequently, the desired RL suppliers are selected. 相似文献
9.
10.
Joaquín Derrac Chris Cornelis Salvador García Francisco Herrera 《Information Sciences》2012,186(1):73-92
In recent years, fuzzy rough set theory has emerged as a suitable tool for performing feature selection. Fuzzy rough feature selection enables us to analyze the discernibility of the attributes, highlighting the most attractive features in the construction of classifiers. However, its results can be enhanced even more if other data reduction techniques, such as instance selection, are considered.In this work, a hybrid evolutionary algorithm for data reduction, using both instance and feature selection, is presented. A global process of instance selection, carried out by a steady-state genetic algorithm, is combined with a fuzzy rough set based feature selection process, which searches for the most interesting features to enhance both the evolutionary search process and the final preprocessed data set. The experimental study, the results of which have been contrasted through nonparametric statistical tests, shows that our proposal obtains high reduction rates on training sets which greatly enhance the behavior of the nearest neighbor classifier. 相似文献
11.
《Computers & Mathematics with Applications》2006,51(9-10):1507-1518
In this paper, we study the fuzzy reasoning based on a new fuzzy rough set. First, we define a broad family of new lower and upper approximation operators of fuzzy sets between different universes using a set of axioms. Then, based on the approximation operators above, we propose the fuzzy reasoning based on the new fuzzy rough set. By means of the above fuzzy reasoning based on the new fuzzy rough set, for a given premise, we can obtain the fuzzy reasoning consequence expressed by the fuzzy interval constructed by the above two approximations of fuzzy sets. Furthermore, through the defuzzification of the lower and upper approximations, we can get the corresponding two values constructing the interval used as the fuzzy reasoning consequence after defuzzification. Then, from the above interval, a suitable value can be selected as the final reasoning consequence so that some special constraints are satisfied as possibly. At last, we apply the fuzzy reasoning based on the new fuzzy rough set to the scheduling problems, and numerical computational results show that the fuzzy reasoning based on the new fuzzy rough set is more suitable for the scheduling problems compared with the fuzzy reasoning based on the CRI method and the III method. 相似文献
12.
A rough set approach to knowledge reduction based on inclusion degree and evidence reasoning theory 总被引:6,自引:0,他引:6
Abstract: The theory of rough sets is an extension of set theory for studying intelligent systems characterized by insufficient and incomplete information. We discuss the basic concept and properties of knowledge reduction based on inclusion degree and evidence reasoning theory, and propose a knowledge discovery approach based on inclusion degree and evidence reasoning theory. 相似文献
13.
《Expert systems with applications》2014,41(1):92-104
We consider the issue of supplier selection by using rule-based methodology. Supplier Selection (SS) is an important activity in Logistics and Supply Chain Management in today’s global market. It is one of major applications of Multiple Criteria Decision Analysis (MCDA) that concerns about preference-related decision information. The rule-based methodology is proven of its effectiveness in handling preference information and performs well in sorting or ranking alternatives. However, how to utilize them in SS still remains open for more studies. In this paper, we propose a novel Believable Rough Set Approach (BRSA). This approach performs the complete problem-solving procedures including (1) criteria analysis, (2) rough approximation, (3) decision rule induction, and (4) a scheme for rule application. Unlike other rule-based solutions that just extract certain information, the proposed solution additionally extracts valuable uncertain information for rule induction. Due to such mechanism, BRSA outperforms other solutions in evaluation of suppliers. A detailed empirical study is provided for demonstration of decision-making procedures and multiple comparisons with other proposals. 相似文献
14.
Rough set theory is one of the effective methods to feature selection, which can preserve the meaning of the features. The essence of rough set approach to feature selection is to find a subset of the original features. Since finding a minimal subset of the features is a NP-hard problem, it is necessary to investigate effective and efficient heuristic algorithms. Ant colony optimization (ACO) has been successfully applied to many difficult combinatorial problems like quadratic assignment, traveling salesman, scheduling, etc. It is particularly attractive for feature selection since there is no heuristic information that can guide search to the optimal minimal subset every time. However, ants can discover the best feature combinations as they traverse the graph. In this paper, we propose a new rough set approach to feature selection based on ACO, which adopts mutual information based feature significance as heuristic information. A novel feature selection algorithm is also given. Jensen and Shen proposed a ACO-based feature selection approach which starts from a random feature. Our approach starts from the feature core, which changes the complete graph to a smaller one. To verify the efficiency of our algorithm, experiments are carried out on some standard UCI datasets. The results demonstrate that our algorithm can provide efficient solution to find a minimal subset of the features. 相似文献
15.
Multimedia Tools and Applications - Multimedia technologies are increasingly used in design of CNC grinding machines. The self-diagnosis system is a necessary part. To make use of multimedia and... 相似文献
16.
Petroleum is an important strategic material which is connected with the vitals and safety of the national economy, and the supplier selections are related to the safety of petroleum production and supply. However, the traditional approaches for supplier selections are limited in subjective evaluation of weights, inaccurate assessing rules, and inefficient decision-making. Although most of the current methods are widely applied in corporation management, a more efficient approach needs to be proposed for supplier selection of oil enterprise.This paper summarizes the particular characteristics of the supply chain of Chinese petroleum enterprises, analyzes the limitations of the traditional methods of supplier selection, and brought forward the method based on case reasoning system (CBR) for petroleum enterprises. The method based on data mining techniques which solves three key problems of CBR, includes calculating the weights of the attributes with information entropy in case warehouse organizing process objectively, evaluating the similarities with k-prototype clustering between the original and target cases in case retrieving process exactly, and extracting the potential rules with back propagation neural networks from conclusions in maintenance and revising process efficiently. It demonstrates the advantages, practicability and validity of this method via case study finally. 相似文献
17.
18.
Applied Intelligence - With increases in feature dimensions and the emergence of hierarchical class structures, hierarchical feature selection has become an important data preprocessing step in... 相似文献
19.
Pattern Analysis and Applications - Multi-label feature selection has been essential in many big data applications and plays a significant role in processing high-dimensional data. However, the... 相似文献
20.
Software cost estimation is one of the most crucial activities in software development process. In the past decades, many methods have been proposed for cost estimation. Case based reasoning (CBR) is one of these techniques. Feature selection is an important preprocessing stage of case based reasoning. Most existing feature selection methods of case based reasoning are ‘wrappers’ which can usually yield high fitting accuracy at the cost of high computational complexity and low explanation of the selected features. In our study, the mutual information based feature selection (MICBR) is proposed. This approach hybrids both ‘wrapper’ and ‘filter’ mechanism which is another kind of feature selector with much lower complexity than wrappers, and the features selected by filters are likely to be generalized to other conditions. The MICBR is then compared with popular feature selectors and the published works. The results show that the MICBR is an effective feature selector for case based reasoning by overcoming some of the limitations and computational complexities of other feature selection techniques in the field. 相似文献