首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
In this paper, a review on condition monitoring of induction motors is first presented. Then, an ensemble of hybrid intelligent models that is useful for condition monitoring of induction motors is proposed. The review covers two parts, i.e., (i) a total of nine commonly used condition monitoring methods of induction motors; and (ii) intelligent learning models for condition monitoring of induction motors subject to single and multiple input signals. Based on the review findings, the Motor Current Signature Analysis (MCSA) method is selected for this study owing to its online, non-invasive properties and its requirement of only single input source; therefore leading to a cost-effective condition monitoring method. A hybrid intelligent model that consists of the Fuzzy Min–Max (FMM) neural network and the Random Forest (RF) model comprising an ensemble of Classification and Regression Trees is developed. The majority voting scheme is used to combine the predictions produced by the resulting FMM–RF ensemble (or FMM–RFE) members. A benchmark problem is first deployed to evaluate the usefulness of the FMM–RFE model. Then, the model is applied to condition monitoring of induction motors using a set of real data samples. Specifically, the stator current signals of induction motors are obtained using the MCSA method. The signals are processed to produce a set of harmonic-based features for classification using the FMM–RFE model. The experimental results show good performances in both noise-free and noisy environments. More importantly, a set of explanatory rules in the form of a decision tree can be extracted from the FMM–RFE model to justify its predictions. The outcomes ascertain the effectiveness of the proposed FMM–RFE model in undertaking condition monitoring tasks, especially for induction motors, under different environments.  相似文献   

2.
Traffic flow classification to identify applications and activity of users is widely studied both to understand privacy threats and to support network functions such as usage policies and QoS. For those needs, real time classification is required and classifier’s complexity is as important as accuracy, especially given the increasing link speeds also in the access section of the network. We propose the application of a highly efficient classification system, specifically Min–Max neuro-fuzzy networks trained by PARC algorithm, and compare it with popular classification systems, by considering traffic data sets collected in different epochs and places. We show that Min–Max networks achieve high accuracy, in line with the best performing algorithms on Weka (SVM, Random Tree, Random Forest). The required classification model complexity is much lower with Min–Max networks with respect to the other models, enabling the implementation of effective classification algorithms in real time on inexpensive platforms.  相似文献   

3.
In this paper, classification of ball bearing faults using vibration signals is presented. A review of condition monitoring using vibration signals with various intelligent systems is first presented. A hybrid intelligent model, FMM-RF, consisting of the Fuzzy Min-Max (FMM) neural network and the Random Forest (RF) model, is proposed. A benchmark problem is tested to evaluate the practicality of the FMM-RF model. The proposed model is then applied to a real-world dataset. In both cases, power spectrum and sample entropy features are used for classification. Results from both experiments show good accuracy achieved by the proposed FMM-RF model. In addition, a set of explanatory rules in the form of a decision tree is extracted to justify the predictions. The outcomes indicate the usefulness of FMM-RF in performing classification of ball bearing faults.  相似文献   

4.
In this paper, a hybrid soft computing model comprising the Fuzzy Min-Max (FMM) neural network and the Classification and Regression Tree (CART) for motor fault detection and diagnosis is described. Specifically, the hybrid model, known as FMM-CART, is used to detect and classify fault conditions of induction motors in both offline and online environments. A series of experiments is conducted, whereby the Motor Current Signature Analysis (MCSA) method is applied to form a database containing stator current signatures under different motor conditions. The signal harmonics from the power spectral density (PSD) are extracted, and used as the discriminative input features for fault classification with FMM-CART. Three main induction motor conditions, viz. broken rotor bars, stator winding faults, and unbalanced supply, are used to evaluate the effectiveness of FMM-CART. The results indicate that FMM-CART is able to detect motor faults in the early stage, in order to avoid further damage to the induction motor as well as the overall machine or system that uses the motor in its operations.  相似文献   

5.
The aim of this article is to construct a practical intrusion detection system (IDS) that properly analyses the statistics of network traffic pattern and classify them as normal or anomalous class. The objective of this article is to prove that the choice of effective network traffic features and a proficient machine-learning paradigm enhances the detection accuracy of IDS. In this article, a rule-based approach with a family of six decision tree classifiers, namely Decision Stump, C4.5, Naive Baye's Tree, Random Forest, Random Tree and Representative Tree model to perform the detection of anomalous network pattern is introduced. In particular, the proposed swarm optimisation-based approach selects instances that compose training set and optimised decision tree operate over this trained set producing classification rules with improved coverage, classification capability and generalisation ability. Experiment with the Knowledge Discovery and Data mining (KDD) data set which have information on traffic pattern, during normal and intrusive behaviour shows that the proposed algorithm produces optimised decision rules and outperforms other machine-learning algorithm.  相似文献   

6.
In this paper, a hybrid online learning model that combines the fuzzy min–max (FMM) neural network and the Classification and Regression Tree (CART) for motor fault detection and diagnosis tasks is described. The hybrid model, known as FMM-CART, incorporates the advantages of both FMM and CART for undertaking data classification (with FMM) and rule extraction (with CART) problems. In particular, the CART model is enhanced with an importance predictor-based feature selection measure. To evaluate the effectiveness of the proposed online FMM-CART model, a series of experiments using publicly available data sets containing motor bearing faults is first conducted. The results (primarily prediction accuracy and model complexity) are analyzed and compared with those reported in the literature. Then, an experimental study on detecting imbalanced voltage supply of an induction motor using a laboratory-scale test rig is performed. In addition to producing accurate results, a set of rules in the form of a decision tree is extracted from FMM-CART to provide explanations for its predictions. The results positively demonstrate the usefulness of FMM-CART with online learning capabilities in tackling real-world motor fault detection and diagnosis tasks.  相似文献   

7.
The use of data mining approaches for analyzing patients trace in different medical databases has become an important research field especially with the evolution of these methods and their contributions in medical decision support. In this paper, we develop a new clinical decision support system (CDSS) to diagnose Coronary Artery Diseases (CAD). According to CAD experts, Angiography is most accurate CAD diagnosis technique. However, it has many aftereffects and is very costly. Existing studies showed that CAD diagnosis requires heterogeneous patients traces from medical history while applying data mining techniques to achieve high accuracy. In this paper, an automatic approach to design CDSS for CAD assessment is proposed. The proposed diagnosis model is based on Random Forest algorithm, C5.0 decision tree algorithm and Fuzzy modeling. It consists of two stages: first, Random Forest algorithm is used to rank the features and a C5.0 decision tree based approach for crisp rule generation is developed. Then, we created the fuzzy inference system. The generation of fuzzy weighted rules is carried out automatically from the previous crisp rules. Moreover, a critical issue about the CDSS is that some values of the features are missing in most cases. A new method to deal with the problem of missing data, which allows evaluating the similarity despite the missing information, was proposed. Finally, experimental results underscore very promising classification accuracy of 90.50% while optimizing training time using UCI (the University of California at Irvine) heart diseases datasets compared to the previously reported results.  相似文献   

8.
学习样本的质量和数量对于智能数据分类系统至关重要,但在数据分类系统中没有一个通用的良好方法用于发现有意义的样本。以此为动机,提出数据集合凸边界的概念,给出了快速发现有意义样本集合的方法。首先,利用箱型函数对学习样本集合中的异常和特征不全样本进行清洗;接着,提出数据锥的概念,对归一化的学习样本进行锥形分割;最后,对每个锥形样本子集进行中心化,以凸边界为基础提取距离凸边界差异极小的样本构成凸边界样本集合。实验在12个UCI数据集上进行,并与高斯朴素贝叶斯(GNB)、决策树(CART)、线性判别分析(LDA)、提升算法(AdaBoost)、随机森林(RF)和逻辑回归(LR)这六种经典的数据分类算法进行对比。结果表明,各个算法在凸边界样本集合的训练时间显著缩短,同时保持了分类性能。特别地,对包含噪声数据较多的数据集,如剖腹产、电网稳定性、汽车评估等数据集,凸边界样本集合能使分类性能得到提升。为了更好地评价凸边界样本集合的效率,以样本变化率和分类性能变化率的比值定义了样本清洗效率,并用该指标来客观评价凸边界样本的意义。清洗效率大于1时说明方法有效,且数值越高效果越好。在脉冲星数据集合上,所提方法对GNB算法的清洗效率超过68,说明所提方法性能优越。  相似文献   

9.
针对现有汉语重音检测方法正确率较低的问题,利用声学、词典和语法相关特征的不同分类器组合,基于Boosting分类回归树+条件随机场的互补模型,提出一种改进的汉语重音检测方法.在ASCCD语料库上的实验结果表明,该方法能获得84.9%的重音检测正确率,相比基于神经网络+决策树的基线系统提高2.7%.  相似文献   

10.
Parallel Formulations of Decision-Tree Classification Algorithms   总被引:5,自引:0,他引:5  
Classification decision tree algorithms are used extensively for data mining in many domains such as retail target marketing, fraud detection, etc. Highly parallel algorithms for constructing classification decision trees are desirable for dealing with large data sets in reasonable amount of time. Algorithms for building classification decision trees have a natural concurrency, but are difficult to parallelize due to the inherent dynamic nature of the computation. In this paper, we present parallel formulations of classification decision tree learning algorithm based on induction. We describe two basic parallel formulations. One is based on Synchronous Tree Construction Approach and the other is based on Partitioned Tree Construction Approach. We discuss the advantages and disadvantages of using these methods and propose a hybrid method that employs the good features of these methods. We also provide the analysis of the cost of computation and communication of the proposed hybrid method. Moreover, experimental results on an IBM SP-2 demonstrate excellent speedups and scalability.  相似文献   

11.
Rough sets for adapting wavelet neural networks as a new classifier system   总被引:2,自引:2,他引:0  
Classification is an important theme in data mining. Rough sets and neural networks are two techniques applied to data mining problems. Wavelet neural networks have recently attracted great interest because of their advantages over conventional neural networks as they are universal approximations and achieve faster convergence. This paper presents a hybrid system to extract efficiently classification rules from decision table. The neurons of such hybrid network instantiate approximate reasoning knowledge gleaned from input data. The new model uses rough set theory to help in decreasing the computational effort needed for building the network structure by using what is called reduct algorithm and a rules set (knowledge) is generated from the decision table. By applying the wavelets, frequencies analysis, rough sets and dynamic scaling in connection with neural network, novel and reliable classifier architecture is obtained and its effectiveness is verified by the experiments comparing with traditional rough set and neural networks approaches.  相似文献   

12.
Database management systems are very sophisticated, efficient, and fast in information retrieval tasks involving traditional data sets such as numbers, strings, and so on, but many limitations become evident when the data are more complex, that is, high or nondimensional data. Considering some existing problems in information retrieval processes, this work proposes a hybrid system that combines a model of the ART family neural network, ART2‐A, with the Slim‐Tree data structure, which is a metric access method. This approach is an alternative to perform clustering on data in an intelligent way so that the data can be recovered from the corresponding Slim‐Tree. The proposed hybrid system is able to perform range and k‐nearest neighbor queries, which is not an inherent characteristic in implementations involving artificial neural networks. Furthermore, experimental results showed that the performance of the hybrid system was better than the performance of Slim‐Tree. © 2007 Wiley Periodicals, Inc. Int J Int Syst 22: 319–336, 2007.  相似文献   

13.
The objective of this paper is to construct a lightweight Intrusion Detection System (IDS) aimed at detecting anomalies in networks. The crucial part of building lightweight IDS depends on preprocessing of network data, identifying important features and in the design of efficient learning algorithm that classify normal and anomalous patterns. Therefore in this work, the design of IDS is investigated from these three perspectives. The goals of this paper are (i) removing redundant instances that causes the learning algorithm to be unbiased (ii) identifying suitable subset of features by employing a wrapper based feature selection algorithm (iii) realizing proposed IDS with neurotree to achieve better detection accuracy. The lightweight IDS has been developed by using a wrapper based feature selection algorithm that maximizes the specificity and sensitivity of the IDS as well as by employing a neural ensemble decision tree iterative procedure to evolve optimal features. An extensive experimental evaluation of the proposed approach with a family of six decision tree classifiers namely Decision Stump, C4.5, Naive Baye’s Tree, Random Forest, Random Tree and Representative Tree model to perform the detection of anomalous network pattern has been introduced.  相似文献   

14.
三支决策基于代价敏感,通过引入延迟决策,在信息不完备的情况下,能够使分类更加合理。考虑具有混合属性特征的决策信息系统优化决策问题,在混合属性信息系统上定义了邻域关系,构建了基于邻域关系的决策粗糙集模型。在此基础上将其应用于痛风临床诊断决策问题,运用多次迭代学习的方法对痛风数据进行分类。与SVM(Support Vector Machine)、RF(Random Forest)、LR(Logistic Regression)分类算法进行对比,证明了该方法的优越性。根据分类结果发现因素之间的内在联系,获取分类规则,探究痛风与肝功、肾功、血脂、血糖的相关性,为痛风成因研究和诊断治疗提供知识支持和决策支持。  相似文献   

15.
The Cloud system shows its growing functionalities in various industrial applications. The safety towards data transfer seems to be a threat where Network Intrusion Detection System (NIDS) is measured as an essential element to fulfill security. Recently, Machine Learning (ML) approaches have been used for the construction of intellectual IDS. Most IDS are based on ML techniques either as unsupervised or supervised. In supervised learning, NIDS is based on labeled data where it reduces the efficiency of the reduced model to identify attack patterns. Similarly, the unsupervised model fails to provide a satisfactory outcome. Hence, to boost the functionality of unsupervised learning, an effectual auto-encoder is applied for feature selection to select good features. Finally, the Naïve Bayes classifier is used for classification purposes. This approach exposes the finest generalization ability to train the data. The unlabelled data is also used for adoption towards data analysis. Here, redundant and noisy samples over the dataset are eliminated. To validate the robustness and efficiency of NIDS, the anticipated model is tested over the NSL-KDD dataset. The experimental outcomes demonstrate that the anticipated approach attains superior accuracy with 93%, which is higher compared to J48, AB tree, Random Forest (RF), Regression Tree (RT), Multi-Layer Perceptrons (MLP), Support Vector Machine (SVM), and Fuzzy. Similarly, False Alarm Rate (FAR) and True Positive Rate (TPR) of Naive Bayes (NB) is 0.3 and 0.99, respectively. When compared to prevailing techniques, the anticipated approach also delivers promising outcomes.  相似文献   

16.
Today, feature selection is an active research in machine learning. The main idea of feature selection is to choose a subset of available features, by eliminating features with little or no predictive information, as well as redundant features that are strongly correlated. There are a lot of approaches for feature selection, but most of them can only work with crisp data. Until now there have not been many different approaches which can directly work with both crisp and low quality (imprecise and uncertain) data. That is why, we propose a new method of feature selection which can handle both crisp and low quality data. The proposed approach is based on a Fuzzy Random Forest and it integrates filter and wrapper methods into a sequential search procedure with improved classification accuracy of the features selected. This approach consists of the following main steps: (1) scaling and discretization process of the feature set; and feature pre-selection using the discretization process (filter); (2) ranking process of the feature pre-selection using the Fuzzy Decision Trees of a Fuzzy Random Forest ensemble; and (3) wrapper feature selection using a Fuzzy Random Forest ensemble based on cross-validation. The efficiency and effectiveness of this approach is proved through several experiments using both high dimensional and low quality datasets. The approach shows a good performance (not only classification accuracy, but also with respect to the number of features selected) and good behavior both with high dimensional datasets (microarray datasets) and with low quality datasets.  相似文献   

17.
遥感图像决策树分类器研究与实现   总被引:6,自引:0,他引:6  
针对传统分类方法在处理空间特征分布极为复杂的数据时效果不佳的缺点[1],结合“分层思想”的树分类技术,将广泛用于数据挖掘模型中的CART决策树算法应用到遥感影像分类中,具有更好的弹性和鲁棒性,且分类结构简单明了,达到了更好地分类效果。并以VC 6.0作为开发工具,设计了一种特殊的数据结构,实现了该分类系统。实践表明,该系统具有很好的稳定性和交互性,实用性较强。  相似文献   

18.
Imperfect information inevitably appears in real situations for a variety of reasons. Although efforts have been made to incorporate imperfect data into classification techniques, there are still many limitations as to the type of data, uncertainty, and imprecision that can be handled. In this paper, we will present a Fuzzy Random Forest ensemble for classification and show its ability to handle imperfect data into the learning and the classification phases. Then, we will describe the types of imperfect data it supports. We will devise an augmented ensemble that can operate with others type of imperfect data: crisp, missing, probabilistic uncertainty, and imprecise (fuzzy and crisp) values. Additionally, we will perform experiments with imperfect datasets created for this purpose and datasets used in other papers to show the advantage of being able to express the true nature of imperfect information.  相似文献   

19.
入侵检测系统是网络安全体系的重要组成部分, 本文针对当前入侵检测系统普遍存在的误报、漏报和缺乏自适应性问题,采用ODM的分类算法中的--决策树分类算法、支持向量机分类算法、朴素贝叶斯算法和二元变量逻辑回归算法等4种重点技术对其实验数据进行模型建立和测试,通过对实验的总结分析,在通用的入侵检测模型的基础上建立了一个以数据库为核心的入侵检测模型,并阐述了模型的构成和工作流程.  相似文献   

20.
Building performance has been shown to degrade significantly after commissioning, resulting in increased energy consumption and associated greenhouse gas emissions. Fault Detection and Diagnosis (FDD) protocols using existing sensor networks and IoT devices have the potential to minimize this waste by continually identifying system degradation and re-tuning control strategies to adapt to real building performance. Due to its significant contribution to greenhouse gas emissions, the performance of gas boiler systems for building heating is critical. A review of boiler performance studies has been used to develop a set of common faults and degraded performance conditions, which have been integrated into a MATLAB/Simscape emulator. This resulted in a labeled dataset with approximately 10,000 simulations of steady-state performance for each of 14 non-condensing boilers. The collected data is used for training and testing fault classification using K-nearest neighbour, Decision tree, Random Forest, and Support Vector Machines. The results show that the Decision Tree, Random Forest, and Support Vector Machines method provide high prediction accuracy, consistently exceeding 95%, and generalization across multiple boilers is not possible due to low classification accuracy.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号