共查询到20条相似文献,搜索用时 0 毫秒
1.
具有数据挖掘功能的电力远程监测管理系统 总被引:1,自引:0,他引:1
针对电力远程监测管理系统中参数众多和信息量巨大的特点,提出了一种具有数据挖掘功能的系统,从监测数据库的海量信息中提取隐含的事先未知的潜在有用信息,作为对系统进行有效管理的依据.介绍了系统的监测原理、硬件结构和软件流程.实际应用表明,这种自动测试系统能够实时监测电压和电流等状态信息,分析其内在联系从而达到准确监测、控制和自我调整的目的. 相似文献
2.
Chao-Hui LeeJessie Chia-Yu Chen Vincent S. Tseng 《Computer methods and programs in biomedicine》2011,101(1):44-61
Chronic asthmatic sufferers need to be constantly observed to prevent sudden attacks. In order to improve the efficiency and effectiveness of patient monitoring, we proposed in this paper a novel data mining mechanism for predicting attacks of chronic diseases by considering of both bio-signals of patients and environmental factors. We proposed two data mining methods, namely Pattern Based Decision Tree (PBDT) and Pattern Based Class-Association Rule (PBCAR). Both methods integrate the concepts of sequential pattern mining to extract features of asthma attacks, and then build classifiers with the concepts of decision tree mining and rule-based method respectively. Besides the general clinical data of patients, we considered environmental factors, which are related to many chronic diseases. For experimental evaluations, we adopted the children asthma allergic dataset collated from a hospital in Taiwan as well as the environmental factors like weather and air pollutant data. The experimental results show that PBCAR delivers 86.89% of accuracy and 84.12% of recall, and PBDT shows 87.52% accuracy and 85.59 of recall. These results also indicate that our methods can perform high accuracy and recall on predictions of chronic disease attacks. The readable rules of both classifiers can provide patients and healthcare workers with insights on essential illness related information. At the same time, additional environmental factors of input data are also proven to be valuable in predicting attacks. 相似文献
3.
Smart meter monitoring and data mining techniques for predicting refrigeration system performance 总被引:1,自引:0,他引:1
《Expert systems with applications》2014,41(5):2144-2156
A major challenge in many countries is providing sufficient energy for human beings and for supporting economic activities while minimizing social and environmental harm. This study predicted coefficient of performance (COP) for refrigeration equipment under varying amounts of refrigerant (R404A) with the aids of data mining (DM) techniques. The performance of artificial neural networks (ANNs), support vector machines (SVMs), classification and regression tree (CART), multiple regression (MR), generalized linear regression (GLR), and chi-squared automatic interaction detector (CHAID) were applied within DM process. After obtaining the COP value, abnormal equipment conditions can be evaluated for refrigerant leakage. Analytical results from cross-fold validation method are compared to determine the best models. The study shows that DM techniques can be used for accurately and efficiently predicting COP. In the liquid leakage phase, ANNs provide the best performance. In the vapor leakage phase, the best model is the GLR model. Experimental results confirm that systematic analyses of model construction processes are effective for evaluating and optimizing refrigeration equipment performance. 相似文献
4.
Software testing is both a time and resource-consuming activity in software development. The most difficult parts of software testing are the generation and prioritization of test data. Principally these two parts are performed manually. Hence introducing an automation approach will significantly reduce the total cost incurred in the software development lifecycle. A number of automatic test case generation (ATCG) and prioritization approaches have been explored. In this paper, we propose two approaches: (1) a pathspecific approach for ATCG using the following metaheuristic techniques: the genetic algorithm (GA), particle swarm optimization (PSO) and artificial bee colony optimization (ABC); and (2) a test case prioritization (TCP) approach using PSO. Based on our experimental findings, we conclude that ABC outperforms the GA and PSO-based approaches for ATC.G Moreover, the results for PSO on TCP arguments demonstrate biased applicability for both small and large test suites against random, reverse and unordered prioritization schemes. Therefore, we focus on conducting a comprehensive and exhaustive study of the application of metaheuristic algorithms in solving ATCG and TCP problems in software engineering. 相似文献
5.
Research groups, large businesses, government agencies and other organizations are using improved data mining technologies and techniques to discover meaningful patterns in huge databases, and now, data mining has been refined to the point where even people who aren't highly trained statisticians can use this complex data analysis tool. Data mining's increased popularity is due partly to technological improvements that permit faster, more effective analyses of databases 相似文献
6.
7.
Distributed data mining: a survey 总被引:1,自引:1,他引:0
Li Zeng Ling Li Lian Duan Kevin Lu Zhongzhi Shi Maoguang Wang Wenjuan Wu Ping Luo 《Information Technology and Management》2012,13(4):403-409
Most data mining approaches assume that the data can be provided from a single source. If data was produced from many physically distributed locations like Wal-Mart, these methods require a data center which gathers data from distributed locations. Sometimes, transmitting large amounts of data to a data center is expensive and even impractical. Therefore, distributed and parallel data mining algorithms were developed to solve this problem. In this paper, we survey the-state-of-the-art algorithms and applications in distributed data mining and discuss the future research opportunities. 相似文献
8.
9.
Ahmed Samet Eric Lefèvre Sadok Ben Yahia 《Journal of Intelligent Information Systems》2016,47(1):135-163
Associative classification has been shown to provide interesting results whenever of use to classify data. With the increasing complexity of new databases, retrieving valuable information and classifying incoming data is becoming a thriving and compelling issue. The evidential database is a new type of database that represents imprecision and uncertainty. In this respect, extracting pertinent information such as frequent patterns and association rules is of paramount importance task. In this work, we tackle the problem of pertinent information extraction from an evidential database. A new data mining approach, denoted EDMA, is introduced that extracts frequent patterns overcoming the limits of pioneering works of the literature. A new classifier based on evidential association rules is thus introduced. The obtained association rules, as well as their respective confidence values, are studied and weighted with respect to their relevance. The proposed methods are thoroughly experimented on several synthetic evidential databases and showed performance improvement. 相似文献
10.
现有的电力仪表图像智能监控系统存在着稳定性差、效率低下的弊端,为了解决上述问题,引入数据挖掘技术对电力仪表图像智能监控系统进行设计与研究。电力仪表图像智能监控系统硬件设计包括图像采集单元、数据挖掘单元、无线通信单元与控制器单元设计,软件设计包括数据挖掘终端节点软件、无线通信协调器节点软件与控制器软件设计。通过系统硬件与软件的设计,实现了电力仪表图像智能监控系统的运行。通过仿真对比实验得到,与现有的电力仪表图像智能监控系统相比,设计的电力仪表图像智能监控系统极大地提升了系统的稳定性与监控效率,充分说明设计的电力仪表图像智能监控系统具备更好的监控性能。 相似文献
11.
《Advanced Engineering Informatics》2014,28(2):166-185
Although the integration of engineering data within the framework of product data management systems has been successful in the recent years, the holistic analysis (from a systems engineering perspective) of multi-disciplinary data or data based on different representations and tools is still not realized in practice. At the same time, the application of advanced data mining techniques to complete designs is very promising and bears a high potential for synergy between different teams in the development process. In this paper, we propose shape mining as a framework to combine and analyze data from engineering design across different tools and disciplines. In the first part of the paper, we introduce unstructured surface meshes as meta-design representations that enable us to apply sensitivity analysis, design concept retrieval and learning as well as methods for interaction analysis to heterogeneous engineering design data. We propose a new measure of relevance to evaluate the utility of a design concept. In the second part of the paper, we apply the formal methods to passenger car design. We combine data from different representations, design tools and methods for a holistic analysis of the resulting shapes. We visualize sensitivities and sensitive cluster centers (after feature reduction) on the car shape. Furthermore, we are able to identify conceptual design rules using tree induction and to create interaction graphs that illustrate the interrelation between spatially decoupled surface areas. Shape data mining in this paper is studied for a multi-criteria aerodynamic problem, i.e. drag force and rear lift, however, the extension to quality criteria from different disciplines is straightforward as long as the meta-design representation is still applicable. 相似文献
12.
Longitudinal data refer to the situation where repeated observations are available for each sampled object. Clustered data,
where observations are nested in a hierarchical structure within objects (without time necessarily being involved) represent
a similar type of situation. Methodologies that take this structure into account allow for the possibilities of systematic
differences between objects that are not related to attributes and autocorrelation within objects across time periods. A standard
methodology in the statistics literature for this type of data is the mixed effects model, where these differences between
objects are represented by so-called “random effects” that are estimated from the data (population-level relationships are
termed “fixed effects,” together resulting in a mixed effects model). This paper presents a methodology that combines the
structure of mixed effects models for longitudinal and clustered data with the flexibility of tree-based estimation methods.
We apply the resulting estimation method, called the RE-EM tree, to pricing in online transactions, showing that the RE-EM
tree is less sensitive to parametric assumptions and provides improved predictive power compared to linear models with random
effects and regression trees without random effects. We also apply it to a smaller data set examining accident fatalities,
and show that the RE-EM tree strongly outperforms a tree without random effects while performing comparably to a linear model
with random effects. We also perform extensive simulation experiments to show that the estimator improves predictive performance
relative to regression trees without random effects and is comparable or superior to using linear models with random effects
in more general situations. 相似文献
13.
HD-Eye: visual mining of high-dimensional data 总被引:3,自引:0,他引:3
Clustering in high-dimensional databases poses an important problem. However, we can apply a number of different clustering algorithms to high-dimensional data. The authors consider how an advanced clustering algorithm combined with new visualization methods interactively clusters data more effectively. Experiments show these techniques improve the data mining process 相似文献
14.
15.
《Applied Artificial Intelligence》2013,27(5-6):545-561
This paper presents a new means of selecting quality data for mining multiple data sources. Traditional data-mining strategies obtain necessary data from internal and external data sources and pool all the data into a huge homogeneous dataset for discovery. In contrast, our data-mining strategy identifies quality data from (internal and external) data sources for a mining task. A framework is advocated for generating quality data. Experimental results demonstrate that application of this new data collecting technique can not only identify quality data, but can also efficiently reduce the amount of data that must be considered during mining. 相似文献
16.
Time series analysis has always been an important and interesting research field due to its frequent appearance in different applications. In the past, many approaches based on regression, neural networks and other mathematical models were proposed to analyze the time series. In this paper, we attempt to use the data mining technique to analyze time series. Many previous studies on data mining have focused on handling binary-valued data. Time series data, however, are usually quantitative values. We thus extend our previous fuzzy mining approach for handling time-series data to find linguistic association rules. The proposed approach first uses a sliding window to generate continues subsequences from a given time series and then analyzes the fuzzy itemsets from these subsequences. Appropriate post-processing is then performed to remove redundant patterns. Experiments are also made to show the performance of the proposed mining algorithm. Since the final results are represented by linguistic rules, they will be friendlier to human than quantitative representation. 相似文献
17.
This research addresses the problem of analyzing the temporal dynamics of business organizations. In particular, we concentrate on inferring the related businesses, i.e., are there groups of companies that are highly correlated through some measurement (metric)? We argue that business relationships derived from general literature (i.e., newspaper articles, news items etc.) may help us create a network of related companies (business networks). On the other hand, relative movement of stock prices can give us an indication of related companies (asset graphs). We also expect to see some relationships between these two kinds of networks. We adapt the asset graph construction approach from the literature for our asset graph implementations, and then, define our methodology for business network construction. Finally, an introduction to the exploration of some relationships between the asset graphs and business networks is presented. 相似文献
18.
19.
20.
Many organizations struggle with the massive amount of data they collect. Today, data does more than serve as the ingredients for churning out statistical reports. They help support efficient operations in many organizations, and to some extent, data provide the competitive intelligence organizations need to survive in today's economy. Data mining can't always deliver timely and relevant results because data are constantly changing. However, stream-data processing might be more effective, judging by the Matrix project. 相似文献