首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 171 毫秒
1.
针对数据挖掘中挖掘过程不透明以及用户交互少的问题,本文设计并实现了VISDMiner系统。VISDMiner系统将可视化技术和数据挖掘技术结合在一起,提供对挖掘过程中各阶段产生的可视化子结果集的分析。用户可根据自己的领域知识和经验去调整数据挖掘算法模型的参数和可视化模型的参数,促进算法和挖掘分析过程的有效调优。为了处理高维数据集,VISDMiner系统采用一种基于最大信息系数的主成分分析改进算法MIC-PCA,该算法主要是针对传统PCA算法降维能力和分类准确率低的问题进行改进。实验结果表明,VISDMine不仅实现了数据挖掘过程的可视化,还提高了用户对数据挖掘〖JP2〗执行结果的可理解性,其采用的改进的MIC-PCA算法提高了PCA算法的降维能力和分类准确率。  相似文献   

2.
关联规则下数据挖掘可视化技术的探讨与实现   总被引:1,自引:1,他引:0  
随着数据挖掘技术的不断发展以及数据挖掘工具应用的日益深入,数据挖掘可视化技术已成为数据挖掘领域新兴的研究热点之一。本文首先探讨数据挖掘可视化技术的主流发展趋势;然后基于Netbeans6.1开发平台,采用Java、JFreeChart等开发技术,结合PMML标准,完成关联规则下的数据挖掘可视化技术的设计与实施。实践表明:所完成的工作能够将关联规则下的数据挖掘过程以可视化的元素展示给参与数据挖掘的用户,让用户能够对挖掘结果进行有效评估。  相似文献   

3.
迄今为止,数据挖掘与知识发现软件的功能不再停留在"挖掘"这个单一功能的实现,而已延伸到数据挖掘与知识发现的过程,即包括数据的预处理、数据挖掘、模型评估与可视化,在单纯的模型可视化基础上扩充了数据可视化与数据挖掘过程可视化.主要讨论了数据挖掘的方法与可视化技术,指出了未来的研究方向.  相似文献   

4.
体可视化是一门交互地视觉呈现三维数据场的技术,允许用户直观方便地理解数据中所包含的几何结构和特征信息,在医学影像、石油勘探、大气气象和科学计算模拟等领域得到广泛应用.由于三维数据场内部特征的复杂性,实现高表达力的体可视化需要同时考虑数据的特性和用户的因素.基于感知的体可视化技术在可视化流程中考虑人类感知过程和认知过程的特点,使得可视化的结果符合人类感知特性和习惯,达到有效地揭示数据特征的目的.文中从用户驱动的体数据分析、符合感知的可视化设计和表意性体可视化3个方面对基于感知的体可视化的相关技术进行梳理总结,并指出该领域需要进一步探索的研究方向.  相似文献   

5.
在对Web应用挖掘的基本步骤作系统性研究的基础上,设计了一个Web应用挖掘可视化系统.该系统能够对用户访问Web时服务器方留下的访问记录进行挖掘,从中得出用户的访问模式和访问兴趣,并对所得出的结果进行可视化的处理.为了识别用户浏览模式利用Apriori算法对Web应用挖掘过程中预处理阶段所产生的用户会话文件进行了挖掘.采用Web图可视化了Web站点的拓扑结构以及各节点访问计数和登录计数信息.Web图的新颖之处在于两点:首先,为了将Web拓扑结构映射到Web图上,利用了站点拓扑结构数据和站点应用数据;其次,在绘制表示用户登录计数的信息层时允许通过使用动态布局的方法,以及为每一层的节点重新分配360度周长的方法来解决节点之间的冲突问题.文中较详细地阐述了该系统对Web应用数据挖掘可视化界面布局的具体措施.  相似文献   

6.
石油工业大数据具有无限潜力与价值,将大数据与数据挖掘技术应用其中,不仅可以提升石油行业工业化水平,而且对石油行业智慧化发展起到强有力地推动作用.由此提出了一个Web架构驱动的、集成了数据挖掘五大模块的新型工业知识挖掘系统-即石油工业数据挖掘系统,包含:数据集管理、预处理算法管理、数据挖掘算法管理以及数据挖掘流程管理和数据结果可视化五大模块.本系统实现了完全自助式的数据提取、数据预处理、数据分析与知识挖掘和结果可视化展示的完整知识挖掘流程.通过以Web的形式满足油田不同层级的用户在不同场景下的即时使用需求,极大提高了系统的灵活性.通过本系统,油田的技术开发人员可忽略大数据的搭建以及其他复杂构建过程,更好的服务于油田数据建模和分析.  相似文献   

7.
一个面向大规模数据库的数据挖掘系统   总被引:18,自引:0,他引:18  
钱卫宁  魏藜  王焱  钱海蕾  周傲英 《软件学报》2002,13(8):1540-1545
数据挖掘融合了数据库技术、人工智能和统计学,是目前的研究热点.为了能够集成当前数据挖掘的主要技术并使它们协同工作,在进行数据挖掘基本算法研究的基础上研制开发了一个数据挖掘系统--Golden-Eye.系统实现了在数据挖掘研究中的一些最新成果,集成了泛化、数据清洗这两个数据准备操作以及关联规则发现、例外规则发现、时序模式发现、分类器构造、聚类分析等基本数据挖掘操作,并实现了对挖掘操作的基本管理和结果的图形化显示.整个框架设计充分体现了系统的完整性、协调性和高效性:自底向上将存储控制模块、数据预处理模块、挖掘操作模块、挖掘库管理模块有机地结合在一起,在底层实现了对包括中间结果在内的数据的统一管理,在上层为用户提供了可视化的界面.实验结果表明,该系统能够在大规模数据库上成功地完成用户所指定的数据挖掘操作.  相似文献   

8.
陈亮  卢欣荣  曹文梁 《福建电脑》2007,(7):30-31,49
迄今为止,数据挖掘与知识发现软件的功能不再停留在“挖掘”这个单一功能的实现,而已延伸到数据挖掘与知识发现的过程.即包括数据的预处理、数据挖掘、模型评估与可视化;在单纯的模型可视化基础上扩充了数据可视化与数据挖掘过程可视化:本文着重讨论了数据挖掘的方法与可视化技术,最后提出了未来的研究方向。  相似文献   

9.
DMVisualMiner:一个可视化数据挖掘分析平台   总被引:1,自引:0,他引:1  
DMVisualMiner是将可视化技术应用于数据挖掘领域而开发的一个数据分析平台.可视化数据挖掘主要应用在4个方面数据准备阶段的可视化、模型生成阶段的可视化、结果呈现阶段的可视化、数据挖掘流程的可视化.实现了对数据挖掘各个方面的可视化,同时DMVisualMiner采用构件的设计方法,利用插件的概念增强了系统的可扩展性,设计并实现了基于XML的模型表示方法,使得DMVisualMiner能够和预言模型系统集成,并能在网络环境下发布.  相似文献   

10.
数据挖掘技术的快速发展,能帮助用户更换的挖掘数据库中隐藏的丰富知识,而受挖掘技术复杂性影响,一些用户很难挖掘完整的数据,对数据的理解和掌握也比较吃力。采用图形和图像的形式,能帮助用户理解和掌握数据挖掘的结果,可视化数据挖掘技术便应运而生。  相似文献   

11.
This paper presents a nonlinear structural health inference technique, based on an interactive data mining approach. A mining control agent emulating cognitive process of human analysts is developed and integrated in the data mining loop, analyzing and verifying the output of the data miner and controlling the data mining process to improve the interaction between human users and computer system. Additionally, an artificial neural network method, which is adopted as a core component of the proposed interactive data mining method, is evolved by adding a novelty detecting and retraining function for handling complicated nuclear power plant quake-proof data. Based on proposed approach, an information inference system has been developed. To demonstrate how the proposed technique can be used as a powerful tool for inferring of structural health status in unclear power plant, quake-proof testing data have been applied.  相似文献   

12.
Mining sequential patterns with regular expression constraints   总被引:5,自引:0,他引:5  
Discovering sequential patterns is an important problem in data mining with a host of application domains including medicine, telecommunications, and the World Wide Web. Conventional sequential pattern mining systems provide users with only a very restricted mechanism (based on minimum support) for specifying patterns of interest. As a consequence, the pattern mining process is typically characterized by lack of focus and users often end up paying inordinate computational costs just to be inundated with an overwhelming number of useless results. We propose the use of Regular Expressions (REs) as a flexible constraint specification tool that enables user-controlled focus to be incorporated into the pattern mining process. We develop a family of novel algorithms (termed SPIRIT-Sequential Pattern mining with Regular expression consTraints) for mining frequent sequential patterns that also satisfy user-specified RE constraints. The main distinguishing factor among the proposed schemes is the degree to which the RE constraints are enforced to prune the search space of patterns during computation. Our solutions provide valuable insights into the trade-offs that arise when constraints that do not subscribe to nice properties (like anti monotonicity) are integrated into the mining process  相似文献   

13.
约束关联规则的增量式维护算法   总被引:6,自引:0,他引:6  
关联规则的挖掘是一个重要的数据挖掘问题,在关联规则的挖掘过程中加入约束条件,是实现用户参与挖掘的一种方式。在有约束的关联规则挖掘过程中,用户会不断调整约束条件,并要求更新挖掘结果。针对这种情况,提出了约束关联规则的增量式维护算法Separate_M,当约束条件发生变化时,在原有挖掘结果的基础上实现增量式更新,较重新运行Separate算法而言,减小了搜索空间,节约了时间,提高了挖掘效率。  相似文献   

14.
The loosely coupled relationships between visualization and analytical data mining (DM) techniques represent the majority of the current state of art in visual data mining; DM modeling is typically an automatic process with very limited forms of guidance from users. A conceptual model of the visualization support to DM modeling process and a novel interactive visual decision tree (IVDT) classification process have been proposed in this paper, with the aim of exploring humans’ pattern recognition ability and domain knowledge to facilitate the knowledge discovery process. An IVDT for categorical input attributes has been developed and experimented on 20 subjects to test three hypotheses regarding its potential advantages. The experimental results suggested that, compared to the automatic modeling process as typically applied in current decision tree modeling tools, IVDT process can improve the effectiveness of modeling in terms of producing trees with relatively high classification accuracies and small sizes, enhance users’ understanding of the algorithm, and give them greater satisfaction with the task.  相似文献   

15.
The Web-its resources and users-offers a wealth of information for data mining and knowledge discovery. Up to now, a great deal of work has been done applying data mining and machine learning methods to discover novel and useful knowledge on the Web. However, many techniques aim only at extracting knowledge for human users to view and use. Recently, more and more work addresses Web for knowledge that computer systems will use. You can apply such actionable knowledge back to the Web for measurable performance improvements. This special issue of IEEE Intelligent Systems features five articles that address the problem of actionable Web mining.  相似文献   

16.
A data warehouse is an important decision support system with cleaned and integrated data for knowledge discovery and data mining systems. In reality, the data warehouse mining system has provided many applicable solutions in industries, yet there are still many problems causing users extra problems in discovering knowledge or even failing to obtain the real and useful knowledge they need. To improve the overall data warehouse mining process, we present an intelligent data warehouse mining approach incorporated with schema ontology, schema constraint ontology, domain ontology and user preference ontology. The structures of these ontologies are illustrated and how they benefit the mining process is also demonstrated by examples utilizing rule mining. Finally, we present a prototype multidimensional association mining system, which with intelligent assistance through the support of the ontologies, can help users build useful data mining models, prevent ineffective pattern generation, discover concept extended rules, and provide an active knowledge re-discovering mechanism.  相似文献   

17.
Mining closed frequent itemsets from data streams is of interest recently. However, it is not easy for users to determine a proper minimum support threshold. Hence, it is more reasonable to ask users to set a bound on the result size. Therefore, an interactive single-pass algorithm, called TKC-DS (top-K frequent closed itemsets of data streams), is proposed for mining top-K closed itemsets from data streams efficiently. A novel data structure, called CIL (closed itemset lattice), is developed for maintaining the essential information of closed itemsets generated so far. Experimental results show that the proposed TKC-DS algorithm is an efficient method for mining top-K frequent itemsets from data streams.  相似文献   

18.
Data mining has proven to be very useful in order to extract information from data in many different contexts. However, due to the complexity of data mining techniques, it is required the know-how of an expert in this field to select and use them. Actually, adequately applying data mining is out of the reach of novice users which have expertise in their area of work, but lack skills to employ these techniques. In this paper, we use both model-driven engineering and scientific workflow standards and tools in order to develop named S3Mining framework, which supports novice users in the process of selecting the data mining classification algorithm that better fits with their data and goal. To this aim, this selection process uses the past experiences of expert data miners with the application of classification techniques over their own datasets. The contributions of our S3Mining framework are as follows: (i) an approach to create a knowledge base which stores the past experiences of experts users, (ii) a process that provides the expert users with utilities for the construction of classifiers’ recommenders based on the existing knowledge base, (iii) a system that allows novice data miners to use these recommenders for discovering the classifiers that better fit for solving their problem at hand, and (iv) a public implementation of the framework’s workflows. Finally, an experimental evaluation has been conducted to shown the feasibility of our framework.  相似文献   

19.
Due to the increasing availability and sophistication of data recording techniques, multiple information sources and distributed computing are becoming the important trends of modern information systems. Many applications such as security informatics and social computing require a ubiquitous data analysis platform so that decisions can be made rapidly under distributed and dynamic system environments. Although data mining has now been popularly used to achieve such goals, building a data mining system is, however, a nontrivial task, which may require a complete understanding on numerous data mining techniques as well as solid programming skills. Employing agent techniques for data analysis thus becomes increasingly important, especially for users not familiar with engineering and computational sciences, to implement an effective ubiquitous mining platform. Such data mining agents should, in practice, be intelligent, complete, and compact. In this paper, we present an interactive data mining agent — OIDM (online interactive data mining), which provides three categories (classification, association analysis, and clustering) of data mining tools, and interacts with the user to facilitate the mining process. The interactive mining is accomplished through interviewing the user about the data mining task to gain efficient and intelligent data mining control. OIDM can help users find appropriate mining algorithms, refine and compare the mining process, and finally achieve the best mining results. Such interactive data mining agent techniques provide alternative solutions to rapidly deploy data mining techniques to broader areas of data intelligence and knowledge informatics.  相似文献   

20.
VISMiner:一个交互式可视化数据挖掘原型系统   总被引:6,自引:0,他引:6  
交互式可视化数据挖掘是利用可视化技术进行联机数据挖掘的技术。基于SOM的交互式可视化数据挖掘原型系统VISMiner的主要目的是将数据挖掘与数据可视化及OLAP进行集成,允许用户以交互的方式从SOM的标记图或距离图中选定感兴趣区域加以深入分析。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号