首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 609 毫秒
1.
Architecture for knowledge discovery and knowledge management   总被引:1,自引:0,他引:1  
In this paper, we propose I-MIN model for knowledge discovery and knowledge management in evolving databases. The model splits the KDD process into three phases. The schema designed during the first phase, abstracts the generic mining requirements of the KDD process and provides a mapping between the generic KDD process and (user) specific KDD subprocesses. The generic process is executed periodically during the second phase and windows of condensed knowledge called knowledge concentrates are created. During the third phase, which corresponds to actual mining by the end users, specific KDD subprocesses are invoked to mine knowledge concentrates. The model provides a set of mining operators for the development of mining applications to discover and renew, preserve and reuse, and share knowledge for effective knowledge management. These operators can be invoked by either using a declarative query language or by writing applications.The architectural proposal emulates a DBMS like environment for the managers, administrators and end users in the organization. Knowledge management functions, like sharing and reuse of the discovered knowledge among the users and periodic updating of the discovered knowledge are supported. Complete documentation and control of all the KDD endeavors in an organization are facilitated by the I-MIN model. This helps in structuring and streamlining the KDD operations in an organization.  相似文献   

2.
王文剑 《计算机工程》2000,26(11):56-57
知识挖掘(KDD)应该不仅能够提供较精确的预测结果,而且提取的规则也应该是可以解释的。讨论了从预测模型中进行规则抽取的一般技术,并介绍了作者用神经网络方法抽取规则的算法。  相似文献   

3.
本文对如何构建一个全新的解析智能系统进行了描述。该系统的特点是把基于化学计量学的解析算法和专家系统柔性集成到一起,并把数据库知识发现(KDD)技术和专家数据库(ED)技术作为构建专家系统知识库的核心工具。利用解析算法可使复杂化学体系简化,结合改进的专家系统可实现化学体系定性定量及结构解析的智能化。  相似文献   

4.
Inductive database languages: requirements and examples   总被引:1,自引:1,他引:0  
Inductive databases (IDBs) represent a database perspective on Knowledge discovery in databases (KDD). In an IDB, the KDD application can express both queries capable of accessing and manipulating data, and queries capable of generating, manipulating, and applying patterns allowing to formalize the notion of mining process. The feature that makes them different from other data mining applications is exactly the idea of looking at the support for knowledge discovery as an extension of the query process. This paper draws a list of desirable properties to be taken into account in the definition of an IDB framework. They involve several dimensions, such as the expressiveness of the language in representing data and models, the closure principle, the capability to provide a support for an efficient algorithm programming. These requirements are a basis for a comparative study that highlights strengths and weaknesses of existing IDB approaches. The paper focuses on the SQL-based ATLaS language/system, on the logic-based LDL++{\mathcal{LDL}++} language/system, and on the XML-based KDDML language/system.  相似文献   

5.
Knowledge Discovery in Databases (KDD) focuses on the computerized exploration of large amounts of data and on the discovery of interesting patterns within them. While most work on KDD has been concerned with structured databases, there has been little work on handling the huge amount of information that is available only in unstructured textual form. This paper describes the KDT system for Knowledge Discovery in Text, in which documents are labeled by keywords, and knowledge discovery is performed by analyzing the co-occurrence frequencies of the various keywords labeling the documents. We show how this keyword-frequency approach supports a range of KDD operations, providing a suitable foundation for knowledge discovery and exploration for collections of unstructured text.  相似文献   

6.
This research adopts a framework that synthesizes Knowledge Discovery in Database (KDD), Cross Industry Standard Process for Data Mining (CRISP-DM), and agile practices. The application of this framework is demonstrated through an institutional case study of three knowledge discovery projects: Persistence, Retention, and Donor projects. Results from the case study suggest that (a) interaction and iteration are foundations for the success of a knowledge discovery project, especially one with a strong business focus; (b) agile practices facilitate the interaction and iteration nature of a knowledge discovery project; (c) adding business understanding and deployment steps from CRISP-DM to KDD explicitly helps data miners stay focused on the ultimate goals of the project—the needs of the business and the users.  相似文献   

7.

While knowledge discovery in databases (KDD) is defined as an iterative sequence of the following steps: data pre-processing, data mining, and post data mining, a significant amount of research in data mining has been done, resulting in a variety of algorithms and techniques for each step. However, a single data-mining technique has not been proven appropriate for every domain and data set. Instead, several techniques may need to be integrated into hybrid systems and used cooperatively during a particular data-mining operation. That is, hybrid solutions are crucial for the success of data mining. This paper presents a hybrid framework for identifying patterns from databases or multi-databases. The framework integrates these techniques for mining tasks from an agent point of view. Based on the experiments conducted, putting different KDD techniques together into the agent-based architecture enables them to be used cooperatively when needed. The proposed framework provides a highly flexible and robust data-mining platform and the resulting systems demonstrate emergent behaviors although it does not improve the performance of individual KDD techniques.  相似文献   

8.
面向复杂系统的知识发现过程模型KD(D&K)及其应用   总被引:1,自引:0,他引:1  
为适应复杂系统的知识发现的需要, 在双库协同机制及其诱导的KDD* 过程模型,双基融合机制及其诱导的KDK*过程模型的基础上,借鉴协同原理,提出了将KDD* 与KDK* 有机地融合在一起的、双库协同机制与双基融合机制协同工作的知识发现过程模型KD(DK);描述了KD(DK) 的总体流程、动态知识库系统及其特征;并在农业施肥和植保领域的应用过程中得到验证.  相似文献   

9.
介绍了一种新的KDD模型,和以往的数据驱动式KDD模型不同,并试图利用假说驱动(Hypothesis-driven)的方式来发现新的知识。该文的目标是提供一种KDD工具支持适合人脑风格的假说-检验发现过程,同时最大限度地削减在此过程中人类的工作量。文章提出了为削减人的工作量所作的各种努力。  相似文献   

10.
Knowledge discovery in databases using lattices   总被引:3,自引:0,他引:3  
The rapid pace at which data gathering, storage and distribution technologies are developing is outpacing our advances in techniques for helping humans to analyse, understand, and digest the vast amounts of resulting data. This has led to the birth of knowledge discovery in databases (KDD) and data mining—a process that has the goal to selectively extract knowledge from data. A range of techniques, including neural networks, rule-based systems, case-based reasoning, machine learning, statistics, etc. can be applied to the problem. We discuss the use of concept lattices, to determine dependences in the data mining process. We first define concept lattices, after which we show how they represent knowledge and how they are formed from raw data. Finally, we show how the lattice-based technique addresses different processes in KDD, especially visualization and navigation of discovered knowledge.  相似文献   

11.
12.
13.
数据库知识发现系统及领域知识在其中的作用   总被引:1,自引:0,他引:1  
论述了一种理想化的知识发现系统模型 ,及其各组成部分的功能。进一步讨论了领域知识在其中的重要作用。  相似文献   

14.
最近几年,知识发现研究的进展很快。目前,在知识发现领域图像数据知识发现形成了新的研究热点。本文介绍了基于Hilbert空间理论的图像知识发现模型IMDFSSM,采用模式(定义为Hilbert空间中的矢量)来定量地表征图像数据的知识表示和参与知识发现过程。然后用图像挖掘系统作为实例进行了验证,结果表明该模型对于图像数据的知识发现过程具有指导性作用。  相似文献   

15.
基于粗集理论的Null值估算方法研究   总被引:1,自引:0,他引:1  
刘业政  杨善林 《计算机工程》2001,27(10):41-42,45
在数据库管理系统中,空值(Null)在所有非主码属性中都可能出现。粗集数据分析不同于其它知识发现方法,特别大模型假设方法的一种方法。文章通过扩展粗集理论,研究了空值的估算方法。  相似文献   

16.
In this paper we describe the final version of a knowledge discovery system, Telecommunication Network Alarm Sequence Analyzer (TASA), for telecommunication networks alarm data analysis. The system is based on the discovery of recurrent, temporal patterns of alarms in databases; these patterns, episode rules, can be used in the construction of real-time alarm correlation systems. Also association rules are used for identifying relationships between alarm properties. TASA uses a methodology for knowledge discovery in databases (KDD) where one first discovers large collections of patterns at once, and then performs interactive retrievals from the collection of patterns. The proposed methodology suits very well such KDD formalisms as association and episode rules, where large collections of potentially interesting rules can be found efficiently. When searching for the most interesting rules, simple threshold-like restrictions, such as rule frequency and confidence may satisfy a large number of rules. In TASA, this problem can be alleviated by templates and pattern expressions that describe the form of rules that are to be selected or rejected. Using templates the user can flexibly specify the focus of interest, and also iteratively refine it. Different versions of TASA have been in prototype use in four telecommunication companies since the beginning of 1995. TASA has been found useful in, e.g. finding long-term, rather frequently occurring dependencies, creating an overview of a short-term alarm sequence, and evaluating the alarm data base consistency and correctness.  相似文献   

17.
基于数据库的知识发现系统设计与实现   总被引:4,自引:0,他引:4  
文章分析了知识发现和数据开采的关系及研究现状,针对数据开采面临的挑战,给出了参照ChristopherM.的KDD参考模型设计的基于数据库的知识发现系统的体系结构和多数据源数据获取的数据库接口设计。该系统能从多数据源获取数据或通过数据产生器产生数据,并能从这些数据中抽取信息,如能自动生成复合公式。  相似文献   

18.
This paper interprets the outputs from the multilayer perceptron (MLP) network by finding the input data features at the input layer of the network which activate the hidden layer feature detectors. This leads directly to the deduction of the significant data inputs, the inputs that the network actually uses to perform the input/output mapping for a classification task, and the discovery of the most significant of these data inputs. The analysis presents a method for providing explanations for the network outputs and for representing the knowledge learned by the network in the form of significant input data relationships. During network development the explanation facilities and data relationships can be used for network validation and verification, and after development, for rule induction and data mining where this method provides a potential tool for knowledge discovery in databases (KDD).  相似文献   

19.
文章通过对基于数据库的知识发现系统(KDD)的研究,提出了双库协同机制,它改变了KDD的结构、运行过程与机制,形成新的知识发现系统KDD。将该发现系统应用于农业领域,为合理地指导农业生产提供了科学的决策,因而具有重要的理论意义和实用价值。  相似文献   

20.
在基于数据库和知识库的知识发现系统(KDD&K)的研究中,需对知识库中的重复、冗余、矛盾、循环的知识进行实时校验、修改,并能够发现知识短缺,指导KDD过程进行聚焦;在KDK过程中,需要找出有关联的知识组成的知识域以便于归纳、解释等具体应用需求,针对于此,该文提出了一种基于知识节点(属性)的图矩阵、二维链表、产生式规则的三级管理模式和数据存储结构,通过知识库管理系统(KBMS)实现了二层逻辑结构和一层物理结构的三层独立映射关系,大大压缩了知识的搜索空间。经在KDD&K原型系统中的具体应用,该知识库系统结构的定义以及相应的KBMS完全满足上述要求,并可推广至通用的大、中型知识库系统。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号