共查询到20条相似文献,搜索用时 15 毫秒
1.
大规模数据库中的知识获取 总被引:1,自引:0,他引:1
一、前言数据是知识的潭泉,拥有大量的数据与拥有许多有用的知识完全是两回事.为了有效地利用大量的公共数据,必须更好地理解这些数据,并从其中快速、准确地发现知识.这里所说的知识是指大量数据中存在的规律性(r egularity)或不同属性值之间所存在的[I F THEN〕规则.将所获取的知识附加于仅由事实数据(fact data)构成的传统数据库上,既可强化数据库的查询能力,又可给数据库提供推理能力, 相似文献
2.
探讨了数据库应用重要技术-数据挖掘技术,描述了数据查询的结构,已有的各种方式及常用技术,涵盖了大部分流行工具,比如分类、聚类、概括,总结了目前使用的统计与机器学习方法和新的解决方案。 相似文献
3.
最近几年,知识发现研究的进展很快。目前,在知识发现领域图像数据知识发现形成了新的研究热点。本文介绍了基于Hilbert空间理论的图像知识发现模型IMDFSSM,采用模式(定义为Hilbert空间中的矢量)来定量地表征图像数据的知识表示和参与知识发现过程。然后用图像挖掘系统作为实例进行了验证,结果表明该模型对于图像数据的知识发现过程具有指导性作用。 相似文献
4.
Knowledge Discovery Through Self-Organizing Maps: Data Visualization and Query Processing 总被引:2,自引:1,他引:2
In data mining, the usefulness of a data pattern depends on the user of the database and does not solely depend on the statistical
strength of the pattern. Based on the premise that heuristic search in combinatorial spaces built on computer and human cognitive
theories is useful for effective knowledge discovery, this study investigates how the use of self-organizing maps as a tool
of data visualization in data mining plays a significant role in human–computer interactive knowledge discovery. This article
presents the conceptual foundations of the integration of data visualization and query processing for knowledge discovery,
and proposes a set of query functions for the validation of self-organizing maps in data mining.
Received 1 November 1999 / Revised 2 March 2000 / Accepted in revised form 20 October 2000 相似文献
5.
Large databases are becoming increasingly common in civil infrastructure applications. Although it is relatively simple to
specifically query these databases at a low level, more abstract questions like ‘How does the environment affect pavement
cracking?’ are difficult to answer with traditional methods. Data mining techniques can provide a solution for learning abstract
knowledge from civil infrastruc-ture databases. However, data mining needs to be performed within a systematic process to
ensure correct and reproducible results. Many decisions must be made during this process, making it difficult for novice analysts
to apply data mining techniques thoroughly. This paper presents an application of a knowledge discovery process to data collected
for an ‘intelligent’ building. The knowledge discovery process is illustrated and explained through this case study. Additionally,
we discuss the importance of this case study in the context of a research effort to develop an interactive guide for the knowledge
discovery process. 相似文献
6.
7.
8.
9.
10.
利用基因遗传算法从数据库自动生成知识库 总被引:4,自引:0,他引:4
此文提出一种从数据库自动生成知识库的新方法。该方法从数据库到知识库的优化目标函数,利用基因遗传算法的优化手段,直接从数据库中生成性能较优的知识库。文中论述了该方法的原理与具体实现过程。 相似文献
11.
DCS故障诊断专家系统中知识自动获取的研究 总被引:2,自引:0,他引:2
着重叙述了知识自动获取系统,其与目前最常用的开发工具不同,在设计与开发中采用了最新的数据库软件来作为开发工具,同时对新系统中知识输入的步骤、知识输入界面及知识输方法作了介绍。 相似文献
12.
基于知识发现的范例推理系统 总被引:1,自引:0,他引:1
1 引言范例推理(Case-Based Reasoning,CBR)是近十几年来人工智能中发展起来的区别于基于规则推理的一种推理模式,它是指借用旧的事例或经验来解决问题、评价解决方案、解释异常情况或理解新情况。CBR兴起的主要原因是传统的基于规则的系统存在诸多的缺点,如:在知识获取问题上存在困难,对于处理过的问题没有记忆而导致推理效率低下,不能有效地处理例外情况,整体性能较为脆弱等等,而CBR恰好能解决以上问题。 相似文献
13.
TIAN Yuan MENG Zhi-qing 《数字社区&智能家居》2007,2(10):1067
We are obtaining a large database of some objects' records of fluctuations of a stock market,medical treatments,changes of weather in certain area and so on,where each record consists of multi-attributes taking multi-values changing with time. Our work is motivated by prediction,which is different from the work in 4,5,8,11. We want to help learn from past data and make informed decisions for the future. This paper is very significant to perfect the theory and the development of the temporal data mining. 相似文献
14.
15.
该文针对目前数据挖掘的研究状况,理论上提出了将基于属性分类方法和多元线形回归算法相结合的算法,首先使用基于属性分类的方法将原始数据库进行属性分类,化简,去掉次要的条件属性,最后得出一个简化的表格,找出影响决策属性的主要因素,根据此表,可以得出简单的ifthen规则;然后使用多元线形回归求出它们之间的近似定量关系,得出一个最优回归方程。 相似文献
16.
Three perspectives of data mining 总被引:2,自引:0,他引:2
Zhi-Hua Zhou 《Artificial Intelligence》2003,143(1):139-146
This paper reviews three recent books on data mining written from three different perspectives, i.e., databases, machine learning, and statistics. Although the exploration in this paper is suggestive instead of conclusive, it reveals that besides some common properties, different perspectives lay strong emphases on different aspects of data mining. The emphasis of the database perspective is on efficiency because this perspective strongly concerns the whole discovery process and huge data volume. The emphasis of the machine learning perspective is on effectiveness because this perspective is heavily attracted by substantive heuristics working well in data analysis although they may not always be useful. As for the statistics perspective, its emphasis is on validity because this perspective cares much for mathematical soundness behind mining methods. 相似文献
17.
18.
来自应用、社会、经济等各方面的迫切需求,以及不断升温的研究兴趣,使知识发现和数据采掘成为目前一个不断发展的领域。本文介绍了知识发现和数据采掘技术的产生背景、基本任务、方法及其应用,同时还简要介绍了目前已有的成熟的KDD系统及其将来的发展。 相似文献
19.
Marc Boullé 《Machine Learning》2006,65(1):131-165
While real data often comes in mixed format, discrete and continuous, many supervised induction algorithms require discrete
data. Efficient discretization of continuous attributes is an important problem that has effects on speed, accuracy and understandability
of the induction models. In this paper, we propose a new discretization method MODL1, founded on a Bayesian approach. We introduce a space of discretization models and a prior distribution defined on this model
space. This results in the definition of a Bayes optimal evaluation criterion of discretizations. We then propose a new super-linear
optimization algorithm that manages to find near-optimal discretizations. Extensive comparative experiments both on real and
synthetic data demonstrate the high inductive performances obtained by the new discretization method.
Editor: Tom Fawcett
1French patent No. 04 00179. 相似文献
20.
提出一种基于信息论与集合论的基本理论相结合的方法,用来从数据库发现分类规则知识;利用该方法可以快速发现知识,且发现的知识简捷、可靠。 相似文献