期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A concurrent rule scheduling algorithm for active rules

Ying Susan D. Suzanne W. 《Data & Knowledge Engineering》2007,60(3):530-546

The use of rules in a distributed environment creates new challenges for the development of active rule execution models. In particular, since a single event can trigger multiple rules that execute over distributed sources of data, it is important to make use of concurrent rule execution whenever possible. This paper presents the details of the integration rule scheduling (IRS) algorithm. Integration rules are active database rules that are used for component integration in a distributed environment. The IRS algorithm identifies rule conflicts for multiple rules triggered by the same event through static, compile-time analysis of the read and write sets of each rule. A unique aspect of the algorithm is that the conflict analysis includes the effects of nested rule execution that occurs as a result of using an execution model with an immediate coupling mode. The algorithm therefore identifies conflicts that may occur as a result of the concurrent execution of different rule triggering sequences. The rules are then formed into a priority graph before execution, defining the order in which rules triggered by the same event should be processed. Rules with the same priority can be executed concurrently. The IRS algorithm guarantees confluence in the final state of the rule execution. The IRS algorithm is applicable for rule scheduling in both distributed and centralized rule execution environments. 相似文献

2.

Generalized production rules as a basis for integrating active anddeductive databases

Palopoli L. Torlone R. 《Knowledge and Data Engineering, IEEE Transactions on》1997,9(6):848-862

The authors address the problem of providing a homogeneous framework for integrating, in a database environment, active rules, which allow the specification of actions to be executed whenever certain events take place, and deductive rules, which allow the specification of deductions in a logic programming style. Actually, it is widely recognized that both kinds of rules enhance the capabilities of database systems since they provide very natural mechanisms for the management of various important activities (e.g., knowledge representation, complex data manipulation, integrity constraint enforcement, view maintenance). However, in spite of their strong relationship, little work has been done on the unification of these powerful paradigms. They present a rule-based language with an event-driven semantics that allows programmers to express both active and deductive computations. The language is based on a new notion of production rules whose effect is both a change of state and an answer to a query. By using several examples, they show that this simple language schema allows one to uniformly define different computations on data, including complex data manipulations, deductive evaluations, and active rule processing. They define the semantics of the language and then describe the architecture of a preliminary implementation of the language. Finally, they report on the application and experience of using the language 相似文献

3.

Maintaining data-driven rules in databases

Gal A. Etzion O. 《Computer》1995,28(1):28-38

A new model with invariant-based language effectively handles data-driven rules in databases and uses the rules' inherent semantic properties and supporting mechanisms to meet high-level language requirements. It is an extension of the basic PARDES model developed by Opher Etzion in 1990 to support derivations and integrity constraints in databases. The model's invariant-based language, unlike other programming languages, can follow data-driven rules' semantic properties. Such rules are activated by modifications of data items in a database, and they play an important role in many applications that maintain complex relationships between data items or interdependencies between parts of the database. Applications include expert systems, real-time databases, simulations, and decision-support systems. The authors present requirements for choosing an adequate programming style that uses data-driven rules. These requirements are based on software-engineering criteria such as compatibility with a high-level language and verifiability of the rule language. The authors show that contemporary database programming styles fail to meet these requirements, and they present the invariant-based language as a viable solution 相似文献

4.

Effective timestamping in databases 总被引：3，自引：0，他引：3

Kristian Torp Christian S. Jensen Richard T. Snodgrass 《The VLDB Journal The International Journal on Very Large Data Bases》2000,8(3-4):267-288

Many existing database applications place various timestamps on their data, rendering temporal values such as dates and times prevalent in database tables. During the past two decades, several dozen temporal data models have appeared, all with timestamps being integral components. The models have used timestamps for encoding two specific temporal aspects of database facts, namely transaction time, when the facts are current in the database, and valid time, when the facts are true in the modeled reality. However, with few exceptions, the assignment of timestamp values has been considered only in the context of individual modification statements. This paper takes the next logical step: It considers the use of timestamping for capturing transaction and valid time in the context of transactions. The paper initially identifies and analyzes several problems with straightforward timestamping, then proceeds to propose a variety of techniques aimed at solving these problems. Timestamping the results of a transaction with the commit time of the transaction is a promising approach. The paper studies how this timestamping may be done using a spectrum of techniques. While many database facts are valid until now, the current time, this value is absent from the existing temporal types. Techniques that address this problem using different substitute values are presented. Using a stratum architecture, the performance of the different proposed techniques are studied. Although querying and modifying time-varying data is accompanied by a number of subtle problems, we present a comprehensive approach that provides application programmers with simple, consistent, and efficient support for modifying bitemporal databases in the context of user transactions. Received: March 11, 1998 / Accepted July 27, 1999 相似文献

5.

Discovering causal rules in relational databases

Floriana Esposito Donato Malerba Vincenza Ripa Giovanni Semeraro 《Applied Artificial Intelligence》2013,27(1):71-84

This article explores the combined application of inductive learning algorithms and causal inference techniques to the problem of discovering causal rules among the attributes of a relational database. Given some relational data each field can be considered as a random variable and a hybrid graph can be built by detecting conditional independencies among variables. The induced graph represents genuine and potential causal relations as well as spurious associations. When the variables are discrete or have been discretized to test condi tional independencies supervised induction algorithms can be used to learn causal rules that is conditional statements in which causes appear as antecedents and effects as consequences. The approach is illustrated by means of some experiments conducted on different data sets. 相似文献

6.

Mining multiple-level association rules in large databases 总被引：2，自引：0，他引：2

Jiawei Han Yongjian Fu 《Knowledge and Data Engineering, IEEE Transactions on》1999,11(5):798-805

A top-down progressive deepening method is developed for efficient mining of multiple-level association rules from large transaction databases based on the a priori principle. A group of variant algorithms is proposed based on the ways of sharing intermediate results, with the relative performance tested and analyzed. The enforcement of different interestingness measurements to find more interesting rules, and the relaxation of rule conditions for finding “level-crossing” association rules, are also investigated. The study shows that efficient algorithms can be developed from large databases for the discovery of interesting and strong multiple-level association rules 相似文献

7.

Mining spatial association rules in image databases 总被引：2，自引：0，他引：2

Anthony J.T. Lee Ruey-Wen Hong Wen-Kwang Tsao 《Information Sciences》2007,177(7):1593-1608

In this paper, we propose a novel spatial mining algorithm, called 9DLT-Miner, to mine the spatial association rules from an image database, where every image is represented by the 9DLT representation. The proposed method consists of two phases. First, we find all frequent patterns of length one. Next, we use frequent k-patterns (k ? 1) to generate all candidate (k + 1)-patterns. For each candidate pattern generated, we scan the database to count the pattern’s support and check if it is frequent. The steps in the second phase are repeated until no more frequent patterns can be found. Since our proposed algorithm prunes most of impossible candidates, it is more efficient than the Apriori algorithm. The experiment results show that 9DLT-Miner runs 2-5 times faster than the Apriori algorithm. 相似文献

8.

Mining interesting association rules from customer databases and transaction databases 总被引：1，自引：0，他引：1

Pauray S. M. Tsai Chien-Ming Chen 《Information Systems》2004,29(8):139-696

In this paper, we examine a new data mining issue of mining association rules from customer databases and transaction databases. The problem is decomposed into two subproblems: identifying all the large itemsets from the transaction database and mining association rules from the customer database and the large itemsets identified. For the first subproblem, we propose an efficient algorithm to discover all the large itemsets from the transaction database. Experimental results show that by our approach, the total execution time can be reduced significantly. For the second subproblem, a relationship graph is constructed according to the identified large itemsets from the transaction database and the priorities of condition attributes from the customer database. Based on the relationship graph, we present an efficient graph-based algorithm to discover interesting association rules embedded in the transaction database and the customer database. 相似文献

9.

Temporal triggers in active databases 总被引：2，自引：0，他引：2

Sistla A.P. Wolfson O. 《Knowledge and Data Engineering, IEEE Transactions on》1995,7(3):471-486

In this paper we propose two languages, called Future Temporal Logic (FTL) and Past Temporal Logic (PTL), for specifying temporal triggers. Some examples of trigger conditions that can be specified in our language are the following: “The value of a certain attribute increases by more than 10% in 10 minutes,” “A tuple that satisfies a certain predicate is added to the database at least 10 minutes before another tuple, satisfying a different condition, is added to the database.” Such triggers are important for monitor and control applications. In addition to the languages, we present algorithms for processing the trigger conditions specified in these languages, namely, procedures for determining when the trigger conditions are satisfied. These methods can be added as a “temporal” component to an existing database management systems. A preliminary prototype of the temporal component that uses the FTL language has been built on top of Sybase running on SUN workstations 相似文献

10.

Efficient mining of association rules in distributed databases 总被引：14，自引：0，他引：14

Cheung D.W. Ng V.T. Fu A.W. Yongjian Fu 《Knowledge and Data Engineering, IEEE Transactions on》1996,8(6):911-922

Many sequential algorithms have been proposed for the mining of association rules. However, very little work has been done in mining association rules in distributed databases. A direct application of sequential algorithms to distributed databases is not effective, because it requires a large amount of communication overhead. In this study, an efficient algorithm called DMA (Distributed Mining of Association rules), is proposed. It generates a small number of candidate sets and requires only O(n) messages for support-count exchange for each candidate set, where n is the number of sites in a distributed database. The algorithm has been implemented on an experimental testbed, and its performance is studied. The results show that DMA has superior performance, when compared with the direct application of a popular sequential algorithm, in distributed databases 相似文献

11.

Data-driven discovery of quantitative rules in relational databases 总被引：9，自引：0，他引：9

Han J. Cai Y. Cercone N. 《Knowledge and Data Engineering, IEEE Transactions on》1993,5(1):29-40

A quantitative rule is a rule associated with quantitative information which assesses the representativeness of the rule in the database. An efficient induction method is developed for learning quantitative rules in relational databases. With the assistance of knowledge about concept hierarchies, data relevance, and expected rule forms, attribute-oriented induction can be performed on the database, which integrates database operations with the learning process and provides a simple, efficient way of learning quantitative rules from large databases. The method involves the learning of both characteristic rules and classification rules. Quantitative information facilitates quantitative reasoning, incremental learning, and learning in the presence of noise. Moreover, learning qualitative rules can be treated as a special case of learning quantitative rules. It is shown that attribute-oriented induction provides an efficient and effective mechanism for learning various kinds of knowledge rules from relational databases 相似文献

12.

不完全数据库中关联规则的两种求估方法

王新《计算机应用》2004,24(8):63-65

在关系数据库中,数据丢失现象常常是不可避免的。在不完全数据库中挖掘关联规则的关键问题是如何估算关联规则的支持度和置信度。给出了不完全数据库中关联规则挖掘的两种求估方法,并进行了简单的比较。相似文献

13.

Priority assignment in real-time active databases 总被引：1，自引：0，他引：1

Rajendran M. Sivasankaran John A. Stankovic Don Towsley Bhaskar Purimetla Krithi Ramamritham 《The VLDB Journal The International Journal on Very Large Data Bases》1996,5(1):19-34

Active databases and real-time databases have been important areas of research in the recent past. It has been recognized that many benefits can be gained by integrating real-time and active database technologies. However, not much work has been done in the area of transaction processing in real-time active databases. This paper deals with an important aspect of transaction processing in real-time active databases, namely the problem of assigning priorities to transactions. In these systems, time-constrained transactions trigger other transactions during their execution. We present three policies for assigning priorities to parent, immediate and deferred transactions executing on a multiprocessor system and then evaluate the policies through simulation. The policies use different amounts of semantic information about transactions to assign the priorities. The simulator has been validated against the results of earlier published studies. We conducted experiments in three settings: a task setting, a main memory database setting and a disk-resident database setting. Our results demonstrate that dynamically changing the priorities of transactions, depending on their behavior (triggering rules), yields a substantial improvement in the number of triggering transactions that meet their deadline in all three settings. Edited by Henry F. Korth and Amith Sheth. Received November 1994 / Accepted March 20, 1995 相似文献

14.

On periodic resource scheduling for continuous-media databases

Minos N. Garofalakis Banu Özden Avi Silberschatz 《The VLDB Journal The International Journal on Very Large Data Bases》1998,7(4):206-225

The Enhanced Pay-Per-View (EPPV) model for providing continuous-media services associates with each continuous-media clip a display frequency that depends on the clip's popularity. The aim is to increase the number of clients that can be serviced concurrently beyond the capacity limitations of available resources, while guaranteeing a constraint on the response time. This is achieved by sharing periodic continuous-media streams among multiple clients. The EPPV model offers a number of advantages over other data-sharing schemes (e.g., batching), which make it more attractive to large-scale service providers. In this paper, we provide a comprehensive study of the resource-scheduling problems associated with supporting EPPV for continuous-media clips with (possibly) different display rates, frequencies, and lengths. Our main objective is to maximize the amount of disk bandwidth that is effectively scheduled under the given data layout and storage constraints. Our formulation gives rise to -hard combinatorial optimization problems that fall within the realm of hard real-time scheduling theory. Given the intractability of the problems, we propose novel heuristic solutions with polynomial-time complexity. We also present preliminary experimental results for the average case behavior of the proposed scheduling schemes and examine how they compare to each other under different workloads. A major contribution of our work is the introduction of a robust scheduling framework that, we believe, can provide solutions for a variety of realistic EPPV resource-scheduling scenarios, as well as any scheduling problem involving regular, periodic use of a shared resource. Based on this framework, we propose various interesting research directions for extending the results presented in this paper. Received June 9, 1998 / Accepted October 13, 1998 相似文献

15.

Parallel mining of association rules from text databases

John D. Holt Soon M. Chung 《The Journal of supercomputing》2007,39(3):273-299

In this paper, we propose a new algorithm named Parallel Multipass with Inverted Hashing and Pruning (PMIHP) for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail transaction databases, and existing mining algorithms cannot handle text databases efficiently because of the large number of itemsets (i.e., sets of words) that need to be counted. The new PMIHP algorithm is a parallel version of our Multipass with Inverted Hashing and Pruning (MIHP) algorithm (Holt, Chung in: Proc of the 14th IEEE int’l conf on tools with artificial intelligence, 2002, pp 49–56), which was shown to be quite efficient than other existing algorithms in the context of mining text databases. The PMIHP algorithm reduces the overhead of communication between miners running on different processors because they are mining local databases asynchronously and prune the global candidates by using the Inverted Hashing and Pruning technique. Compared with the well-known Count Distribution algorithm (Agrawal, Shafer in: (1996) IEEE Trans Knowl Data Eng 8(6):962–969), PMIHP demonstrates superior performance characteristics for mining association rules in large text databases, and when the minimum support level is low, its speedup is superlinear as the number of processors increases. These experiments were performed on a cluster of Linux workstations using a collection of Wall Street Journal articles. This research was supported in part by Ohio Board of Regents, LexisNexis, and AFRL/Wright Brothers Institute (WBI). 相似文献

16.

Real time fuzzy scheduling rules in FMS

Felix T. S. Chan H. K. Chan A. Kazerooni 《Journal of Intelligent Manufacturing》2003,14(3-4):341-350

This paper presents a real-time fuzzy expert system to scheduling parts for a flexible manufacturing system (FMS). First, some vagueness and uncertainties in scheduling rules are indicated and then a fuzzy-logic approach is proposed to improve the system performance by considering multiple performance measures. This approach focuses on characteristics of the system's status, instead of parts, to assign priorities to the parts waiting to be processed. Secondly, a simulation model is developed and it has shown that the proposed fuzzy logic-based decision making process keeps all performance measures at a good level. The proposed approach provides a promising alternative framework in solving scheduling problems in FMSs, in contrast to traditional rules, by making use of intelligent tools. 相似文献

17.

Analysis and optimization of active databases

Danilo Montesi Riccardo Torlone 《Data & Knowledge Engineering》2002,40(3):241-271

We introduce a new formal semantics for active databases that relies on a transaction rewriting technique. A user-defined transaction, which is viewed here as a sequence of atomic database updates forming a semantic atomic unit, is translated by means of active rules into induced one(s). These transactions embody active rule semantics which can be either immediate or deferred. Rule semantics, confluence, equivalence and optimization are then formally investigated and characterized in a solid framework that naturally extends a known model for relational database transactions. 相似文献

18.

蚁群算法求解多目标资源受限项目排程问题——结合不同排程法则的修正与比较

《计算机工程与应用》2017,(5):249-254

现有文献较多研究工期最小化的单目标项目排程问题,对于综合考虑项目总工期、总延迟时间、总延迟成本的多目标资源受限项目排程问题(RCPSP)还较少探讨。建构了一个多目标RCPSP模型,以蚁群算法(ACO)配合综合现有排程法则提出的局部启发式函数AM排程法则,修正得到AM_ACO演算法,设计出新的费洛蒙(Pheromone)更新方式,运用田口方法,测试分析ACO各项参数值。最后利用PSPLIB中的测试例题,比较验证AM_ACO演算法的求解品质与效率。比较结果证实AM_ACO演算法有较高的求解品质与效率。相似文献

19.

Effective iterative algorithms in scheduling theory

Ya. A. Zinder V. V. Shkurba 《Cybernetics and Systems Analysis》1985,21(1):86-90

相似文献

20.

Effective periodic pattern mining in time series databases

Manziba Akanda Nishi Chowdhury Farhan Ahmed Md. Samiullah Byeong-Soo Jeong 《Expert systems with applications》2013,40(8):3015-3027

The goal of analyzing a time series database is to find whether and how frequent a periodic pattern is repeated within the series. Periodic pattern mining is the problem that regards temporal regularity. However, most of the existing algorithms have a major limitation in mining interesting patterns of users interest, that is, they can mine patterns of specific length with all the events sequentially one after another in exact positions within this pattern. Though there are certain scenarios where a pattern can be flexible, that is, it may be interesting and can be mined by neglecting any number of unimportant events in between important events with variable length of the pattern. Moreover, existing algorithms can detect only specific type of periodicity in various time series databases and require the interaction from user to determine periodicity. In this paper, we have proposed an algorithm for the periodic pattern mining in time series databases which does not rely on the user for the period value or period type of the pattern and can detect all types of periodic patterns at the same time, indeed these flexibilities are missing in existing algorithms. The proposed algorithm facilitates the user to generate different kinds of patterns by skipping intermediate events in a time series database and find out the periodicity of the patterns within the database. It is an improvement over the generating pattern using suffix tree, because suffix tree based algorithms have weakness in this particular area of pattern generation. Comparing with the existing algorithms, the proposed algorithm improves generating different kinds of interesting patterns and detects whether the generated pattern is periodic or not. We have tested the performance of our algorithm on both synthetic and real life data from different domains and found a large number of interesting event sequences which were missing in existing algorithms and the proposed algorithm was efficient enough in generating and detecting periodicity of flexible patterns on both types of data. 相似文献