Efficient Rule-Based Attribute-Oriented Induction for Data Mining |
| |
Authors: | David W. Cheung H.Y. Hwang Ada W. Fu Jiawei Han |
| |
Affiliation: | (1) Department of Computer Science and Information Systems, The University of Hong Kong, Hong Kong;(2) Department of Computer Science and Engineering, Chinese University of Hong Kong, Hong Kong;(3) Department of Computer Science and Engineering, Chinese University of Hong Kong, Hong Kong;(4) School of Computing Science, Simon Fraser University, Canada |
| |
Abstract: | Data mining has become an important technique which has tremendous potential in many commercial and industrial applications. Attribute-oriented induction is a powerful mining technique and has been successfully implemented in the data mining system DBMiner (Han et al. Proc. 1996 Int'l Conf. on Data Mining and Knowledge Discovery (KDD'96), Portland, Oregon, 1996). However, its induction capability is limited by the unconditional concept generalization. In this paper, we extend the concept generalization to rule-based concept hierarchy, which enhances greatly its induction power. When previously proposed induction algorithm is applied to the more general rule-based case, a problem of induction anomaly occurs which impacts its efficiency. We have developed an efficient algorithm to facilitate induction on the rule-based case which can avoid the anomaly. Performance studies have shown that the algorithm is superior than a previously proposed algorithm based on backtracking. |
| |
Keywords: | data mining knowledge discovery in databases rule-based concept generalization rule-based concept hierarchy attribute-oriented induction inductive learning learning and adaptive systems |
本文献已被 SpringerLink 等数据库收录! |
|