Surrogate maximization/minimization algorithms and extensions |
| |
Authors: | Zhihua Zhang James T. Kwok Dit-Yan Yeung |
| |
Affiliation: | (1) Faculty of Information Technology, Monash University, PO Box 75, Clayton, Vic., 3800, Australia |
| |
Abstract: | Pattern discovery techniques, such as association rule discovery, explore large search spaces of potential patterns to find
those that satisfy some user-specified constraints. Due to the large number of patterns considered, they suffer from an extreme
risk of type-1 error, that is, of finding patterns that appear due to chance alone to satisfy the constraints on the sample
data. This paper proposes techniques to overcome this problem by applying well-established statistical practices. These allow
the user to enforce a strict upper limit on the risk of experimentwise error. Empirical studies demonstrate that standard
pattern discovery techniques can discover numerous spurious patterns when applied to random data and when applied to real-world
data result in large numbers of patterns that are rejected when subjected to sound statistical evaluation. They also reveal
that a number of pragmatic choices about how such tests are performed can greatly affect their power.
Editor: Johannes Fürnkranz.
An erratum to this article can be found at |
| |
Keywords: | Pattern discovery Statistical evaluation Association rules |
本文献已被 SpringerLink 等数据库收录! |
|