Regression analysis of the number of association rules |
| |
Authors: | Wei-Guo Yi Ming-Yu Lu Zhi Liu |
| |
Affiliation: | 1. Department of Information Science and Technology, Dalian Maritime University, Dalian 116026, PRC2. Department of Software Institute, Dalian Jiaotong University, Dalian 116052, PRC |
| |
Abstract: | The typical model, which involves the measures: support, confidence, and interest, is often adapted to mining association rules. In the model, the related parameters are usually chosen by experience; consequently, the number of useful rules is hard to estimate. If the number is too large, we cannot effectively extract the meaningful rules. This paper analyzes the meanings of the parameters and designs a variety of equations between the number of rules and the parameters by using regression method. Finally, we experimentally obtain a preferable regression equation. This paper uses multiple correlation coefficients to test the fitting effects of the equations and uses significance test to verify whether the coefficients of parameters are significantly zero or not. The regression equation that has a larger multiple correlation coefficient will be chosen as the optimally fitted equation. With the selected optimal equation, we can predict the number of rules under the given parameters and further optimize the choice of the three parameters and determine their ranges of values. |
| |
Keywords: | Association rules regression analysis multiple correlation coefficients interest support confidence |
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录! |
| 点击此处可从《国际自动化与计算杂志》浏览原始摘要信息 |
|
点击此处可从《国际自动化与计算杂志》下载全文 |
|