首页 | 本学科首页   官方微博 | 高级检索  
     


Mining optimized support rules for numeric attributes
Affiliation:1. Bell Laboratories, Murray Hill, NJ 07974, USA;2. Korea Advanced Institute of Science and Technology and Advanced Information Technology Research Center, Taejon 305-701, South Korea;1. Dept. of Civil Engrg., Inje Univ., 197 Inje-ro, Gimhae, Gyeongnam 50834, South Korea;2. Dept. of Environ. Engrg., Nakdong River Environmental Research Center, Inje Univ., 197 Inje-ro, Gimhae, Gyeongnam 50834, South Korea;3. School of Sci. & Engrg., Univ. of Dundee, Perth Rd., Dundee DD1 4HN, UK;4. Dept. of Environ. Sci. & Engrg., Inje Univ., 197 Inje-ro, Gimhae, Gyeongnam 50834, South Korea;1. Research & Development Division, SK Hynix Semiconductor Inc., San 136-1, Ami-ri, Bubal-eub, Icheon-si, Kyoungki-do 467-701, Republic of Korea;2. Center for Artificial Low Dimensional Electronic Systems, Institute for Basic Science (IBS), 77 Cheongam-ro, Pohang 790-784, Republic of Korea
Abstract:Mining association rules on large data sets have received considerable attention in recent years. Association rules are useful for determining correlations between attributes of a relation and have applications in marketing, financial and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support, confidence or gain of the rule is maximized. In this paper, we generalize the optimized support association rule problem by permitting rules to contain disjunctions over uninstantiated numeric attributes. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving the uninstantiated attribute. For rules containing a single numeric attribute, we present a dynamic programming algorithm for computing optimized association rules. Furthermore, we propose bucketing technique for reducing the input size, and a divide and conquer strategy that improves the performance significantly without sacrificing optimality. We also present approximation algorithms based on dynamic programming for two numeric attributes. Our experimental results for a single numeric attribute indicate that our bucketing and divide and conquer enhancements are very effective in reducing the execution times and memory requirements of our dynamic programming algorithm. Furthermore, they show that our algorithms scale up almost linearly with the attribute's domain size as well as the number of disjunctions.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号