首页 | 本学科首页   官方微博 | 高级检索  
     


Mining non-redundant diverse patterns: an information theoretic perspective
Authors:Chaofeng Sha   Jian Gong  Aoying Zhou
Affiliation:(1) Department of Computer Science, University of Rochester, Rochester, NY, 14627;(2) Oracle Corporation, 500 Oracle Parkway, M/S 4op9, Redwood Shores, CA, 94065
Abstract:The discovery of diversity patterns from binary data is an important data mining task. In this paper, we propose the problem of mining highly diverse patterns called non-redundant diversity patterns (NDPs). In this framework, entropy is adopted to measure the diversity of itemsets. In addition, an algorithm called NDP miner is proposed to exploit both monotone properties of entropy diversity measure and pruning power for the efficient discovery of non-redundant diversity patterns. Finally, our experimental results are given to show that the NDP miner can efficiently identify non-redundant diversity patterns.
Keywords:
本文献已被 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号