首页 | 本学科首页   官方微博 | 高级检索  
     

一种有效的隐私保护关联规则挖掘方法
引用本文:张鹏,童云海,唐世渭,杨冬青,马秀莉.一种有效的隐私保护关联规则挖掘方法[J].软件学报,2006,17(8):1764-1774.
作者姓名:张鹏  童云海  唐世渭  杨冬青  马秀莉
作者单位:1. 北京大学,信息科学技术学院,北京,100871
2. 北京大学,信息科学技术学院,北京,100871;视觉与听觉信息处理国家重点实验室(北京大学),北京,100871
摘    要:隐私保护是当前数据挖掘领域中一个十分重要的研究问题,其目标是要在不精确访问真实原始数据的条件下,得到准确的模型和分析结果.为了提高对隐私数据的保护程度和挖掘结果的准确性,提出一种有效的隐私保护关联规则挖掘方法.首先将数据干扰和查询限制这两种隐私保护的基本策略相结合,提出了一种新的数据随机处理方法,即部分隐藏的随机化回答(randomized response with partial hiding,简称RRPH)方法,以对原始数据进行变换和隐藏.然后以此为基础,针对经过RRPH方法处理后的数据,给出了一种简单而又高效的频繁项集生成算法,进而实现了隐私保护的关联规则挖掘.理论分析和实验结果均表明,基于RRPH的隐私保护关联规则挖掘方法具有很好的隐私性、准确性、高效性和适用性.

关 键 词:隐私保护  数据挖掘  关联规则  频繁项集  随机化回答
收稿时间:2005-01-27
修稿时间:1/9/2006 12:00:00 AM

An Effective Method for Privacy Preserving Association Rule Mining
ZHANG Peng,TONG Yun-Hai,TANG Shi-Wei,YANG Dong-Qing and MA Xiu-Li.An Effective Method for Privacy Preserving Association Rule Mining[J].Journal of Software,2006,17(8):1764-1774.
Authors:ZHANG Peng  TONG Yun-Hai  TANG Shi-Wei  YANG Dong-Qing and MA Xiu-Li
Affiliation:1.School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China; 2.National Laboratory on Machine Perception (Peking University
Abstract:Privacy preservation is one of the most important topics in data mining. The purpose is to discover accurate patterns without precise access to the original data. In order to improve the privacy preservation and mining accuracy, an effective method for privacy preserving association rule mining is presented in this paper. First, a new data preprocessing approach, Randomized Response with Partial Hiding (RRPH) is proposed. In this approach, the two privacy preserving strategies, data perturbation and query restriction, are combined to transform and hide the original data. Then, a privacy preserving association rule mining algorithm based on RRPH is presented. As shown in the theoretical analysis and the experimental results, privacy preserving association rule mining based on RRPH can achieve significant improvements in terms of privacy, accuracy, efficiency, and applicability.
Keywords:privacy preservation  data mining  association rule  frequent itemset  randomized response
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号