首页 | 本学科首页   官方微博 | 高级检索  
     

基于差分隐私保护的DP-DBScan聚类算法研究
引用本文:吴伟民,黄焕坤.基于差分隐私保护的DP-DBScan聚类算法研究[J].计算机工程与科学,2015,37(4):830-834.
作者姓名:吴伟民  黄焕坤
作者单位:广东工业大学计算机学院,广东广州,510006
基金项目:广州市科技计划资助项目(2012Y2-00046)
摘    要:差分隐私保护是一种基于数据失真的隐私保护方法,通过添加随机噪声使敏感数据失真的同时也保证数据的统计特性。针对DBScan聚类算法在聚类分析过程中会泄露隐私的问题,提出一种新的基于差分隐私保护的DP-DBScan聚类算法。在满足ε-差分隐私保护的前提下,DP-DBScan聚类算法在基于密度的DBScan聚类算法上引入并实现了差分隐私保护。算法能够有效地保护个人隐私,适用于不同规模和不同维度的数据集。实验结果表明,与DBScan聚类算法相比,DP-DBScan聚类算法在添加少量随机噪声的情况下能保持聚类的有效性并获得差分隐私保护。

关 键 词:差分隐私  DBScan  DP-DBScan  隐私保护  数据挖掘
收稿时间:2014-01-13
修稿时间:2014-04-03

A DP-DBScan clustering algorithm based on differential privacy preserving
WU Wei-min , HUANG Huan-kun.A DP-DBScan clustering algorithm based on differential privacy preserving[J].Computer Engineering & Science,2015,37(4):830-834.
Authors:WU Wei-min  HUANG Huan-kun
Affiliation:(School of Computer,Guangdong University of Technology,Guangzhou 510006,China)
Abstract:Differential privacy preserving is a privacy preserving method based on data distortion,which protects the sensitive data and keeps the data statistical properties by adding random noise.To protect data privacy for the clustering process of DBScan, we present a novel DP-DBScan clustering algorithm in the framework of differential privacy preserving.Subjected to the restriction on ε differential privacy, the proposed DP-DBScan clustering algorithm can not only protect personal privacy effectively but can be applied to data sets of different sizes and dimensions.Experimental results show that,compared with the DBScan clustering method,the DP-DBScan clustering algorithm achieves clustering validity as well as differential privacy preserving when a small amount of noise are added.
Keywords:differential privacy  DBScan  DP-DBScan  privacy preserving  data mining
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号