首页 | 本学科首页   官方微博 | 高级检索  
     

基于属性分割的差分隐私异构多属性数据发布
引用本文:张小玉,沈国华,杨阳.基于属性分割的差分隐私异构多属性数据发布[J].计算机系统应用,2022,31(10):225-235.
作者姓名:张小玉  沈国华  杨阳
作者单位:南京航空航天大学 计算机科学与技术学院, 南京 211106;南京航空航天大学 高安全系统的软件开发与验证技术工业和信息化部重点实验室, 南京 211106;南京航空航天大学 计算机科学与技术学院, 南京 211106;南京航空航天大学 高安全系统的软件开发与验证技术工业和信息化部重点实验室, 南京 211106;南京大学 软件新技术与产业化协同创新中心, 南京 210093
基金项目:国家自然科学基金(61772270)
摘    要:针对现有多属性数据隐私发布方法无法兼顾属性的敏感性差异和计算效率低的问题, 提出了一种基于属性分割的差分隐私异构多属性数据发布方法HMPrivBayes. 首先, 设计了满足差分隐私的谱聚类算法分割原始数据集, 其中相似矩阵的生成借助于属性最大信息系数. 其次, 借助属性信息, 该方法使用满足差分隐私的改进贝叶斯网络构建算法分别为每个数据子集构建贝叶斯网络. 最后, 以属性归一化风险熵为权重分配隐私预算, 对贝叶斯网络提取的属性联合分布添加异构噪声扰动, 实现了异构多属性数据保护. 实验结果表明, HMPrivBayes可以在减少注入合成数据集中噪声量的同时, 提高合成数据计算效率.

关 键 词:差分隐私  异构多属性数据发布  谱聚类  属性分割  贝叶斯网络  隐私保护
收稿时间:2022/1/6 0:00:00
修稿时间:2022/2/17 0:00:00

Differentially Private Heterogeneous Multi-attribute Data Publication via Attribute Segmentation
ZHANG Xiao-Yu,SHEN Guo-Hu,YANG Yang.Differentially Private Heterogeneous Multi-attribute Data Publication via Attribute Segmentation[J].Computer Systems& Applications,2022,31(10):225-235.
Authors:ZHANG Xiao-Yu  SHEN Guo-Hu  YANG Yang
Affiliation:College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;Key Laboratory of Safety-critical Software, Ministry of Industry and Information Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;Key Laboratory of Safety-critical Software, Ministry of Industry and Information Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing University, Nanjing 210093, China
Abstract:Multi-attribute data privacy publication fails to balance the difference in attribute sensitivity and computational efficiency. For this reason, HMPrivBayes, a heterogeneous multi-attribute data publishing method with differential privacy based on attribute segmentation, is proposed. Firstly, the spectral clustering algorithm satisfying differential privacy is designed to segment the original data set, in which the similarity matrix is generated by the attribute maximum information coefficient. Secondly, with the help of attribute information, this method uses an improved Bayesian network construction algorithm to build Bayesian networks for each data subset. Finally, HMPrivBayes adds heterogeneous noise disturbance to the attribute joint distribution extracted from the Bayesian network to realize the protection of heterogeneous multi-attribute data, in which privacy budget is allocated based on the normalized risk entropy of attribute. The experimental results show that HMPrivBayes not only reduces the added noise but also improves the computational efficiency of synthetic data.
Keywords:differential privacy  heterogeneous multi-attribute data publishing  spectral clustering  attribute segmentation  Bayesian network  privacy protection
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号