首页 | 本学科首页   官方微博 | 高级检索  
     

不完全数据集的差分隐私保护决策树研究
引用本文:沈思倩,毛宇光,江冠儒.不完全数据集的差分隐私保护决策树研究[J].计算机科学,2017,44(6):139-143, 149.
作者姓名:沈思倩  毛宇光  江冠儒
作者单位:南京航空航天大学计算机科学与技术学院 南京211106,南京航空航天大学计算机科学与技术学院 南京211106;南京大学计算机软件新技术国家重点实验室 南京210093,卡尔斯鲁厄理工学院计算机系 巴登-符腾堡州76131
摘    要:主要研究在对不完全数据集进行决策树分析时,如何加入差分隐私保护技术。首先简单介绍了差分隐私ID3算法和差分隐私随机森林决策树算法;然后针对上述算法存在的缺陷和不足进行了修改,提出指数机制的差分隐私随机森林决策树算法;最后对于不完全数据集提出了一种新的WP(Weight Partition)缺失值处理方法,能够在不需要插值的情况下,使决策树分析算法既能满足差分隐私保护,也能拥有更高的预测准确率和适应性。实验证明,无论是Laplace机制还是指数机制,无论是ID3算法还是随机森林决策树算法,都能适用于所提方法。

关 键 词:差分隐私保护  不完全数据集  ID3算法  随机森林决策树
收稿时间:2016/5/22 0:00:00
修稿时间:2016/7/18 0:00:00

Method of Constructing Differential Privacy Decision Tree Classifier with Incomplete Data Sets
SHEN Si-qian,MAO Yu-guang and JIANG Guan-ru.Method of Constructing Differential Privacy Decision Tree Classifier with Incomplete Data Sets[J].Computer Science,2017,44(6):139-143, 149.
Authors:SHEN Si-qian  MAO Yu-guang and JIANG Guan-ru
Affiliation:College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China,College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China;State Key Laboratory for Novel Software Technology,Nanjing University,Nanjing 210093,China and Informatics,Karlsruhe Institute of Technology,Baden-Württemberg 76131,Germany
Abstract:We mainly studied the problem of constructing differential privacy decision tree classifier with incomplete data sets.We first introduced the differential privacy ID3 decision tree algorithm and differentially private random decision tree algorithm.Then we considered the weakness of the algorithms talked above,and created a new differentially private random decision tree algorithm with exponential mechanism.Finally,an approach for decision tree classifier with incomplete data sets was proposed,which yields better prediction while maintaining good privacy without inserting values,called WP(Weight Partition).And the experimental results show that our approach is suitable for either differential privacy ID3 decision trees or differentially private random decision trees,either laplace or exponential mechanism.
Keywords:Differential privacy  Incomplete data sets  ID3 decision tree algorithm  Random decision tree algorithm
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号