首页 | 本学科首页   官方微博 | 高级检索  
     

一种保护原始数据的多属性值分类挖掘算法*
引用本文:李兴国,周志纯,刘辉. 一种保护原始数据的多属性值分类挖掘算法*[J]. 计算机应用研究, 2008, 25(8): 2332-2334
作者姓名:李兴国  周志纯  刘辉
作者单位:(合肥工业大学 管理学院, 合肥 230009)
基金项目:国家自然科学基金资助项目(70631003);安徽省人文社科基地研究项目(2006sk008)
摘    要:针对基于随机响应的隐私保护分类挖掘算法仅适用于原始数据属性值是二元的问题,设计了一种适用于多属性值原始数据的隐私保护分类挖掘算法。算法分为两个部分:a)通过比较参数设定值和随机产生数之间的大小,决定是否改变原始数据的顺序,以实现对原始数据进行变换,从而起到保护数据隐私性的目的;b)通过求解信息增益比例的概率估计值,在伪装后的数据上构造决策树。

关 键 词:数据挖掘; 隐私保护; 分类; 决策树

Algorithm for privacy preserving classification mining with multivariate data
LI Xing guo,ZHOU Zhi chun,LIU Hui. Algorithm for privacy preserving classification mining with multivariate data[J]. Application Research of Computers, 2008, 25(8): 2332-2334
Authors:LI Xing guo  ZHOU Zhi chun  LIU Hui
Affiliation:(School of Management, Hefei University of Technology, Hefei 230009, China)
Abstract:Randomized response technique was used in privacy preserving classification mining, and had acquired good results. But the method was only fit for binary data. To solve this problem, this paper dasigned an algorithm which was fit for multivariate data for privacy preserving classification mining. The algorithm divided into two parts. In the first part, compared the size of parameter and random generated number to decide whether the sequence of the original data should be changed or not, in order to disguise the original data and then protected the privacy of data. In the second part, estimated the value of gain ratio to build the decision tree on disguised data.
Keywords:data mining   privacy preserving   classification   decision tree
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号