首页 | 本学科首页   官方微博 | 高级检索  
     

机器学习中的隐私攻击与防御
引用本文:刘睿瑄,陈红,郭若杨,赵丹,梁文娟,李翠平.机器学习中的隐私攻击与防御[J].软件学报,2020,31(3):866-892.
作者姓名:刘睿瑄  陈红  郭若杨  赵丹  梁文娟  李翠平
作者单位:数据工程与知识工程教育部重点实验室(中国人民大学),北京100872;中国人民大学信息学院,北京 100872;数据工程与知识工程教育部重点实验室(中国人民大学),北京100872;中国人民大学信息学院,北京 100872;数据工程与知识工程教育部重点实验室(中国人民大学),北京100872;中国人民大学信息学院,北京 100872;数据工程与知识工程教育部重点实验室(中国人民大学),北京100872;中国人民大学信息学院,北京 100872;数据工程与知识工程教育部重点实验室(中国人民大学),北京100872;中国人民大学信息学院,北京 100872;数据工程与知识工程教育部重点实验室(中国人民大学),北京100872;中国人民大学信息学院,北京 100872
基金项目:国家重点研发计划(2018YFB1004401),国家自然科学基金(61532021,61772537,61772536,61702522)
摘    要:大数据时代丰富的信息来源促进了机器学习技术的蓬勃发展,然而机器学习模型的训练集在数据采集、模型训练等各个环节中存在的隐私泄露风险,为人工智能环境下的数据管理提出了重大挑战.传统数据管理中的隐私保护方法无法满足机器学习中多个环节、多种场景下的隐私保护要求.分析并展望了机器学习技术中隐私攻击与防御的研究进展和趋势.首先介绍了机器学习中隐私泄露的场景和隐私攻击的敌手模型,并根据攻击者策略分类梳理了机器学习中隐私攻击的最新研究;介绍了当前机器学习隐私保护的主流基础技术,进一步分析了各技术在保护机器学习训练集隐私时面临的关键问题,重点分类总结了5种防御策略以及具体防御机制;最后展望了机器学习技术中隐私防御机制的未来方向和挑战.

关 键 词:数据管理  机器学习  隐私保护  隐私攻击
收稿时间:2019/7/19 0:00:00
修稿时间:2019/9/10 0:00:00

Survey on Privacy Attacks and Defenses in Machine Learning
LIU Rui-Xuan,CHEN Hong,GUO Ruo-Yang,ZHAO Dan,LIANG Wen-Juan and LI Cui-Ping.Survey on Privacy Attacks and Defenses in Machine Learning[J].Journal of Software,2020,31(3):866-892.
Authors:LIU Rui-Xuan  CHEN Hong  GUO Ruo-Yang  ZHAO Dan  LIANG Wen-Juan and LI Cui-Ping
Affiliation:School of Information, Renmin University of China, Beijing 100872, China;Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education(Renmin University of China), Beijing 100872, China,School of Information, Renmin University of China, Beijing 100872, China;Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education(Renmin University of China), Beijing 100872, China,School of Information, Renmin University of China, Beijing 100872, China;Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education(Renmin University of China), Beijing 100872, China,School of Information, Renmin University of China, Beijing 100872, China;Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education(Renmin University of China), Beijing 100872, China,School of Information, Renmin University of China, Beijing 100872, China;Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education(Renmin University of China), Beijing 100872, China and School of Information, Renmin University of China, Beijing 100872, China;Key Laboratory of Data Engineering and Knowledge Engineering of Ministry of Education(Renmin University of China), Beijing 100872, China
Abstract:In the era of big data, a rich source of data prompts the development of machine learning technology. However, risks of privacy leakage of models'' training data in data collecting and training stages pose essential challenges to data management in the artificial intelligence age. Traditional privacy preserving methods of data management and analysis could not satisfy the complex privacy problems in various stages and scenarios of machine learning. This paper surveys the state-of-the-art works of privacy attacks and defenses in machine learning. On the one hand, scenarios of privacy leakage and adversarial models of privacy attacks are illustrated. Also, specific works of privacy attacks are classified with respect to adversarial strategies. On the other hand, 3 main technologies which are commonly applied in privacy preserving of machine learning are introduced and key problems of their applications are pointed out. In addition, 5 defense strategies and corresponding specific mechanisms are elaborated. Finally, future works and challenges of privacy preserving in machine learning are concluded.
Keywords:data management  machine learning  privacy preserving  privacy attacks
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号