首页 | 本学科首页   官方微博 | 高级检索  
     


Privacy-preserving distributed projected one-point bandit online optimization over directed graphs
Authors:Mengli Wei  Zhiqiang Yang  Qiutong Ji  Zhongyuan Zhao
Affiliation:1. School of Cyber Science and Engineering, Southeast University, Nanjing, China;2. College of Automation, Nanjing University of Information Science and Technology, Nanjing, China

Contribution: Data curation, ?Investigation, Writing - original draft, Writing - review & editing;3. School of Cyber Science and Engineering, Southeast University, Nanjing, China

Contribution: Data curation, Formal analysis, ?Investigation, Software;4. College of Automation, Nanjing University of Information Science and Technology, Nanjing, China

Abstract:The distributed online optimization (DOO) problem with privacy-preserving properties over multiple agents is considered in this paper, where the network model is built by a strongly connected directed graph. To solve this problem, a stochastic bandit DOO algorithm based on differential privacy is proposed. This algorithm uses row- and column-stochastic matrix as the weighting matrices, the requirement of the double random weighting matrix is released. To handle the unknown objective function, the one-point bandit is used to estimate the true gradient information, and the estimated gradient information is used to update of decision variables. Different from the existing DOO algorithms that ignore privacy issues, this algorithm successfully protects the privacy of nodes through a differential privacy policy. Theoretical results show that the algorithm can not only achieve sublinear regret bounds but also protect the privacy of nodes. Finally, simulation results verify the effectiveness of the algorithm.
Keywords:differential privacy  distributed online optimization  multi-agent systems  one-point bandit feedback
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号