Optimal mixed block withholding attacks based on reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Optimal mixed block withholding attacks based on reinforcement learning

Authors:	Yilei Wang Guoyu Yang Tao Li Lifeng Zhang Yanli Wang Lishan Ke Yi Dou Shouzhe Li Xiaomei Yu

Affiliation:	1. School of Information Science and Engineering, Qufu Normal University, Rizhao, China;2. School of Information and Electrical Engineering, Ludong University, Yantai, China;3. School of Computer Science and Cyber Engineering, Guangzhou University, Guangzhou, China;4. Department of Computing, Hong Kong Polytechnic University, Hong Kong, China;5. Department of Computer Science, University of Wisconsin-Madison, Madison, Wisconsin, USA;6. School of Information Science and Engineering, Shandong Normal University, Jinan, China

Abstract:	The vulnerabilities in cryptographic currencies facilitate the adversarial attacks. Therefore, the attackers have incentives to increase their rewards by strategic behaviors. Block withholding attacks (BWH) are such behaviors that attackers withhold blocks in the target pools to subvert the blockchain ecosystem. Furthermore, BWH attacks may dwarf the countermeasures by combining with selfish mining attacks or other strategic behaviors, for example, fork after withholding (FAW) attacks and power adaptive withholding (PAW) attacks. That is, the attackers may be intelligent enough such that they can dynamically gear their behaviors to optimal attacking strategies. In this paper, we propose mixed-BWH attacks with respect to intelligent attackers, who leverage reinforcement learning to pin down optimal strategic behaviors to maximize their rewards. More specifically, the intelligent attackers strategically toggle among BWH, FAW, and PAW attacks. Their main target is to fine-tune the optimal behaviors, which incur maximal rewards. The attackers pinpoint the optimal attacking actions with reinforcement learning, which is formalized into a Markov decision process. The simulation results show that the rewards of the mixed strategy are much higher than that of honest strategy for the attackers. Therefore, the attackers have enough incentives to adopt the mixed strategy.

Keywords:	block withholding attacks Markov chain reinforcement learning strategic behaviors

设为首页 | 免责声明 | 关于勤云 | 加入收藏