A reinforcement learning approach to stochastic business games期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A reinforcement learning approach to stochastic business games

Authors:	Kiran Kumar Ravulapati Jaideep Rao Tapas K. Das

Affiliation:	^a Delta Technology, Atlanta, GA, USA^b Pilgrim Software, Tampa, FL, USA^c Department of Industrial and Management Systems Engineering, University of South Florida, Tampa, FL, USA

Abstract:	The Internet revolution has resulted in increased competition among providers of goods and services to lure customers by tearing down the barriers of time and distance. For example, a home buyer shopping for a mortgage loan through the Internet is now a potential customer for a large number of lending institutions throughout the world. The lenders (players, in generic game theory nomenclature) seeking to capture this customer are involved in a nonzero-sum stochastic game. Stochastic games are among the least studied and understood of the management science problems, and no computationally tractable solution technique is available for multi-player nonzero-sum stochastic games. We now develop a computer-simulation-based machine learning algorithm that can be used to solve nonzero-sum stochastic game problems that are modeled as competitive Markov decision processes. The methodology based on this algorithm is implemented on a supply chain inventory planning problem with a limited state space. The equilibrium reward obtained from the stochastic game problem is compared with a logical upper bound obtained from the corresponding Markov decision problem in which a single decision maker (player) is substituted for all the competing players in the game. Several numerical versions of the problem are studied to assess the performance of the methodology. The results obtained from our methodology for the inventory planning problems are within 0.8% of the upper bound.

Keywords:
本文献已被 InformaWorld 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏