采用时间差分算法的九路围棋机器博弈系统 A 9×9 Go computer game system using temporal difference期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

采用时间差分算法的九路围棋机器博弈系统

引用本文：	张小川,唐艳,梁宁宁.采用时间差分算法的九路围棋机器博弈系统[J].智能系统学报,2012,7(3):278-282.

作者姓名：	张小川唐艳梁宁宁

作者单位：	重庆理工大学计算机科学与工程学院,重庆,400054

基金项目：	重庆市教委科研项目(KJ120824);重庆市自然科学基金资助项目(2007BB2415)

摘要：	围棋机器博弈是机器博弈中重要的分支之一,其庞大的博弈空间给机器博弈研究者带来了巨大挑战.目前围棋机器博弈多采用静态估值搜索与蒙特卡洛树搜索,故将时间差分算法引入至九路围棋机器博弈系统中,提出基于时间差分算法的围棋机器博弈系统模型,该博弈系统具有一定的自学习能力,能在不断的对弈中逐步提高博弈能力.通过与采用α-β搜索算法的博弈系统进行实际对弈,证明了该方法的可行性.
关键词：	机器博弈九路围棋围棋机器博弈时间差分算法
A 9×9 Go computer game system using temporal difference

ZHANG Xiaochuan , TANG Yan , LIANG Ningning.A 9×9 Go computer game system using temporal difference[J].CAAL Transactions on Intelligent Systems,2012,7(3):278-282.

Authors:	ZHANG Xiaochuan TANG Yan LIANG Ningning

Affiliation:	(College of Computer Science and Engineering,Chongqing University of Technology,Chongqing 400054,China)

Abstract:	Computer Go is an important branch of computer games and presents great challenges to computer game researchers due to its need for huge game space.Presently,the static evaluation method and the Monte-Carlo tree search method are widely used in Go computer games.In this paper,a temporal difference algorithm was introduced to the 9×9 Go computer game system which gave it self-learning capability,thereby improving the game levels as a result of the continuous training.Through playing chess with a system which adopts an α-β algorithm,the new method was proven to be effective.

Keywords:	computer game 9×9 Go Go computer game temporal difference
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏