首页 | 本学科首页   官方微博 | 高级检索  
     

一种自对弈棋局学习样例质量评价方法
引用本文:姬波,尤惠彬,卢红星,田欣,柳宏川.一种自对弈棋局学习样例质量评价方法[J].小型微型计算机系统,2021(3):467-471.
作者姓名:姬波  尤惠彬  卢红星  田欣  柳宏川
作者单位:郑州大学信息工程学院;郑州大学产业技术研究所第4代工业研究所
基金项目:国家重点研发计划项目(2018YFB1201403)资助;国家自然科学基金项目(61772475,61502434)资助。
摘    要:计算机棋类游戏学习中的自对弈学习指仅依赖行棋过程及最终的输赢结果的学习.整个过程中除下棋规则外不预设任何领域知识,也无专家指导.虽然基于极大极小算法、α-β剪枝算法和蒙特卡洛搜索的自对弈学习已经取得了卓越成果,但是目前仍旧缺乏对于学习样例质量评价的针对性研究.因此,本文首次提出了一种自对弈棋局学习样例质量评价方法,该方法采用样本规模综合指标T—使用样例重复度和样例个数的线性组合—来决定学习样例大小.在西洋跳棋上的实验表明,本评价方法可以达到有效控制学习样例规模的目的,在不降低学习效果的前提下大幅降低学习样例产生的计算成本.

关 键 词:计算机博弈  自对弈  西洋跳棋  样例质量

Method for Evaluating Quality of Self-play Chess Game Learning Examples
JI Bo,YOU Hui-bin,LU Hong-xing,TIAN Xin,LIU Hong-chuan.Method for Evaluating Quality of Self-play Chess Game Learning Examples[J].Mini-micro Systems,2021(3):467-471.
Authors:JI Bo  YOU Hui-bin  LU Hong-xing  TIAN Xin  LIU Hong-chuan
Affiliation:(School of Information Engineering,Zhengzhou University,Zhengzhou 450001.China;Fourth Generation of Industry Research Institute,Zhengzhou University,Zhengzhou 450001.China)
Abstract:Self-play game learning in computer chess game learning refers to learning that relies only on the chess process and the final winning and losing results.Except for the rules of playing chess,no domain knowledge is preset in the whole process,and there is no expert guidance.Although the self-play learning based on the minimax algorithm,α-β pruning algorithm and Monte Carlo search has achieved excellent results,there is still a lack of targeted research on the quality evaluation of learning examples.Therefore,this paper proposes for the first time a self-play chess game learning sample quality evaluation method.This method uses a sample size comprehensive indicator T-using a linear combination of sample repeatability and sample number-to determine the size of the learning samples.Experiments on checkers show that the evaluation method can achieve the purpose of effectively controlling the size of the learning examples,and greatly reduce the calculation cost of the learning examples without reducing the learning effect.
Keywords:computer game  self-play  checkers  sample quality
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号