首页 | 本学科首页   官方微博 | 高级检索  
     

基于遗传算法的Bayesian网中连续变量离散化的研究
引用本文:王飞,刘大有,薛万欣.基于遗传算法的Bayesian网中连续变量离散化的研究[J].计算机学报,2002,25(8):794-800.
作者姓名:王飞  刘大有  薛万欣
作者单位:1. 复旦大学计算机科学与工程系,上海,200433;复旦大学智能信息处理开放实验室,上海,200433
2. 吉林大学计算机科学与技术学院,长春,130012
基金项目:国家“八六三”高技术研究发展计划 ( 86 3-30 6 -ZD0 5 -0 1-2 ),国家自然科学基金 ( 6 98830 0 3),教育部高校博士点专项科研基金项目,教育部符号计算与知识工程重点实验室资助
摘    要:文中如何从含有离散变量和连续变量的混合数据中学习Bayesian网进行了研究,提出了一种基于遗传算法的连续变量散化算法,在该处中给出了兼顾离散模型准确度和复杂度的适应度函数;并基于对离散化的实质性分析,定义了离散策略等价的概念,由此制定了离散策略的编码方案;进一步设计了变换离散策略的遗传算法。算法不存在局部极值问题,且不需要事先给定变量序关系,模拟实验结果表明,该算法能有效地对连续变量散化,从而使得从混合数据中学到的Bayesian网具有较好性能。

关 键 词:遗传算法  Bayesian网  连续变量离散化  离散变量  学习算法
修稿时间:2001年6月11日

Discretizing Continuous Variables of Bayesian Networks Based on Genetic Algorithms
WANG Fei , LIU Da-You XUE Wan-Xin.Discretizing Continuous Variables of Bayesian Networks Based on Genetic Algorithms[J].Chinese Journal of Computers,2002,25(8):794-800.
Authors:WANG Fei  LIU Da-You XUE Wan-Xin
Affiliation:WANG Fei 1,2) LIU Da-You 3) XUE Wan-Xin 3) 1)
Abstract:Based on Genetic algorithm, this paper presents a discretization algorithm called DGA. Compared with univariate discretization that deals with the single variables individually, the result from DGA is more exact because it is a multivariate discretization method, whereby each variable is discretized taking into account its interaction with the other variables. Besides, it searches for the best discretization strategy by genetic operators, avoiding finding local maxima and specifying an ordering between the variables in advance which are inevitable for deterministic search adopted by previous discretization algorithms. In this paper, (1) fitness function is given that pays attention to not only accuracy and concision of discretization model, but also accuracy and concision of learned structure; (2) encoding is described by giving definition of discretization police equivalence based on essential of discretization strategy; (3) genetic operators are designed that switch individuals to evolve good discretization policy. Experimental results show that this algorithm can effectively discretize continuous variables so that the learned structure takes on good performance.
Keywords:learning Bayesian networks  discrete variable  continuous variable  discretization  genetic algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号