首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的连续属性全局离散化算法
引用本文:石红,沈毅,刘志言.一种改进的连续属性全局离散化算法[J].电机与控制学报,2004,8(3):268-270,288.
作者姓名:石红  沈毅  刘志言
作者单位:哈尔滨工业大学,控制科学与工程系,黑龙江,哈尔滨,150001
摘    要:连续属性的离散化问题是粗糙集理论研究的一个重要内容,通过对一种局部离散化方法的改进,提出了全局的离散化算法。利用粗糙集理论,首先定义一致性的度量(辨别函数),修改了基于“最小描述长度准则”的离散化算法,实现了全局离散,弥补了前者引入不一致的缺陷;在保持数据一致性的前提下,进一步分析了离散中分割点的冗金并进行了约简。实验通过基于粗糙集的分类工具,在几组典型数据集上得到了预期的满意结果,验证了该算法的有效性。

关 键 词:离散化  连续属性  粗糙集  最小描述长度  一致性
文章编号:1007-449X(2004)03-0268-03
修稿时间:2004年6月5日

Improved global discretization algorithm of continuous attributes
SHI HOng,SHEN Yi,LIU Zhi-yan.Improved global discretization algorithm of continuous attributes[J].Electric Machines and Control,2004,8(3):268-270,288.
Authors:SHI HOng  SHEN Yi  LIU Zhi-yan
Affiliation:SHI HOng,SHEN Yi,LIU Zhi-yan Department of Control Science and Engineering,Harbin Institute of Technology,Harbin 150001,China
Abstract:The problem of discretization of continuous attributes is an important issue in the research of rough sets theory. By modifying the local method that is based on the MDLPC criterion with the help of rough sets theory, a global discretization algorithm is proposed. In the first stage, it modifies the criterion of selecting the best cut point in the MDLPC method, and makes the MDLPC method globalized by introducing inconsistency checking based on rough set theory to preserve the fidelity of the original data. Then the reduction of cut points is performed, which will not change the consistency level and lead to small size learning model. The algorithm is tested on several data sets, and the results are satisfactory, which proved its effectiveness.
Keywords:discretization  continuous attributes  rough sets  MDLPC  consistency
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号