首页 | 本学科首页   官方微博 | 高级检索  
     

用于不平衡数据分类的0阶TSK型模糊系统
引用本文:顾晓清,蒋亦樟,王士同.用于不平衡数据分类的0阶TSK型模糊系统[J].自动化学报,2017,43(10):1773-1788.
作者姓名:顾晓清  蒋亦樟  王士同
作者单位:1.江南大学数字媒体学院 无锡 214122
基金项目:国家自然科学基金61572236中央高校基本科研业务费专项资金资助项目JUSRP51614A江苏省自然科学基金资助BK20160187国家自然科学基金61502058国家自然科学基金61572085
摘    要:处理不平衡数据分类时,传统模糊系统对少数类样本识别率较低.针对这一问题,首先,在前件参数学习上,提出了竞争贝叶斯模糊聚类(Bayesian fuzzy clustering based on competitive learning,BFCCL)算法,BFCCL算法考虑不同类别样本聚类中心间的排斥作用,采用交替迭代的执行方式并通过马尔科夫蒙特卡洛方法获得模型参数最优解.其次,在后件参数学习上,基于大间隔的策略并通过参数调节使得少数类到分类面的距离大于多数类到分类面的距离,该方法能有效纠正分类面的偏移.基于上述思想以0阶TSK型模糊系统为具体研究对象构造了适用于不平衡数据分类问题的0阶TSK型模糊系统(0-TSK-IDC).人工和真实医学数据集实验结果表明,0-TSK-IDC在不平衡数据分类问题中对少数类和多数类均具有较高的识别率,且具有良好的鲁棒性和可解释性.

关 键 词:不平衡数据    分类    马尔科夫蒙特卡洛    Takagi-Sugeno-Kang型模糊系统
收稿时间:2016-02-29

Zero-order TSK-type Fuzzy System for Imbalanced Data Classification
Affiliation:1.School of Digital Media, Jiangnan University, Wuxi 2141222.School of Information Science & Engineering, Changzhou University, Changzhou 213164
Abstract:When learning from imbalanced datasets, the traditional fuzzy systems have a low rate of identification over the minority class. Firstly, in the antecedent parameter learning stage, a new clustering method, called Bayesian fuzzy clustering based on competitive learning (BFCCL), is proposed to partition the input space for the antecedents of if-then rules. BFCCL considers the repulsed force of clustering prototypes between different classes, and uses an alternating iterative strategy to obtain the optimal model parameters by Markov chain Monte Carlo method. Secondly, in the consequent parameter learning stage, based on the maximum separation strategy and by keeping the distance between the minority class and the classification hyperplane larger than the distance between the majority class and the hyperplane, the method can effectively correct the skewness of the classification hyperplane. Based on the above ideas, a zero-orderTakagi-Sugeno-Kang fuzzy system for imbalanced data classification (0-TSK-IDC) is proposed. Experimental results on artificial and real-world medicine datasets illustrate the effectiveness of 0-TSK-IDC on both minority and majority classes in imbalanced data classification, as well as its good robustness and interpretability.
Keywords:
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号