首页 | 本学科首页   官方微博 | 高级检索  
     

DP聚类的可信性加权模糊支持向量机
引用本文:盛晓遐,杨志民,王甜甜. DP聚类的可信性加权模糊支持向量机[J]. 计算机工程与应用, 2019, 55(10): 169-178. DOI: 10.3778/j.issn.1002-8331.1804-0054
作者姓名:盛晓遐  杨志民  王甜甜
作者单位:1.浙江工业大学 理学院,杭州 3100232.浙江工业大学 之江学院,杭州 310024
基金项目:国家自然科学基金(No.10926198);浙江省自然科学基金(No.LY16A010020)
摘    要:由于SVM(Support Vector Machine)在有离群点和不平衡数据的问题中分类性能相对较低,有研究者提出了一种面向不均衡分类的隶属度加权模糊支持向量机,只是文中的模糊隶属度并不能较好衡量样本点对确定最佳分划超平面所做的贡献大小。针对以上问题提出了密度峰(Density Peaks,DP)聚类的可信性加权模糊支持向量机。首先由DP聚类找到离群点后剔除。再根据点到由DEC(Different Error Costs)确定的超平面的距离,得到初始隶属度,并用改进的FSVM-CIL(Fuzzy Support Vector Machines for Class Imbalance Learning)更新隶属度。之后剔除部分样本点,起到简约样本的作用,并减少数据不平衡带来的影响。通过实验验证了所提出算法的有效性。

关 键 词:离群点  不平衡数据  密度峰(DP)  加权模糊支持向量机  模糊隶属度  可信性

DP Clustering,Creditability Weighted Fuzzy Support Vector Machine
SHENG Xiaoxia,YANG Zhimin,WANG Tiantian. DP Clustering,Creditability Weighted Fuzzy Support Vector Machine[J]. Computer Engineering and Applications, 2019, 55(10): 169-178. DOI: 10.3778/j.issn.1002-8331.1804-0054
Authors:SHENG Xiaoxia  YANG Zhimin  WANG Tiantian
Affiliation:1.College of Science, Zhejiang University of Technology, Hangzhou 310023, China2.Zhijiang College, Zhejiang University of Technology, Hangzhou 310024, China
Abstract:Considering that SVM(Support Vector Machine) has relatively low classification performance in the case of outliers and unbalanced data, a weighted fuzzy support vector machine was proposed. And the fuzzy membership in that paper is not a good measure for the contribution of the sample to the determination of the optimal separating hyperplane. Thus, a DP(Density Peaks)clustering, creditability weighted fuzzy support vector machine is proposed. Outliers are found by DP clustering, then the outliers are eliminated. The distance from every sample to the hyperplane determined by DEC(Different Error Costs)is used to bulid the initial degree of membership. Then the degree of membership is updated with the improved FSVM-CIL(Fuzzy Support Vector Machines for Class Imbalance Learning). Finally, some samples are removed, which reduces the number of samples and reduces the impact of data imbalances. The effectiveness of the proposed algorithm is verified by experiments.
Keywords:outliers,unbalanced data,Density Peaks(DP),weighted fuzzy support vector machine,fuzzy membership  creditability,
本文献已被 维普 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号