首页 | 本学科首页   官方微博 | 高级检索  
     

一种面向不平衡数据的结构化SVM集成分类器
引用本文:袁兴梅,杨明,杨杨. 一种面向不平衡数据的结构化SVM集成分类器[J]. 模式识别与人工智能, 2013, 26(3): 315-320
作者姓名:袁兴梅  杨明  杨杨
作者单位:1.南京师范大学计算机科学与技术学院南京210023
2.南京工程学院信息化建设与管理办公室南京211167
3.南京师范大学强化培养学院南京210023
基金项目:国家自然科学基金项目(No.60873176,40871176,61003116);江苏省自然科学基金重点项目(No.BK2011005)、江苏省自然科学基金项目(No.BK2011782,BK2010263);南京工程学院青年基金项目(No.QKJB2011028)资助
摘    要:为改进面向不平衡数据的SVM分类器性能,以结构化SVM为基础,提出一种基于代价敏感的结构化支持向量机集成分类器模型.该模型首先通过训练样本的聚类,得到隐含在数据中的结构信息,并对样本进行初始加权.运用AdaBoost策略对各样本的权重进行动态调整,适当增大少数类样本的权重,使小类中误分的样本代价增大,以此来改进不平衡数据的分类性能.实验结果表明,该算法可有效提高不平衡数据的分类性能.

关 键 词:不平衡数据  结构化支持向量机(StASVM)  代价敏感  
收稿时间:2011-05-09

An Ensemble Classifier Based on Structural Support Vector Machine for Imbalanced Data
YUAN Xing-Mei,YANG Ming,YANG Yang. An Ensemble Classifier Based on Structural Support Vector Machine for Imbalanced Data[J]. Pattern Recognition and Artificial Intelligence, 2013, 26(3): 315-320
Authors:YUAN Xing-Mei  YANG Ming  YANG Yang
Affiliation:1.School of Computer Science and Technology,Nanjing Normal University,Nanjing 210023
2.Office of Information Construction and Management,Nanjing Institute of Technology,Nanjing 211167
3.Honor School,Nanjing Normal University,Nanjing 210023
Abstract:To improve the performance of Support Vector Machine(SVM) classifier for imbalanced data,an ensemble classifier model based on structural SVM is introduced by incorporating cost-sensitive strategy. In the proposed classifier model,the training data is partitioned into several group by Ward hierarchical clustering algorithm,the structure information hidden in data is obtained,and the weight of every sample is initialized by using the prior knowledge hidden in clusters. Furthermore,employing AdaBoost strategy,the weight of each sample is dynamically adjusted effectively,and the weights of minority class samples are relatively increased. Hence,the cost of the misclassified positive samples is also increased for improving the classification accuracy of positive samples(minority class samples). The experimental results show that the proposed model effectively improves the classification performance of the imbalanced data.
Keywords:Imbalanced Data  Structural ASVM (StASVM)  Cost-Sensitive  
本文献已被 CNKI 等数据库收录!
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号