Boosting imbalanced data learning with Wiener process oversampling期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Boosting imbalanced data learning with Wiener process oversampling

Authors:	Qian Li Gang Li Wenjia Niu Yanan Cao Liang Chang Jianlong Tan Li Guo

Affiliation:	1.Institute of Information Engineering,Chinese Academy of Sciences,Beijing,China;2.School of Information Technology,Deakin University,Geelong,Australia;3.Guangxi Key Laboratory of Trusted Software,Guilin University of Electronic Technology,Guilin,China

Abstract:	Learning from imbalanced data is a challenging task in a wide range of applications, which attracts significant research efforts from machine learning and data mining community. As a natural approach to this issue, oversampling balances the training samples through replicating existing samples or synthesizing new samples. In general, synthesization outperforms replication by supplying additional information on the minority class. However, the additional information needs to follow the same normal distribution of the training set, which further constrains the new samples within the predefined range of training set. In this paper, we present the Wiener process oversampling (WPO) technique that brings the physics phenomena into sample synthesization. WPO constructs a robust decision region by expanding the attribute ranges in training set while keeping the same normal distribution. The satisfactory performance of WPO can be achieved with much lower computing complexity. In addition, by integrating WPO with ensemble learning, the WPOBoost algorithm outperformsmany prevalent imbalance learning solutions.

Keywords:
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏