首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进的代价敏感决策树的网络贷款分类
引用本文:郭冰楠,吴广潮.基于改进的代价敏感决策树的网络贷款分类[J].计算机应用,2019,39(10):2888-2892.
作者姓名:郭冰楠  吴广潮
作者单位:华南理工大学数学学院,广州,510640;华南理工大学数学学院,广州,510640
摘    要:在网络贷款用户数据集中,贷款成功和贷款失败的用户数量存在着严重的不平衡,传统的机器学习算法在解决该类问题时注重整体分类正确率,导致贷款成功用户的预测精度较低。针对此问题,在代价敏感决策树敏感函数的计算中加入类分布,以减弱正负样本数量对误分类代价的影响,构建改进的代价敏感决策树;以该决策树作为基分类器并以分类准确度作为衡量标准选择表现较好的基分类器,将它们与最后阶段生成的分类器集成得到最终的分类器。实验结果表明,与已有的常用于解决此类问题的算法(如MetaCost算法、代价敏感决策树、AdaCost算法等)相比,改进的代价敏感决策树对网络贷款用户分类可以降低总体的误分类错误率,具有更强的泛化能力。

关 键 词:不平衡  代价敏感  网络贷款  集成学习  决策树
收稿时间:2019-03-22
修稿时间:2019-05-09

Classification of online loan based on improved cost-sensitive decision tree
GUO Bingnan,WU Guangchao.Classification of online loan based on improved cost-sensitive decision tree[J].journal of Computer Applications,2019,39(10):2888-2892.
Authors:GUO Bingnan  WU Guangchao
Affiliation:College of Mathematics, South China University of Technology, Guangzhou Guangdong 510640, China
Abstract:In the online loan user data set, there is a serious imbalance between the number of successful and failed loan users. The traditional machine learning algorithm pays attention to the overall classification accuracy when solving such problems, which leads to lower prediction accuracy of successful loan users. In order to solve this problem, the class distribution was added to the calculation of cost-sensitive decision tree sensitivity function, in order to weaken the impact of positive and negative samples on the misclassification cost, and an improved cost-sensitive decision tree based on ID3 (ID3cs)was constructed. With the improved cost-sensitive decision tree as the base classifier and the classification accuracy as the criterion, the base classifiers with better performance were selected and integrated with the classifier generated in the last stage to obtain the final classifier. Experimental results show that compared with the existing algorithms to solve such problems (such as MetaCost algorithm, cost-sensitive decision tree, AdaCost algorithm), the improved cost-sensitive decision tree can reduce the overall misclassification rate of online loan users and has stronger generalization ability.
Keywords:imbalance                                                                                                                        cost-sensitive                                                                                                                        online loan                                                                                                                        integrated learning                                                                                                                        decision tree
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号