首页 | 本学科首页   官方微博 | 高级检索  
     

计算机辅助乳腺癌诊断中的非平衡学习技术
引用本文:沈晔,李敏丹,夏顺仁.计算机辅助乳腺癌诊断中的非平衡学习技术[J].浙江大学学报(自然科学版 ),2013,47(1):1-7.
作者姓名:沈晔  李敏丹  夏顺仁
作者单位:1.浙江大学 生物医学工程与仪器科学学院,浙江 杭州310027; 2.中国计量学院 信号与信息处理系,浙江 杭州 310018
基金项目:国家自然科学基金资助项目(60772092,81101903)
摘    要:针对计算机辅助诊断(CAD)中学习算法处理非平衡数据时,分类器预测具有大类样本的分类误差小,而稀有类样本的分类误差大的倾向性分类问题,提出基于反向k近邻的欠采样新方法.通过去除大类样本集中的噪声及冗余样本、保留具有类别代表性且可靠的样本作为有效样本以此平衡训练样本集,解决了欠采样引起的类别信息的丢失问题.基于UCI Breast-cancer数据集的仿真实验结果表明,该方法解决了非平衡学习问题的有效性,进一步的横向评测对比显示该算法性能显著优于其他同类算法.

关 键 词:计算机辅助诊断  非平衡学习  支持向量机  反向k近邻  欠采样

Learning algorithm with non-balanced data for computer-aided diagnosis of breast cancer
SHEN Ye,LI Min-dan,XIA Shun-ren.Learning algorithm with non-balanced data for computer-aided diagnosis of breast cancer[J].Journal of Zhejiang University(Engineering Science),2013,47(1):1-7.
Authors:SHEN Ye  LI Min-dan  XIA Shun-ren
Affiliation:1(1.School of Biomedical Engineering and Instrument Science,Zhejiang University,Hangzhou 310027,China; 2.Department of Signal and Information Processing,China Jiliang University,Hangzhou 310018,China)
Abstract:When the learning algorithm handles non-balanced data in the computer-aided diagnosis, the prediction result of classifier is undesirably biased. The classification error of the big samples is small, while the classification error of the small samples is great. A reverse k nearest neighbor subsampling method was proposed in order to address the non-balanced learning issue. By removing the noisy and redundant samples from the big samples, and keeping the representative and reliable samples as the effective samples, the balanced training samples was realized, and the problem of the loss of the class information resulted from the subsampling was solved. The simulation results with the Breast-cancer dataset in UCI Machine Learning Repository show the validity of the algorithm to deal with the learning problems for non-balanced data. The experimental results show that the algorithm obviously outperforms existing methods.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
点击此处可从《浙江大学学报(自然科学版 )》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号