首页 | 本学科首页   官方微博 | 高级检索  
     

中医临床不均衡数据疾病分类方法研究
引用本文:潘主强,张林,张磊,李国正,颜仕星.中医临床不均衡数据疾病分类方法研究[J].智能系统学报,2017,12(6):848-856.
作者姓名:潘主强  张林  张磊  李国正  颜仕星
作者单位:1. 西南石油大学 计算机科学学院, 四川 成都 610500;2. 中国中医科学院 中医临床基础医学研究所, 北京 100700;3. 中国中医科学院 中医药数据中心, 北京 100700;4. 上海金灯台信息科技有限公司, 上海 201800
摘    要:基于欠采样的不均衡数据分类算法是一种随机数据优化算法,但它不能最好地反映中医临床原始数据的分布并解决数据的特征冗余问题。提出了基于预测风险的最远病例不均衡装袋算法(PRFS-FPUSAB)。该算法中首先基于欠采样提出了改进的抽样方式尽可能地反映原始数据分布,然后结合集成学习、预测风险标准提高不均衡的分类性能并进行特征选择。在中医临床采集的经络电阻数据上的实验结果表明,该算法改善了曲线下面积并且选择的特征也符合中医学相关理论。

关 键 词:中医临床  不均衡数据分类  原始数据分布  特征选择

Research on classification of diseases of clinical imbalanced data in traditional Chinese medicine
PAN Zhuqiang,ZHANG Lin,ZHANG Lei,LI Guozheng,YAN Shixing.Research on classification of diseases of clinical imbalanced data in traditional Chinese medicine[J].CAAL Transactions on Intelligent Systems,2017,12(6):848-856.
Authors:PAN Zhuqiang  ZHANG Lin  ZHANG Lei  LI Guozheng  YAN Shixing
Affiliation:1. School of Computer Science, Southwest Petroleum University, Chengdu 610500, China;2. Institute of Basic Research in Clinical Medicine of Traditional Chinese Medicine, China Academy of Chinese Medical Science, Beijing 100700, China;3. National D
Abstract:An algorithm based on under-sampling unbalanced data classification is a stochastic data optimization algorithm. However, in traditional Chinese medicine (TCM), it is difficult to best reflect the distribution of original clinical data to solve the problem of feature redundancy in data. Therefore, in this paper, the PRFS-FPUSAB algorithm is proposed. In the algorithm, an improved sampling method is proposed based on under-sampling. The original data distribution is reflected as much as possible; then, the classification is improved by combining integrated learning, prediction risk, and feature selection. The experimental results on meridian resistance data collected from TCM show that the algorithm improves the area under the curve, and the selected characteristics are also in accordance with TCM theory.
Keywords:Chinese medicine clinical  imbalance data classification  initial data distribution  feature selection
点击此处可从《智能系统学报》浏览原始摘要信息
点击此处可从《智能系统学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号