首页 | 本学科首页   官方微博 | 高级检索  
     

医疗体检数据预处理方法研究
引用本文:林予松,王培培,刘炜,李润知,王宗敏.医疗体检数据预处理方法研究[J].计算机应用研究,2017,34(4).
作者姓名:林予松  王培培  刘炜  李润知  王宗敏
作者单位:郑州大学互联网医疗与健康服务河南省协同创新中心,郑州大学互联网医疗与健康服务河南省协同创新中心,郑州大学互联网医疗与健康服务河南省协同创新中心,郑州大学互联网医疗与健康服务河南省协同创新中心,郑州大学互联网医疗与健康服务河南省协同创新中心
基金项目:智能医疗大数据分析关键支撑技术研究
摘    要:原始体检数据存在信息模糊、有噪声、不完整和冗余的问题,无法直接用于疾病的风险评估与预测。由于体检数据在结构和格式等方面的不足,不适合采用传统的数据预处理方法。为了充分挖掘体检数据中有价值的信息,从多角度提出了针对体检数据的预处理方法:通过基于压缩方法的数据归约,降低了体检数据预处理的时间及空间复杂度;通过基于分词和权值的字段匹配算法,完成了体检数据的清洗,解决了体检数据不一致的问题;通过基于线性函数的数据变换,实现了历年体检数据的一致性和连续性。实验结果表明,基于分词和权值的字段匹配算法,相对于传统算法具有更高的准确性。

关 键 词:体检数据  预处理  字段匹配算法  数据清洗  数据归约  数据变换
收稿时间:2016/3/15 0:00:00
修稿时间:2017/2/16 0:00:00

Research on preprocessing methods for medical examination data
LinYusong,WangPeipei,LiuWei,LiRunzhi and Wangzongmin.Research on preprocessing methods for medical examination data[J].Application Research of Computers,2017,34(4).
Authors:LinYusong  WangPeipei  LiuWei  LiRunzhi and Wangzongmin
Affiliation:Cooperative Innovation Center of Internet Healthcare,Cooperative Innovation Center of Internet Healthcare,Cooperative Innovation Center of Internet Healthcare,Cooperative Innovation Center of Internet Healthcare,
Abstract:The original physical examination data has many problems, including ambiguity, noise, incomplete and redundancy information, so it cannot be used for disease risk assessment and prediction directly. Traditional processing methods are not suitable for physical examination data because of its special structure and format. In order to solve these problems and make full use of the valuable information in the data, several methods are proposed in this paper. A compression-based data reduction method is used to reduce the time and space complexity of the data; a field matching algorithm based on segmentation and weights is used to complete the data cleaning and solve the problem of inconsistency; a data transformation method based on linear function is used to get the consistency and continuity of the history data. It is also proved that the filed matching algorithm in this paper is more accurate than the traditional method.
Keywords:Physical examination data  Data preprocessing  Field matching algorithm  Data reduction  Data cleaning  Data transformation
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号