首页 | 本学科首页   官方微博 | 高级检索  
     

面向临床科研的医疗事件模型与开放数据集合构建
引用本文:刘旭利,金季豪,阮彤,高大启,殷亦超,葛小玲.面向临床科研的医疗事件模型与开放数据集合构建[J].中文信息学报,2021,34(11):37-48.
作者姓名:刘旭利  金季豪  阮彤  高大启  殷亦超  葛小玲
作者单位:1.华东理工大学 信息科学与工程学院,上海 200237;
2.上海中医药大学 附属曙光医院,上海 200021;
3.复旦大学附属儿科医院,上海 201108
基金项目:国家重大新药创制项目(2019ZX09201004);基于上海区域卫生信息平台的复旦儿科医联体互联网医院项目(201701013)
摘    要:基于电子病历观察性数据的真实世界研究成为目前临床科研的热点。然而关系数据模型无法直接支撑起科研应用中医疗事件的时序关系表示以及知识融合的查询需求。针对上述问题,该文提出了一种新的基于RDF的医疗观察性数据表示模型,该模型可以清晰地表示临床检查、诊断、治疗等多种事件类型以及事件的时序关系。对来源于医院的电子病历数据,经过数据预处理、数据模式转换、时序关系构建以及知识融合4个步骤建立事件图谱。具体地,使用三家上海三甲医院的电子病历数据,构建了包括3个专科、173 395个医疗事件以及501 335个事件时序关系的医疗数据集,并融合了5 313个中文医疗知识库概念。基于临床文献与医生科研需求,该文根据公共卫生流行病学的病因研究、治疗研究等类型,分别提供了针对本数据集的40个问题示例,并将其中的部分问题与传统关系数据库在查询的构建与执行方面进行了实验比对,论证了该事件图谱的优越性。该数据集遵循开放链接标准,在OpenKG上发布并提供了在线访问的SPARQL站点,链接为 https://peg.ecustnlplab.com/dataset.html。

关 键 词:电子病历数据  病人事件图谱  知识融合  

Construction of An Open Dataset for Clinical Event Graph
LIU Xuli,JIN Jihao,RUAN Tong,GAO Daqi,YIN Yichao,GE Xiaoling.Construction of An Open Dataset for Clinical Event Graph[J].Journal of Chinese Information Processing,2021,34(11):37-48.
Authors:LIU Xuli  JIN Jihao  RUAN Tong  GAO Daqi  YIN Yichao  GE Xiaoling
Affiliation:1.School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China;
2.Shanghai Shuguang Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai 200021, China;
3.The Childrens Hospital of Fudan University, Shanghai 201108, China
Abstract:Clinical research based on observational data of electronic medical records has become a hot topic. In this paper, a new representation model of medical observation data based on RDF is proposed. The model can clearly represent multiple event types such as clinical examination, diagnosis, treatment as well as temporal relationships between events. Base on electronic medical records from hospitals, clinical event graphs are constructed by four steps: data preprocessing, RDF format conversion, time sequence construction and knowledge fusion. Specifically, using the electronic medical records of three first-class hospitals in Shanghai, we constructed a medical dataset including three specialties, 173 395 medical events, 501 335 temporal relationships of events, and linked with 5 313 concepts in the knowledge base. This paper further provides 40 sample queries for clinical retrospective research including etiology analysis and treatment analysis, with demonstration in contrast to the traditional database in terms of query formulation and retrieval process. The dataset follows the Open Link Standard and is published on OpenKG with online SPARQL site (https://peg.ecustnlplab.com/dataset.html).
Keywords:electronic medical record  patient event graph  knowledge fusion  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号