首页 | 本学科首页   官方微博 | 高级检索  
     

基于 XGBoost 的基因静态数据调控网络推断方法
引用本文:车丹丹,郭顺,姜青山.基于 XGBoost 的基因静态数据调控网络推断方法[J].集成技术,2020,9(2):50-59.
作者姓名:车丹丹  郭顺  姜青山
作者单位:中国科学院深圳先进技术研究院 深圳 518055;中国科学院大学深圳先进技术学院 深圳 518055;中国科学院深圳先进技术研究院 深圳 518055
基金项目:中国博士后科学基金项目(2018M633187);深圳市发改委健康大数据智能分析技术国家地方联合工程中心
摘    要:对于静态基因表达数据来说,推断基因调控网络仍是系统生物学中的一个挑战——存在大量识别难度高的直接或间接调控关系,而传统方法的准确性和可靠性还有待进一步提高。为此,该文提出一种基于 Boosting 集成模型的方法(XGBoost),应用随机化和正则化来解决模型过拟合问题,同时针对建模所得权重不一致的问题,对初始权重增加归一化和统计学方法处理。最终,采用 DREAM5挑战的基准数据集对所提出方法进行性能验证。实验结果表明,XGBoost 比现有其他方法获得更好的性能:在 in-silico 生成的模拟数据集中,接受者操作特征曲线面积(AUPR)和正确率-召回率曲线面积(AUROC)两个评估指标均显著优于现有方法;在 E.coli 和 S.cerevisiae 两种生物的真实实验数据中,AUROC 指标均高于现有最优方法。

关 键 词:基因调控网络  静态数据  Boosting模型  基因表达数据

XGBoost-Based Gene Network Inference Method for Steady-State Data
Authors:CHE Dandan  GUO Shun and JIANG Qingshan
Abstract:Inferring gene regulatory networks (GRNs) from steady gene expression data remains a challenge in systems biology. There are a large number of potential direct or indirect regulatory relationships that are difficult to be identified by traditional methods. To address this issue, we propose a new method based on boosting integrated model, and apply randomization and regularization to solve the model over fitting problem. For the inconsistent weights from different subproblems, we integrate normalization and statistical methods to deal with the initial weights. Using the benchmark datasets from DREAM5 challenges, it shows that our method achieves better performance than other state-of-the-art methods. In the simulated data set generated by in-silico, the two evaluation indicators of area under precision-recall curves (AUPR) and area under receiver operating characteristic (AUROC) are significantly better than existing methods, and the accuracy is higher in the real experimental data of two organisms, E.coli and S.cerevisiae. Especially for AUROC, the indicators are higher than the existing best methods.
Keywords:gene regulatory networks  static data  Boosting model  gene expression data
本文献已被 万方数据 等数据库收录!
点击此处可从《集成技术》浏览原始摘要信息
点击此处可从《集成技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号