首页 | 本学科首页   官方微博 | 高级检索  
     

基于Sector/Sphere的气相色谱-质谱联用多样本并行对齐算法
引用本文:杨辉华,任洪军,李灵巧,段礼新,郭拓,杜玲玲,漆小泉.基于Sector/Sphere的气相色谱-质谱联用多样本并行对齐算法[J].计算机应用,2013,33(1):215-218.
作者姓名:杨辉华  任洪军  李灵巧  段礼新  郭拓  杜玲玲  漆小泉
作者单位:1. 桂林电子科技大学 电子工程与自动化学院,广西 桂林 541004 2. 中国科学院植物研究所 植物分子生理学重点实验室, 北京 100093
基金项目:国家自然科学基金资助项目(30860381,31200227);广西自然科学基金资助项目(2012GXNSFAA053230);国家863计划项目(2012AA10A304);广西高等学校优秀人才资助计划项目(桂教人[2011]40号);广西可信软件重点实验室开放基金资助项目(kx201121)
摘    要:针对气相色谱-质谱联用(GC-MS)数据处理过程复杂且计算量大、处理时间过长而严重拖延实验进度的问题,以多样本保留时间对齐为例,设计了基于分布式平台Sector/Sphere的GC-MS数据处理并行框架,实现了多样本并行对齐算法。首先分布式计算所有样本的相似度矩阵;然后依据层次聚类原理将原样本集划分为小样本集,分布式对齐各小样本集内部的样本;最后以各小样本集的平均样本作为对齐依据合并各样本集的对齐结果。实验结果表明:多样本并行对齐算法的错误率为2.9%,由4台PC组成的集群处理大量样本时,最高加速比达到3.29;能够在保证较高正确率的前提下提升计算速度,解决处理时间过长的问题。

关 键 词:Sector/Sphere平台  分布式计算  并行框架  多样本对齐  
收稿时间:2012-07-05
修稿时间:2012-08-12

Multiple samples alignment for GC-MS data in parallel on Sector/Sphere
YANG Huihua,REN Hongjun,LI Lingqiao,DUAN Lixin,GUO Tuo,DU Lingling,QI Xiaoquan.Multiple samples alignment for GC-MS data in parallel on Sector/Sphere[J].journal of Computer Applications,2013,33(1):215-218.
Authors:YANG Huihua  REN Hongjun  LI Lingqiao  DUAN Lixin  GUO Tuo  DU Lingling  QI Xiaoquan
Affiliation:1. School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin Guangxi 541004, China
2. Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
Abstract:To deal with the problem that the process of Gas Chromatography-Mass Spectrography (GC-MS) data is complex and time consuming which delays the whole experimental progress, taking the alignment of multiple samples as an example, a parallel framework for processing GC-MS data on Sector/Sphere was proposed, and an algorithm of aligning multiple samples in parallel was implemented. First, the similarity matrix of all the samples was computed, then the sample set was divided into small sample sets according to hierarchical clustering and samples in each set were aligned respectively, finally the results of each set were merged according to the average sample of the set. The experimental results show that the error rate of the parallel alignment algorithm is 2.9% and the speedup ratio reaches 3.29 using the cluster with 4 PC, which can speed up the process at a high accuracy, and handle the problem that the processing time is too long.
Keywords:Sector/Sphere platform                                                                                                                          distributed computation                                                                                                                          parallel framework                                                                                                                          multiple samples alignment
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号