首页 | 本学科首页   官方微博 | 高级检索  
     

基于最大权值路径算法的DNA多序列比对方法
引用本文:霍红卫,肖智伟.基于最大权值路径算法的DNA多序列比对方法[J].软件学报,2007,18(2):185-195.
作者姓名:霍红卫  肖智伟
作者单位:西安电子科技大学,计算机学院,陕西,西安,710071
基金项目:国家自然科学基金;陕西省自然科学基金
摘    要:针对生物序列分析中的多序列比对问题,当输入数据量比较大时,人们提出了很多启发式的算法来改善计算速度和比对结果.提出了用于进行全局DNA多序列比对的一种方法:MWPAlign(maximum weighted path alignment).该算法把序列信息用de Bruijn图的形式表示,并将输入序列的信息记录在图的边上,这样,就将求调和序列的问题转化为求图的最大权值路径问题,使多序列比对问题的时间复杂度降低到几乎线性.实验结果显示:MWPAlign是可行的多序列比对算法,尤其对于变异率低于5.2%的大量序列数据,相对于CLUSTALW(cluster alignments weight),T-Coffee和HMMT(hidden Markov model training)有较好的比对结果和运算性能.

关 键 词:多序列比对  de  Bruijn图  调和序列  最大权值路径
收稿时间:4/9/2006 12:00:00 AM
修稿时间:2006-07-26

A Multiple Alignment Approach for DNA Sequences Based on the Maximum Weighted Path Algorithms
HUO Hong-Wei and XIAO Zhi-Wei.A Multiple Alignment Approach for DNA Sequences Based on the Maximum Weighted Path Algorithms[J].Journal of Software,2007,18(2):185-195.
Authors:HUO Hong-Wei and XIAO Zhi-Wei
Abstract:For multiple sequences alignment problem in molecular biological sequence analysis, when the input sequence number is very large, many heuristic algorithms have been proposed to improve the computation speed and the quality of alignment. An approach called MWPAlign (maximum weighted path alignment) is presented to do global multiple alignment for DNA sequences. In this method, a de Bruijn graph is used to express the input sequences information, which is recorded in the edges of the graph. As a result, a consensus-finding problem can be transformed to a maximum weighted path problem of the graph. MWPAlign obtains almost linear computation speed of the multiple sequences alignment problem. Experimental results show that the proposed algorithm is feasible, and for a large number of sequences with mutation rate lower than 5.2%, MWPALign can obtain better alignment results and has lower computational time as compared to CLUSTALW (cluster alignments weight), T-Coffee and HMMT (hidden Markov model training).
Keywords:multiple sequence alignment  de Bruijn graph  consensus sequence  maximum weighted path
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号