首页 | 本学科首页   官方微博 | 高级检索  
     

基于块排序索引的生物序列局部比对查询技术
引用本文:李永光,王镝,王国仁,马宜菲. 基于块排序索引的生物序列局部比对查询技术[J]. 计算机科学, 2005, 32(12): 159-163
作者姓名:李永光  王镝  王国仁  马宜菲
作者单位:东北大学信息科学与工程学院计算机系统研究所,沈阳,110004;东北大学信息科学与工程学院计算机系统研究所,沈阳,110004;东北大学信息科学与工程学院计算机系统研究所,沈阳,110004;东北大学信息科学与工程学院计算机系统研究所,沈阳,110004
基金项目:基金项目:教育部高等学校优秀青年教师学科研奖励计划基金资助项目;国家自然科学基金(60273079,60473074)资助.
摘    要:生物数据库中的查询是在生物序列数据集中查找与输入查询序列相似的目标,目前的一些流行工具如BLAST等,是利用启发式算法来提高查询的速度。然而,这些启发式算法无法找到所有的满足要求的结果,而一些精确算法,如动态规划算法,却需要非常高昂的代价。最近,一种新的技术,QASIS,提出了在后缀树的遍历中使用动态规划的精确查找算法,其性能与BLAST相当。但是它的主要缺点就是后缀树这种索引结构需要巨大的空间开销。本文采用基于无损压缩的块排序结构来索引超常的生物序列,减小索引的存储空间开销,有效地减少动态规划算法的计算代价。实验结果表明基于块排序索引的算法在性能方面优于OASIS算法。

关 键 词:生物序列  块排序索引  精确度

Block Sorting Index-based Techniques for Local Alignment Searches on Biological Sequences
LI Yong-Guang,WANG Di,WANG Guo-Ren,MA Yi-Fei. Block Sorting Index-based Techniques for Local Alignment Searches on Biological Sequences[J]. Computer Science, 2005, 32(12): 159-163
Authors:LI Yong-Guang  WANG Di  WANG Guo-Ren  MA Yi-Fei
Affiliation:Institufe Conputer System,College of Computer Science and Engineering,North-east Uuiversity,Shenyang 110004
Abstract:A common query against large protein and gene sequence data sets is to locate targets that are similar to an input query sequence.The current set popular search tools,such as BLAST,employ heuristics to improve the speed of such searches.However,such heuristics can sometimes miss targets,which in many cases is undesirable.The alterna- tive to BLAST is to use an accurate algorithm,Such as Smith-Waterman(S-W) algorithm.However,these accurate al- gorithms are computationaUy very expensive.Recently,a new technique,OASIS,has been proposed to improve the ef- ficiency and accuracy by employing dynamical programming during traversing suffix tree and its speed is comparable to BLAST.But its main drawback is too much memory consuming.We propose an efficient and accurate algorithm for lo- cally aligning genome sequences.We construct a block sorting index structure for the large sequence.The index struc- ture is less than the suffix tree index and can be fit for large data size.Experimental results show that our algorithm has better performance than OASIS.
Keywords:Sequence  Block sorting index  Accuracy
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号