首页 | 本学科首页   官方微博 | 高级检索  
     


Revisiting bounded context block‐sorting transformations
Authors:J Shane Culpepper  Matthias Petri  Simon J Puglisi
Affiliation:School of Computer Science & Information Technology, RMIT University, , Melbourne, VIC, 3001 Australia
Abstract:The Burrows–Wheeler Transform (BWT ) produces a permutation of a string X, denoted X?, by sorting the n cyclic rotations of X into full lexicographical order and taking the last column of the resulting n×n matrix to be X?. The transformation is reversible inurn:x-wiley:380644:media:spe1112:spe1112-math-0001 time. In this paper, we consider an alteration to the process, called k‐BWT , where rotations are only sorted to a depth k. We propose new approaches to the forward and reverse transform, and show that the methods are efficient in practice. More than a decade ago, two algorithms were independently discovered for reversing k‐BWT , both of which run inurn:x-wiley:380644:media:spe1112:spe1112-math-0002 time. Two recent algorithms have lowered the bounds for the reverse transformation tourn:x-wiley:380644:media:spe1112:spe1112-math-0003 andurn:x-wiley:380644:media:spe1112:spe1112-math-0004, respectively. We examine the practical performance for these reversal algorithms. We find that the originalurn:x-wiley:380644:media:spe1112:spe1112-math-0005 approach is most efficient in practice, and investigates new approaches, aimed at further speeding reversal, which store precomputed context boundaries in the compressed file. By explicitly encoding the context boundaries, we present anurn:x-wiley:380644:media:spe1112:spe1112-math-0006 reversal technique that is both efficient and effective. Finally, our study elucidates an inherently cache‐friendly – and hitherto unobserved – behavior in the reverse k‐BWT , which could lead to new applications of the k‐BWT transform. In contrast to previous empirical studies, we show that the partial transform can be reversed significantly faster than the full transform, without significantly affecting compression effectiveness. Copyright © 2011 John Wiley & Sons, Ltd.
Keywords:data compression  Burrows–  Wheeler transform  block‐sorting  suffix array
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号