Improving a lightweight LZ77 computation algorithm for running faster |
| |
Authors: | Wei Jun Liu Ge Nong Wai hong Chan Yi Wu |
| |
Affiliation: | 1. Computer Science Department, Sun Yat‐sen University, Guangzhou, China;2. School of Physics and Electronic Information, Gannan Normal University, Ganzhou, China;3. SYSU‐CMU Shunde International Joint Research Institute, Shunde, China;4. Department of Mathematics and Information Technology, The Hong Kong Institute of Education, Hong Kong |
| |
Abstract: | Computing the Lempel–Ziv factorization (LZ77) of a string is a key step in many applications. However, at the same time, it constitutes a bottleneck of the entire computation. The investigation of time and space efficient computation of the LZ77 has become an important topic. In this paper, we present a lightweight linear‐time algorithm called LZone for computing the LZ77, which is designed by improvements on the existing linear‐time space efficient LZ77 algorithm BGone for speed acceleration. For an input string T1..n] over a constant alphabet size of O(1), LZone requires only n words of workspace in addition to the input string and the output factorization, ?logn? bits per word. This is the same space requirement for the algorithm BGone. LZone has two versions, LZoneT and LZoneSA, corresponding to BGoneT and BGoneSA, respectively. Our experimental results show that for computing the LZ77 from an input string T, LZoneT and LZoneSA run at around 26% and 57%, respectively, faster than their counterparts in BGone. Moreover, for computing the LZ77 from the suffix array of T, the speed of LZoneSA is on average twice that of BGoneSA. Copyright © 2015 John Wiley & Sons, Ltd. |
| |
Keywords: | Lempel– Ziv factorization algorithm linear time lightweight data compression suffix array |
|
|