Sampled suffix array with minimizers |
| |
Authors: | Szymon Grabowski Marcin Raniszewski |
| |
Affiliation: | Institute of Applied Computer Science, Lodz University of Technology, ?ód?, Poland |
| |
Abstract: | Sampling (evenly) the suffixes from the suffix array is an old idea trading the pattern search time for reduced index space. A few years ago Claude et al. showed an alphabet sampling scheme allowing for more efficient pattern searches compared with the sparse suffix array, for long enough patterns. A drawback of their approach is the requirement that sought patterns need to contain at least one character from the chosen subalphabet. In this work, we propose an alternative suffix sampling approach with only a minimum pattern length as a requirement, which is more convenient in practice. Experiments show that our algorithm (in a few variants) achieves competitive time‐space tradeoffs on most standard benchmark data. Copyright © 2017 John Wiley & Sons, Ltd. |
| |
Keywords: | full‐text indexing sparse suffix array sampled suffix array minimizers |
|
|