Lempel—Ziv Index for q -Grams |
| |
Authors: | J. Kärkkäinen E. Sutinen |
| |
Affiliation: | (1) Department of Computer Science, P.O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, Finland. Juha.Karkkainen@cs.Helsinki.FI., FI;(2) Department of Computer Science, P.O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, Finland. Erkki.Sutinen@cs.Helsinki.FI., FI |
| |
Abstract: | We present a new sublinear-size index structure for finding all occurrences of a given q -gram in a text. Such a q -gram index is needed in many approximate pattern matching algorithms. All earlier q -gram indexes require at least O(n) space, where n is the length of the text. The new Lempel—Ziv index needs only O(n/log n) space while being as fast as previous methods. The new method takes advantage of repetitions in the text found by Lempel—Ziv parsing. Received November 1996; revised March 1997. |
| |
Keywords: | . q -Gram index, Approximate pattern matching, Text indexing, Lempel— Ziv parsing, String algorithms, Data compression. |
本文献已被 SpringerLink 等数据库收录! |
|