Approximate string matching with suffix automata |
| |
Authors: | Esko Ukkonen Derick Wood |
| |
Affiliation: | (1) Department of Computer Science, University of Helsinki, Teollisuuskatu 23, SF-00510 Helsinki, Finland;(2) Department of Computer Science, University of Waterloo, N2L 3G1 Waterloo, Ontario, Canada |
| |
Abstract: | Theapproximate string matching problem is, given a text string, a pattern string, and an integerk, to find in the text all approximate occurrences of the pattern. An approximate occurrence means a substring of the text with edit distance at mostk from the pattern. We give a newO(kn) algorithm for this problem, wheren is the length of the text. The algorithm is based on the suffix automaton with failure transitions and on the diagonalwise monotonicity of the edit distance table. Some experiments showing that the algorithm has a small overhead are reported. |
| |
Keywords: | Approximate string matching Edit distance Dynamic programming Suffix automaton Aho-Corasick automaton |
本文献已被 SpringerLink 等数据库收录! |
|