Pattern matching in the Hamming distance with thresholds |
| |
Authors: | Mikhail J. Atallah Timothy W. Duket |
| |
Affiliation: | Department of Computer Science, Purdue University, United States |
| |
Abstract: | It has long been known that pattern matching in the Hamming distance metric can be done in time, where n is the length of the text, m is the length of the pattern, and Σ is the alphabet. The classic algorithm for this is due to Abrahamson and Kosaraju. This paper considers the following generalization, motivated by the situation where the entries in the text and pattern are analog, or distorted by additive noise, or imprecisely given for some other reason: in any alignment of the pattern with the text, two aligned symbols a and b contribute +1 to the similarity score if they differ by no more than a given threshold θ, otherwise they contribute zero. We give an time algorithm for this more general version of the problem; the classic Hamming distance matching problem is the special case of θ=0. |
| |
Keywords: | Algorithms Pattern matching Convolution Hamming distance |
本文献已被 ScienceDirect 等数据库收录! |
|