期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Approximate string matching with suffix automata

Esko Ukkonen Derick Wood 《Algorithmica》1993,10(5):353-364

Theapproximate string matching problem is, given a text string, a pattern string, and an integerk, to find in the text all approximate occurrences of the pattern. An approximate occurrence means a substring of the text with edit distance at mostk from the pattern. We give a newO(kn) algorithm for this problem, wheren is the length of the text. The algorithm is based on the suffix automaton with failure transitions and on the diagonalwise monotonicity of the edit distance table. Some experiments showing that the algorithm has a small overhead are reported. 相似文献

2.

Special factors and the combinatorics of suffix and factor automata

Gabriele Fici 《Theoretical computer science》2011,412(29):3604-3615

The suffix automaton (resp. factor automaton) of a finite word w is the minimal deterministic automaton recognizing the set of suffixes (resp. factors) of w. We study the relationships between the structure of the suffix and factor automata and classical combinatorial parameters related to the special factors of w. We derive formulae for the number of states of these automata. We also characterize the languages L_SA and L_FA of words having respectively suffix automaton and factor automaton with the minimal possible number of states. 相似文献

3.

下载免费PDF全文

Jean-Pierre Duval;Thierry Lecroq;Arnaud Lefebvre 《法国自动化、信息与运筹学；理论与应用信息》2009,43(2):281-297

We present an on-line linear time and space algorithm to check if an integer array f is the border array of at least one string w built on a bounded or unbounded size alphabet Σ. First of all, we show a bijection between the border array of a string w and the skeleton of the DFA recognizing Σ*ω, called a string matching automaton (SMA).Different strings can have the same border array but the originality of the presented method is that the correspondence between a border array and a skeleton of SMA is independent from the underlying strings. This enables to design algorithms for validating and generating border arrays that outperform existing ones.The validating algorithm lowers the delay (maximal number of comparisons on one element of the array) from O(|w|) to 1 + min{|Σ|,1 + log₂|ω|} compared to existing algorithms.We then give results on the numbers of distinct border arrays depending on the alphabet size.We also present an algorithm that checks if a given directed unlabeled graph G is the skeleton of a SMA on an alphabet of size s in linear time.Along the process the algorithm can build one string w for which G is the SMA skeleton. https://doi.org/10.1051/ita:2008030 相似文献

4.

Parallel construction of a suffix tree with applications

A. Apostolico C. Iliopoulos G. M. Landau B. Schieber U. Vishkin 《Algorithmica》1988,3(1):347-365

Many string manipulations can be performed efficiently on suffix trees. In this paper a CRCW parallel RAM algorithm is presented that constructs the suffix tree associated with a string ofn symbols inO(logn) time withn processors. The algorithm requires (n ²) space. However, the space needed can be reduced toO(n ¹⁺) for any 0< 1, with a corresponding slow-down proportional to 1/. Efficient parallel procedures are also given for some string problems that can be solved with suffix trees.The results of this paper have been achieved independently and simultaneously in [AI-86] and [LSV-86]. The research of U. Vishkin was supported by NSF Grant NSF-CCR-8615337, ONR Grant N00014-85-K-0046, and Foundation for Research in Electronics, Computers, and Communication, administered by the Israeli Academy of Sciences and Humanities. The research of A. Apostolico was carried out in part while visiting at the Istituto di Analisi dei Sistemi e Informatica, Rome, with support from the Italian National Research Council. The research of G. M. Landau, B. Schieber, and U. Vishkin was supported by the Applied Mathematical Sciences subprogram of the Office of Energy Research, U.S. Department of Energy under Contract DE-AC02-76ER03077. 相似文献

5.

Estimation of the length of reset words for automata with simple idempotents

I. K. Rystsov 《Cybernetics and Systems Analysis》2000,36(3):339-344

A quadratic upper bound on the length of a minimal reset word is obtained for finite automata with simple idempotents. Each input symbol of the automata considered induces a transformation that is an idempotent with the unit defect or a bijection on the set of states. This bound is only twice as large as the well-known lower bound of this length. Translated from Kibernetika i Sistemnyi Analiz, No. 3, pp. 32–39, May–June, 2000. 相似文献

6.

From Ukkonen to McCreight and Weiner: A Unifying View of Linear-Time Suffix Tree Construction 总被引：6，自引：0，他引：6

R. Giegerich S. Kurtz 《Algorithmica》1997,19(3):331-353

We review the linear-time suffix tree constructions by Weiner, McCreight, and Ukkonen. We use the terminology of the most recent algorithm, Ukkonen's on-line construction, to explain its historic predecessors. This reveals relationships much closer than one would expect, since the three algorithms are based on rather different intuitive ideas. Moreover, it completely explains the differences between these algorithms in terms of simplicity, efficiency, and implementation complexity. Received February 12, 1995; revised January 28, 1996. 相似文献

7.

Data structures and algorithms for the string statistics problem

A. Apostolico F. P. Preparata 《Algorithmica》1996,15(5):481-494

Given a textstringx of lengthn, theMinimal Augmented Suffix Tree T (x) ofx is a digital-search index that returns, for anyquery stringw and in a number of comparisons bounded by the length ofw, the maximum number of nonoverlapping occurrences ofw inx. It is shown that, denoting the length ofx byn, T(x) can be built in timeO(n log² n) and spaceO(n logn), off-line on a RAM.This research was supported in part, through the Leonardo Fibonacci Institute, by the Istituto Trentino di Cultura, Trento, Italy.Additional support was provided by NSF Grants CCR-8900305 and CCR-9201078, by NATO Grant CRG 900293, by the National Research Council of Italy, and by the ESPRIT III Basic Research Programme of the EC under Contract No. 9072 (Project GEPPCOM).Additional support was provided by NSF Grant CCR-91-96176 and ONR Contract N 00014-91-J-4052, ARPA Order 2225. 相似文献

8.

Approximate swapped matching

Amihood Amir Moshe Lewenstein Ely Porat 《Information Processing Letters》2002,83(1):33-39

相似文献

9.

“Computing with words” Concept Applied to Musical Information Retrieval

Boena Kostek 《Electronic Notes in Theoretical Computer Science》2003,82(4):141

相似文献