An Approximate Determinization Algorithm for Weighted Finite-State Automata |
| |
Authors: | A. L. Buchsbaum R. Giancarlo J. R. Westbrook |
| |
Affiliation: | (1) AT&T Labs, Shannon Laboratory, 180 Park Avenue, Florham Park, NJ 07932, USA. {alb,jeffw}@research. att.com., US;(2) Dipartimento di Matematica ed Applicazioni, Universitá di Palermo, Via Archirafi 34, 90123 Palermo, Italy. raffaele@altair.math.unipa.it., IT |
| |
Abstract: | Nondeterministic weighted finite-state automata are a key abstraction in automatic speech recognition systems. The efficiency of automatic speech recognition depends directly on the sizes of these automata and the degree of nondeterminism present, so recent research has studied ways to determinize and minimize them, using analogues of classical automata determinization and minimization. Although, as we describe here, determinization can in the worst case cause poly-exponential blowup in the number of states of a weighted finite-state automaton, in practice it is remarkably successful. In extensive experiments in automatic speech recognition systems, deterministic weighted finite-state automata tend to be smaller than the corresponding nondeterministic inputs. Our observations show that these size reductions depend critically on the interplay between weights and topology in nondeterministic weighted finite-state automata. We exploit these observations to design a new approximate determinization algorithm, which produces a deterministic weighted finite-state automaton that preserves the strings of a weighted language but not necessarily their weights. We apply our algorithm to two different types of weighted finite-state automata that occur in automatic speech recognition systems and in each case provide extensive experimental results showing that, compared with current techniques, we achieve significant size reductions without affecting performance. In particular, for a standard test bed, we can reduce automatic speech recognition memory requirements by 25—35percent with negligible effects on recognition time and accuracy. Received March 31, 1998; revised January 29, 1999. |
| |
Keywords: | . Algorithms Determinization Weighted automata. |
本文献已被 SpringerLink 等数据库收录! |
|