Inference of finite-state transducers from regular languages |
| |
Authors: | Francisco Casacuberta [Author Vitae] Enrique Vidal [Author Vitae] [Author Vitae] |
| |
Affiliation: | Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politécnica de Valencia, 46071 Valencia, Spain |
| |
Abstract: | Finite-state transducers are models that are being used in different areas of pattern recognition and computational linguistics. One of these areas is machine translation, where the approaches that are based on building models automatically from training examples are becoming more and more attractive. Finite-state transducers are very adequate to be used in constrained tasks where training samples of pairs of sentences are available. A technique to infer finite-state transducers is proposed in this work. This technique is based on formal relations between finite-state transducers and finite-state grammars. Given a training corpus of input-output pairs of sentences, the proposed approach uses statistical alignment methods to produce a set of conventional strings from which a stochastic finite-state grammar is inferred. This grammar is finally transformed into a resulting finite-state transducer. The proposed methods are assessed through series of machine translation experiments within the framework of the EUTRANS project. |
| |
Keywords: | Machine translation Grammatical inference Formal language theory Stochastic finite-state transducers Natural language processing |
本文献已被 ScienceDirect 等数据库收录! |