共查询到20条相似文献,搜索用时 0 毫秒
1.
In this paper, we develop an approach called syntax-based reordering (SBR) to handling the fundamental problem of word ordering for statistical machine translation (SMT). We propose to alleviate the word order challenge including morpho-syntactical and statistical information in the context of a pre-translation reordering framework aimed at capturing short- and long-distance word distortion dependencies. We examine the proposed approach from the theoretical and experimental points of view discussing and analyzing its advantages and limitations in comparison with some of the state-of-the-art reordering methods.In the final part of the paper, we describe the results of applying the syntax-based model to translation tasks with a great need for reordering (Chinese-to-English and Arabic-to-English). The experiments are carried out on standard phrase-based and alternative N-gram-based SMT systems. We first investigate sparse training data scenarios, in which the translation and reordering models are trained on a sparse bilingual data, then scaling the method to a large training set and demonstrating that the improvement in terms of translation quality is maintained. 相似文献
2.
We propose a novel approach to cross-lingual language model and translation lexicon adaptation for statistical machine translation
(SMT) based on bilingual latent semantic analysis. Bilingual LSA enables latent topic distributions to be efficiently transferred
across languages by enforcing a one-to-one topic correspondence during training. Using the proposed bilingual LSA framework,
model adaptation can be performed by, first, inferring the topic posterior distribution of the source text and then applying
the inferred distribution to an n-gram language model of the target language and translation lexicon via marginal adaptation. The background phrase table is
enhanced with the additional phrase scores computed using the adapted translation lexicon. The proposed framework also features
rapid bootstrapping of LSA models for new languages based on a source LSA model of another language. Our approach is evaluated
on the Chinese–English MT06 test set using the medium-scale SMT system and the GALE SMT system measured in BLEU and NIST scores.
Improvement in both scores is observed on both systems when the adapted language model and the adapted translation lexicon
are applied individually. When the adapted language model and the adapted translation lexicon are applied simultaneously,
the gain is additive. At the 95% confidence interval of the unadapted baseline system, the gain in both scores is statistically
significant using the medium-scale SMT system, while the gain in the NIST score is statistically significant using the GALE
SMT system. 相似文献
3.
Statistical machine translation systems are usually trained on large amounts of bilingual text (used to learn a translation
model), and also large amounts of monolingual text in the target language (used to train a language model). In this article
we explore the use of semi-supervised model adaptation methods for the effective use of monolingual data from the source language
in order to improve translation quality. We propose several algorithms with this aim, and present the strengths and weaknesses
of each one. We present detailed experimental evaluations on the French–English EuroParl data set and on data from the NIST
Chinese–English large-data track. We show a significant improvement in translation quality on both tasks. 相似文献
4.
Joan Albert Silvestre-Cerdà Jesús Andrés-Ferrer Jorge Civera 《Pattern recognition》2012,45(9):3183-3192
Explicit length modelling has been previously explored in statistical pattern recognition with successful results. In this paper, two length models along with two parameter estimation methods and two alternative parametrisations for statistical machine translation (SMT) are presented. More precisely, we incorporate explicit bilingual length modelling in a state-of-the-art log-linear SMT system as an additional feature function in order to prove the contribution of length information. Finally, a systematic evaluation on reference SMT tasks considering different language pairs proves the benefits of explicit length modelling. 相似文献
5.
对齐短语是决定统计机器翻译系统质量的核心模块。提出基于短语结构树的层次短语模型,这是利用串-树模型的思想对层次短语模型的扩展。基于短语结构树的层次短语模型是在双语对齐短语的基础之上结合英语短语结构树抽取翻译规则,并利用启发式策略获得翻译规则的扩展句法标记。采用翻译规则的统计机器翻译系统在不同数据集上具有稳定的翻译结果,在训练集和测试集的平均BlEU评分高于短语模型和层次短语模型的BLEU评分。 相似文献
6.
Assessing the quality of candidate translations involves diverse linguistic facets. However, most automatic evaluation methods in use today rely on limited quality assumptions, such as lexical similarity. This introduces a bias in the development cycle which in some cases has been reported to carry very negative consequences. In order to tackle this methodological problem, we explore a novel path towards heterogeneous automatic Machine Translation evaluation. We have compiled a rich set of specialized similarity measures operating at different linguistic dimensions and analyzed their individual and collective behaviour over a wide range of evaluation scenarios. Results show that measures based on syntactic and semantic information are able to provide more reliable system rankings than lexical measures, especially when the systems under evaluation are based on different paradigms. At the sentence level, while some linguistic measures perform better than most lexical measures, some others perform substantially worse, mainly due to parsing problems. Their scores are, however, suitable for combination, yielding a substantially improved evaluation quality. 相似文献
7.
Sankaranarayanan Ananthakrishnan Rohit Prasad David Stallard Prem Natarajan 《Computer Speech and Language》2013,27(2):397-406
The development of high-performance statistical machine translation (SMT) systems is contingent on the availability of substantial, in-domain parallel training corpora. The latter, however, are expensive to produce due to the labor-intensive nature of manual translation. We propose to alleviate this problem with a novel, semi-supervised, batch-mode active learning strategy that attempts to maximize in-domain coverage by selecting sentences, which represent a balance between domain match, translation difficulty, and batch diversity. Simulation experiments on an English-to-Pashto translation task show that the proposed strategy not only outperforms the random selection baseline, but also traditional active selection techniques based on dissimilarity to existing training data. 相似文献
8.
This paper proposes a novel method for phrase-based statistical machine translation based on the use of a pivot language.
To translate between languages L
s
and L
t
with limited bilingual resources, we bring in a third language, L
p
, called the pivot language. For the language pairs L
s
− L
p
and L
p
− L
t
, there exist large bilingual corpora. Using only L
s
− L
p
and L
p
− L
t
bilingual corpora, we can build a translation model for L
s
− L
t
. The advantage of this method lies in the fact that we can perform translation between L
s
and L
t
even if there is no bilingual corpus available for this language pair. Using BLEU as a metric, our pivot language approach
significantly outperforms the standard model trained on a small bilingual corpus. Moreover, with a small L
s
− L
t
bilingual corpus available, our method can further improve translation quality by using the additional L
s
− L
p
and L
p
− L
t
bilingual corpora. 相似文献
9.
Marta R. Costa-jussà Rafael E. Banchs 《Journal of Intelligent Information Systems》2011,37(2):139-154
In this paper, we propose and evaluate a novel dynamic feature function for log-linear model combinations in phrase-based
statistical machine translation. The feature function is inspired on the popularly known vector-space model which is typically
used in information retrieval and text mining applications, and it aims at improving translation unit selection at decoding
time by incorporating context information from the source language. Significant improvements on an English-Spanish experimental
corpus are presented and discussed. 相似文献
10.
This article describes a method that successfully exploits syntactic features for n-best translation candidate reranking using
perceptrons. We motivate the utility of syntax by demonstrating the superior performance of parsers over n-gram language models
in differentiating between Statistical Machine Translation output and human translations. Our approach uses discriminative
language modelling to rerank the n-best translations generated by a statistical machine translation system. The performance
is evaluated for Arabic-to-English translation using NIST’s MT-Eval benchmarks. While deep features extracted from parse trees
do not consistently help, we show how features extracted from a shallow Part-of-Speech annotation layer outperform a competitive
baseline and a state-of-the-art comparative reranking approach, leading to significant BLEU improvements on three different
test sets. 相似文献
11.
Word reordering is one of the challengeable problems of machine translation. It is an important factor of quality and efficiency of machine translation systems. In this paper, we introduce a novel reordering model based on an innovative structure, named, phrasal dependency tree. The phrasal dependency tree is a modern syntactic structure which is based on dependency relationships between contiguous non-syntactic phrases. The proposed model integrates syntactical and statistical information in the context of log-linear model aimed at dealing with the reordering problems. It benefits from phrase dependencies, translation directions (orientations) and translation discontinuity between translated phrases. In comparison with well-known and popular reordering models such as distortion, lexicalised and hierarchical models, the experimental study demonstrates the superiority of our model in terms of translation quality. Performance is evaluated for Persian → English and English → German translation tasks using Tehran parallel corpus and WMT07 benchmarks, respectively. The results report 1.54/1.7 and 1.98/3.01 point improvements over the baseline in terms of BLEU/TER metrics on Persian → English and German → English translation tasks, respectively. On average our model retrieved a significant impact on precision with comparable recall value with respect to the lexicalised and distortion models. 相似文献
12.
In Statistical Machine Translation (SMT), the constraints on word reorderings have a great impact on the set of potential translations that is explored during search. Notwithstanding computational issues, the reordering space of a SMT system needs to be designed with great care: if a larger search space is likely to yield better translations, it may also lead to more decoding errors, because of the added ambiguity and the interaction with the pruning strategy. In this paper, we study the reordering search space, using a state-of-the art translation system, where all reorderings are represented in a permutation lattice prior to decoding. This allows us to directly explore and compare different reordering schemes and oracle settings. We also study in detail a rule-based preordering system, varying the length and number of rules, the tagset used, as well as contrasting with purely combinatorial subsets of permutations. We carry out experiments on three language pairs in both directions: English-French, a close language pair; English-German and English-Czech, two much more challenging pairs. We show that even though it might be desirable to design better reordering spaces, model and search errors seem to be the most important issues. Therefore, improvements of the reordering space should come along with improvements of the associated models to be really effective. 相似文献
13.
In this paper we introduce a set of related confidence measures for large vocabulary continuous speech recognition (LVCSR) based on local phone posterior probability estimates output by an acceptor HMM acoustic model. In addition to their computational efficiency, these confidence measures are attractive as they may be applied at the state-, phone-, word- or utterance-levels, potentially enabling discrimination between different causes of low confidence recognizer output, such as unclear acoustics or mismatched pronunciation models. We have evaluated these confidence measures for utterance verification using a number of different metrics. Experiments reveal several trends in “profitability of rejection", as measured by the unconditional error rate of a hypothesis test. These trends suggest that crude pronunciation models can mask the relatively subtle reductions in confidence caused by out-of-vocabulary (OOV) words and disfluencies, but not the gross model mismatches elicited by non-speech sounds. The observation that a purely acoustic confidence measure can provide improved performance over a measure based upon both acoustic and language model information for data drawn from the Broadcast News corpus, but not for data drawn from the North American Business News corpus suggests that the quality of model fit offered by a trigram language model is reduced for Broadcast News data. We also argue that acoustic confidence measures may be used to inform the search for improved pronunciation models. 相似文献
14.
A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability 总被引:1,自引:13,他引:1
S. García A. Fernández J. Luengo F. Herrera 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2009,13(10):959-977
The experimental analysis on the performance of a proposed method is a crucial and necessary task to carry out in a research.
This paper is focused on the statistical analysis of the results in the field of genetics-based machine Learning. It presents
a study involving a set of techniques which can be used for doing a rigorous comparison among algorithms, in terms of obtaining
successful classification models. Two accuracy measures for multi-class problems have been employed: classification rate and
Cohen’s kappa. Furthermore, two interpretability measures have been employed: size of the rule set and number of antecedents.
We have studied whether the samples of results obtained by genetics-based classifiers, using the performance measures cited
above, check the necessary conditions for being analysed by means of parametrical tests. The results obtained state that the
fulfillment of these conditions are problem-dependent and indefinite, which supports the use of non-parametric statistics
in the experimental analysis. In addition, non-parametric tests can be satisfactorily employed for comparing generic classifiers
over various data-sets considering any performance measure. According to these facts, we propose the use of the most powerful
non-parametric statistical tests to carry out multiple comparisons. However, the statistical analysis conducted on interpretability
must be carefully considered. 相似文献
15.
Rejwanul Haque Sudip Kumar Naskar Antal van den Bosch Andy Way 《Machine Translation》2011,25(3):239-285
The translation features typically used in Phrase-Based Statistical Machine Translation (PB-SMT) model dependencies between the source and target phrases, but not among the phrases in the source language themselves. A swathe of research has demonstrated that integrating source context modelling directly into log-linear PB-SMT can positively influence the weighting and selection of target phrases, and thus improve translation quality. In this contribution we present a revised, extended account of our previous work on using a range of contextual features, including lexical features of neighbouring words, supertags, and dependency information. We add a number of novel aspects, including the use of semantic roles as new contextual features in PB-SMT, adding new language pairs, and examining the scalability of our research to larger amounts of training data. While our results are mixed across feature selections, classifier hyperparameters, language pairs, and learning curves, we observe that including contextual features of the source sentence in general produces improvements. The most significant improvements involve the integration of long-distance contextual features, such as dependency relations in combination with part-of-speech tags in Dutch-to-English subtitle translation, the combination of dependency parse and semantic role information in English-to-Dutch parliamentary debate translation, or supertag features in English-to-Chinese translation. 相似文献
16.
We present a phrase-based statistical machine translation approach which uses linguistic analysis in the preprocessing phase.
The linguistic analysis includes morphological transformation and syntactic transformation. Since the word-order problem is
solved using syntactic transformation, there is no reordering in the decoding phase. For morphological transformation, we
use hand-crafted transformational rules. For syntactic transformation, we propose a transformational model based on a probabilistic
context-free grammar. This model is trained using a bilingual corpus and a broad-coverage parser of the source language. This
approach is applicable to language pairs in which the target language is poor in resources. We considered translation from
English to Vietnamese and from English to French. Our experiments showed significant BLEU-score improvements in comparison
with Pharaoh, a state-of-the-art phrase-based SMT system. 相似文献
17.
Sauleh Eetemadi William Lewis Kristina Toutanova Hayder Radha 《Machine Translation》2015,29(3-4):189-223
Statistical machine translation has seen significant improvements in quality over the past several years. The single biggest factor in this improvement has been the accumulation of ever larger stores of data. We now find ourselves, however, the victims of our own success, in that it has become increasingly difficult to train on such large sets of data, due to limitations in memory, processing power, and ultimately, speed (i.e. data-to-models takes an inordinate amount of time). Moreover, the training data has a wide quality spectrum. A variety of methods for data cleaning and data selection have been developed to address these issues. Each of these methods employs a search or filtering algorithm to select a subset of the data, given a defined set of feature functions. In this paper we provide a comparative overview of research in this area based on application scenario, feature functions and search method. 相似文献
18.
《Computer Speech and Language》2007,21(2):350-372
We describe methods for improving the performance of statistical machine translation (SMT) between four linguistically different languages, i.e., Chinese, English, Japanese, and Korean by using morphosyntactic knowledge. For the purpose of reducing the translation ambiguities and generating grammatically correct and fluent translation output, we address the use of shallow linguistic knowledge, that is: (1) enriching a word with its morphosyntactic features, (2) obtaining shallow linguistically-motivated phrase pairs, (3) iteratively refining word alignment using filtered phrase pairs, and (4) building a language model from morphosyntactically enriched words. Previous studies reported that the introduction of syntactic features into SMT models resulted in only a slight improvement in performance in spite of the heavy computational expense, however, this study demonstrates the effectiveness of morphosyntactic features, when reliable, discriminative features are used. Our experimental results show that word representations that incorporate morphosyntactic features significantly improve the performance of the translation model and language model. Moreover, we show that refining the word alignment using fine-grained phrase pairs is effective in improving system performance. 相似文献
19.
Dekai Wu 《Machine Translation》2005,19(3-4):213-227
We offer a perspective on EBMT from a statistical MT standpoint, by developing a three-dimensional MT model space based on
three pairs of definitions: (1) logical versus statistical MT, (2) schema-based versus example-based MT, and (3) lexical versus
compositional MT. Within this space we consider the interplay of three key ideas in the evolution of transfer, example-based,
and statistical approaches to MT. We depict how all translation models face these issues in one way or another, regardless
of the school of thought, and suggest where the real questions for the future may lie. 相似文献
20.
Bilingual termbanks are important for many natural language processing applications, especially in translation workflows in industrial settings. In this paper, we apply a log-likelihood comparison method to extract monolingual terminology from the source and target sides of a parallel corpus. The initial candidate terminology list is prepared by taking all arbitrary n-gram word sequences from the corpus. Then, a well-known statistical measure (the Dice coefficient) is employed in order to remove any multi-word terms with weak associations from the candidate term list. Thereafter, the log-likelihood comparison method is applied to rank the phrasal candidate term list. Then, using a phrase-based statistical machine translation model, we create a bilingual terminology with the extracted monolingual term lists. We integrate an external knowledge source—the Wikipedia cross-language link databases—into the terminology extraction (TE) model to assist two processes: (a) the ranking of the extracted terminology list, and (b) the selection of appropriate target terms for a source term. First, we report the performance of our monolingual TE model compared to a number of the state-of-the-art TE models on English-to-Turkish and English-to-Hindi data sets. Then, we evaluate our novel bilingual TE model on an English-to-Turkish data set, and report the automatic evaluation results. We also manually evaluate our novel TE model on English-to-Spanish and English-to-Hindi data sets, and observe excellent performance for all domains. 相似文献