Word-to-Word Models of Translational Equivalence

I. Dan Melamed
DOI: https://doi.org/10.48550/arXiv.cmp-lg/9805006
1998-05-12
Abstract:Parallel texts (bitexts) have properties that distinguish them from other kinds of parallel data. First, most words translate to only one other word. Second, bitext correspondence is noisy. This article presents methods for biasing statistical translation models to reflect these properties. Analysis of the expected behavior of these biases in the presence of sparse data predicts that they will result in more accurate models. The prediction is confirmed by evaluation with respect to a gold standard -- translation models that are biased in this fashion are significantly more accurate than a baseline knowledge-poor model. This article also shows how a statistical translation model can take advantage of various kinds of pre-existing knowledge that might be available about particular language pairs. Even the simplest kinds of language-specific knowledge, such as the distinction between content words and function words, is shown to reliably boost translation model performance on some tasks. Statistical models that are informed by pre-existing knowledge about the model domain combine the best of both the rationalist and empiricist traditions.
Computation and Language
What problem does this paper attempt to address?
This paper attempts to study the natural selection dynamics described in the Fisher model through differential geometric methods. Specifically, the author introduced an affine connection, which is proven to be projectively Euclidean and equiaffine. Through this method, the selection dynamics is reformulated as the motion of an "effective particle" in an "effective external field", which is a tensor type. The paper also found the exact solutions of the Fisher equation under specific fitness matrices, which are related to the chromosomal imprinting effects in mammals. In addition, the paper discussed the biological significance of the differential geometric construction, especially the affine curvature as a direct consequence of allele coupling in the system, and related it to the inhomogeneity of the time flow in the selection process. In short, the core problem of the paper is to use differential geometric methods to understand how allele coupling in the Fisher model affects the dynamic process of natural selection.