A Lexicalist Approach to the Translation of Colloquial Text

Fred Popowich,Davide Turcato,Olivier Laurens,Paul McFetridge,J. Devlan Nicholson,Patrick McGivern,Maricela Corzo Pena,Lisa Pidruchney,Scott MacDonald
DOI: https://doi.org/10.48550/arXiv.cmp-lg/9706024
1997-06-18
Computation and Language
Abstract:Colloquial English (CE) as found in television programs or typical conversations is different than text found in technical manuals, newspapers and books. Phrases tend to be shorter and less sophisticated. In this paper, we look at some of the theoretical and implementational issues involved in translating CE. We present a fully automatic large-scale multilingual natural language processing system for translation of CE input text, as found in the commercially transmitted closed-caption television signal, into simple target sentences. Our approach is based on the Whitelock's Shake and Bake machine translation paradigm, which relies heavily on lexical resources. The system currently translates from English to Spanish with the translation modules for Brazilian Portuguese under development.
What problem does this paper attempt to address?