The Quality of Google Translate and ChatGPT English to Arabic Translation The Case of Scientific Text Translation

Elham Alzain,Khalil A. Nagi,Faiz AlGobaei
DOI: https://doi.org/10.30564/fls.v6i3.6799
2024-08-27
Forum for Linguistic Studies
Abstract:The aim of the study is to investigate the quality of neural machine translation (NMT) and that of large language models (LLMs). The research team uses Google Translate and ChatGPT in the translation of various selected scientific texts. They provide an evaluation of the translation outputs. Both an error analysis and human evaluation are provided by professional annotators. The error analysis is provided based on the typology of errors introduced by Multidimensional Quality Metrics (MQM). A professional evaluation is also provided using a 7-point Likert scale. The professional annotators provide an evaluation on the document level. Both the evaluation and the number of errors show that Google Translate outperforms ChatGPT. However, the results indicate that both systems still require a lot of training. It is also suggested that annotated corpora need to be constructed. The study provides invaluable insights on the strength and weakness of the systems under study which will be beneficial for translators, researchers and developers of machine translations.
What problem does this paper attempt to address?