Improving neural machine translation for low-resource Indian languages using rule-based feature extraction

Muskaan Singh,Ravinder Kumar,Inderveer Chana
DOI: https://doi.org/10.1007/s00521-020-04990-9
2020-06-02
Neural Computing and Applications
Abstract:Languages help to unite the world socially, culturally and technologically. Different natives communicate in different languages; there is a tremendous requirement for inter-language information translation process to transfer and share information and ideas. Though Sanskrit is an ancient Indo-European language, a significant amount of work for processing the information is required to explore the full potential of this language to open vistas in computational linguistics and computer science domain. In this paper, we have proposed and presented the machine translation system for translating Sanskrit to the Hindi language. The developed technique uses linguistic features from rule-based feed to train neural machine translation system. The work is novel and applicable to any low-resource language with rich morphology. It is a generic system covering various domains with minimal human intervention. The performance analysis of work is performed on automatic and linguistic measures. The results show that proposed and developed approach outperforms earlier work for this language pair.
computer science, artificial intelligence
What problem does this paper attempt to address?