Abstract:Most State-Of-The-Art (SOTA) Neural Machine Translation (NMT) systems today achieve outstanding results based only on large parallel corpora. The large-scale parallel corpora for high-resource languages is easily obtainable. However, the translation quality of NMT for morphologically rich languages is still unsatisfactory, mainly because of the data sparsity problem encountered in Low-Resource Languages (LRLs). In the low-resource NMT paradigm, Transfer Learning (TL) has been developed into one of the most efficient methods. It is difficult to train the model on high-resource languages to include the information in both parent and child models, as well as the initially trained model that only contains the lexicon features and word embeddings of the parent model instead of the child languages feature. In this work, we aim to address this issue by proposing the language-independent Hybrid Transfer Learning (HTL) method for LRLs by sharing lexicon embedding between parent and child languages without leveraging back translation or manually injecting noises. First, we train the High-Resource Languages (HRLs) as the parent model with its vocabularies. Then, we combine the parent and child language pairs using the oversampling method to train the hybrid model initialized by the previously parent model. Finally, we fine-tune the morphologically rich child model using a hybrid model. Besides, we explore some exciting discoveries on the original TL approach. Experimental results show that our model consistently outperforms five SOTA methods in two languages Azerbaijani (Az) and Uzbek (Uz). Meanwhile, our approach is practical and significantly better, achieving improvements of up to 4.94 and 4.84 BLEU points for low-resource child languages Az ! Zh and Uz ! Zh, respectively.

Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning

Transfer learning of language-independent end-to-end ASR with language model fusion

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Almost Unsupervised Text to Speech and Automatic Speech Recognition

Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation

Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks

Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping

Towards scalable efficient on-device ASR with transfer learning

Improving RNN Transducer Based ASR with Auxiliary Tasks

Extending Multilingual ASR to New Languages Using Supplementary Encoder and Decoder Components

End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning.

Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions

Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition

On-the-fly Text Retrieval for End-to-End ASR Adaptation

Multilingual Meta-Transfer Learning for Low-Resource Speech Recognition

Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study

Semantic Data Augmentation for End-to-End Mandarin Speech Recognition

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer

Enriching the Transfer Learning with Pre-Trained Lexicon Embedding for Low-Resource Neural Machine Translation

Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition