Abstract:This paper presents a solution for the Multilingual Text Detoxification task in the PAN-2024 competition of the SmurfCat team. Using data augmentation through machine translation and a special filtering procedure, we collected an additional multilingual parallel dataset for text detoxification. Using the obtained data, we fine-tuned several multilingual sequence-to-sequence models, such as mT0 and Aya, on a text detoxification task. We applied the ORPO alignment technique to the final model. Our final model has only 3.7 billion parameters and achieves state-of-the-art results for the Ukrainian language and near state-of-the-art results for other languages. In the competition, our team achieved first place in the automated evaluation with a score of 0.52 and second place in the final human evaluation with a score of 0.74.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the Multilingual Text Detoxification task. Specifically, the researchers aim to develop a system that can transform toxic texts into non - toxic texts, and this system needs to support multiple languages, including low - resource languages such as Amharic, Arabic, German, Spanish, Hindi, Chinese, Russian, Ukrainian, and English.
### Problem Background
Multilingual text detoxification is a challenging task, especially when adapting to low - resource languages. The goal of the PAN - 2024 Multilingual Text Detoxification task is to develop an efficient detoxification system for these nine languages. The main challenges include:
1. **Data Scarcity**: For some languages, the available parallel datasets are very limited.
2. **Model Adaptability**: How to make the model perform well on different languages, especially low - resource languages.
3. **Translation Quality**: When expanding data through machine translation, ensure that the translated text still retains the original meaning and the toxicity label is correct.
### Solutions
To address these challenges, the SmurfCat team adopted the following strategies:
1. **Data Augmentation**:
- Use machine translation to expand the dataset from English to other languages. For example, the original English data was translated into other languages through the GoogleTranslator model, thus obtaining an additional 19,700 samples.
- Conduct specific post - processing steps to ensure that the translated data retains the original meaning and maintains the accuracy of the toxicity label. Use the LaBSE model to evaluate the similarity between translation pairs and use the XLM - R toxicity classifier to check toxicity changes.
2. **Model Fine - Tuning**:
- Select the mT0 series models for fine - tuning because these models have strong multilingual capabilities. The experiment also tried the Aya - 101 model, which is a multilingual instruction - tuned model based on mT5 - xl.
- Use the supervised learning method to fine - tune these models, adjusting hyper - parameters such as the learning rate and batch size.
3. **Best Candidate Selection**:
- During the inference process, generate multiple hypotheses and select the most likely candidate through diverse - beam search. Calculate the correlation measure (i.e., the product of similarity and toxicity scores) to select the best candidate.
4. **ORPO Alignment**:
- Use the Odds Ratio Preference Optimization (ORPO) method to further optimize the model. By generating hypotheses on the test set and annotating correlation scores, a dataset containing prompts, rejected samples, and selected samples was constructed for final alignment.
### Results
Through the above methods, the SmurfCat team achieved the first - place result in the automatic evaluation, with an average score of 0.52, and the second - place result in the human evaluation, with an average score of 0.74. Especially for Ukrainian, their model performed extremely well, reaching the state - of - the - art level.
In summary, this paper aims to develop an efficient text detoxification system applicable to multiple languages through data augmentation, model fine - tuning, and optimization techniques, solving the key problems in multilingual text detoxification.