Abstract:This paper presents a solution for the Multilingual Text Detoxification task in the PAN-2024 competition of the SmurfCat team. Using data augmentation through machine translation and a special filtering procedure, we collected an additional multilingual parallel dataset for text detoxification. Using the obtained data, we fine-tuned several multilingual sequence-to-sequence models, such as mT0 and Aya, on a text detoxification task. We applied the ORPO alignment technique to the final model. Our final model has only 3.7 billion parameters and achieves state-of-the-art results for the Ukrainian language and near state-of-the-art results for other languages. In the competition, our team achieved first place in the automated evaluation with a score of 0.52 and second place in the final human evaluation with a score of 0.74.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the Multilingual Text Detoxification task. Specifically, the researchers aim to develop a system that can transform toxic texts into non - toxic texts, and this system needs to support multiple languages, including low - resource languages such as Amharic, Arabic, German, Spanish, Hindi, Chinese, Russian, Ukrainian, and English. ### Problem Background Multilingual text detoxification is a challenging task, especially when adapting to low - resource languages. The goal of the PAN - 2024 Multilingual Text Detoxification task is to develop an efficient detoxification system for these nine languages. The main challenges include: 1. **Data Scarcity**: For some languages, the available parallel datasets are very limited. 2. **Model Adaptability**: How to make the model perform well on different languages, especially low - resource languages. 3. **Translation Quality**: When expanding data through machine translation, ensure that the translated text still retains the original meaning and the toxicity label is correct. ### Solutions To address these challenges, the SmurfCat team adopted the following strategies: 1. **Data Augmentation**: - Use machine translation to expand the dataset from English to other languages. For example, the original English data was translated into other languages through the GoogleTranslator model, thus obtaining an additional 19,700 samples. - Conduct specific post - processing steps to ensure that the translated data retains the original meaning and maintains the accuracy of the toxicity label. Use the LaBSE model to evaluate the similarity between translation pairs and use the XLM - R toxicity classifier to check toxicity changes. 2. **Model Fine - Tuning**: - Select the mT0 series models for fine - tuning because these models have strong multilingual capabilities. The experiment also tried the Aya - 101 model, which is a multilingual instruction - tuned model based on mT5 - xl. - Use the supervised learning method to fine - tune these models, adjusting hyper - parameters such as the learning rate and batch size. 3. **Best Candidate Selection**: - During the inference process, generate multiple hypotheses and select the most likely candidate through diverse - beam search. Calculate the correlation measure (i.e., the product of similarity and toxicity scores) to select the best candidate. 4. **ORPO Alignment**: - Use the Odds Ratio Preference Optimization (ORPO) method to further optimize the model. By generating hypotheses on the test set and annotating correlation scores, a dataset containing prompts, rejected samples, and selected samples was constructed for final alignment. ### Results Through the above methods, the SmurfCat team achieved the first - place result in the automatic evaluation, with an average score of 0.52, and the second - place result in the human evaluation, with an average score of 0.74. Especially for Ukrainian, their model performed extremely well, reaching the state - of - the - art level. In summary, this paper aims to develop an efficient text detoxification system applicable to multiple languages through data augmentation, model fine - tuning, and optimization techniques, solving the key problems in multilingual text detoxification.

SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text Detoxification

Multilingual Text Detoxification Using Google Cloud Translation and Post-Processing

A Multilingual Text Detoxification Method Based on Few-shot Learning and CO-STAR Framework

MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages

Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification

Exploring Cross-lingual Textual Style Transfer with Large Multilingual Language Models

Text Detoxification using Large Pre-trained Neural Models

Methods for Detoxification of Texts for the Russian Language

UPB at IberLEF-2023 AuTexTification: Detection of Machine-Generated Text using Transformer Ensembles

Russian Texts Detoxification with Levenshtein Editing

A Study of Multilingual Toxic Text Detection Approaches under Imbalanced Sample Distribution

Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation

Text Detoxification as Style Transfer in English and Hindi

Enhancing Model Performance in Multilingual Information Retrieval with Comprehensive Data Engineering Techniques

Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Data

Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated Text

DetoxLLM: A Framework for Detoxification with Explanations

Self-Detoxifying Language Models via Toxification Reversal

GPT-DETOX: An In-Context Learning-Based Paraphraser for Text Detoxification

Fine-grained detoxification framework via instance-level prefixes for large language models

DESTEIN: Navigating Detoxification of Language Models via Universal Steering Pairs and Head-wise Activation Fusion