Toxic Comments Hunter : Score Severity of Toxic Comments

Zhichang Wang,Qipeng Zhu
DOI: https://doi.org/10.48550/arXiv.2203.03548
2022-02-15
Computation and Language
Abstract:The detection and identification of toxic comments are conducive to creating a civilized and harmonious Internet environment. In this experiment, we collected various data sets related to toxic comments. Because of the characteristics of comment data, we perform data cleaning and feature extraction operations on it from different angles to obtain different toxic comment training sets. In terms of model construction, we used the training set to train the models based on TFIDF and finetuned the Bert model separately. Finally, we encapsulated the code into software to score toxic comments in real-time.
What problem does this paper attempt to address?