Abstract:With the rapid development and wide application of deep learning technology, AI-generated text detection plays an increasingly important role in various fields. In this study, we developed an efficient AI-generated text detection model based on the BERT algorithm, which provides new ideas and methods for solving related problems. In the data preprocessing stage, a series of steps were taken to process the text, including operations such as converting to lowercase, word splitting, removing stop words, stemming extraction, removing digits, and eliminating redundant spaces, to ensure data quality and accuracy. By dividing the dataset into a training set and a test set in the ratio of 60% and 40%, and observing the changes in the accuracy and loss values during the training process, we found that the model performed well during the training process. The accuracy increases steadily from the initial 94.78% to 99.72%, while the loss value decreases from 0.261 to 0.021 and converges gradually, which indicates that the BERT model is able to detect AI-generated text with high accuracy and the prediction results are gradually approaching the real classification results. Further analysis of the results of the training and test sets reveals that in terms of loss value, the average loss of the training set is 0.0565, while the average loss of the test set is 0.0917, showing a slightly higher loss value. As for the accuracy, the average accuracy of the training set reaches 98.1%, while the average accuracy of the test set is 97.71%, which is not much different from each other, indicating that the model has good generalisation ability. In conclusion, the AI-generated text detection model based on the BERT algorithm proposed in this study shows high accuracy and stability in experiments, providing an effective solution for related fields. In the future, the model performance can be further optimised and its potential for application in a wider range of fields can be explored to promote the development and application of AI technology in the field of text detection.

The Automatic Text Classification Method Based on BERT and Feature Union

Chinese Text Classification Using BERT and Flat-Lattice Transformer.

A Chinese Text Classification Method Based on BERT and Convolutional Neural Network

A BERT-Based Hybrid Short Text Classification Model Incorporating CNN and Attention-Based BiGRU

News text classification based on hybrid model of Bidirectional Encoder Representation from Transformers and Convolutional Neural Network

Chinese text classification method based on sentence information enhancement and feature fusion

A Long-Text Classification Method of Chinese News Based on BERT and CNN

Research on sentiment classification for netizens based on the BERT-BiLSTM-TextCNN model

A text classification network model combining machine learning and deep learning

Chinese Text Classification Model Based On Bert And Capsule Network Structure

A Method of Sustainable Development for Three Chinese Short-Text Datasets Based on BERT-CAM

Research on Text Classification Based on BERT-BiGRU Model

Feature-Enhanced Nonequilibrium Bidirectional Long Short-Term Memory Model for Chinese Text Classification

DCNN-BiGRU Text Classification Model Based on BERT Embedding

Short Text Classification Model based on Pre-trained Language Model with Feature Fusion

BERT-based Chinese Text Classification for Emergency Domain with a Novel Loss Function

Improved Chinese Short Text Classification Method Based on ERNIE_BiGRU Model

AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm