AI vs. Human -- Differentiation Analysis of Scientific Content Generation

Yongqiang Ma,Jiawei Liu,Fan Yi,Qikai Cheng,Yong Huang,Wei Lu,Xiaozhong Liu
DOI: https://doi.org/10.48550/arXiv.2301.10416
2023-02-12
Abstract:Recent neural language models have taken a significant step forward in producing remarkably controllable, fluent, and grammatical text. Although studies have found that AI-generated text is not distinguishable from human-written text for crowd-sourcing workers, there still exist errors in AI-generated text which are even subtler and harder to spot. We primarily focus on the scenario in which scientific AI writing assistant is deeply involved. First, we construct a feature description framework to distinguish between AI-generated text and human-written text from syntax, semantics, and pragmatics based on the human evaluation. Then we utilize the features, i.e., writing style, coherence, consistency, and argument logistics, from the proposed framework to analyze two types of content. Finally, we adopt several publicly available methods to investigate the gap of between AI-generated scientific text and human-written scientific text by AI-generated scientific text detection models. The results suggest that while AI has the potential to generate scientific content that is as accurate as human-written content, there is still a gap in terms of depth and overall quality. The AI-generated scientific content is more likely to contain errors in factual issues. We find that there exists a "writing style" gap between AI-generated scientific text and human-written scientific text. Based on the analysis result, we summarize a series of model-agnostic and distribution-agnostic features for detection tasks in other domains. Findings in this paper contribute to guiding the optimization of AI models to produce high-quality content and addressing related ethical and security concerns.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to identify the differences between scientific texts generated by AI and those written by humans. Specifically, the researchers are concerned with how to distinguish between AI - generated scientific texts and human - written scientific texts in the case of in - depth use of AI writing assistants in scientific writing. They analyze the differences between these two types of texts from the perspectives of syntax, semantics and pragmatics by constructing a feature - description framework, and use these features to evaluate and detect AI - generated scientific texts. In addition, the paper also explores the existing problems of current AI - generated scientific texts, such as factual errors, differences in writing styles, etc., and the potential threats of these problems to scientific publishing and research integrity. Through this research, the author hopes to provide guidance for optimizing AI models, improving the quality of AI - generated content and solving related ethical and safety issues.