Abstract:Recent advances in Artificial Intelligence (AI), such as the development of large language models like ChatGPT, have blurred the boundaries between human and AI-generated text. This has led to a pressing need for tools that can determine whether text has been created or revised using AI. A general and universally effective detection model would be extremely useful, but appears to be beyond the reach of current technology and detection methods. The research described in this study adopts a domain and task specific approach and shows that specialized detection models can attain high accuracy. The study focuses on the higher education graduate admissions process, with the specific goal of identifying AI-generated and AI-revised Letters of Recommendation (LORs) and Statements of Intent (SOIs). Detecting such application materials is essential to ensure that applicants are evaluated on their true merits and abilities, and to foster an equitable and trustworthy admissions process. Our research is based on 3755 LORs and 1973 SOIs extracted from the application records of Fordham University's Master's programs in Computer Science and Data Science. To facilitate the construction and evaluation of detection models, we generated AI counterparts for each LOR and SOI using the GPT-3.5 Turbo API. The prompts for AI-generation text were derived from the admission data of the respective applicants, and the AI-revised LORs and SOIs were generated directly from the human-authored versions. We also utilize an open-access GPT-wiki-intro dataset to further validate our hypothesis regarding the feasibility of constructing domain-specific AI content detectors. Our experiments yield promising results in developing classifiers tailored to a specific domain when provided with sufficient training samples. Additionally, we present a comparative analysis of the word frequency and statistical characteristics of the text, which provides convincing evidence that ChatGPT employs distinctive vocabulary and paragraph structure compared to human-authored text. The code for this study is available on GitHub, and the models can be executed on user-provided data via an interactive web interface.

Detecting AI-generated essays: the ChatGPT challenge

Academics' perceptions of ChatGPT-generated written outputs: A practical application of Turing’s Imitation Game

Student Mastery or AI Deception? Analyzing ChatGPT's Assessment Proficiency and Evaluating Detection Strategies

The imitation game: Detecting human and AI-generated texts in the era of ChatGPT and BARD

Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text

Detecting LLM-Generated Text in Computing Education: A Comparative Study for ChatGPT Cases

Will ChatGPT get you caught? Rethinking of Plagiarism Detection

Classification of Human- and AI-Generated Texts: Investigating Features for ChatGPT

An Empirical Study of AI Generated Text Detection Tools

The Effectiveness of Software Designed to Detect AI-Generated Writing: A Comparison of 16 AI Text Detectors

To ChatGPT, or not to ChatGPT: That is the question!

Testing the Ability of Teachers and Students to Differentiate between Essays Generated by ChatGPT and High School Students

The ChatGPT conundrum: Human-generated scientific manuscripts misidentified as AI creations by AI text detection tool

ChatGPT versus human essayists: an exploration of the impact of artificial intelligence for authorship and academic integrity in the humanities

Admissions in the age of AI: detecting AI-generated application materials in higher education

Comparing ai detectors: evaluating performance and efficiency

Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?

AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays

Detecting ChatGPT: A Survey of the State of Detecting ChatGPT-Generated Text

Distinguishing academic science writing from humans or ChatGPT with over 99% accuracy using off-the-shelf machine learning tools