Abstract:The United States has experienced a significant increase in violent extremism, prompting the need for automated tools to detect and limit the spread of extremist ideology online. This study evaluates the performance of Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-Trained Transformers (GPT) in detecting and classifying online domestic extremist posts. We collected social media posts containing "far-right" and "far-left" ideological keywords and manually labeled them as extremist or non-extremist. Extremist posts were further classified into one or more of five contributing elements of extremism based on a working definitional framework. The BERT model's performance was evaluated based on training data size and knowledge transfer between categories. We also compared the performance of GPT 3.5 and GPT 4 models using different prompts: naïve, layperson-definition, role-playing, and professional-definition. Results showed that the best performing GPT models outperformed the best performing BERT models, with more detailed prompts generally yielding better results. However, overly complex prompts may impair performance. Different versions of GPT have unique sensitives to what they consider extremist. GPT 3.5 performed better at classifying far-left extremist posts, while GPT 4 performed better at classifying far-right extremist posts. Large language models, represented by GPT models, hold significant potential for online extremism classification tasks, surpassing traditional BERT models in a zero-shot setting. Future research should explore human-computer interactions in optimizing GPT models for extremist detection and classification tasks to develop more efficient (e.g., quicker, less effort) and effective (e.g., fewer errors or mistakes) methods for identifying extremist content.

Detection of Conspiracy Theories Beyond Keyword Bias in German-Language Telegram Using Large Language Models

More than Memes: A Multimodal Topic Modeling Approach to Conspiracy Theories on Telegram

Classifying Conspiratorial Narratives At Scale: False Alarms and Erroneous Connections

Assessing the Impact of Conspiracy Theories Using Large Language Models

Large Language Models for Propaganda Detection

Unveiling Online Conspiracy Theorists: a Text-Based Approach and Characterization

Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge

An iterative topic model filtering framework for short and noisy user-generated data: analyzing conspiracy theories on twitter

Efficacy of Utilizing Large Language Models to Detect Public Threat Posted Online

Unveiling the Potential of BERTopic for Multilingual Fake News Analysis -- Use Case: Covid-19

Detecting COVID-19 Conspiracy Theories with Transformers and TF-IDF

Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation

The anatomy of conspiracy theorists: Unveiling traits using a comprehensive twitter dataset

ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model

Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames

The Anatomy of Conspirators: Unveiling Traits using a Comprehensive Twitter Dataset

Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Automated stance detection in complex topics and small languages: The challenging case of immigration in polarizing news media

Unmasking the Imposters: How Censorship and Domain Adaptation Affect the Detection of Machine-Generated Tweets

Large Language Models for Propaganda Span Annotation