CPL-NoViD: Context-Aware Prompt-based Learning for Norm Violation Detection in Online Communities

Zihao He,Jonathan May,Kristina Lerman
2024-04-17
Abstract:Detecting norm violations in online communities is critical to maintaining healthy and safe spaces for online discussions. Existing machine learning approaches often struggle to adapt to the diverse rules and interpretations across different communities due to the inherent challenges of fine-tuning models for such context-specific tasks. In this paper, we introduce Context-aware Prompt-based Learning for Norm Violation Detection (CPL-NoViD), a novel method that employs prompt-based learning to detect norm violations across various types of rules. CPL-NoViD outperforms the baseline by incorporating context through natural language prompts and demonstrates improved performance across different rule types. Significantly, it not only excels in cross-rule-type and cross-community norm violation detection but also exhibits adaptability in few-shot learning scenarios. Most notably, it establishes a new state-of-the-art in norm violation detection, surpassing existing benchmarks. Our work highlights the potential of prompt-based learning for context-sensitive norm violation detection and paves the way for future research on more adaptable, context-aware models to better support online community moderators.
Computation and Language,Social and Information Networks
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to address the challenges of norm violation detection in online communities. Specifically, existing machine learning methods struggle to adapt to the diverse rules and interpretations of different communities, as these tasks are highly context-specific. This leads to the following major issues: 1. **Diverse Rules and Interpretations**: Each online community has its unique culture, norms, and different interpretations of rules, making it very difficult to develop a universal detection method. 2. **Context Understanding**: Existing methods often fail to adequately capture and understand the contextual information of comments, affecting the accuracy of detection. 3. **Challenges of Data Annotation**: Obtaining a large amount of annotated data for model training can be both expensive and time-consuming in practical applications. To address these issues, the paper proposes a new method—**Context-aware Prompt-based Learning for Norm Violation Detection (CPL-NoViD)**. This method uses natural language prompts to detect different types of norm violations, not only improving detection performance but also excelling in cross-rule type and cross-community norm violation detection, and demonstrating good adaptability in few-shot learning scenarios.