Abstract:The vast majority of online media rely heavily on the revenues generated by their readers' views, and due to the abundance of such outlets, they must compete for reader attention. It is a common practise for publishers to employ attention-grabbing headlines as a means to entice users to visit their websites. These headlines, commonly referred to as clickbaits, strategically leverage the curiosity gap experienced by users, enticing them to click on hyperlinks that frequently fail to meet their expectations. Therefore, the identification of clickbaits is a significant NLP application. Previous studies have demonstrated that language models can effectively detect clickbaits. Deep learning models have attained great success in text-based assignments, but these are vulnerable to adversarial modifications. These attacks involve making undetectable alterations to a small number of words or characters in order to create a deceptive text that misleads the machine into making incorrect predictions. The present work introduces " Non-Alpha-Num ", a newly proposed textual adversarial assault that functions in a black box setting, operating at the character level. The primary goal is to manipulate a certain NLP model in a manner that the alterations made to the input data are undetectable by human observers. A series of comprehensive tests were conducted to evaluate the efficacy of the suggested attack approach on several widely-used models, including Word-CNN, BERT, DistilBERT, ALBERTA, RoBERTa, and XLNet. These models were fine-tuned using the clickbait dataset, which is commonly employed for clickbait detection purposes. The empirical evidence suggests that the attack model being offered routinely achieves much higher attack success rates (ASR) and produces high-quality adversarial instances in comparison to traditional adversarial manipulations. The findings suggest that the clickbait detection system has the potential to be circumvented, which might have significant implications for current policy efforts.

Low-Resource Clickbait Spoiling for Indonesian via Question Answering

Clickbait Spoiling via Question Answering and Passage Retrieval

Clickbait Classification and Spoiling Using Natural Language Processing

Mitigating Clickbait: An Approach to Spoiler Generation Using Multitask Learning

Generating clickbait spoilers with an ensemble of large language models

CLICK-ID: A novel dataset for Indonesian clickbait headlines

Clickbait Headline Detection in Indonesian News Sites using Multilingual Bidirectional Encoder Representations from Transformers (M-BERT)

Clickbait Detection of Indonesian News Headlines using Fine-Tune Bidirectional Encoder Representations from Transformers (BERT)

Web-based Application for Detecting Indonesian Clickbait Headlines using IndoBERT

Clickbait Detection via Large Language Models

Clickbait detection on WeChat: A deep model integrating semantic and syntactic information

BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset

Clickbait Detection Via Prompt-Tuning with Titles Only

Spoiler Alert: Using Natural Language Processing to Detect Spoilers in Book Reviews

Detecting Clickbait in Chinese Social Media by Prompt Learning.

Prompt-tuning for Clickbait Detection via Text Summarization

Multi-modal Soft Prompt-Tuning for Chinese Clickbait Detection

NoticIA: A Clickbait Article Summarization Dataset in Spanish

Detecting Clickbait in Online Social Media: You Won't Believe How We Did It

Non-Alpha-Num: a novel architecture for generating adversarial examples for bypassing NLP-based clickbait detection mechanisms

Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch