Abstract:Online Social Networks serve as fertile ground for harmful behavior, ranging from hate speech to the dissemination of disinformation. Malicious actors now have unprecedented freedom to misbehave, leading to severe societal unrest and dire consequences, as exemplified by events such as the Capitol assault during the US presidential election and the Antivaxx movement during the COVID-19 pandemic. Understanding online language has become more pressing than ever. While existing works predominantly focus on content analysis, we aim to shift the focus towards understanding harmful behaviors by relating content to their respective authors. Numerous novel approaches attempt to learn the stylistic features of authors in texts, but many of these approaches are constrained by small datasets or sub-optimal training losses. To overcome these limitations, we introduce the Style Transformer for Authorship Representations (STAR), trained on a large corpus derived from public sources of 4.5 x 10^6 authored texts involving 70k heterogeneous authors. Our model leverages Supervised Contrastive Loss to teach the model to minimize the distance between texts authored by the same individual. This author pretext pre-training task yields competitive performance at zero-shot with PAN challenges on attribution and clustering. Additionally, we attain promising results on PAN verification challenges using a single dense layer, with our model serving as an embedding encoder. Finally, we present results from our test partition on Reddit. Using a support base of 8 documents of 512 tokens, we can discern authors from sets of up to 1616 authors with at least 80\% accuracy. We share our pre-trained model at huggingface (<a class="link-external link-https" href="https://huggingface.co/AIDA-UPM/star" rel="external noopener nofollow">this https URL</a>) and our code is available at (<a class="link-external link-https" href="https://github.com/jahuerta92/star" rel="external noopener nofollow">this https URL</a>)

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer

UATST: Towards Unpaired Arbitrary Text-Guided Style Transfer with Cross-Space Modulation

APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations

Language Style Transfer from Non-Parallel Text with Arbitrary Styles

Language Style Transfer from Sentences with Arbitrary Unknown Styles

A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images

Style Transfer in Text: Exploration and Evaluation

Transductive Learning for Unsupervised Text Style Transfer

Contextual Text Style Transfer

Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer

Unsupervised offensive speech detection for multimedia based on multilingual BERT

MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer

Style Transfer as Unsupervised Machine Translation

Understanding writing style in social media with a supervised contrastively pre-trained transformer

Text Detoxification using Large Pre-trained Neural Models

Delete, Retrieve, Generate: A Simple Approach to Sentiment and Style Transfer

Multilingual Text Style Transfer: Datasets & Models for Indian Languages

Low-Level Linguistic Controls for Style Transfer and Content Preservation

Text Style Transfer Via Learning Style Instance Supported Latent Space

Fair Transfer of Multiple Style Attributes in Text