4TCT, A 4chan Text Collection Tool

Jack H. Culbert
2023-07-07
Abstract:4chan is a popular online imageboard which has been widely studied due to an observed concentration of far-right, antisemitic, racist, misogynistic, and otherwise hateful material being posted to the site, as well as the emergence of political movements and the evolution of memes which are posted there, discussed in Section 1.1. We have created a tool developed in Python which utilises the 4chan API to collect data from a selection of boards. This paper accompanies the release of the code via the github repository: <a class="link-external link-https" href="https://github.com/jhculb/4TCT" rel="external noopener nofollow">this https URL</a>. We believe this tool will be of use to academics studying 4chan by providing a tool for collection of data from 4chan to sociological researchers, and potentially contributing to GESIS' Digital Behavioural Data project.
Digital Libraries,Social and Information Networks
What problem does this paper attempt to address?