Says who? Automatic Text-Based Content Analysis of Television News

Carlos Castillo,Gianmarco De Francisci Morales,Marcelo Mendoza,Nasir Khan
DOI: https://doi.org/10.48550/arXiv.1307.4879
2013-07-18
Computation and Language
Abstract:We perform an automatic analysis of television news programs, based on the closed captions that accompany them. Specifically, we collect all the news broadcasted in over 140 television channels in the US during a period of six months. We start by segmenting, processing, and annotating the closed captions automatically. Next, we focus on the analysis of their linguistic style and on mentions of people using NLP methods. We present a series of key insights about news providers, people in the news, and we discuss the biases that can be uncovered by automatic means. These insights are contrasted by looking at the data from multiple points of view, including qualitative assessment.
What problem does this paper attempt to address?