Online News Media Website Ranking Using User Generated Content

Samaneh Karimi,Azadeh Shakery,Rakesh Verma
DOI: https://doi.org/10.48550/arXiv.1910.12441
2019-10-28
Abstract:News media websites are important online resources that have drawn great attention of text mining researchers. The main aim of this study is to propose a framework for ranking online news websites from different viewpoints. The ranking of news websites is useful information, which can benefit many news-related tasks such as news retrieval and news recommendation. In the proposed framework, the ranking of news websites is obtained by calculating three measures introduced in the paper and based on user-generated content. Each proposed measure is concerned with the performance of news websites from a particular viewpoint including the completeness of news reports, the diversity of events being covered by the website and its speed. The use of user-generated content in this framework, as a partly-unbiased, real-time and low cost content on the web distinguishes the proposed news website ranking framework from the literature. The results obtained for three prominent news websites, BBC, CNN, NYTimes, show that BBC has the best performance in terms of news completeness and speed, and NYTimes has the best diversity in comparison with the other two websites.
Information Retrieval,Computation and Language,Social and Information Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use user - generated content (such as Twitter) to rank online news websites. Specifically, the author proposes a framework aimed at evaluating and ranking online news websites from different perspectives, which include: 1. **News Diversity**: Measure the ability of a news website to report different events within a specific time period. A higher news diversity value indicates that the news website performs better in meeting the needs of reader groups seeking diverse news. 2. **News Completeness**: Evaluate the detail and comprehensiveness of news articles when reporting events. 3. **Speed**: Measure the speed at which a news website publishes relevant news after an event occurs. In the fierce news competition, rapid news reporting is very important. Through these measures, this framework can help readers select the most suitable news website according to their needs. In addition, the framework also proposes a search - engine - based website ranking method for ranking websites for the same set of events and analyzes the news website rankings using two different methods. The main contributions of the paper include: - Proposing a news website ranking framework based on events detected from Twitter. - Proposing news - specific ranking indicators using the language modeling method. - Proposing a search - engine - based website ranking method for a known set of events. In summary, this paper aims to provide an objective, real - time, and low - cost method to evaluate and rank news websites by using user - generated content, thereby providing support for tasks such as news retrieval and recommendation.