TimeMachine: Entity-centric Search and Visualization of News Archives

Pedro Saleiro,Jorge Teixeira,Carlos Soares,Eugénio Oliveira
DOI: https://doi.org/10.48550/arXiv.1601.00855
2016-01-05
Information Retrieval
Abstract:We present a dynamic web tool that allows interactive search and visualization of large news archives using an entity-centric approach. Users are able to search entities using keyword phrases expressing news stories or events and the system retrieves the most relevant entities to the user query based on automatically extracted and indexed entity profiles. From the computational journalism perspective, TimeMachine allows users to explore media content through time using automatic identification of entity names, jobs, quotations and relations between entities from co-occurrences networks extracted from the news articles. TimeMachine demo is available at http://maquinadotempo.sapo.pt/
What problem does this paper attempt to address?