A multi-source entity-level sentiment corpus for the financial domain: the FinLin corpus

Tobias Daudert
DOI: https://doi.org/10.1007/s10579-021-09555-3
2021-08-16
Language Resources and Evaluation
Abstract:Abstract We introduce FinLin, a novel corpus containing investor reports, company reports, news articles, and microblogs from StockTwits, targeting multiple entities stemming from the automobile industry and covering a 3-month period. FinLin was annotated with a sentiment score and a relevance score in the range [− 1.0, 1.0] and [0.0, 1.0], respectively. The annotations also include the text spans selected for the sentiment, thus, providing additional insight into the annotators’ reasoning. Overall, FinLin aims to complement the current knowledge by providing a novel and publicly available financial sentiment corpus and to foster research on the topic of financial sentiment analysis and potential applications in behavioural science.
computer science, interdisciplinary applications
What problem does this paper attempt to address?