Unlocking the power of the topic content in news headlines: BERTopic for predicting Chinese corporate bond defaults

Wenjin Tang,Hui Bu,Yuan Zuo,Junjie Wu
DOI: https://doi.org/10.1016/j.frl.2024.105062
IF: 9.848
2024-02-22
Finance Research Letters
Abstract:This study explores the thematic content of news headlines and assesses their predictive power for corporate bond defaults. It establishes a data-driven framework, that emphasizes transparency and interpretability through the incorporation of explainable AI (xAI) techniques. The interpretable AI method, BERTopic, is applied to analyze news headlines from Jan. 1, 2014, to Aug. 31, 2021. A total of 18 economically inrerpretable topic measures are derived by combining similar topics among 78 original topics, offering insights into bond issuers' operational behavior and associated risks. Integrating the BERTopic model, various machine learning prediction models, the SHapley Additive exPlanations (SHAP) approach, and feature combination approaches, this study uncovers the incremental information contributed by news headlines beyond financial ratios and economic variables. The inclusion of these topic measures significantly enhances the predictive performance of first-time corporate bond defaults within a 3-month horizon. Additionally, the robustness of news headlines' information value is validated by extending the sample and employing an alternative study sample with differing credit risk scenarios, diverse markets, and even distinct news sources.
business, finance
What problem does this paper attempt to address?