ESGPDE: an ESG Performance Data Extraction Model

X. Wang,Zhengzheng Yang,Le Zhang,Zhuo Li,Zhenzhi Lin
DOI: https://doi.org/10.3905/jfds.2023.1.148
2023-01-01
Abstract:Acquiring environmental, social, and governance (ESG) performance data manually can lead to substantial expenses and yield inconsistencies, and current research on extracting exact ESG performance data is limited. The authors introduce a natural language processing model aimed at extracting numerical data from sentences or concise paragraphs, encompassing a range of ESG performance categories. In particular, the authors employ the prompt-learning approach to build two ESG performance data extraction (ESGPDE) models based on pretrained BERT-large and DeBERTa-V3-large models, respectively, utilizing a set of predefined ESG performance categories from Refinitiv data. Subsequently, a comparison is drawn on accuracy between ESGPDEs and ChatGPT with a simple prompt. The authors conclude that the DeBERTa-V3-large–based ESGPDE shows the best performance of 73.99% accuracy overall. Both ESGPDEs are significantly superior to the ChatGPT benchmark model. In particular, the DeBERTa-V3-large–based ESGPDE attains a minimum of 75% accuracy for 50% of the granular metrics with sufficient test data and achieves at least 80% accuracy on six specific metrics.
What problem does this paper attempt to address?