Explainable Natural Language Processing for Corporate Sustainability Analysis

Keane Ong,Rui Mao,Ranjan Satapathy,Ricardo Shirota Filho,Erik Cambria,Johan Sulaeman,Gianmarco Mengaldo
2024-10-16
Abstract:Sustainability commonly refers to entities, such as individuals, companies, and institutions, having a non-detrimental (or even positive) impact on the environment, society, and the economy. With sustainability becoming a synonym of acceptable and legitimate behaviour, it is being increasingly demanded and regulated. Several frameworks and standards have been proposed to measure the sustainability impact of corporations, including United Nations' sustainable development goals and the recently introduced global sustainability reporting framework, amongst others. However, the concept of corporate sustainability is complex due to the diverse and intricate nature of firm operations (i.e. geography, size, business activities, interlinks with other stakeholders). As a result, corporate sustainability assessments are plagued by subjectivity both within data that reflect corporate sustainability efforts (i.e. corporate sustainability disclosures) and the analysts evaluating them. This subjectivity can be distilled into distinct challenges, such as incompleteness, ambiguity, unreliability and sophistication on the data dimension, as well as limited resources and potential bias on the analyst dimension. Put together, subjectivity hinders effective cost attribution to entities non-compliant with prevailing sustainability expectations, potentially rendering sustainability efforts and its associated regulations futile. To this end, we argue that Explainable Natural Language Processing (XNLP) can significantly enhance corporate sustainability analysis. Specifically, linguistic understanding algorithms (lexical, semantic, syntactic), integrated with XAI capabilities (interpretability, explainability, faithfulness), can bridge gaps in analyst resources and mitigate subjectivity problems within data.
Computers and Society,Computation and Language
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is **the subjectivity problem in corporate sustainability analysis**. Specifically, the author points out that the current corporate sustainability assessment faces challenges in two main dimensions: the data dimension and the analyst dimension. ### Challenges in the data dimension 1. **Incompleteness**: - Enterprises can selectively disclose the sustainability information they consider important, resulting in incomplete data. - Companies may overlook some key sustainability dimensions, thus affecting the accuracy of the assessment. 2. **Unreliability**: - Enterprises may engage in "greenwashing", that is, exaggerating or misleading their environmental protection behaviors. - Using opaque carbon offset products cannot fulfill the promised emission reduction effects. 3. **Ambiguity**: - The data may be ambiguous and lack appropriate background information, making it difficult for analysts to understand the actual sustainability efforts of enterprises. - Ambiguity is different from unreliability. It does not directly involve inaccurate or misleading data. 4. **Sophistication**: - Sustainability reports are usually long and complex, requiring professional knowledge to understand, which increases the workload of analysts. ### Challenges in the analyst dimension 1. **Limited human - hours**: - Analysts need to spend a great deal of time reading and understanding complex sustainability reports. - To make up for the incompleteness of the disclosed information, analysts also need to analyze additional data sources, further increasing the workload. 2. **Potential bias**: - Analysts are easily influenced by personal biases when interpreting sustainability data. - The ambiguity and unreliability of data will exacerbate this bias, making it difficult for analysts to make clear judgments. ### Solutions To solve the above - mentioned problems, the author proposes to use **Explainable Natural Language Processing (XNLP)** technology to enhance corporate sustainability analysis. XNLP combines language understanding algorithms (lexical, semantic, syntactic) and Explainable Artificial Intelligence (XAI) capabilities (explainability, interpretability, faithfulness), and can: - **Reduce subjectivity**: By automating the processing and analysis of a large amount of text data, reduce the influence of human factors on the assessment results. - **Improve efficiency**: Automatically extract key information, reducing the time and energy required by analysts. - **Enhance transparency**: Provide explainable results to help analysts better understand and interpret data. In conclusion, this paper aims to overcome the subjectivity problems in the data and analyst dimensions of corporate sustainability analysis by introducing XNLP technology, thereby improving the accuracy and reliability of the assessment.