Addressing Social Misattributions of Large Language Models: An HCXAI-based Approach

Andrea Ferrario,Alberto Termine,Alessandro Facchini
2024-03-27
Abstract:Human-centered explainable AI (HCXAI) advocates for the integration of social aspects into AI explanations. Central to the HCXAI discourse is the Social Transparency (ST) framework, which aims to make the socio-organizational context of AI systems accessible to their users. In this work, we suggest extending the ST framework to address the risks of social misattributions in Large Language Models (LLMs), particularly in sensitive areas like mental health. In fact LLMs, which are remarkably capable of simulating roles and personas, may lead to mismatches between designers' intentions and users' perceptions of social attributes, risking to promote emotional manipulation and dangerous behaviors, cases of epistemic injustice, and unwarranted trust. To address these issues, we propose enhancing the ST framework with a fifth 'W-question' to clarify the specific social attributions assigned to LLMs by its designers and users. This addition aims to bridge the gap between LLM capabilities and user perceptions, promoting the ethically responsible development and use of LLM-based technology.
Artificial Intelligence
What problem does this paper attempt to address?
This paper aims to address the issue of social attribute misattribution in large language models (LLMs). Specifically, since LLMs can simulate different roles and personality traits, this may lead to inconsistencies between the designer's intentions and the user's perceptions, thereby causing issues such as emotional manipulation, dangerous behavior, and cognitive biases. The researchers propose extending the Social Transparency framework by adding an additional "W question" to clarify the social attributes assigned to LLMs by designers and those actually perceived by users, in order to reduce the risk of such misattributions. Furthermore, the paper discusses how to develop relevant strategies and technical means to detect and prevent these misattribution phenomena, to promote the ethical development and responsible use of LLM technology.