Abstract:This paper aims to help structure the risk landscape associated with large-scale Language Models (LMs). In order to foster advances in responsible innovation, an in-depth understanding of the potential risks posed by these models is needed. A wide range of established and anticipated risks are analysed in detail, drawing on multidisciplinary expertise and literature from computer science, linguistics, and social sciences. We outline six specific risk areas: I. Discrimination, Exclusion and Toxicity, II. Information Hazards, III. Misinformation Harms, V. Malicious Uses, V. Human-Computer Interaction Harms, VI. Automation, Access, and Environmental Harms. The first area concerns the perpetuation of stereotypes, unfair discrimination, exclusionary norms, toxic language, and lower performance by social group for LMs. The second focuses on risks from private data leaks or LMs correctly inferring sensitive information. The third addresses risks arising from poor, false or misleading information including in sensitive domains, and knock-on risks such as the erosion of trust in shared information. The fourth considers risks from actors who try to use LMs to cause harm. The fifth focuses on risks specific to LLMs used to underpin conversational agents that interact with human users, including unsafe use, manipulation or deception. The sixth discusses the risk of environmental harm, job automation, and other challenges that may have a disparate effect on different social groups or communities. In total, we review 21 risks in-depth. We discuss the points of origin of different risks and point to potential mitigation approaches. Lastly, we discuss organisational responsibilities in implementing mitigations, and the role of collaboration and participation. We highlight directions for further research, particularly on expanding the toolkit for assessing and evaluating the outlined risks in LMs.

Risks of Cultural Erasure in Large Language Models

Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models

Cultural Fidelity in Large-Language Models: An Evaluation of Online Language Resources as a Driver of Model Performance in Value Representation

Ethical and social risks of harm from Language Models

From Bytes to Biases: Investigating the Cultural Self-Perception of Large Language Models

Investigating Cultural Alignment of Large Language Models

Geographical Erasure in Language Generation

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Cultural Bias and Cultural Alignment of Large Language Models

Extrinsic Evaluation of Cultural Competence in Large Language Models

Double Jeopardy and Climate Impact in the Use of Large Language Models: Socio-economic Disparities and Reduced Utility for Non-English Speakers

Cultural Alignment in Large Language Models: An Explanatory Analysis Based on Hofstede's Cultural Dimensions

Socially Responsible Data for Large Multilingual Language Models

Large Language Models Humanize Technology

Towards Measuring and Modeling "Culture" in LLMs: A Survey

Fairness in Language Models Beyond English: Gaps and Challenges

"They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations

Having Beer after Prayer? Measuring Cultural Bias in Large Language Models

Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models

CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting

The Application of Large Language Models Reducing Cultural Barriers in International Trade: A Perspective from Cultural Conflicts, Potential and Obstacles