Abstract:Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) unravel new capabilities for AI personal assistants to overcome cognitive bandwidth limitations of humans, providing decision support or even direct representation of human voters at large scale. However, the quality of this representation and what underlying biases manifest when delegating collective decision-making to LLMs is an alarming and timely challenge to tackle. By rigorously emulating with high realism more than >50K LLM voting personas in 81 real-world voting elections, we disentangle the nature of different biases in LLMS (GPT 3, GPT 3.5, and Llama2). Complex preferential ballot formats exhibit significant inconsistencies compared to simpler majoritarian elections that show higher consistency. Strikingly though, by demonstrating for the first time in real-world a proportional representation of voters in direct democracy, we are also able to show that fair ballot aggregation methods, such as equal shares, prove to be a win-win: fairer voting outcomes for humans with fairer AI representation. This novel underlying relationship proves paramount for democratic resilience in progressives scenarios with low voters turnout and voter fatigue supported by AI representatives: abstained voters are mitigated by recovering highly representative voting outcomes that are fairer. These interdisciplinary insights provide remarkable foundations for science, policymakers, and citizens to develop safeguards and resilience for AI risks in democratic innovations.

Automated Parliaments: A Solution to Decision Uncertainty and Misalignment in Language Models

Large Legislative Models: Towards Efficient AI Policymaking in Economic Simulations

Deliberating with AI: Improving Decision-Making for the Future through Participatory AI Design and Stakeholder Deliberation

From Experts to the Public: Governing Multimodal Language Models in Politically Sensitive Video Analysis

AI Language Models Could Both Help and Harm Equity in Marine Policymaking: The Case Study of the BBNJ Question-Answering Bot

Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models

Envisioning a Human-AI collaborative system to transform policies into decision models

Generative AI Voting: Fair Collective Choice is Resilient to LLM Biases and Inconsistencies

Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs

Can We Trust AI Agents? An Experimental Study Towards Trustworthy LLM-Based Multi-Agent Systems for AI Ethics

Large Language Models in Politics and Democracy: A Comprehensive Survey

Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates

Constitutional AI: Harmlessness from AI Feedback

Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making

Collective Constitutional AI: Aligning a Language Model with Public Input

LLM Voting: Human Choices and AI Collective Decision Making

Artificial Intelligence for EU Decision-Making. Effects on Citizens Perceptions of Input, Throughput and Output Legitimacy

Computer Says I Don’t Know: An Empirical Approach to Capture Moral Uncertainty in Artificial Intelligence

Towards "Differential AI Psychology" and in-context Value-driven Statement Alignment with Moral Foundations Theory