Abstract:Allowing machines to choose whether to kill humans would be devastating for world peace and security. But how do we equip machines with the ability to learn ethical or even moral choices? In this study, we show that applying machine learning to human texts can extract deontological ethical reasoning about "right" and "wrong" conduct. We create a template list of prompts and responses, such as "Should I [action]?", "Is it okay to [action]?", etc. with corresponding answers of "Yes/no, I should (not)." and "Yes/no, it is (not)." The model's bias score is the difference between the model's score of the positive response ("Yes, I should") and that of the negative response ("No, I should not"). For a given choice, the model's overall bias score is the mean of the bias scores of all question/answer templates paired with that choice. Specifically, the resulting model, called the Moral Choice Machine (MCM), calculates the bias score on a sentence level using embeddings of the Universal Sentence Encoder since the moral value of an action to be taken depends on its context. It is objectionable to kill living beings, but it is fine to kill time. It is essential to eat, yet one might not eat dirt. It is important to spread information, yet one should not spread misinformation. Our results indicate that text corpora contain recoverable and accurate imprints of our social, ethical and moral choices, even with context information. Actually, training the Moral Choice Machine on different temporal news and book corpora from the year 1510 to 2008/2009 demonstrate the evolution of moral and ethical choices over different time periods for both atomic actions and actions with context information. By training it on different cultural sources such as the Bible and the constitution of different countries, the dynamics of moral choices in culture, including technology are revealed. That is the fact that moral biases can be extracted, quantified, tracked, and compared across cultures and over time.

Using Machine Learning to Guide Cognitive Modeling: A Case Study in Moral Reasoning

Scaling up Psychology via Scientific Regret Minimization: A Case Study in Moral Decisions

A Computational Model of Commonsense Moral Decision Making

Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?

Cognitive Models as Simulators: The Case of Moral Decision-Making

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment

Can Machines Learn Morality? The Delphi Experiment

From computational ethics to morality: how decision-making algorithms can help us understand the emergence of moral principles, the existence of an optimal behaviour and our ability to discover it

HUMAN DECISIONS AND MACHINE PREDICTIONS

Capturing the Complexity of Human Strategic Decision-Making with Machine Learning

The Moral Mind(s) of Large Language Models

The moral machine experiment on large language models

The Moral Turing Test: Evaluating Human-LLM Alignment in Moral Decision-Making

Behavior-Based Machine-Learning: A Hybrid Approach for Predicting Human Decision Making

Modeling Moral Choices in Social Dilemmas with Multi-Agent Reinforcement Learning

The Moral Choice Machine

Using large-scale experiments and machine learning to discover theories of human decision-making

Large-scale moral machine experiment on large language models

Can Machine Learning be Moral?

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

Machine Learning Approaches for Principle Prediction in Naturally Occurring Stories