Measuring Human and AI Values based on Generative Psychometrics with Large Language Models

Haoran Ye,Yuhang Xie,Yuanyi Ren,Hanjun Fang,Xin Zhang,Guojie Song
2024-09-19
Abstract:Human values and their measurement are long-standing interdisciplinary inquiry. Recent advances in AI have sparked renewed interest in this area, with large language models (LLMs) emerging as both tools and subjects of value measurement. This work introduces Generative Psychometrics for Values (GPV), an LLM-based, data-driven value measurement paradigm, theoretically grounded in text-revealed selective perceptions. We begin by fine-tuning an LLM for accurate perception-level value measurement and verifying the capability of LLMs to parse texts into perceptions, forming the core of the GPV pipeline. Applying GPV to human-authored blogs, we demonstrate its stability, validity, and superiority over prior psychological tools. Then, extending GPV to LLM value measurement, we advance the current art with 1) a psychometric methodology that measures LLM values based on their scalable and free-form outputs, enabling context-specific measurement; 2) a comparative analysis of measurement paradigms, indicating response biases of prior methods; and 3) an attempt to bridge LLM values and their safety, revealing the predictive power of different value systems and the impacts of various values on LLM safety. Through interdisciplinary efforts, we aim to leverage AI for next-generation psychometrics and psychometrics for value-aligned AI.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Measurement of Human Values**: Traditional psychological measurement tools (such as self-report questionnaires) have issues like response bias, high resource demands, and difficulty in capturing real behavior. This paper proposes a generative psychometric method (GPV) based on large language models (LLMs) to measure personal values by analyzing perceived information in text. 2. **Measurement of LLMs' Values**: As LLMs become more prevalent in public applications, it is crucial to reliably measure their values. Existing measurement methods (such as self-report questionnaires) are not fully applicable to LLMs and have static, non-scalable issues. GPV addresses these problems by dynamically generating perceived information and enabling context-relevant measurement based on LLMs' outputs. The paper demonstrates the effectiveness and superiority of GPV through the following specific steps: - **Model Training and Validation**: Fine-tuning the Llama 3 model to achieve perception-level value measurement. Experimental results show that this model outperforms other advanced models in perception relevance and tendency classification. - **Application to Human Blog Data**: Analyzing 791 blog posts to validate GPV's performance in terms of stability, construct validity, concurrent validity, and predictive validity, showing it to be superior to traditional tools. - **Measurement of LLMs' Values**: Evaluating 17 LLMs under four different value theories, the results show that GPV significantly outperforms existing tools in construct validity and reveals the impact of different value systems on the safety of LLMs. In summary, the paper aims to improve the accuracy and flexibility of measuring human and LLMs' values by introducing a new LLM-driven psychometric method.