Machine Mindset: An MBTI Exploration of Large Language Models

Jiaxi Cui,Liuzhenghao Lv,Jing Wen,Rongsheng Wang,Jing Tang,YongHong Tian,Li Yuan
2024-06-02
Abstract:We present a novel approach for integrating Myers-Briggs Type Indicator (MBTI) personality traits into large language models (LLMs), addressing the challenges of personality consistency in personalized AI. Our method, "Machine Mindset," involves a two-phase fine-tuning and Direct Preference Optimization (DPO) to embed MBTI traits into LLMs. This approach ensures that models internalize these traits, offering a stable and consistent personality profile. We demonstrate the effectiveness of our models across various domains, showing alignment between model performance and their respective MBTI traits. The paper highlights significant contributions in the development of personality datasets and a new training methodology for personality integration in LLMs, enhancing the potential for personalized AI applications. We also open-sourced our model and part of the data at \url{<a class="link-external link-https" href="https://github.com/PKU-YuanGroup/Machine-Mindset" rel="external noopener nofollow">this https URL</a>}.
Computation and Language
What problem does this paper attempt to address?
This paper proposes a new method to integrate Myers-Briggs Type Indicator (MBTI) personality traits into Large Language Models (LLMs) to address the issue of personality consistency in personalized AI. In this study, the authors present the "Machine Mindset" framework, which embeds MBTI characteristics through two-stage fine-tuning and Direct Preference Optimization (DPO) to ensure the internalization of these traits and the formation of a stable and consistent personality profile. The experiments demonstrate that the model's performance aligns with its corresponding MBTI features, validating the effectiveness of this approach. The main contributions of this paper include: 1. Establishing a personality dataset based on MBTI, including behavioral and self-awareness datasets. 2. Proposing a training method to inject specific personalities into the model, including two-stage supervised fine-tuning and Direct Preference Optimization. 3. The trained model is capable of learning behavioral patterns specific to certain personalities and acquiring corresponding self-awareness of these personalities. The research also involves constructing datasets from different domains such as law, patents, and general aptitude tests, and the experimental results verify the alignment between the performance of different personality models and their corresponding personality traits. Furthermore, the consistency in data processing and training processes reduces reliance on specific LLMs, facilitating the integration of new models or general LLMs. The paper discusses the shortcomings of previous work in data construction and model training, and compares it with traditional methods. Finally, the authors explore potential application areas and future research directions.