Toward best research practices in AI Psychology

Anna A. Ivanova
2024-10-29
Abstract:Language models have become an essential part of the burgeoning field of AI Psychology. I discuss 14 methodological considerations that can help design more robust, generalizable studies evaluating the cognitive abilities of language-based AI systems, as well as to accurately interpret the results of these studies.
Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the methodological challenges faced when conducting research in Artificial Intelligence Psychology (AI Psychology). With the wide application of language models in cognitive ability assessment, how to design more robust and generalizable research methods and how to accurately interpret these research results have become important issues that current researchers need to face. Specifically, the paper discusses 14 methodological considerations, aiming to help researchers better assess the cognitive abilities of language - based artificial intelligence systems and accurately interpret the results of these assessments. The paper points out that although it is relatively easy to conduct cognitive tests on AI systems, interpreting these test results is very complicated. For example, when a model passes a test that humans use to assess a certain cognitive ability, it does not necessarily mean that the model actually has this ability. Therefore, the paper proposes multiple "should - do" (DOs) and "should - not - do" (DON'Ts) suggestions to guide the design, implementation of research and the interpretation of results. These suggestions cover multiple aspects, from determining what the model may have learned during the training process, considering alternative strategies for the model to solve problems, to the design of control conditions, etc., to ensure the validity and reliability of the research. In summary, this paper aims to provide a set of methodological guidelines to help researchers overcome various challenges encountered in AI psychology research, thereby promoting the healthy development of this field.