MindSet: Vision. A toolbox for testing DNNs on key psychological experiments

Valerio Biscione,Dong Yin,Gaurav Malhotra,Marin Dujmovic,Milton L. Montero,Guillermo Puebla,Federico Adolfi,Rachel F. Heaton,John E. Hummel,Benjamin D. Evans,Karim Habashy,Jeffrey S. Bowers
2024-04-08
Abstract:Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbox MindSet: Vision, consisting of a collection of image datasets and related scripts designed to test DNNs on 30 psychological findings. In all experimental conditions, the stimuli are systematically manipulated to test specific hypotheses regarding human visual perception and object recognition. In addition to providing pre-generated datasets of images, we provide code to regenerate these datasets, offering many configurable parameters which greatly extend the dataset versatility for different research contexts, and code to facilitate the testing of DNNs on these image datasets using three different methods (similarity judgments, out-of-distribution classification, and decoder method), accessible at
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the problem of how to more effectively evaluate the similarity between deep neural networks (DNNs) and human visual perception and object recognition capabilities. Specifically, the paper introduces a toolbox named MindSet: Vision, which includes a series of image datasets and related scripts for testing DNN performance on 30 psychological findings. The stimuli under these experimental conditions are systematically manipulated to test specific hypotheses about human visual perception and object recognition. Additionally, the toolbox provides pre-generated datasets and code, allowing users to customize parameters based on different research contexts, thereby greatly expanding the applicability of the datasets. It also offers three different methods to test DNNs (i.e., similarity judgment analysis, out-of-distribution classification, and decoder methods). In this way, the authors hope to bridge the gap between computational modeling and psychological research, promoting further exploration of other human psychological domains in machines, such as memory, language, and speech perception.