MindSet: Vision. A toolbox for testing DNNs on key psychological experiments

Valerio Biscione,Dong Yin,Gaurav Malhotra,Marin Dujmovic,Milton L. Montero,Guillermo Puebla,Federico Adolfi,Rachel F. Heaton,John E. Hummel,Benjamin D. Evans,Karim Habashy,Jeffrey S. Bowers

2024-04-08

Abstract:Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbox MindSet: Vision, consisting of a collection of image datasets and related scripts designed to test DNNs on 30 psychological findings. In all experimental conditions, the stimuli are systematically manipulated to test specific hypotheses regarding human visual perception and object recognition. In addition to providing pre-generated datasets of images, we provide code to regenerate these datasets, offering many configurable parameters which greatly extend the dataset versatility for different research contexts, and code to facilitate the testing of DNNs on these image datasets using three different methods (similarity judgments, out-of-distribution classification, and decoder method), accessible at

Computer Vision and Pattern Recognition,Artificial Intelligence

What problem does this paper attempt to address?

The paper aims to address the problem of how to more effectively evaluate the similarity between deep neural networks (DNNs) and human visual perception and object recognition capabilities. Specifically, the paper introduces a toolbox named MindSet: Vision, which includes a series of image datasets and related scripts for testing DNN performance on 30 psychological findings. The stimuli under these experimental conditions are systematically manipulated to test specific hypotheses about human visual perception and object recognition. Additionally, the toolbox provides pre-generated datasets and code, allowing users to customize parameters based on different research contexts, thereby greatly expanding the applicability of the datasets. It also offers three different methods to test DNNs (i.e., similarity judgment analysis, out-of-distribution classification, and decoder methods). In this way, the authors hope to bridge the gap between computational modeling and psychological research, promoting further exploration of other human psychological domains in machines, such as memory, language, and speech perception.

MindSet: Vision. A toolbox for testing DNNs on key psychological experiments

Seeing eye-to-eye? A comparison of object recognition performance in humans and deep convolutional neural networks under image manipulation

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Controversial stimuli: Pitting neural networks against each other as models of human cognition

Controversial stimuli: pitting neural networks against each other as models of human recognition

NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models

Benchmarking Out-of-Distribution Generalization Capabilities of DNN-based Encoding Models for the Ventral Visual Cortex

Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study

The Neural Representation Benchmark and its Evaluation on Brain and Machine

Partial success in closing the gap between human and machine vision

BOLD5000: A public fMRI dataset of 5000 images

DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors

Wake Vision: A Tailored Dataset and Benchmark Suite for TinyML Computer Vision Applications

The 3D-PC: a benchmark for visual perspective taking in humans and machines

A large-scale examination of inductive biases shaping high-level visual representation in brains and machines

Brain-like emergent properties in deep networks: impact of network architecture, datasets and training

UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling

Deep Learning Human Mind for Automated Visual Classification

The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects

Soft tissue case 61.

DNNBrain: A Unifying Toolbox for Mapping Deep Neural Networks and Brains