Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception?

Felix A. Wichmann,Robert Geirhos
2023-05-26
Abstract:Deep neural networks (DNNs) are machine learning algorithms that have revolutionised computer vision due to their remarkable successes in tasks like object classification and segmentation. The success of DNNs as computer vision algorithms has led to the suggestion that DNNs may also be good models of human visual perception. We here review evidence regarding current DNNs as adequate behavioural models of human core object recognition. To this end, we argue that it is important to distinguish between statistical tools and computational models, and to understand model quality as a multidimensional concept where clarity about modelling goals is key. Reviewing a large number of psychophysical and computational explorations of core object recognition performance in humans and DNNs, we argue that DNNs are highly valuable scientific tools but that as of today DNNs should only be regarded as promising -- but not yet adequate -- computational models of human core object recognition behaviour. On the way we dispel a number of myths surrounding DNNs in vision science.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning,Neurons and Cognition
What problem does this paper attempt to address?
This paper aims to explore whether deep neural networks (DNNs) can serve as effective models of human visual perception behavior. The authors review a large number of psychophysical experiments and computational studies to evaluate the similarity between current DNN performance and human performance on core object recognition tasks. The paper emphasizes the importance of distinguishing between statistical tools and computational models and argues that model quality is a multidimensional concept that requires clear modeling objectives. Although DNNs perform excellently as tools in scientific research, at the current stage, they cannot yet be considered fully suitable as computational models for core object recognition behavior. The paper further clarifies some misconceptions about DNNs in visual science and suggests using more modern DNN architectures instead of the outdated AlexNet for research. Overall, the paper concludes that DNNs are valuable research tools but need further development to become sufficient models of human visual perception behavior.