Abstract:Capacity limitations in visual tasks can be observed when the number of task-related objects increases. An influential idea is that such capacity limitations are determined by competition at the neural level: two objects that are encoded by shared neural populations interfere more in behavior (e.g., visual search) than two objects encoded by separate neural populations. However, the neural representational similarity of objects varies across brain regions and across time, raising the question of where and when competition determines task performance. Furthermore, it is unclear whether the association between neural representational similarity and task performance is common or unique across tasks. Here, we used neural representational similarity derived from fMRI, MEG, and deep neural networks (DNN) to predict performance on two visual search tasks involving the same objects and requiring the same responses but differing in instructions: cued visual search and oddball visual search. Separate groups of human participants (both sexes) viewed the individual objects in neuroimaging experiments to establish the neural representational similarity between those objects. Results showed that performance on both search tasks could be predicted by neural representational similarity throughout the visual system (fMRI), from 80 msec after onset (MEG), and in all DNN layers. Stepwise regression analysis, however, revealed task-specific associations, with unique variability in oddball search performance predicted by early/posterior neural similarity, and unique variability in cued search task performance predicted by late/anterior neural similarity. These results reveal that capacity limitations in superficially similar visual search tasks may reflect competition at different stages of visual processing. Significance Statement Visual search for target objects is slowed down by the presence of distractors, but not all distractors are equally distracting – the more similar a distractor is to the target, the more it slows down search. Here, we used fMRI, MEG, and a deep neural network to reveal where, when, and how neural similarity between targets and distractors predicts visual search performance across two search tasks: oddball visual search (locating the different-looking object) and cued visual search (locating the cued object). Results also revealed brain regions, time points, and feature levels that predicted task-unique performance. These results provide a neural basis for similarity theories of visual search and show that this neural basis differs across visual search tasks.

Visual search and real-image similarity: An empirical assessment through the lens of deep learning

Predicting cued and oddball visual search performance from fMRI, MEG, and DNN neural representational similarity

Seeing eye-to-eye? A comparison of object recognition performance in humans and deep convolutional neural networks under image manipulation

Predicting cued and oddball visual search performance from neural representational similarity

Modeling Human Visual Search Performance on Realistic Webpages Using Analytical and Deep Learning Methods

A high-throughput approach for the efficient prediction of perceived similarity of natural objects

Visual Attention driven by Convolutional Features

Deep Learning for Content-Based Image Retrieval: A Comprehensive Study

Top-Down Priors Disambiguate Target and Distractor Features in Simulated Covert Visual Search

Learn and Search: An Elegant Technique for Object Lookup using Contrastive Learning

Predicting Visual Attention and Distraction During Visual Search Using Convolutional Neural Networks

Target-distractor similarity predicts visual search efficiency but only for highly similar features

Visual search as an embodied process: The effects of perspective change and external reference on search performance

Efficient Discovery and Effective Evaluation of Visual Perceptual Similarity: A Benchmark and Beyond

Learning an Adaptation Function to Assess Image Visual Similarities

DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data

Comparing object recognition in humans and deep convolutional neural networks -- An eye tracking study

Modelling Visual Search with the Selective Attention for Identification Model (VS-SAIM): A Novel Explanation for Visual Search Asymmetries

Visual search: Attentional neurodynamics at work

A computational model of serial and parallel processing in visual search

Correlation of Object Detection Performance with Visual Saliency and Depth Estimation