Abstract:According to a classical view of face perception (Bruce and Young, 1986; Haxby et al., 2000), face identity and facial expression recognition are performed by separate neural substrates (ventral and lateral temporal face-selective regions, respectively). However, recent studies challenge this view, showing that expression valence can also be decoded from ventral regions (Skerry and Saxe, 2014; Li et al., 2019), and identity from lateral regions (Anzellotti and Caramazza, 2017). These findings could be reconciled with the classical view if regions specialized for one task (either identity or expression) contain a small amount of information for the other task (that enables above-chance decoding). In this case, we would expect representations in lateral regions to be more similar to representations in deep convolutional neural networks (DCNNs) trained to recognize facial expression than to representations in DCNNs trained to recognize face identity (the converse should hold for ventral regions). We tested this hypothesis by analyzing neural responses to faces varying in identity and expression. Representational dissimilarity matrices (RDMs) computed from human intracranial recordings (n = 11 adults; 7 females) were compared with RDMs from DCNNs trained to label either identity or expression. We found that RDMs from DCNNs trained to recognize identity correlated with intracranial recordings more strongly in all regions tested-even in regions classically hypothesized to be specialized for expression. These results deviate from the classical view, suggesting that face-selective ventral and lateral regions contribute to the representation of both identity and expression.SIGNIFICANCE STATEMENT Previous work proposed that separate brain regions are specialized for the recognition of face identity and facial expression. However, identity and expression recognition mechanisms might share common brain regions instead. We tested these alternatives using deep neural networks and intracranial recordings from face-selective brain regions. Deep neural networks trained to recognize identity and networks trained to recognize expression learned representations that correlate with neural recordings. Identity-trained representations correlated with intracranial recordings more strongly in all regions tested, including regions hypothesized to be expression specialized in the classical hypothesis. These findings support the view that identity and expression recognition rely on common brain regions. This discovery may require reevaluation of the roles that the ventral and lateral neural pathways play in processing socially relevant stimuli.

Face recognition depends on specialized mechanisms tuned to view-invariant facial features: Insights from deep neural networks optimized for face or object recognition

Seeing eye-to-eye? A comparison of object recognition performance in humans and deep convolutional neural networks under image manipulation

Face processing emerges from object-trained convolutional neural networks

Humans and deep networks largely agree on which kinds of variation make object recognition harder

Modeling Biological Face Recognition with Deep Convolutional Neural Networks

Concurrent emergence of view invariance, sensitivity to critical features, and identity face classification through visual experience: Insights from deep learning algorithms

Deep Convolutional Neural Network Features and the Original Image

Improved object recognition using neural networks trained to mimic the brain's statistical properties

Brain-like functional specialization emerges spontaneously in deep neural networks

Comparing object recognition in humans and deep convolutional neural networks -- An eye tracking study

Deep learning algorithms reveal a new visual-semantic representation of familiar faces in human perception and memory

Implementation-independent Representation for Deep Convolutional Neural Networks and Humans in Processing Faces

Optimized Visual Recognition Through a Deep Convolutional Neural Network With Hierarchical Modular Organization

The Face Inversion Effect in Deep Convolutional Neural Networks

Explaining face representation in the primate brain using different computational models

Pursuing Face Identity From View-Specific Representation To View-Invariant Representation

Intracranial Electroencephalography and Deep Neural Networks Reveal Shared Substrates for Representations of Face Identity and Expressions

Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition

Single Unit Status in Deep Convolutional Neural Network Codes for Face Identification: Sparseness Redefined

Understanding of Facial Features in Face Perception: Insights from Deep Convolutional Neural Networks

Common Sequential Organization of Face Processing in the Human Brain and Convolutional Neural Networks