Unsigned temporal difference errors in cortical L5 dendrites during learning
Gwendolin Schoenfeld,Sepp Kollmorgen,Matthias Chinyen Tsai,Christopher M Lewis,Shuting Han,Philipp Bethge,Anna Maria Reuss,Adriano Aguzzi,Walter Senn,Valerio Mante,Fritjof Helmchen
DOI: https://doi.org/10.1101/2021.12.28.474360
2024-10-05
Abstract:Learning goal-directed behavior requires association of pertinent sensory stimuli with behaviorally relevant outcomes. In the mammalian neocortex, dendrites of pyramidal neurons are suitable association sites but how their activities adapt during learning remains elusive. Here, we track calcium signals in apical dendrites of layer 5 (L5) pyramidal neurons in mouse barrel cortex during texture discrimination learning. We observe diverse task-related responses, either localized to branches or widespread throughout the apical tuft. However, even in expert mice, the tufts' capability to discriminate go/no-go stimuli remains poor. Yet, we identify two prevailing response types in dendritic branches: 1) responses to unexpected outcome (reward) in naïve mice that decrease with growing task proficiency, and 2) responses associated with salient sensory stimuli, especially the outcome-predicting texture touch, that strengthen upon learning. We demonstrate that these response types match distinct unsigned components of the temporal difference error5 by replicating our results with a reinforcement learning model. Moreover, optogenetic apical inhibition of L5 neurons during the outcome time window prevents naïve animals from learning the task, consistent with the effect of equivalent model perturbation. Our findings suggest that salience signals in L5 apical dendrites facilitate the recruitment of task-relevant neurons via dendritic gain modulation.
Neuroscience