Malarone-donation programme

W. Foege

DOI: https://doi.org/10.1016/S0140-6736(05)64045-7

IF: 202.731

1997-11-29

The Lancet

Abstract:

What problem does this paper attempt to address?

Surprise-minimization as a solution to the structural credit assignment problem

Franz Wurm,Benjamin Ernst,Marco Steinhauser

DOI: https://doi.org/10.1371/journal.pcbi.1012175

2024-05-29

PLoS Computational Biology

Abstract:The structural credit assignment problem arises when the causal structure between actions and subsequent outcomes is hidden from direct observation. To solve this problem and enable goal-directed behavior, an agent has to infer structure and form a representation thereof. In the scope of this study, we investigate a possible solution in the human brain. We recorded behavioral and electrophysiological data from human participants in a novel variant of the bandit task, where multiple actions lead to multiple outcomes. Crucially, the mapping between actions and outcomes was hidden and not instructed to the participants. Human choice behavior revealed clear hallmarks of credit assignment and learning. Moreover, a computational model which formalizes action selection as the competition between multiple representations of the hidden structure was fit to account for participants data. Starting in a state of uncertainty about the correct representation, the central mechanism of this model is the arbitration of action control towards the representation which minimizes surprise about outcomes. Crucially, single-trial latent-variable analysis reveals that the neural patterns clearly support central quantitative predictions of this surprise minimization model. The results suggest that neural activity is not only related to reinforcement learning under correct as well as incorrect task representations but also reflects central mechanisms of credit assignment and behavioral arbitration. In naturalistic environments, causal relationships between actions and their consequences are often hidden from direct observation. To overcome this structural credit-assignment problem, agents have to infer causal structures from experience. Here, we developed a computational model which formalizes action selection as the competition between structural representations, while action control is arbitrated towards the representation that minimizes surprise over time. To validate this model, we recorded behavioral and electrophysiological data from human participants in a novel task in which independent decisions are followed by outcomes, whereby the decision-outcome mapping is unknown. The model could account for patterns of choice behavior revealing clear hallmarks of credit assignment. Model-based analysis of EEG activity confirmed central model characteristics of concurrent prediction errors and a signature of evidence accumulation and behavioral arbitration. These findings highlight a key role of surprise minimization for both value and representation learning and reveal neural correlates of credit assignment.

biochemical research methods,mathematical & computational biology
Policy adjustment in a dynamic economic game

Jian Li,Samuel M McClure,Brooks King-Casas,P Read Montague

DOI: https://doi.org/10.1371/journal.pone.0000103

IF: 3.7

2006-12-20

PLoS ONE

Abstract:Making sequential decisions to harvest rewards is a notoriously difficult problem. One difficulty is that the real world is not stationary and the reward expected from a contemplated action may depend in complex ways on the history of an animal's choices. Previous functional neuroimaging work combined with principled models has detected brain responses that correlate with computations thought to guide simple learning and action choice. Those works generally employed instrumental conditioning tasks with fixed action-reward contingencies. For real-world learning problems, the history of reward-harvesting choices can change the likelihood of rewards collected by the same choices in the near-term future. We used functional MRI to probe brain and behavioral responses in a continuous decision-making task where reward contingency is a function of both a subject's immediate choice and his choice history. In these more complex tasks, we demonstrated that a simple actor-critic model can account for both the subjects' behavioral and brain responses, and identified a reward prediction error signal in ventral striatal structures active during these non-stationary decision tasks. However, a sudden introduction of new reward structures engages more complex control circuitry in the prefrontal cortex (inferior frontal gyrus and anterior insula) and is not captured by a simple actor-critic model. Taken together, these results extend our knowledge of reward-learning signals into more complex, history-dependent choice tasks. They also highlight the important interplay between striatum and prefrontal cortex as decision-makers respond to the strategic demands imposed by non-stationary reward environments more reminiscent of real-world tasks.
Hyperglycemia enhances kidney cell injury in HIVAN through down-regulation of vitamin D receptors.

Partab Rai,Tejinder Singh,Rivka Lederman,Amrita Chawla,Dileep Kumar,K. Cheng,G. Valecha,P. Mathieson,M. Saleem,A. Malhotra,P. Singhal

DOI: https://doi.org/10.1016/j.cellsig.2014.12.011

IF: 4.85

2015-03-01

Cellular Signalling

Abstract:
Fast adaptation to rule switching using neuronal surprise

Martin L L R Barry,Wulfram Gerstner,Martin L. L. R. Barry

DOI: https://doi.org/10.1371/journal.pcbi.1011839

2024-02-21

PLoS Computational Biology

Abstract:In humans and animals, surprise is a physiological reaction to an unexpected event, but how surprise can be linked to plausible models of neuronal activity is an open problem. We propose a self-supervised spiking neural network model where a surprise signal is extracted from an increase in neural activity after an imbalance of excitation and inhibition. The surprise signal modulates synaptic plasticity via a three-factor learning rule which increases plasticity at moments of surprise. The surprise signal remains small when transitions between sensory events follow a previously learned rule but increases immediately after rule switching. In a spiking network with several modules, previously learned rules are protected against overwriting, as long as the number of modules is larger than the total number of rules—making a step towards solving the stability-plasticity dilemma in neuroscience. Our model relates the subjective notion of surprise to specific predictions on the circuit level. Everybody knows the subjective feeling of surprise and behavioral reactions to surprising events such as startle response and pupil dilation are widely studied—but how can surprise arise from neural activity? And why is surprise useful? To answer these questions we use a modeling approach. We design a self-supervised spiking neural network capable of extracting surprising information from its own activity. Surprise is measured by a mismatch between the representation of the current stimulus inside the model and the expectations of the model given previous stimuli. We propose a specific network architecture which allows the network—in combination with a three-factor NeoHebbian learning rule—to detect rule changes, signal these changes as a surprise signal, and in turn use the surprise signal to rapidly re-adapt the model's predictions of possible next stimuli. Our bottom-up model presents a concrete hypothesis of a bio-plausible implementation of surprise and makes several specific experimental predictions for future in vivo studies.

biochemical research methods,mathematical & computational biology
Cognitive mechanisms of learning in sequential decision-making under uncertainty: an experimental and theoretical approach

Gloria Cecchini,Michael DePass,Emre Baspinar,Marta Andujar,Surabhi Ramawat,Pierpaolo Pani,Stefano Ferraina,Alain Destexhe,Rubén Moreno-Bote,Ignasi Cos

DOI: https://doi.org/10.3389/fnbeh.2024.1399394

2024-08-12

Abstract:Learning to make adaptive decisions involves making choices, assessing their consequence, and leveraging this assessment to attain higher rewarding states. Despite vast literature on value-based decision-making, relatively little is known about the cognitive processes underlying decisions in highly uncertain contexts. Real world decisions are rarely accompanied by immediate feedback, explicit rewards, or complete knowledge of the environment. Being able to make informed decisions in such contexts requires significant knowledge about the environment, which can only be gained via exploration. Here we aim at understanding and formalizing the brain mechanisms underlying these processes. To this end, we first designed and performed an experimental task. Human participants had to learn to maximize reward while making sequences of decisions with only basic knowledge of the environment, and in the absence of explicit performance cues. Participants had to rely on their own internal assessment of performance to reveal a covert relationship between their choices and their subsequent consequences to find a strategy leading to the highest cumulative reward. Our results show that the participants' reaction times were longer whenever the decision involved a future consequence, suggesting greater introspection whenever a delayed value had to be considered. The learning time varied significantly across participants. Second, we formalized the neurocognitive processes underlying decision-making within this task, combining mean-field representations of competing neural populations with a reinforcement learning mechanism. This model provided a plausible characterization of the brain dynamics underlying these processes, and reproduced each aspect of the participants' behavior, from their reaction times and choices to their learning rates. In summary, both the experimental results and the model provide a principled explanation to how delayed value may be computed and incorporated into the neural dynamics of decision-making, and to how learning occurs in these uncertain scenarios.
Neural Mechanisms of Human Decision-Making

Seth Herd,Kai Krueger,Ananta Nair,Jessica Mollick,Randall OReilly

DOI: https://doi.org/10.48550/arXiv.1912.07660

2019-12-17

Abstract:We present a computational and theoretical model of the neural mechanisms underlying human decision-making. We propose a detailed model of the interaction between brain regions, under a proposer-predictor-actor-critic framework. Task-relevant areas of cortex propose a candidate plan using fast, model-free, parallel constraint-satisfaction computations. Other areas of cortex and medial temporal lobe can then predict likely outcomes of that plan in this situation. This step is optional. This prediction-(or model-) based computation produces better accuracy and generalization, at the expense of speed. Next, linked regions of basal ganglia act to accept or reject the proposed plan based on its reward history in similar contexts. Finally the reward-prediction system acts as a critic to determine the value of the outcome relative to expectations, and produce dopamine as a training signal for cortex and basal ganglia. This model gains many constraints from the hypothesis that the mechanisms of complex human decision-making are closely analogous to those that have been empirically studied in detail for animal action-selection. We argue that by operating sequentially and hierarchically, these same mechanisms are responsible for the most complex human plans and decisions. Finally, we use the computational model to generate novel hypotheses on causes of human risky decision-making, and compare this to other theories of human decision-making.

Neurons and Cognition
Signals in Human Striatum Are Appropriate for Policy Update Rather Than Value Prediction

Jian Li,Nathaniel D. Daw

DOI: https://doi.org/10.1523/jneurosci.6316-10.2011

2011-01-01

Journal of Neuroscience

Abstract:Influential reinforcement learning theories propose that prediction error signals in the brain's nigrostriatal system guide learning for trial-and-error decision-making. However, since different decision variables can be learned from quantitatively similar error signals, a critical question is: what is the content of decision representations trained by the error signals? We used fMRI to monitor neural activity in a two-armed bandit counterfactual decision task that provided human subjects with information about forgone and obtained monetary outcomes so as to dissociate teaching signals that update expected values for each action, versus signals that train relative preferences between actions (a policy). The reward probabilities of both choices varied independently from each other. This specific design allowed us to test whether subjects' choice behavior was guided by policy-based methods, which directly map states to advantageous actions, or value-based methods such as Q-learning, where choice policies are instead generated by learning an intermediate representation (reward expectancy). Behaviorally, we found human participants' choices were significantly influenced by obtained as well as forgone rewards from the previous trial. We also found subjects' blood oxygen level-dependent responses in striatum were modulated in opposite directions by the experienced and forgone rewards but not by reward expectancy. This neural pattern, as well as subjects' choice behavior, is consistent with a teaching signal for developing habits or relative action preferences, rather than prediction errors for updating separate action values.
Dissociable Neural Representations of Reinforcement and Belief Prediction Errors Underlie Strategic Learning

Lusha Zhu,Kyle E. Mathewson,Ming Hsu

DOI: https://doi.org/10.1073/pnas.1116783109

IF: 11.1

2012-01-01

Proceedings of the National Academy of Sciences

Abstract:Decision-making in the presence of other competitive intelligent agents is fundamental for social and economic behavior. Such decisions require agents to behave strategically, where in addition to learning about the rewards and punishments available in the environment, they also need to anticipate and respond to actions of others competing for the same rewards. However, whereas we know much about strategic learning at both theoretical and behavioral levels, we know relatively little about the underlying neural mechanisms. Here, we show using a multi-strategy competitive learning paradigm that strategic choices can be characterized by extending the reinforcement learning (RL) framework to incorporate agents’ beliefs about the actions of their opponents. Furthermore, using this characterization to generate putative internal values, we used model-based functional magnetic resonance imaging to investigate neural computations underlying strategic learning. We found that the distinct notions of prediction errors derived from our computational model are processed in a partially overlapping but distinct set of brain regions. Specifically, we found that the RL prediction error was correlated with activity in the ventral striatum. In contrast, activity in the ventral striatum, as well as the rostral anterior cingulate (rACC), was correlated with a previously uncharacterized belief-based prediction error. Furthermore, activity in rACC reflected individual differences in degree of engagement in belief learning. These results suggest a model of strategic behavior where learning arises from interaction of dissociable reinforcement and belief-based inputs.
Dynamic reinforcement learning reveals time-dependent shifts in strategy during reward learning.

Sarah Jo C Venditto,Kevin J Miller,Carlos D Brody,Nathaniel D Daw

DOI: https://doi.org/10.1101/2024.02.28.582617

2024-10-07

Abstract:Different brain systems have been hypothesized to subserve multiple "experts" that compete to generate behavior. In reinforcement learning, two general processes, one model-free (MF) and one model-based (MB), are often modeled as a mixture of agents (MoA) and hypothesized to capture differences between automaticity vs. deliberation. However, shifts in strategy cannot be captured by a static MoA. To investigate such dynamics, we present the mixture-of-agents hidden Markov model (MoA-HMM), which simultaneously learns inferred action values from a set of agents and the temporal dynamics of underlying "hidden" states that capture shifts in agent contributions over time. Applying this model to a multi-step, reward-guided task in rats reveals a progression of within-session strategies: a shift from initial MB exploration to MB exploitation, and finally to reduced engagement. The inferred states predict changes in both response time and OFC neural encoding during the task, suggesting that these states are capturing real shifts in dynamics.

Neuroscience
How Instructed Knowledge Modulates the Neural Systems of Reward Learning

Jian Li,Mauricio R. Delgado,Elizabeth A. Phelps

DOI: https://doi.org/10.1073/pnas.1014938108

2010-01-01

Abstract:Recent research in neuroeconomics has demonstrated that the reinforcement learning model of reward learning captures the patterns of both behavioral performance and neural responses during a range of economic decision-making tasks. However, this powerful theoretical model has its limits. Trial-and-error is only one of the means by which individuals can learn the value associated with different decision options. Humans have also developed efficient, symbolic means of communication for learning without the necessity for committing multiple errors across trials. In the present study, we observed that instructed knowledge of cue-reward probabilities improves behavioral performance and diminishes reinforcement learning-related blood-oxygen level-dependent (BOLD) responses to feedback in the nucleus accumbens, ventromedial prefrontal cortex, and hippocampal complex. The decrease in BOLD responses in these brain regions to reward-feedback signals was functionally correlated with activation of the dorsolateral prefrontal cortex (DLPFC). These results suggest that when learning action values, participants use the DLPFC to dynamically adjust outcome responses in valuation regions depending on the usefulness of action-outcome information.
How cortico-basal ganglia-thalamic subnetworks can shift decision policies to maximize reward rate

Jyotika Bahuguna,Timothy V Verstynen,Jonathan Rubin

DOI: https://doi.org/10.1101/2024.05.21.595174

2024-05-22

Abstract:All mammals exhibit flexible decision policies that depend, at least in part, on the cortico-basal ganglia-thalamic (CBGT) pathways. Yet understanding how the complex connectivity, dynamics, and plasticity of CBGT circuits translates into experience-dependent shifts of decision policies represents a longstanding challenge in neuroscience. Here we used a computational approach to address this problem. Specifically, we simulated decisions driven by CBGT circuits under baseline, unrewarded conditions using a spiking neural network, and fit the resulting behavior to an evidence accumulation model. Using canonical correlation analysis, we then replicated the existence of three recently identified control ensembles (responsiveness, pliancy and choice) within CBGT circuits, with each ensemble mapping to a specific configuration of the evidence accumulation process. We subsequently simulated learning in a simple two-choice task with one optimal (i.e., rewarded) target. We find that value-based learning, via dopaminergic signals acting on cortico-striatal synapses, effectively manages the speed-accuracy tradeoff so as to increase reward rate over time. Within this process, learning-related changes in decision policy can be decomposed in terms of the contributions of each control ensemble, and these changes are driven by sequential reward prediction errors on individual trials. Our results provide a clear and simple mechanism for how dopaminergic plasticity shifts specific subnetworks within CBGT circuits so as to strategically modulate decision policies in order to maximize effective reward rate.

Neuroscience
Intracranial electroencephalography reveals effector-independent evidence accumulation dynamics in multiple human brain regions

Sabina Gherman,Noah Markowitz,Gelana Tostaeva,Elizabeth Espinal,Ashesh D. Mehta,Redmond G. O’Connell,Simon P. Kelly,Stephan Bickel

DOI: https://doi.org/10.1038/s41562-024-01824-9

IF: 24.252

2024-02-17

Nature Human Behaviour

Abstract:Neural representations of perceptual decision formation that are abstracted from specific motor requirements have previously been identified in humans using non-invasive electrophysiology; however, it is currently unclear where these originate in the brain. Here we capitalized on the high spatiotemporal precision of intracranial EEG to localize such abstract decision signals. Participants undergoing invasive electrophysiological monitoring for epilepsy were asked to judge the direction of random-dot stimuli and respond either with a speeded button press ( N = 24), or vocally, after a randomized delay ( N = 12). We found a widely distributed motor-independent network of regions where high-frequency activity exhibited key characteristics consistent with evidence accumulation, including a gradual buildup that was modulated by the strength of the sensory evidence, and an amplitude that predicted participants' choice accuracy and response time. Our findings offer a new view on the brain networks governing human decision-making.

psychology, experimental,neurosciences,multidisciplinary sciences
Action-modulated midbrain dopamine activity arises from distributed control policies

Jack Lindsey,Ashok Litwin-Kumar

DOI: https://doi.org/10.48550/arXiv.2207.00636

2022-07-01

Neurons and Cognition

Abstract:Animal behavior is driven by multiple brain regions working in parallel with distinct control policies. We present a biologically plausible model of off-policy reinforcement learning in the basal ganglia, which enables learning in such an architecture. The model accounts for action-related modulation of dopamine activity that is not captured by previous models that implement on-policy algorithms. In particular, the model predicts that dopamine activity signals a combination of reward prediction error (as in classic models) and "action surprise," a measure of how unexpected an action is relative to the basal ganglia's current policy. In the presence of the action surprise term, the model implements an approximate form of Q-learning. On benchmark navigation and reaching tasks, we show empirically that this model is capable of learning from data driven completely or in part by other policies (e.g. from other brain regions). By contrast, models without the action surprise term suffer in the presence of additional policies, and are incapable of learning at all from behavior that is completely externally driven. The model provides a computational account for numerous experimental findings about dopamine activity that cannot be explained by classic models of reinforcement learning in the basal ganglia. These include differing levels of action surprise signals in dorsal and ventral striatum, decreasing amounts movement-modulated dopamine activity with practice, and representations of action initiation and kinematics in dopamine activity. It also provides further predictions that can be tested with recordings of striatal dopamine activity.
Downstream processing of insect cell cultures

A. Bernard,M. Lusti‐Narasimhan,Kathryn M. Radford,R. Hale,E. Sebille,P. Graber

DOI: https://doi.org/10.1007/BF00350404

Abstract:
Brain network dynamics predict moments of surprise across contexts

Ziwei Zhang,Monica D. Rosenberg

DOI: https://doi.org/10.1101/2023.12.01.569271

2024-09-03

Abstract:We experience surprise when reality conflicts with our expectations. When we encounter such expectation violations in psychological tasks and daily life, are we experiencing completely different forms of surprise? Or is surprise a fundamental psychological process with shared neural bases across contexts? To address this question, we identified a brain network model, the surprise edge-fluctuation-based predictive model (EFPM), whose regional interaction dynamics measured with functional magnetic resonance imaging (fMRI) predicted surprise in an adaptive learning task. The same model generalized to predict surprise as a separate group of individuals watched suspenseful basketball games and as a third group watched videos violating psychological expectations. The surprise EFPM also uniquely predicts surprise, capturing expectation violations better than models built from other brain networks, fMRI measures, and behavioral metrics. These results suggest that shared neurocognitive processes underlie surprise across contexts and that distinct experiences can be translated into the common space of brain dynamics.

Neuroscience
Competing neural representations of choice shape evidence accumulation in humans

Krista Bond,Javier Rasero,Raghav Madan,Jyotika Bahuguna,Jonathan Rubin,Timothy Verstynen

DOI: https://doi.org/10.7554/eLife.85223

IF: 7.7

2023-10-11

eLife

Abstract:Making adaptive choices in dynamic environments requires flexible decision policies. Previously, we showed how shifts in outcome contingency change the evidence accumulation process that determines decision policies. Using in silico experiments to generate predictions, here we show how the cortico-basal ganglia-thalamic (CBGT) circuits can feasibly implement shifts in decision policies. When action contingencies change, dopaminergic plasticity redirects the balance of power, both within and between action representations, to divert the flow of evidence from one option to another. When competition between action representations is highest, the rate of evidence accumulation is the lowest. This prediction was validated in in vivo experiments on human participants, using fMRI, which showed that (1) evoked hemodynamic responses can reliably predict trial-wise choices and (2) competition between action representations, measured using a classifier model, tracked with changes in the rate of evidence accumulation. These results paint a holistic picture of how CBGT circuits manage and adapt the evidence accumulation process in mammals.
Decision-Making with Predictions of Others' Likely and Unlikely Choices in the Human Brain

Ning Ma,Norihiro Harasawa,Kenichi Ueno,Kang Cheng,Hiroyuki Nakahara

DOI: https://doi.org/10.1523/JNEUROSCI.2236-23.2024

2024-08-23

Abstract:For better decisions in social interactions, humans often must understand the thinking of others and predict their actions. Since such predictions are uncertain, multiple predictions may be necessary for better decision-making. However, the neural processes and computations underlying such social decision-making remain unclear. We investigated this issue by developing a behavioral paradigm and performing functional magnetic resonance imaging and computational modeling. In our task, female and male participants were required to predict others' choices in order to make their own value-based decisions, as the outcome depended on others' choices. Results showed, to make choices, the participants mostly relied on a value difference (primary) generated from the case where others would make a likely choice, but sometimes they additionally used another value difference (secondary) from the opposite case where others make an unlikely choice. We found that the activations in the posterior cingulate cortex (PCC) correlated with the primary difference while the activations in the right dorsolateral prefrontal cortex (rdlPFC) correlated with the secondary difference. Analysis of neural coupling and temporal dynamics suggested a three-step processing network, beginning with the left amygdala signals for predictions of others' choices. Modulated by these signals, the PCC and rdlPFC reflect the respective value differences for self-decisions. Finally, the medial prefrontal cortex integrated these decision signals for a final decision. Our findings elucidate the neural process of constructing value-based decisions by predicting others and illuminate their key variables with social modulations, providing insight into the differential functional roles of these brain regions in this process.Significance Statement In daily life, to adjust our decisions, we constantly predict others' choices, but the inherent uncertainty means we face multiple scenarios for different choices by others. Using computational modeling-based fMRI, we identified a network in three-stage computations for such decision-making. Amygdala signals represent predictions of others' choices. These signals then interact with the posterior cingulate cortex and dorsolateral prefrontal cortex, representing the decision variables for the prediction of others' likely and unlikely choices, respectively. Finally, these signals modulate the medial prefrontal cortex, influencing our final choices. These pivotal variables and their corresponding brain signals play a fundamental role in a broad range of social cognitive processes. Our findings shed light on underlying mechanisms for complex social interactions in human behavior.
Beyond Reward Prediction Errors: Human Striatum Updates Rule Values During Learning

Ian Ballard,Eric M Miller,Steven T Piantadosi,Noah D Goodman,Samuel M McClure

DOI: https://doi.org/10.1093/cercor/bhx259

IF: 4.861

2017-10-13

Cerebral Cortex

Abstract:Abstract Humans naturally group the world into coherent categories defined by membership rules. Rules can be learned implicitly by building stimulus-response associations using reinforcement learning or by using explicit reasoning. We tested if the striatum, in which activation reliably scales with reward prediction error, would track prediction errors in a task that required explicit rule generation. Using functional magnetic resonance imaging during a categorization task, we show that striatal responses to feedback scale with a “surprise” signal derived from a Bayesian rule-learning model and are inconsistent with RL prediction error. We also find that striatum and caudal inferior frontal sulcus (cIFS) are involved in updating the likelihood of discriminative rules. We conclude that the striatum, in cooperation with the cIFS, is involved in updating the values assigned to categorization rules when people learn using explicit reasoning.

neurosciences
A multi-stage anticipated surprise model with dynamic expectation for economic decision-making

Ho Ka Chan,Taro Toyoizumi

DOI: https://doi.org/10.1038/s41598-023-50529-y

IF: 4.6

2024-01-06

Scientific Reports

Abstract:There are many modeling works that aim to explain people's behaviors that violate classical economic theories. However, these models often do not take into full account the multi-stage nature of real-life problems and people's tendency in solving complicated problems sequentially. In this work, we propose a descriptive decision-making model for multi-stage problems with perceived post-decision information. In the model, decisions are chosen based on an entity which we call the 'anticipated surprise'. The reference point is determined by the expected value of the possible outcomes, which we assume to be dynamically changing during the mental simulation of a sequence of events. We illustrate how our formalism can help us understand prominent economic paradoxes and gambling behaviors that involve multi-stage or sequential planning. We also discuss how neuroscience findings, like prediction error signals and introspective neuronal replay, as well as psychological theories like affective forecasting, are related to the features in our model. This provides hints for future experiments to investigate the role of these entities in decision-making.

multidisciplinary sciences
Parallel Contributions of Distinct Human Memory Systems During Probabilistic Learning.

Kathryn C. Dickerson,Jian Li,Mauricio R. Delgado

DOI: https://doi.org/10.1016/j.neuroimage.2010.10.080

IF: 5.7

2011-01-01

NeuroImage

Abstract:Regions within the medial temporal lobe and basal ganglia are thought to subserve distinct memory systems underlying declarative and nondeclarative processes, respectively. One question of interest is how these multiple memory systems interact during learning to contribute to goal directed behavior. While some hypotheses suggest that regions such as the striatum and the hippocampus interact in a competitive manner, alternative views posit that these structures may operate in a parallel manner to facilitate learning. In the current experiment, we probed the functional connectivity between regions in the striatum and hippocampus in the human brain during an event related probabilistic learning task that varied with respect to type of difficulty (easy or hard cues) and type of learning (via feedback or observation). We hypothesized that the hippocampus and striatum would interact in a parallel manner during learning. We identified regions of interest (ROI) in the striatum and hippocampus that showed an effect of cue difficulty during learning and found that such ROIs displayed a similar pattern of blood oxygen level dependent (BOLD) responses, irrespective of learning type, and were functionally correlated as assessed by a Granger causality analysis. Given the connectivity of both structures with dopaminergic midbrain centers, we further applied a reinforcement learning algorithm often used to highlight the role of dopamine in human reward related learning paradigms. Activity in both the striatum and hippocampus positively correlated with a prediction error signal during feedback learning. These results suggest that distinct human memory systems operate in parallel during probabilistic learning, and may act synergistically particularly when a violation of expectation occurs, to jointly contribute to learning and decision making.

Malarone-donation programme

Surprise-minimization as a solution to the structural credit assignment problem

Policy adjustment in a dynamic economic game

Hyperglycemia enhances kidney cell injury in HIVAN through down-regulation of vitamin D receptors.

Fast adaptation to rule switching using neuronal surprise

Cognitive mechanisms of learning in sequential decision-making under uncertainty: an experimental and theoretical approach

Neural Mechanisms of Human Decision-Making

Signals in Human Striatum Are Appropriate for Policy Update Rather Than Value Prediction

Dissociable Neural Representations of Reinforcement and Belief Prediction Errors Underlie Strategic Learning

Dynamic reinforcement learning reveals time-dependent shifts in strategy during reward learning.

How Instructed Knowledge Modulates the Neural Systems of Reward Learning

How cortico-basal ganglia-thalamic subnetworks can shift decision policies to maximize reward rate

Intracranial electroencephalography reveals effector-independent evidence accumulation dynamics in multiple human brain regions

Action-modulated midbrain dopamine activity arises from distributed control policies

Downstream processing of insect cell cultures

Brain network dynamics predict moments of surprise across contexts

Competing neural representations of choice shape evidence accumulation in humans

Decision-Making with Predictions of Others' Likely and Unlikely Choices in the Human Brain

Beyond Reward Prediction Errors: Human Striatum Updates Rule Values During Learning

A multi-stage anticipated surprise model with dynamic expectation for economic decision-making

Parallel Contributions of Distinct Human Memory Systems During Probabilistic Learning.