Greater reliance on model-free learning in adolescent anorexia nervosa: An examination of dual-system reinforcement learning

Carina S. Brown,Sean Devine,A. Ross Otto,Amanda Bischoff-Grethe,Christina E. Wierenga
DOI: https://doi.org/10.1101/2024.01.31.24302097
2024-02-07
Abstract:Alterations in learning and decision-making systems are thought to contribute to core features of anorexia nervosa (AN), a psychiatric disorder characterized by persistent dietary restriction and weight loss. Instrumental learning theory identifies a dual-system of habit and goal-directed decision-making, linked to model-free and model-based reinforcement learning algorithms. Difficulty arbitrating between these systems, resulting in an over-reliance on one strategy over the other, has been implicated in compulsivity and extreme goal pursuit, both of which are observed in AN. Characterizing alterations in model-free and model-based systems, and their neural correlates, in AN may clarify mechanisms contributing to symptom heterogeneity (e.g., binge/purge symptoms). This study tested whether adolescents with restricting AN (AN-R; = 36) and binge/purge AN (AN-BP; = 20) differentially utilized model-based and model-free learning systems compared to a healthy control group (HC; = 28) during a Markov two-step decision-making task under conditions of reward and punishment. Associations between model-free and model-based learning and resting-state functional connectivity between neural regions of interest, including orbitofrontal cortex (OFC), nucleus accumbens (NAcc), putamen, and sensory motor cortex (SMC) were examined. AN-R showed higher utilization of model-free learning compared to HC for reward, but attenuated model-free and model-based learning for punishment. In AN-R only, higher model-based learning was associated with stronger OFC-to-left NAcc functional connectivity, regions linked to goal-directed behavior. Greater utilization of model-free learning for reward in AN-R may differentiate this group, particularly during adolescence, and facilitate dietary restriction by prioritizing habitual control in rewarding contexts.
Psychiatry and Clinical Psychology
What problem does this paper attempt to address?
The paper aims to explore the differences in model-based and model-free reinforcement learning systems under reward and punishment conditions in different types of adolescent anorexia nervosa (AN) and to investigate the relationship between these differences and cortico-striatal functional connectivity. Specifically, the study focuses on the following aspects: 1. **Different subtypes of adolescent AN**: The study includes AN-R with only dietary restriction (n=36) and AN-BP with binge-eating/purging symptoms (n=20), and compares them with a healthy control group (HC; n=28). 2. **Learning mechanisms under reward and punishment conditions**: Participants' learning behaviors under reward and punishment conditions were examined using a Markov two-step decision-making task. 3. **Neural mechanisms**: Resting-state functional magnetic resonance imaging (rsfMRI) analysis was used to explore the functional connectivity between the orbitofrontal cortex (OFC), nucleus accumbens (NAcc), putamen, and sensorimotor cortex (SMC) and their relationship with different learning strategies. The main findings include: - **Performance of AN-R under reward conditions**: Compared to the healthy control group, AN-R showed higher dependence on model-free learning under reward conditions but significantly reduced model-free and model-based learning under punishment conditions. - **Performance of AN-BP under punishment conditions**: AN-BP also showed weakened model-free learning under punishment conditions, but the degree was not as significant as in AN-R. - **Neural connectivity**: In AN-R, higher model-based learning was associated with stronger functional connectivity from the OFC to the left NAcc. These findings contribute to a better understanding of the differences in learning strategies among AN patients and their potential neural mechanisms, providing new perspectives for future treatments.