A Neuro-computational Account of Arbitration between Choice Imitation and Goal Emulation during Human Observational Learning

When individuals learn from observing the behavior of others, they deploy at least two distinct strategies. Choice imitation involves repeating other agents’ previous actions, whereas emulation proceeds from inferring their goals and intentions. Despite the prevalence of observational learning in hu...

Full description

Saved in:
Bibliographic Details
Published inNeuron (Cambridge, Mass.) Vol. 106; no. 4; pp. 687 - 699.e7
Main Authors Charpentier, Caroline J., Iigaya, Kiyohito, O’Doherty, John P.
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 20.05.2020
Elsevier Limited
Subjects
Online AccessGet full text
ISSN0896-6273
1097-4199
1097-4199
DOI10.1016/j.neuron.2020.02.028

Cover

More Information
Summary:When individuals learn from observing the behavior of others, they deploy at least two distinct strategies. Choice imitation involves repeating other agents’ previous actions, whereas emulation proceeds from inferring their goals and intentions. Despite the prevalence of observational learning in humans and other social animals, a fundamental question remains unaddressed: how does the brain decide which strategy to use in a given situation? In two fMRI studies (the second a pre-registered replication of the first), we identify a neuro-computational mechanism underlying arbitration between choice imitation and goal emulation. Computational modeling, combined with a behavioral task that dissociated the two strategies, revealed that control over behavior was adaptively and dynamically weighted toward the most reliable strategy. Emulation reliability, the model’s arbitration signal, was represented in the ventrolateral prefrontal cortex, temporoparietal junction, and rostral cingulate cortex. Our replicated findings illuminate the computations by which the brain decides to imitate or emulate others. •Replicable evidence of arbitration between choice imitation and goal emulation•Control over behavior is adaptively weighted toward the most reliable strategy•Distinct brain networks implement each strategy’s learning signals in parallel•Arbitration is driven by variations in emulation reliability in rvlPFC, ACC, and TPJ Charpentier et al. show, in two independent studies, that people learn from observing others by flexibly deploying one of two strategies, imitation or emulation, depending on the conditions of the environment. By tracking changes in the reliability of emulation, fronto-parietal brain regions assign control to the most reliable strategy.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
C.C. and J.O. were responsible for conceptualization and methodology. C.C. carried out the investigations. C.C. and K.I. performed the analyses. C.C. and J.O. wrote the original manuscript draft. C.C., K.I. and J.O. reviewed and edited the manuscript. J.O. supervised the study and acquired funding.
Author contributions
ISSN:0896-6273
1097-4199
1097-4199
DOI:10.1016/j.neuron.2020.02.028