Removal of reinforcement improves instrumental performance in humans by decreasing a general action bias rather than unmasking learnt associations

Performance during instrumental learning is commonly believed to reflect the knowledge that has been acquired up to that point. However, recent work in rodents found that instrumental performance was enhanced during periods when reinforcement was withheld, relative to periods when reinforcement was...

Full description

Saved in:

Bibliographic Details
Published in	PLoS computational biology Vol. 18; no. 12; p. e1010201
Main Authors	Kurtenbach, Hannah, Ort, Eduard, Froböse, Monja Isabel, Jocham, Gerhard
Format	Journal Article
Language	English
Published	United States Public Library of Science 01.12.2022 Public Library of Science (PLoS)
Subjects	Biology and Life Sciences Conditioning, Operant - physiology Engineering and Technology Humans Learning Reinforcement, Psychology Research and Analysis Methods Social Sciences
Online Access	Get full text
ISSN	1553-7358 1553-734X 1553-7358
DOI	10.1371/journal.pcbi.1010201

Cover

More Information
Summary:	Performance during instrumental learning is commonly believed to reflect the knowledge that has been acquired up to that point. However, recent work in rodents found that instrumental performance was enhanced during periods when reinforcement was withheld, relative to periods when reinforcement was provided. This suggests that reinforcement may mask acquired knowledge and lead to impaired performance. In the present study, we investigated whether such a beneficial effect of removing reinforcement translates to humans. Specifically, we tested whether performance during learning was improved during non-reinforced relative to reinforced task periods using signal detection theory and a computational modelling approach. To this end, 60 healthy volunteers performed a novel visual go/no-go learning task with deterministic reinforcement. To probe acquired knowledge in the absence of reinforcement, we interspersed blocks without feedback. In these non-reinforced task blocks, we found an increased d’ , indicative of enhanced instrumental performance. However, computational modelling showed that this improvement in performance was not due to an increased sensitivity of decision making to learnt values, but to a more cautious mode of responding, as evidenced by a reduction of a general response bias. Together with an initial tendency to act, this is sufficient to drive differential changes in hit and false alarm rates that jointly lead to an increased d’ . To conclude, the improved instrumental performance in the absence of reinforcement observed in studies using asymmetrically reinforced go/no-go tasks may reflect a change in response bias rather than unmasking latent knowledge.
Bibliography:	new_version ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 The authors have declared that no competing interests exist.
ISSN:	1553-7358 1553-734X 1553-7358
DOI:	10.1371/journal.pcbi.1010201