Valence-dependent influence of serotonin depletion on model-based choice strategy

Y Worbe, S Palminteri, G Savulich, N D Daw, E Fernandez-Egea, T W Robbins, V Voon, Y Worbe, S Palminteri, G Savulich, N D Daw, E Fernandez-Egea, T W Robbins, V Voon

Abstract

Human decision-making arises from both reflective and reflexive mechanisms, which underpin goal-directed and habitual behavioural control. Computationally, these two systems of behavioural control have been described by different learning algorithms, model-based and model-free learning, respectively. Here, we investigated the effect of diminished serotonin (5-hydroxytryptamine) neurotransmission using dietary tryptophan depletion (TD) in healthy volunteers on the performance of a two-stage decision-making task, which allows discrimination between model-free and model-based behavioural strategies. A novel version of the task was used, which not only examined choice balance for monetary reward but also for punishment (monetary loss). TD impaired goal-directed (model-based) behaviour in the reward condition, but promoted it under punishment. This effect on appetitive and aversive goal-directed behaviour is likely mediated by alteration of the average reward representation produced by TD, which is consistent with previous studies. Overall, the major implication of this study is that serotonin differentially affects goal-directed learning as a function of affective valence. These findings are relevant for a further understanding of psychiatric disorders associated with breakdown of goal-directed behavioural control such as obsessive-compulsive disorders or addictions.

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Two-stage decision-making task. Task. (a) On each trial (first stage), the initial choice between two stimuli (left-right randomised) led with fixed probabilities (transition) to one of two pairs of stimuli in stage 2. Each of the four second-stage stimuli was associated with probabilistic outcome: monetary reward in the reward or loss in the punishment version of the task. All stimuli in second stage were associated with probabilistic outcome, which changed slowly and independently across the trials. (b) Model-based and model-free strategies predict different choice patterns by which outcome obtained after the second stage affected subsequent first-stage choices. In the model-free system, the choices are driven by the reward or the no loss, which increase the chance of choosing the same stimulus on the next trial independently of the type of transition (upper row). In a model-based system, the choices of the stimuli on the next trial integrate the transition type (lower row).
Figure 2
Figure 2
(a) Factorial (stay-shift) behavioural results. Separate analysis of task valence showed a mixed choice strategy in BAL and a shift to a model-free choice strategy in the TD group in the reward condition. In the loss condition, the significant interaction between outcome × transition in the TD group indicates a shift of behavioural choice towards a model-based strategy. (b) Computationally fitted behavioural results before arscin transformation. Compared with BAL, the TD group showed a significant difference in the weighting factor ω in reward condition. BAL=control group; TD=TRP-depleted group. *P<0.05.

References

    1. Balleine BW, O'Doherty JP. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2010; 35: 48–69.
    1. Dickinson A. Actions and habits: the development of behavioural and autonomy. Philos Trans R Soc Lond B Biol Sci 1985; 308: 67–78.
    1. Dolan RJ, Dayan P. Goals and habits in the brain. Neuron 2013; 80: 312–325.
    1. Daw ND, Niv Y, Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci 2005; 8: 1704–1711.
    1. Dezfouli A, Lingawi NW, Balleine BW. Habits as action sequences: hierarchical action control and changes in outcome value. Philos Trans R Soc Lond B Biol Sci 2014; 369; doi:10.1098/rstb.2013.0482.
    1. Wunderlich K, Dayan P, Dolan RJ. Mapping value based planning and extensively trained choice in the human brain. Nat Neurosci 2012; 15: 786–791.
    1. Smittenaar P, Fitzgerald TH, Romei V, Wright ND, Dolan RJ. Disruption of dorsolateral prefrontal cortex decreases model-based in favor of model-free control in humans. Neuron 2013; 80: 914–919.
    1. Frank MJ, Seeberger LC, O'Reilly RC. By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science 2004; 306: 1940–1943.
    1. Pessiglione M, Seymour B, Flandin G, Dolan RJ, Frith CD. Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 2006; 442: 1042–1045.
    1. Worbe Y, Palminteri S, Hartmann A, Vidailhet M, Lehericy S, Pessiglione M. Reinforcement learning and gilles de la tourette syndrome: dissociation of clinical phenotypes and pharmacological treatments. Arch Gen Psychiatry 2011; 68: 1257–1266.
    1. Wunderlich K, Smittenaar P, Dolan RJ. Dopamine enhances model-based over model-free choice behavior. Neuron 2012; 75: 418–424.
    1. Boureau YL, Dayan P. Opponency revisited: competition and cooperation between dopamine and serotonin. Neuropsychopharmacology 2011; 36: 74–97.
    1. Dayan P, Huys QJ. Serotonin in affective control. Annu Rev Neurosci 2009; 32: 95–126.
    1. Palminteri S, Clair AH, Mallet L, Pessiglione M. Similar improvement of reward and punishment learning by serotonin reuptake inhibitors in obsessive-compulsive disorder. Biol Psychiatry 2012; 72: 244–250.
    1. Miyazaki KW, Miyazaki K, Doya K. Activation of dorsal raphe serotonin neurons is necessary for waiting for delayed rewards. J Neurosci 2012; 32: 10451–10457.
    1. Miyazaki KW, Miyazaki K, Tanaka KF, Yamanaka A, Takahashi A, Tabuchi S et al. Optogenetic activation of dorsal raphe serotonin neurons enhances patience for future rewards. Curr Biol 2014; 24: 2033–2040.
    1. Schweighofer N, Bertin M, Shishida K, Okamoto Y, Tanaka SC, Yamawaki S et al. Low-serotonin levels increase delayed reward discounting in humans. J Neurosci 2008; 28: 4528–4532.
    1. den Ouden HE, Swart JC, Schmidt K, Fekkes D, Geurts DE, Cools R. Acute serotonin depletion releases motivated inhibition of response vigour. Psychopharmacology (Berl) 2014; 232: 1303–1312.
    1. den Ouden HE, Daw ND, Fernandez G, Elshout JA, Rijpkema M, Hoogman M et al. Dissociable effects of dopamine and serotonin on reversal learning. Neuron 2013; 80: 1090–1100.
    1. Crockett MJ, Clark L, Robbins TW. Reconciling the role of serotonin in behavioral inhibition and aversion: acute tryptophan depletion abolishes punishment-induced inhibition in humans. J Neurosci 2009; 29: 11993–11999.
    1. Geurts DE, Huys QJ, den Ouden HE, Cools R. Serotonin and aversive Pavlovian control of instrumental behavior in humans. J Neurosci 2013; 33: 18932–18939.
    1. Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ. Model-based influences on humans' choices and striatal prediction errors. Neuron 2011; 69: 1204–1215.
    1. Ardis TC, Cahir M, Elliott JJ, Bell R, Reynolds GP, Cooper SJ. Effect of acute tryptophan depletion on noradrenaline and dopamine in the rat brain. J Psychopharmacol 2009; 23: 51–55.
    1. Biggio G, Fadda F, Fanni P, Tagliamonte A, Gessa GL. Rapid depletion of serum tryptophan, brain tryptophan, serotonin and 5-hydroxyindoleacetic acid by a tryptophan-free diet. Life Sci 1974; 14: 1321–1329.
    1. Carpenter LL, Anderson GM, Pelton GH, Gudin JA, Kirwin PD, Price LH et al. Tryptophan depletion during continuous CSF sampling in healthy human subjects. Neuropsychopharmacology 1998; 19: 26–35.
    1. Cox SM, Benkelfat C, Dagher A, Delaney JS, Durand F, Kolivakis T et al. Effects of lowerd serotonin transmission on cocaine-induced striatal dopamine response: PET (11C)raclopride study in humans. Br J Psychiatry 2011; 199: 391–397.
    1. Daw N, Kakadeb S, Dayan P. Opponent interactions between serotonin and dopamine. Neural Networks 2002; 15: 603–616.
    1. Cools R, Nakamura K, Daw ND. Serotonin and dopamine: unifying affective, activational, and decision functions. Neuropsychopharmacology 2011; 36: 98–113.
    1. Niv Y, Daw ND, Joel D, Dayan P. Tonic dopamine: opportunity coast and the control of response vigor. Psychopharmacology 2007; 191: 507–520.
    1. Keramati M, Dezfouli A, Piray P. Speed/accuracy trade-off between the habitual and the goal-directed process. PLoS Comput Biol 2011; 7: e1002055.
    1. Dayan P. Instrumental vigor in punishment and reward. Eur J Neurosci 2012; 35: 1152–1168.
    1. Cowen P, Sherwood AC. The role of serotonin in cognitive function: evidence from recent studies and implications for understanding depression. J Psychopharmacol 2013; 27: 575–583.
    1. Otto AR, Raiob CM, Chiangb A, Phelpsa EA, Daw ND. Working-memory capacity protects model-based learning from stress. PNAS 2013; 110: 20941–20946.
    1. Courville AC, Daw N, Touretzk DS. Bayesian theories of conditioning in a changing world. Trends Cogn Sci 2006; 10: 294–300.
    1. Behrens TE, Woolrich MW, Walton ME, Rushworth MF. Learning the value of information in an uncertain world. Nat Neurosci 2007; 10: 1214–1221.
    1. Koot S, Zoratto F, Cassano T, Colangeli R, Laviola G, van den Bos R et al. Compromised decision-making and increased gambling proneness following dietary serotonin depletion in rats. Neuropharmacology 2012; 62: 1640–1650.
    1. Long AB, Kuhn CM, Platt ML. Serotonin shapes risky decision making in monkeys. Soc Cogn Affect Neurosci 2009; 4: 346–356.
    1. Macoveanu J, Rowe JB, Hornboll B, Elliott R, Paulson OB, Knudsen GM et al. Playing it safe but losing anyway—serotonergic signaling of negative outcomes in dorsomedial prefrontal cortex in the context of risk-aversion. Eur Neuropsychopharmacol 2013; 23: 919–930.
    1. Worbe Y, Savulich G, Voon V, Fernandez-Egea E, Robbins TW. Serotonin depletion induces ‘waiting impulsivity' on the human four choice serial reaction time task: cross-species translational significance. Neuropsychopharmacology 2014; 39: 1519–1526.
    1. Crockett MJ, Clark L, Roiser JP, Robinson OJ, Cools R, Chase HW et al. Converging evidence for central 5-HT effects in acute tryptophan depletion. Mol Psychiatry 2012; 17: 121–123.
    1. Palminteri S, Lebreton M, Worbe Y, Grabli D, Hartmann A, Pessiglione M. Pharmacological modulation of subliminal learning in Parkinson's and Tourette's syndromes. Proc Natl Acad Sci USA 2009; 106: 19179–19184.
    1. McCabe C, Mishor Z, Cowen PJ, Harmer CJ. Diminished neural processing of aversive and rewarding stimuli during selective serotonin reuptake inhibitor treatment. Biol Psychiatry 2010; 67: 439–445.
    1. Seymour B, Daw ND, Roiser JD, Dayan P, Dolan R. Serotonin selectively modulates reward value in human decision-making. J Neurosci 2012; 31: 5833–5842.
    1. Tricomi EM, Balleine BW, O'Doherty JP. A specific role for posterior dorsolateral striatum in human habit learning. Eur J Neurosci 2009; 29: 2225–2232.
    1. Valentin VV, Dickinson A, O'Doherty JP. Determining the neural substrates of goal-directed learning in the human brain. J Neurosci 2007; 27: 4019–4026.
    1. Gläscher J, Daw N, Dayan P, O'Doherty J. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010; 66: 585–595.
    1. Killcross S, Coutoureau E. Coordination of action and habits in the medial prefrontal cortex of rats. Cereb Cortex 2003; 13: 400–408.
    1. Griffiths KR, Morris RW, Balleine BW. Translatinal studies of goal-directed action as a framework for classifying deficit across psychiatric disorders. Front Syst Neurosci 2014; 8: 101.
    1. Gillan CM, Robbins TW. Goal-directed learning and obsessive-compulsive disorders. Philos Trans R Soc Lond B Biol Sci 2014; 369560 doi:10.1098/rstb.2013.0475.
    1. Gillan CM, Papmeyer M, Morein-Zamir S, Sahakian BJ, Fineberg NA, Robbins TW et al. Disruption in the balance between goal-directed behavior and habit learning in obsessive-compulsive disorder. Am J Psychiatry 2011; 168: 718–726.
    1. Voon V, Derbyshire K, Rück C, Irvine MA, Worbe Y, Enander J et al. Disorders of compulsivity: a common bias towards learning habits. Mol Psychiatry 2014; 20: 345–352.

Source: PubMed

3
Abonner