How to be patient. The ability to wait for a reward depends on menstrual cycle phase and feedback-related activity

Luise Reimers, Christian Büchel, Esther K Diekhof, Luise Reimers, Christian Büchel, Esther K Diekhof

Abstract

Dopamine (DA) plays a major role in reinforcement learning with increases promoting reward sensitivity (Go learning) while decreases facilitate the avoidance of negative outcomes (NoGo learning). This is also reflected in adaptations of response time: higher levels of DA enhance speeding up to get a reward, whereas lower levels favor slowing down. The steroid hormones estradiol and progesterone have been shown to modulate dopaminergic tone. Here, we tested 14 women twice during their menstrual cycle, during the follicular (FP) and the luteal phase (LP), applying functional magnetic resonance imaging while they performed a feedback learning task. Subsequent behavioral testing assessed response time preferences with a clock task, in which subjects had to explore the optimal response time (RT) to maximize reward. In the FP subjects displayed a greater learning-related change of their RT than during the LP, when they were required to slow down. Final RTs in the slow condition were also predicted by feedback-related brain activation, but only in the FP. Increased activation of the inferior frontal junction and rostral cingulate zone was thereby predictive of slower and thus better adapted final RTs. Conversely, final RT was faster and less optimal for reward maximization if activation in the ventromedial prefrontal cortex was enhanced. These findings show that hormonal shifts across the menstrual cycle affect adaptation of response speed during reward acquisition with higher RT adjustment in the FP in the condition that requires slowing down. Since high estradiol levels during the FP increase synaptic DA levels, this conforms well to our hypothesis that estradiol supports Go learning at the expense of NoGo learning. Brain-behavior correlations further indicated that the compensatory capacity to counteract the follicular Go bias may be linked to the ability to more effectively monitor action outcomes and suppress bottom-up reward desiring during feedback processing.

Keywords: RCZ; VMPFC; dopamine; estradiol; fMRI; menstrual cycle; reinforcement learning; time perception.

Figures

Figure 1
Figure 1
Clock task. Reward values varied as a cosine function of response time. Subjects were instructed to figure out the optimal time point during a whole clock-arm turn of 5 s to stop the ticking clock and achieve maximum reward. In the fast clock condition, fast responses yielded the highest reward value (FAST clock for “Go learning”), whereas in the slow clock subjects had to wait longer to win maximum points (SLOW clock for “NoGo learning”). The RANDOM clock acted as a control condition, in which there existed no relation between response time and reward value. In all clock conditions random noise ranging between −5 and 4 points was added to the reward value in order to prevent subjects from memorizing an exact reward value at a specific response time. The three different clock types were presented in three separate runs for each clock consisting of 50 trials, which were counterbalanced across subjects and cycle phases.
Figure 2
Figure 2
Learning outcome from the probabilistic learning task predicts response speed in the first block of the fast clock condition during the FP. (A) Subjects who showed a higher percentage of choosing the best option in the second test session of the probabilistic learning task (i.e., “Go learners”) were also better able to speed up during the fast condition at the very beginning. (B) In contrast, subjects who showed a higher tendency to avoid the most punished option (i.e., “NoGo learners”) took longer to respond.
Figure 3
Figure 3
Functional opponency of slowing and speeding. Relative slowing and speeding were negatively correlated in both cycle phases (FP: R = −0.918, p < 0.001; LP: R = −0.637, p = 0.014).
Figure 4
Figure 4
RT change from first to last block of the clock task in both cycle phases. In both clock conditions subjects showed greater RT change in the course of the experimental run during the FP (*p < 0.05). Error bars represent standard error of the mean (sem).
Figure 5
Figure 5
Feedback-related activity during probabilistic learning predicted performance in the last block of the SLOW clock during the FP. (A) Correlations between neural responses based on the contrast positive feedback > baseline in the probabilistic learning task and optimized RTs of the last block during the SLOW clock condition. (B) Correlations between neural responses based on the contrast negative feedback > baseline in the probabilistic learning task and optimized RTs of the last block during the SLOW clock condition. Displayed positive correlations survived FWE correction for multiple comparisons at cluster-level (p < 0.05). The negative correlations are reported at p < 0.05, SVC (based on the coordinates reported in McClure et al., 2004).
Figure 6
Figure 6
Correlations between RTs of the last block in the SLOW clock and the parameter estimates of peak voxels in feedback-related clusters found to be associated with response time adjustment in the SLOW clock during FP. (A) Increased activation in RCZ and IFJ predicted effective response time adaption in the SLOW clock. (B) Increased activation in the VMPFC was associated with compromised RT adaptation in the SLOW clock.

References

    1. Bazzett T. J., Becker J. B. (1994). Sex differences in the rapid and acute effects of estrogen on striatal D2 dopamine receptor binding. Brain Res. 637, 163–172. 10.1016/0006-8993(94)91229-7
    1. Becker J. B. (1999). Gender differences in dopaminergic function in striatum and nucleus accumbens. Pharmacol. Biochem. Behav. 64, 803–812. 10.1016/S0091-3057(99)00168-9
    1. Brass M., Derrfuss J., Forstmann B., von Cramon D. Y. (2005). The role of the inferior frontal junction area in cognitive control. Trends Cogn. Sci. 9, 314–316. 10.1016/j.tics.2005.05.001
    1. Buhusi C. V., Meck W. H. (2005). What makes us tick? Functional and neural mechanisms of interval timing. Nat. Rev. Neurosci. 6, 755–765. 10.1038/nrn1764
    1. Bussi I. L., Levín G., Golombek D. A., Agostino P. V. (2014). Involvement of dopamine signaling in the circadian modulation of interval timing. Eur. J. Neurosci. 40, 2299–2310. 10.1111/ejn.12569
    1. Cai J. X., Arnsten A. F. (1997). Dose-dependent effects of the dopamine D1 receptor agonists A77636 or SKF81297 on spatial working memory in aged monkeys. J. Pharmacol. Exp. Ther. 283, 183–189.
    1. Caldú X., Dreher J. C. (2007). Hormonal and genetic influences on processing reward and social information. Ann. N.Y. Acad. Sci. 1118, 43–73. 10.1196/annals.1412.007
    1. Colzato L. S., Hertsig G., van den Wildenberg W. P., Hommel B. (2010). Estrogen modulates inhibitory control in healthy human females: evidence from the stop-signal paradigm. Neuroscience 167, 709–715. 10.1016/j.neuroscience.2010.02.029
    1. Colzato L. S., Hommel B. (2014). Effects of estrogen on higher-order cognitive functions in unstressed human females may depend on individual variation in dopamine baseline levels. Front. Neurosci. 8:65. 10.3389/fnins.2014.00065
    1. Cools R. (2008). Role of dopamine in the motivational and cognitive control of behavior. Neuroscientist 14, 381–395. 10.1177/1073858408317009
    1. Cosgrove K. P., Mazure C. M., Staley J. K. (2007). Evolving knowledge of sex differences in brain structure, function, and chemistry. Biol. Psychiatry 62, 847–855. 10.1016/j.biopsych.2007.03.001
    1. Derrfuss J., Brass M., Neumann J., von Cramon D. Y. (2005). Involvement of the inferior frontal junction in cognitive control: meta-analyses of switching and Stroop studies. Hum. Brain Mapp. 25, 22–34. 10.1002/hbm.20127
    1. Dluzen D. E., Ramirez V. D. (1984). Bimodal effect of progesterone on in vitro dopamine function of the rat corpus striatum. Neuroendocrinology 39, 149–155. 10.1159/000123971
    1. Dreher J. C., Schmidt P. J., Kohn P., Furman D., Rubinow D., Berman K. F. (2007). Menstrual cycle phase modulates reward-related neural function in women. Proc. Natl. Acad. Sci. U.S.A. 104, 2465–2470. 10.1073/pnas.0605569104
    1. Droit-Volet S., Gil S. (2009). The time-emotion paradox. Philos. Trans. R. Soc. Lond. B Biol. Sci. 364, 1943–1953. 10.1098/rstb.2009.0013
    1. Fellows L. K. (2007). Advances in understanding ventromedial prefrontal function: the accountant joins the executive. Neurology 68, 991–995. 10.1212/01.wnl.0000257835.46290.57
    1. Fiehler K., Ullsperger M., von Cramon D. Y. (2004). Neural correlates of error detection and error correction: is there a common neuroanatomical substrate? Eur. J. Neurosci. 19, 3081–3087. 10.1111/j.0953-816X.2004.03414.x
    1. Frank M. J., Seeberger L. C., O'reilly R. C. (2004). By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science 306, 1940–1943. 10.1126/science.1102941
    1. Gasbarri A., Pompili A., D'Onofrio A., Cifariello A., Tavares M. C., Tomaz C. (2008). Working memory for emotional facial expressions: role of the estrogen in young women. Psychoneuroendocrinology 33, 964–972. 10.1016/j.psyneuen.2008.04.007
    1. Gibbs S. E. B., D'Esposito M. (2005). Individual capacity differences predict working memory performance and prefrontal activity following dopamine receptor stimulation. Cogn. Affect. Behav. Neurosci. 5, 212–221. 10.3758/CABN.5.2.212
    1. Grabenhorst F., Rolls E. T. (2011). Value, pleasure and choice in the ventral prefrontal cortex. Trends Cogn. Sci. 15, 56–67. 10.1016/j.tics.2010.12.004
    1. Hare T. A., O'Doherty J., Camerer C. F., Schultz W., Rangel A. (2008). Dissociating the role of the orbitofrontal cortex and the striatum in the computation of goal values and prediction errors. J. Neurosci. 28, 5623–5630. 10.1523/JNEUROSCI.1309-08.2008
    1. Hariri A. R., Brown S. M., Williamsom D. E., Flory J. D., de Wit H., Manuck S. B. (2006). Preference for immediate over delayed rewards is associated with magnitude of ventral striatal activity. J. Neurosci. 26, 13213–13217. 10.1523/JNEUROSCI.3446-06.2006
    1. Jacobs E., D'Esposito M. (2011). Estrogen shapes dopamine-dependent cognitive processes: implications for women's health. J. Neurosci. 31, 5286–5293. 10.1523/JNEUROSCI.6394-10.2011
    1. Jocham G., Klein T. A., Ullsperger M. (2011). Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices. J. Neurosci. 31, 1606–1613. 10.1523/JNEUROSCI.3904-10.2011
    1. Jocham G., Neumann J., Klein T. A., Danielmeier C., Ullsperger M. (2009). Adaptive coding of action values in the human rostral cingulate zone. J. Neurosci. 29, 7489–7496. 10.1523/JNEUROSCI.0349-09.2009
    1. Klein T. A., Endrass T., Kathmann N., Neumann J., von Cramon D. Y., Ullsperger M. (2007a). Neural correlates of error awareness. Neuroimage 34, 1774–1781. 10.1016/j.neuroimage.2006.11.014
    1. Klein T. A., Neumann J., Reuter M., Hennig J., von Cramon D. Y., Ullsperger M. (2007b). Genetically determined differences in learning from errors. Science 318, 1642–1645. 10.1126/science.1145044
    1. Kringelbach M. L., Rolls E. T. (2004). The functional neuroanatomy of the human orbitofrontal cortex: evidence from neuroimaging and neuropsychology. Prog. Neurobiol. 72, 341–372. 10.1016/j.pneurobio.2004.03.006
    1. Kuriyama K., Uchiyama M., Suzuki H., Tagaya H., Ozaki A., Aritake S., et al. . (2003). Circadian fluctuation of time perception in healthy human subjects. Neurosci. Res. 46, 23–31. 10.1016/S0168-0102(03)00025-7
    1. Lancaster J. L., Tordesillas-Gutiérrez D., Martinez M., Salinas F., Evans A., Zilles K., et al. . (2007). Bias between MNI and Talairach coordinates analyzed using the ICBM-152 brain template. Hum. Brain Mapp. 28, 1194–1205. 10.1002/hbm.20345
    1. Lévesque D., Gagnon S., Di Paolo T. (1989). Striatal D1 dopamine receptor density fluctuates during the rat estrous cycle. Neurosci. Lett. 98, 345–350. 10.1016/0304-3940(89)90426-6
    1. Luine V. N., Rhodes J. C. (1983). Gonadal hormone regulation of MAO and other enzymes in hypothalamic areas. Neuroendocrinology 36, 235–241. 10.1159/000123461
    1. Lustig C., Meck W. H. (2001). Paying attention to time as one gets older. Psychol. Sci. 12, 478–484. 10.1111/1467-9280.00389
    1. Majewska M. D., Harrison N. L., Schwartz R. D., Barker J. L., Paul S. M. (1986). Steroid hormone metabolites are barbiturate-like modulators of the GABA receptor. Science 232, 1004–1007. 10.1126/science.2422758
    1. Maricq A. V., Church R. M. (1983). The differential effects of haloperidol and methamphetamine on time estimation in the rat. Psychopharmacology (Berl.) 79, 10–15.
    1. Matell M. S., King G. R., Meck W. H. (2004). Differential modulation of clock speed by the administration of intermittent versus continuous cocaine. Behav. Neurosci. 118, 150–156. 10.1037/0735-7044.118.1.150
    1. Mauvais-Jarvis P., Kuttenn F., Gompel A. (1986). Antiestrogen action of progesterone in breast tissue. Breast Cancer Res. Treat. 8, 179–188. 10.1007/BF01807330
    1. McClure S. M., Ericson K. M., Laibson D. I., Loewenstein G., Cohen J. D. (2007). Time discounting for primary rewards. J. Neurosci. 27, 5796–5804. 10.1523/JNEUROSCI.4246-06.2007
    1. McClure S. M., Laibson D. I., Loewenstein G., Cohen J. D. (2004). Separate neural systems value immediate and delayed monetary rewards. Science 306, 503–507. 10.1126/science.1100907
    1. Meck W. H., Cheng R. K., MacDonald C. J., Gainetdinov R. R., Caron M. G., Cevik M. Ö. (1996). Gene-dose dependent effects of methamphetamine on interval timing in dopamine-transporter knockout mice. Neuropharmacology 62, 1221–1229. 10.1016/j.neuropharm.2011.01.042
    1. Moustafa A. A., Cohen M. X., Sherman S. J., Frank M. J. (2008). A role for dopamine in temporal decision making and reward maximization in parkinsonism. J. Neurosci. 28, 12294–12304. 10.1523/JNEUROSCI.3116-08.2008
    1. Peters J., Büchel C. (2011). Neural representations of subjective reward value. Behav. Brain Res. 213, 135–141. 10.1016/j.bbr.2010.04.031
    1. Pleil K. E., Cordes S., Meck W. H., Williams C. L. (2011). Rapid and acute effects of estrogen on time perception in male and female rats. Front. Integr. Neurosci. 5:63. 10.3389/fnint.2011.00063
    1. Rammsayer T. H. (1993). On dopaminergic modulation of temporal information processing. Biol. Psychol. 36, 209–222. 10.1016/0301-0511(93)90018-4
    1. Ridderinkhof K. R., Ullsperger M., Crone E. A., Nieuwenhuis S. (2004). The role of the medial frontal cortex in cognitive control. Science 306, 443–447. 10.1126/science.1100301
    1. Sandstrom N. J. (2007). Estradiol modulation of the speed of an internal clock. Behav. Neurosci. 121, 422–432. 10.1037/0735-7044.121.2.422
    1. Smith C. T., Sierra Y., Oppler S. H., Boettiger C. A. (2014). Ovarian cycle effects on immediate reward selection bias in humans: a role for estradiol. J. Neurosci. 34, 5468–5476. 10.1523/JNEUROSCI.0014-14.2014
    1. Takahashi T. (2004). Cortisol levels and time-discounting of monetary gain in humans. Neuroreport 15, 2145–2147. 10.1097/00001756-200409150-00029
    1. Treisman M. (1963). Temporal discrimination and the indifference interval. Implications for a model of the “internal clock.” Psychol. Monogr. 77, 1–31. 10.1037/h0093864
    1. Weinberger D. R., Egan M. F., Bertolino A., Callicott J. H., Mattay V. S., Lipska B. K., et al. . (2001). Prefrontal neurons and the genetics of schizophrenia. Biol. Psychiatry 50, 825–844. 10.1016/S0006-3223(01)01252-5
    1. Winstanley C. A. (2011). The utility of rat models of impulsivity in developing pharmacotherapies for impulse control disorders. Br. J. Pharmacol. 164, 1301–1321. 10.1111/j.1476-5381.2011.01323.x
    1. Worsley K. J., Marrett S., Neelin P., Vandal A. C., Friston K. J., Evans A. C. (1996). A unified statistical approach for determining significant signals in images of cerebral activation. Hum. Brain Mapp. 4, 58–73. 10.1002/(SICI)1097-0193(1996)4:1<58::AID-HBM4>3.0CO;2-O

Source: PubMed

3
Tilaa