Representation of aversive prediction errors in the human periaqueductal gray

Mathieu Roy, Daphna Shohamy, Nathaniel Daw, Marieke Jepma, G Elliott Wimmer, Tor D Wager, Mathieu Roy, Daphna Shohamy, Nathaniel Daw, Marieke Jepma, G Elliott Wimmer, Tor D Wager

Abstract

Pain is a primary driver of learning and motivated action. It is also a target of learning, as nociceptive brain responses are shaped by learning processes. We combined an instrumental pain avoidance task with an axiomatic approach to assessing fMRI signals related to prediction errors (PEs), which drive reinforcement-based learning. We found that pain PEs were encoded in the periaqueductal gray (PAG), a structure important for pain control and learning in animal models. Axiomatic tests combined with dynamic causal modeling suggested that ventromedial prefrontal cortex, supported by putamen, provides an expected value-related input to the PAG, which then conveys PE signals to prefrontal regions important for behavioral regulation, including orbitofrontal, anterior mid-cingulate and dorsomedial prefrontal cortices. Thus, pain-related learning involves distinct neural circuitry, with implications for behavior and pain dynamics.

Figures

Figure 1
Figure 1
Pain avoidance learning task, behavioral and brain imaging results. (A) One experimental trial. Participants had 1.8 seconds to make their choice, after which their choice was displayed for 0.2 seconds. After an anticipation period of 4 seconds, participants either received a painful stimulation or nothing. The stimulation period was marked by a different fixation point. Trials were separated by a 6.6 – 11.4 seconds inter-trial interval. (B) Data from one participant. The blue and green lines depict the probability of pain associated with each option over the 150 trials (one of four possible pairs of random walks). Blue and green dots represent the selected option, and black triangles represent pain delivery. (C) Logistic regression model results (number or participants = 23). Probability of switching as a function of pain 1 – 6 trials back decays exponentially and is significantly different from zero at one (t(22) = 9.20, p

Figure 2

Pain avoidance learning model and…

Figure 2

Pain avoidance learning model and axiomatic tests in ventral striatum and periaqueductal gray…

Figure 2
Pain avoidance learning model and axiomatic tests in ventral striatum and periaqueductal gray (PAG) regions of interest (ROI) (A) Regions encoding aversive prediction errors (in gray) should display higher activity for pain vs. No-Stimulus trials (axiom 1), higher activity for low expected probability of pain, regardless of outcome (axiom 2), and no difference in activity between highly predicted pain or No-Stimulus outcomes (axiom 3). Regions encoding the expected probability of avoidance (in green) should only display higher activity for low expected probability of pain, regardless of outcome. Aversive prediction errors result from the integration of pain-related information with prior expectations. This signal is then used to update future predictions. (B) Activity in a priori ventral striatum and PAG matter ROIs per quartile of expected probability of pain (number or participants = 23). Activity in the PAG satisfies all 3 axioms for aversive prediction errors (Axiom #1 (pain > no stimulus): t(22) = 3.67, p <0.05; Axiom # 2 (significance of slope for pain trials, sign permutation test): t(22)=– 2.05, p < 0.05; Axiom # 2 (significance of slope for no stimulus trials, sign permutation test): t(22)=–1.98, p < 0.05; Axiom # 3 (no difference between highly predicted pain or no stimulus outcomes, Bayesian analyses, odds in favor of the null: 5.48). By contrast, activity in the ventral striatum only satisfies axiom # 3 (no difference between highly predicted pain or no stimulus outcomes, Bayesian analyses, odds in favor of the null: 7.07). Asterisks with horizontal bars indicate significant differences between pain and no stimulus outcomes. Asterisks with vertical bars indicate significant slopes (* = p<0.05).

Figure 3

Results from the whole-brain conjunctive…

Figure 3

Results from the whole-brain conjunctive search (number or participants = 23). (A) Conjunction…

Figure 3
Results from the whole-brain conjunctive search (number or participants = 23). (A) Conjunction analysis of pain > no stimulus effects (axiom #1) and expected avoidance probability (1– pain probability) effects for both pain and no stimulus outcomes (axiom #2). Clusters used for the conjunction analysis were cluster-thresholded (p

Figure 4

Dynamic causal model (DCM) of…

Figure 4

Dynamic causal model (DCM) of aversive prediction errors at outcome onset (number or…

Figure 4
Dynamic causal model (DCM) of aversive prediction errors at outcome onset (number or participants = 23). (A–B) This model was identified as the most likely of all the models tested (see Supplementary Fig. 3–6) by a Bayesian model selection (BMS) process. The regions included in the model were identified by the previous conjunction analysis (Fig. 3) as reflecting 1– expected probability of avoidance regardless of outcome (green), aversive prediction errors (black), or avoidance value updating (blue).
Comment in
Similar articles
Cited by
References
    1. McNally GP, Johansen JP, Blair HT. Placing prediction into the fear circuit. Trends Neurosci. 2011;34:283–92. - PMC - PubMed
    1. Seymour B, et al. Nature. 2004;Temporal difference models describe higher-order learning in humans.429:664–667. - PubMed
    1. Hollerman JR, Schultz W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nat. Neurosci. 1998;1:304–9. - PubMed
    1. O'Doherty JP, Hampton A, Kim H. Model-based fMRI and its application to reward learning and decision making. Ann. N. Y. Acad. Sci. 2007;1104:35–53. - PubMed
    1. Daw ND. In: Decis. Making, Affect. Learn. Delgado MR, Phelps EA, Robbins TW, editors. Oxford University Press; 2011. pp. 3–38.
Show all 50 references
Publication types
MeSH terms
[x]
Cite
Copy Download .nbib
Format: AMA APA MLA NLM
Figure 2
Figure 2
Pain avoidance learning model and axiomatic tests in ventral striatum and periaqueductal gray (PAG) regions of interest (ROI) (A) Regions encoding aversive prediction errors (in gray) should display higher activity for pain vs. No-Stimulus trials (axiom 1), higher activity for low expected probability of pain, regardless of outcome (axiom 2), and no difference in activity between highly predicted pain or No-Stimulus outcomes (axiom 3). Regions encoding the expected probability of avoidance (in green) should only display higher activity for low expected probability of pain, regardless of outcome. Aversive prediction errors result from the integration of pain-related information with prior expectations. This signal is then used to update future predictions. (B) Activity in a priori ventral striatum and PAG matter ROIs per quartile of expected probability of pain (number or participants = 23). Activity in the PAG satisfies all 3 axioms for aversive prediction errors (Axiom #1 (pain > no stimulus): t(22) = 3.67, p <0.05; Axiom # 2 (significance of slope for pain trials, sign permutation test): t(22)=– 2.05, p < 0.05; Axiom # 2 (significance of slope for no stimulus trials, sign permutation test): t(22)=–1.98, p < 0.05; Axiom # 3 (no difference between highly predicted pain or no stimulus outcomes, Bayesian analyses, odds in favor of the null: 5.48). By contrast, activity in the ventral striatum only satisfies axiom # 3 (no difference between highly predicted pain or no stimulus outcomes, Bayesian analyses, odds in favor of the null: 7.07). Asterisks with horizontal bars indicate significant differences between pain and no stimulus outcomes. Asterisks with vertical bars indicate significant slopes (* = p<0.05).
Figure 3
Figure 3
Results from the whole-brain conjunctive search (number or participants = 23). (A) Conjunction analysis of pain > no stimulus effects (axiom #1) and expected avoidance probability (1– pain probability) effects for both pain and no stimulus outcomes (axiom #2). Clusters used for the conjunction analysis were cluster-thresholded (p

Figure 4

Dynamic causal model (DCM) of…

Figure 4

Dynamic causal model (DCM) of aversive prediction errors at outcome onset (number or…

Figure 4
Dynamic causal model (DCM) of aversive prediction errors at outcome onset (number or participants = 23). (A–B) This model was identified as the most likely of all the models tested (see Supplementary Fig. 3–6) by a Bayesian model selection (BMS) process. The regions included in the model were identified by the previous conjunction analysis (Fig. 3) as reflecting 1– expected probability of avoidance regardless of outcome (green), aversive prediction errors (black), or avoidance value updating (blue).
Figure 4
Figure 4
Dynamic causal model (DCM) of aversive prediction errors at outcome onset (number or participants = 23). (A–B) This model was identified as the most likely of all the models tested (see Supplementary Fig. 3–6) by a Bayesian model selection (BMS) process. The regions included in the model were identified by the previous conjunction analysis (Fig. 3) as reflecting 1– expected probability of avoidance regardless of outcome (green), aversive prediction errors (black), or avoidance value updating (blue).

References

    1. McNally GP, Johansen JP, Blair HT. Placing prediction into the fear circuit. Trends Neurosci. 2011;34:283–92.
    1. Seymour B, et al. Nature. 2004;Temporal difference models describe higher-order learning in humans.429:664–667.
    1. Hollerman JR, Schultz W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nat. Neurosci. 1998;1:304–9.
    1. O'Doherty JP, Hampton A, Kim H. Model-based fMRI and its application to reward learning and decision making. Ann. N. Y. Acad. Sci. 2007;1104:35–53.
    1. Daw ND. In: Decis. Making, Affect. Learn. Delgado MR, Phelps EA, Robbins TW, editors. Oxford University Press; 2011. pp. 3–38.
    1. Behrens TEJ, Hunt LT, Woolrich MW, Rushworth MFS. Associative learning of social value. Nature. 2008;456:245–9.
    1. Li J, Daw ND. Signals in human striatum are appropriate for policy update rather than value prediction. J. Neurosci. 2011;31:5504–11.
    1. Niv Y, Edlund JA, Dayan P, O'Doherty JP. Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain. J. Neurosci. 2012;32:551–62.
    1. Rutledge RB, Dean M, Caplin A, Glimcher PW. Testing the reward prediction error hypothesis with an axiomatic model. J. Neurosci. 2010;30:13525–36.
    1. Seymour B, et al. Opponent appetitive-aversive neural processes underlie predictive learning of pain relief. Nat. Neurosci. 2005;8:1234–40.
    1. Seymour B, Daw ND, Roiser JP, Dayan P, Dolan R. Serotonin selectively modulates reward value in human decision-making. J. Neurosci. 2012;32:5833–42.
    1. Yacubian J, et al. Dissociable systems for gain- and loss-related value predictions and errors of prediction in the human brain. J. Neurosci. 2006;26:9530–7.
    1. Ploghaus A, et al. Learning about pain: the neural substrate of the prediction error for aversive events. Proc. Natl. Acad. Sci. U. S. A. 2000;97:9281–6.
    1. Li J, Schiller D, Schoenbaum G, Phelps EA, Daw ND. Differential roles of human striatum and amygdala in associative learning. Nat. Neurosci. 2011;14:1250–2.
    1. Schiller D, Levy I, Niv Y, LeDoux JE, Phelps EA. From fear to safety and back: reversal of fear in the human brain. J. Neurosci. 2008;28:11517–25.
    1. Delgado MR, Li J, Schiller D, Phelps EA. The role of the striatum in aversive learning and aversive prediction errors. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 2008;363:3787–800.
    1. Hindi Attar C, Finckh B, Büchel C. The influence of serotonin on fear learning. PLoS One. 2012;7:e42397.
    1. Johansen JP, Tarpley JW, LeDoux JE, Blair HT. Neural substrates for expectation-modulated fear learning in the amygdala and periaqueductal gray. Nat. Neurosci. 2010;13:979–86.
    1. Stephan KE, et al. Dynamic causal models of neural system dynamics:current state and future extensions. J. Biosci. 2007;32:129–44.
    1. Schönberg T, Daw ND, Joel D, O'Doherty JP. Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making. J. Neurosci. 2007;27:12860–7.
    1. Gallistel CR. The importance of proving the null. Psychol. Rev. 2009;116:439–53.
    1. Garrison J, Erdeniz B, Done J. Prediction error in reinforcement learning: a meta-analysis of neuroimaging studies. Neurosci. Biobehav. Rev. 2013;37:1297–310.
    1. Wimmer GE, Daw ND, Shohamy D. Generalization of value in reinforcement learning by humans. Eur. J. Neurosci. 2012;35:1092–104.
    1. Satpute AB, et al. Identification of discrete functional subregions of the human periaqueductal gray. Proc. Natl. Acad. Sci. U. S. A. 2013;110:17101–6.
    1. Beissner F, Baudrexel S. Investigating the human brainstem with structural and functional MRI. Front. Hum. Neurosci. 2014;8:116.
    1. Keay KA, Bandler R. Parallel circuits mediating distinct emotional coping reactions to different types of stress. Neurosci. Biobehav. Rev. 2001;25:669–78.
    1. Schmidt L, Lebreton M, Cléry-Melin M-L, Daunizeau J, Pessiglione M. Neural mechanisms underlying motivation of mental versus physical effort. PLoS Biol. 2012;10:e1001266.
    1. Millan MJ. The induction of pain: an integrative review. Prog. Neurobiol. 1999;57:1–164.
    1. Brooks AM, Berns GS. Aversive stimuli and loss in the mesocorticolimbic dopamine system. Trends Cogn. Sci. 2013;17:281–286.
    1. Price JL. Definition of the orbital cortex in relation to specific connections with limbic and visceral structures and other cortical regions. Ann. N. Y. Acad. Sci. 2007;1121:54–71.
    1. Rangel A, Hare T. Neural computations associated with goal-directed choice. Curr. Opin. Neurobiol. 2010;20:262–70.
    1. Herrero MT, Insausti R, Gonzalo LM. Cortically projecting cells in the periaqueductal gray matter of the rat. A retrograde fluorescent tracer study Cortical injections. 1991;543:201–212.
    1. Shackman AJ, et al. The integration of negative affect, pain and cognitive control in the cingulate cortex. Nat. Rev. Neurosci. 2011;12:154–67.
    1. Krasne FB, Fanselow MS, Zelikowsky M. Design of a neurally plausible model of fear learning. Front. Behav. Neurosci. 2011;5:41.
    1. Reynolds SM, Berridge KC. Emotional environments retune the valence of appetitive versus fearful functions in nucleus accumbens. Nat. Neurosci. 2008;11:423–5.
    1. Tom SM, Fox CR, Trepel C, Poldrack RA. The neural basis of loss aversion in decision-making under risk. Science. 2007;315:515–8.
    1. Kim H, Shimojo S, O'Doherty JP. Is avoiding an aversive outcome rewarding? Neural substrates of avoidance learning in the human brain. PLoS Biol. 2006;4:e233.
    1. Boll S, Gamer M, Gluth S, Finsterbusch J, Büchel C. Eur. J. Neurosci. 1–10: 2012. Separate amygdala subregions signal surprise and predictiveness during associative fear learning in humans.
    1. Paton JJ, Belova MA, Morrison SE, Salzman CD. The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature. 2006;439:865–70.
    1. Linnman C, Moulton EA, Barmettler G, Becerra L, Borsook D. Neuroimaging of the periaqueductal gray: state of the field. Neuroimage. 2012;60:505–22.
    1. Buhle JT, et al. Cognitive Reappraisal of Emotion: A Meta-Analysis of Human Neuroimaging Studies. Cereb. Cortex. 2013:1–10. doi:10.1093/cercor/bht154.
    1. Buhle JT, et al. Common representation of pain and negative emotion in the midbrain periaqueductal gray. Soc. Cogn. Affect. Neurosci. 2012 doi:10.1093/scan/nss038.
    1. Wager TD, et al. Brain mediators of cardiovascular responses to social threat, part II: Prefrontal subcortical pathways and relationship with anxiety. Neuroimage. 2009;47:836–51.
    1. Bartra O, McGuire JT, Kable JW. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage. 2013;76:412–27.
    1. Roy M, Shohamy D, Wager TD. Ventromedial prefrontal-subcortical systems and the generation of affective meaning. Trends Cogn. Sci. 2012;16:147–156.
    1. Chib VS, Rangel A, Shimojo S, O'Doherty JP. Evidence for a common representation of decision values for dissimilar goods in human ventromedial prefrontal cortex. J. Neurosci. 2009;29:12315–20.
    1. Milad MR, et al. Recall of fear extinction in humans activates the ventromedial prefrontal cortex and hippocampus in concert. Biol. Psychiatry. 2007;62:446–54.
    1. Wunderlich K, Dayan P, Dolan RJ. Mapping value based planning and extensively trained choice in the human brain. Nat. Neurosci. 2012;15:786–91.
    1. Caplin A, Dean M. Axiomatic methods, dopamine and reward prediction error. Curr. Opin. Neurobiol. 2008;18:197–202.
    1. Jepma M, Jones M, Wager TD. The dynamics of pain: evidence for simultaneous site-specific habituation and site-nonspecific sensitization in thermal pain. J. Pain. 2014;15:734–46.

Source: PubMed

3
Abonner