Representation of aversive prediction errors in the human periaqueductal gray
Mathieu Roy, Daphna Shohamy, Nathaniel Daw, Marieke Jepma, G Elliott Wimmer, Tor D Wager, Mathieu Roy, Daphna Shohamy, Nathaniel Daw, Marieke Jepma, G Elliott Wimmer, Tor D Wager
Abstract
Pain is a primary driver of learning and motivated action. It is also a target of learning, as nociceptive brain responses are shaped by learning processes. We combined an instrumental pain avoidance task with an axiomatic approach to assessing fMRI signals related to prediction errors (PEs), which drive reinforcement-based learning. We found that pain PEs were encoded in the periaqueductal gray (PAG), a structure important for pain control and learning in animal models. Axiomatic tests combined with dynamic causal modeling suggested that ventromedial prefrontal cortex, supported by putamen, provides an expected value-related input to the PAG, which then conveys PE signals to prefrontal regions important for behavioral regulation, including orbitofrontal, anterior mid-cingulate and dorsomedial prefrontal cortices. Thus, pain-related learning involves distinct neural circuitry, with implications for behavior and pain dynamics.
Figures
References
- McNally GP, Johansen JP, Blair HT. Placing prediction into the fear circuit. Trends Neurosci. 2011;34:283–92.
- Seymour B, et al. Nature. 2004;Temporal difference models describe higher-order learning in humans.429:664–667.
- Hollerman JR, Schultz W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nat. Neurosci. 1998;1:304–9.
- O'Doherty JP, Hampton A, Kim H. Model-based fMRI and its application to reward learning and decision making. Ann. N. Y. Acad. Sci. 2007;1104:35–53.
- Daw ND. In: Decis. Making, Affect. Learn. Delgado MR, Phelps EA, Robbins TW, editors. Oxford University Press; 2011. pp. 3–38.
- Behrens TEJ, Hunt LT, Woolrich MW, Rushworth MFS. Associative learning of social value. Nature. 2008;456:245–9.
- Li J, Daw ND. Signals in human striatum are appropriate for policy update rather than value prediction. J. Neurosci. 2011;31:5504–11.
- Niv Y, Edlund JA, Dayan P, O'Doherty JP. Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain. J. Neurosci. 2012;32:551–62.
- Rutledge RB, Dean M, Caplin A, Glimcher PW. Testing the reward prediction error hypothesis with an axiomatic model. J. Neurosci. 2010;30:13525–36.
- Seymour B, et al. Opponent appetitive-aversive neural processes underlie predictive learning of pain relief. Nat. Neurosci. 2005;8:1234–40.
- Seymour B, Daw ND, Roiser JP, Dayan P, Dolan R. Serotonin selectively modulates reward value in human decision-making. J. Neurosci. 2012;32:5833–42.
- Yacubian J, et al. Dissociable systems for gain- and loss-related value predictions and errors of prediction in the human brain. J. Neurosci. 2006;26:9530–7.
- Ploghaus A, et al. Learning about pain: the neural substrate of the prediction error for aversive events. Proc. Natl. Acad. Sci. U. S. A. 2000;97:9281–6.
- Li J, Schiller D, Schoenbaum G, Phelps EA, Daw ND. Differential roles of human striatum and amygdala in associative learning. Nat. Neurosci. 2011;14:1250–2.
- Schiller D, Levy I, Niv Y, LeDoux JE, Phelps EA. From fear to safety and back: reversal of fear in the human brain. J. Neurosci. 2008;28:11517–25.
- Delgado MR, Li J, Schiller D, Phelps EA. The role of the striatum in aversive learning and aversive prediction errors. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 2008;363:3787–800.
- Hindi Attar C, Finckh B, Büchel C. The influence of serotonin on fear learning. PLoS One. 2012;7:e42397.
- Johansen JP, Tarpley JW, LeDoux JE, Blair HT. Neural substrates for expectation-modulated fear learning in the amygdala and periaqueductal gray. Nat. Neurosci. 2010;13:979–86.
- Stephan KE, et al. Dynamic causal models of neural system dynamics:current state and future extensions. J. Biosci. 2007;32:129–44.
- Schönberg T, Daw ND, Joel D, O'Doherty JP. Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making. J. Neurosci. 2007;27:12860–7.
- Gallistel CR. The importance of proving the null. Psychol. Rev. 2009;116:439–53.
- Garrison J, Erdeniz B, Done J. Prediction error in reinforcement learning: a meta-analysis of neuroimaging studies. Neurosci. Biobehav. Rev. 2013;37:1297–310.
- Wimmer GE, Daw ND, Shohamy D. Generalization of value in reinforcement learning by humans. Eur. J. Neurosci. 2012;35:1092–104.
- Satpute AB, et al. Identification of discrete functional subregions of the human periaqueductal gray. Proc. Natl. Acad. Sci. U. S. A. 2013;110:17101–6.
- Beissner F, Baudrexel S. Investigating the human brainstem with structural and functional MRI. Front. Hum. Neurosci. 2014;8:116.
- Keay KA, Bandler R. Parallel circuits mediating distinct emotional coping reactions to different types of stress. Neurosci. Biobehav. Rev. 2001;25:669–78.
- Schmidt L, Lebreton M, Cléry-Melin M-L, Daunizeau J, Pessiglione M. Neural mechanisms underlying motivation of mental versus physical effort. PLoS Biol. 2012;10:e1001266.
- Millan MJ. The induction of pain: an integrative review. Prog. Neurobiol. 1999;57:1–164.
- Brooks AM, Berns GS. Aversive stimuli and loss in the mesocorticolimbic dopamine system. Trends Cogn. Sci. 2013;17:281–286.
- Price JL. Definition of the orbital cortex in relation to specific connections with limbic and visceral structures and other cortical regions. Ann. N. Y. Acad. Sci. 2007;1121:54–71.
- Rangel A, Hare T. Neural computations associated with goal-directed choice. Curr. Opin. Neurobiol. 2010;20:262–70.
- Herrero MT, Insausti R, Gonzalo LM. Cortically projecting cells in the periaqueductal gray matter of the rat. A retrograde fluorescent tracer study Cortical injections. 1991;543:201–212.
- Shackman AJ, et al. The integration of negative affect, pain and cognitive control in the cingulate cortex. Nat. Rev. Neurosci. 2011;12:154–67.
- Krasne FB, Fanselow MS, Zelikowsky M. Design of a neurally plausible model of fear learning. Front. Behav. Neurosci. 2011;5:41.
- Reynolds SM, Berridge KC. Emotional environments retune the valence of appetitive versus fearful functions in nucleus accumbens. Nat. Neurosci. 2008;11:423–5.
- Tom SM, Fox CR, Trepel C, Poldrack RA. The neural basis of loss aversion in decision-making under risk. Science. 2007;315:515–8.
- Kim H, Shimojo S, O'Doherty JP. Is avoiding an aversive outcome rewarding? Neural substrates of avoidance learning in the human brain. PLoS Biol. 2006;4:e233.
- Boll S, Gamer M, Gluth S, Finsterbusch J, Büchel C. Eur. J. Neurosci. 1–10: 2012. Separate amygdala subregions signal surprise and predictiveness during associative fear learning in humans.
- Paton JJ, Belova MA, Morrison SE, Salzman CD. The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature. 2006;439:865–70.
- Linnman C, Moulton EA, Barmettler G, Becerra L, Borsook D. Neuroimaging of the periaqueductal gray: state of the field. Neuroimage. 2012;60:505–22.
- Buhle JT, et al. Cognitive Reappraisal of Emotion: A Meta-Analysis of Human Neuroimaging Studies. Cereb. Cortex. 2013:1–10. doi:10.1093/cercor/bht154.
- Buhle JT, et al. Common representation of pain and negative emotion in the midbrain periaqueductal gray. Soc. Cogn. Affect. Neurosci. 2012 doi:10.1093/scan/nss038.
- Wager TD, et al. Brain mediators of cardiovascular responses to social threat, part II: Prefrontal subcortical pathways and relationship with anxiety. Neuroimage. 2009;47:836–51.
- Bartra O, McGuire JT, Kable JW. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage. 2013;76:412–27.
- Roy M, Shohamy D, Wager TD. Ventromedial prefrontal-subcortical systems and the generation of affective meaning. Trends Cogn. Sci. 2012;16:147–156.
- Chib VS, Rangel A, Shimojo S, O'Doherty JP. Evidence for a common representation of decision values for dissimilar goods in human ventromedial prefrontal cortex. J. Neurosci. 2009;29:12315–20.
- Milad MR, et al. Recall of fear extinction in humans activates the ventromedial prefrontal cortex and hippocampus in concert. Biol. Psychiatry. 2007;62:446–54.
- Wunderlich K, Dayan P, Dolan RJ. Mapping value based planning and extensively trained choice in the human brain. Nat. Neurosci. 2012;15:786–91.
- Caplin A, Dean M. Axiomatic methods, dopamine and reward prediction error. Curr. Opin. Neurobiol. 2008;18:197–202.
- Jepma M, Jones M, Wager TD. The dynamics of pain: evidence for simultaneous site-specific habituation and site-nonspecific sensitization in thermal pain. J. Pain. 2014;15:734–46.
Source: PubMed