Explicit neural signals reflecting reward uncertainty
Wolfram Schultz, Kerstin Preuschoff, Colin Camerer, Ming Hsu, Christopher D Fiorillo, Philippe N Tobler, Peter Bossaerts, Wolfram Schultz, Kerstin Preuschoff, Colin Camerer, Ming Hsu, Christopher D Fiorillo, Philippe N Tobler, Peter Bossaerts
Abstract
The acknowledged importance of uncertainty in economic decision making has stimulated the search for neural signals that could influence learning and inform decision mechanisms. Current views distinguish two forms of uncertainty, namely risk and ambiguity, depending on whether the probability distributions of outcomes are known or unknown. Behavioural neurophysiological studies on dopamine neurons revealed a risk signal, which covaried with the standard deviation or variance of the magnitude of juice rewards and occurred separately from reward value coding. Human imaging studies identified similarly distinct risk signals for monetary rewards in the striatum and orbitofrontal cortex (OFC), thus fulfilling a requirement for the mean variance approach of economic decision theory. The orbitofrontal risk signal covaried with individual risk attitudes, possibly explaining individual differences in risk perception and risky decision making. Ambiguous gambles with incomplete probabilistic information induced stronger brain signals than risky gambles in OFC and amygdala, suggesting that the brain's reward system signals the partial lack of information. The brain can use the uncertainty signals to assess the uncertainty of rewards, influence learning, modulate the value of uncertain rewards and make appropriate behavioural choices between only partly known options.
Figures
References
- Abel A.B. Asset prices under habit formation and catching up with the Joneses. Am. Econ. Rev. 1990;80:38–42.
- Basso M.A, Wurtz R.H. Modulation of neuronal activity by target uncertainty. Nature. 1997;398:66–69.
- Bechara A, Damasio A.R, Damasio H, Anderson S.W. Insensitivity to future consequences following damage to human prefrontal cortex. Cognition. 1994;50:7–15.
- Bechara A, Damasio H, Damasio A.R. Emotion, decision-making and the orbitofrontal cortex. Cereb. Cortex. 2000;10:295–307.
- Bolla K.I, Eldreth D.A, Matochik J.A, Cadet J.L. Neural substrates of faulty decision-making in abstinent marijuana users. Neuroimage. 2005;26:480–492.
- Bossaerts P, Plott C. Basic principles of asset pricing theory: evidence from large-scale experimental financial markets. Rev. Finance. 2004;8:135–169.
- Caraco T, Martindale S, Whitham T.S. An empirical demonstration of risk-sensitive foraging preferences. Anim. Behav. 1980;28:820–830.
- Caraco T, Blankenhorn W.U, Gregory G.M, Newman J.A, Recer G.M, Zwicker S.M. Risk-sensitivity: ambient temperaure effects foraging choice. Anim. Behav. 1990;39:338–345.
- Cromwell H.C, Schultz W. Effects of expectations for different reward magnitudes on neuronal activity in primate striatum. J. Neurophysiol. 2003;89:2823–2838.
- Delgado M.R, Nystrom L.E, Fissell C, Noll D.C, Fiez J.A. Tracking the hemodynamic responses to reward and punishment in the striatum. J. Neurophysiol. 2000;84:3072–3077.
- Dunn B.D, Dalgleish T, Lawrence A.D. The somatic marker hypothesis: a critical evaluation. Neurosci. Biobehav. Rev. 2006;30:239–271.
- Elliott R, Newman J.L, Longe O.A, Deakin J.F.W. Differential response pattern in the striatum and orbitofrontal cortex to financial rewards in humans: a parametric functional magnetic resonance imaging study. J. Neurosci. 2003;23:303–307.
- Ellsberg D. Risk, ambiguity and the Savage axioms. Quart. J. Econ. 1961;75:643–649.
- Ersche K.D, Fletcher P.C, Lewis S.J, Clark L, Stocks-Gee G, London M, Deakin J.B, Robbins T.W, Sahakian B.J. Abnormal frontal activations related to decision-making in current and former amphetamine and opiate dependent individuals. Psychopharmacology. 2005;180:612–623.
- Fiorillo C.D, Tobler P.N, Schultz W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science. 2003;299:1898–1902.
- Harper D.G.C. Competitive foraging in mallards: ‘ideal free’ ducks. Anim. Behav. 1982;30:575–584.
- Holt C.A, Laury S.K. Risk aversion and incentive effects. Am. Econ. Rev. 2002;92:1644–1655.
- Hsu M, Bhatt M, Adolphs R, Tranel D, Camerer C.F. Neural systems responding to degrees of uncertainty in human decision-making. Science. 2005;310:1680–1683.
- Huang C.-F, Litzenberger R.H. Prentice-Hall; Upper Saddle River, NJ: 1988. Foundations for financial economics.
- Huettel S.A, Song A, McCarthy G. Decisions under uncertainty: probabilistic context influences activation of prefrontal and parietal cortices. J. Neurosci. 2005;25:3304–3311.
- Huettel S.A, Stowe C.J, Gordon E.M, Warner B.T, Platt M.L. Neural signatures of economic preferences for risk and ambiguity. Neuron. 2006;49:765–775.
- Kahneman D, Tversky A. Prospect theory: an analysis of decision under risk. Econometrica. 1979;47:263–291.
- Knutson B, Fong G.W, Bennett S.M, Adams C.M, Hommer D. A region of mesial prefrontal cortex tracks monetarily rewarding outcomes: characterization with rapid event-related fMRI. Neuroimage. 2003;18:263–272.
- Knutson B, Taylor J, Kaufman M, Peterson R, Glover G. Distributed neural representation of expected value. J. Neurosci. 2005;25:4806–4812.
- Levy H, Markowitz H.M. Approximating expected utility by a function of mean and variance. Am. Econ. Rev. 1979;69:308–317.
- Livingstone M, Hubel D. Segregation of form, folor, movement, and depth: anatomy, physiology, and perception. Science. 1988;240:740–749.
- Logothetis N.K, Pauls J, Augath M, Trinath T, Oeltermann A. Neurophysiological investigation of the basis of the fMRI signal. Nature. 2001;412:150–157.
- Mackintosh N.J. A theory of attention: variations in the associability of stimulus with reinforcement. Psychol. Rev. 1975;82:276–298.
- Maia T.V, McClelland J.L. A reexamination of the evidence for the somatic marker hypothesis: what participants really know in the Iowa gambling task. Proc. Natl Acad. Sci. USA. 2004;101:16 075–16 080.
- Markowitz H. Portfolio selection. J. Finance. 1952;7:77–91.
- McCoy A.N, Platt M.L. Risk-sensitive neurons in macaque posterior cingulate cortex. Nat. Neurosci. 2005;8:1220–1227.
- McCoy A.N, Crowley J.C, Haghighian G, Dean H.L, Platt M.L. Saccade reward signals in posterior cingulate cortex. Neuron. 2003;40:1031–1040.
- McNamara J, Houston A. The application of statistical decision theory to animal behaviour. J. Theor. Biol. 1980;85:673–960.
- Mobini S, Body S, Ho M.-Y, Bradshaw C.M, Szabadi E, Deakin J.F.W, Anderson I.M. Effects of lesions of the orbitofrontal cortex on sensitivity to delayed and probabilistic reinforcement. Psychopharmacology. 2002;160:290–298.
- Musallam S, Corneil B.D, Greger B, Scherberger H, Andersen R.A. Cognitive control signals for neural prosthetics. Science. 2004;305:258–262.
- Padoa-Schioppa C, Assad J.A. Neurons in the orbitofrontal cortex encode economic value. Nature. 2006;441:223–226.
- Paton J.J, Belova M.A, Morrison S.E, Salzman C.D. The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature. 2006;439:865–870.
- Pearce J.M, Hall G. A model for Pavlovian conditioning: variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychol. Rev. 1980;87:532–552.
- Platt M.L, Glimcher P.W. Neural correlates of decision variables in parietal cortex. Nature. 1999;400:233–238.
- Preuschoff K, Bossaerts P. Adding prediction risk to the theory of reward learning. Ann. NY Acad. Sci. 2007;1104:135–146.
- Preuschoff K, Bossaerts P, Quartz S.R. Neural differentiation of expected reward and risk in human subcortical structures. Neuron. 2006;51:381–390.
- Real L.A. Animal choice behavior and the evolution of cognitive architecture. Science. 1991;253:980–986.
- Rescorla R.A, Wagner A.R. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In: Black A.H, Prokasy W.F, editors. Classical conditioning II: current research and theory. Appleton Century Crofts; New York, NY: 1972. pp. 64–99.
- Samejima K, Ueda Y, Doya K, Kimura M. Representation of action-specific reward values in the striatum. Science. 2005;310:1337–1340.
- Sanfey A.G, Hastie R, Colvin M.K, Grafman J. Phineas gauged: decision making and the human prefrontal cortex. Neuropsychologia. 2003;41:1218–1229.
- Schultz W, Dayan P, Montague R.R. A neural substrate of prediction and reward. Science. 1997;275:1593–1599.
- Shidara M, Richmond B.J. Anterior cingulate: single neuron signals related to degree of reward expectancy. Science. 2002;296:1709–1711.
- Stephens J.W, Krebs J.R. Princeton University Press; Princeton, NJ: 1986. Foraging theory.
- Sutton R.S, Barto A.G. MIT Press; Cambridge, MA: 1998. Reinforcement learning.
- Tobin J. Liquidity preference as behavior towards risk. Rev. Econ. Stud. 1958;25:65–86.
- Tobler P.N, Fiorillo C.D, Schultz W. Adaptive coding of reward value by dopamine neurons. Science. 2005;307:1642–1645.
- Tobler P.N, O'Doherty J.P, Dolan R, Schultz W. Reward value coding distinct from risk attitude-related uncertainty coding in human reward systems. J. Neurophysiol. 2007;97:1621–1632.
- von Neumann J, Morgenstern O. Princeton University Press; Princeton, NJ: 1944. The theory of games and economic behavior.
- Watanabe M. Reward expectancy in primate prefrontal neurons. Nature. 1996;382:629–632.
- Weber E.U, Milliman R.A. Perceived risk attitudes: relating risk perception to risky choice. Manage. Sci. 1997;43:123–144.
- Weber E.U, Shafir S, Blais A.-R. Predicting risk sensitivity in humans and lower animals: risk as variance or coefficient of variation. Psychol. Rev. 2004;111:430–445.
Source: PubMed