The neural correlates of subjective value during intertemporal choice

Joseph W Kable, Paul W Glimcher, Joseph W Kable, Paul W Glimcher

Abstract

Neuroimaging studies of decision-making have generally related neural activity to objective measures (such as reward magnitude, probability or delay), despite choice preferences being subjective. However, economic theories posit that decision-makers behave as though different options have different subjective values. Here we use functional magnetic resonance imaging to show that neural activity in several brain regions--particularly the ventral striatum, medial prefrontal cortex and posterior cingulate cortex--tracks the revealed subjective value of delayed monetary rewards. This similarity provides unambiguous evidence that the subjective value of potential rewards is explicitly represented in the human brain.

Figures

Figure 1
Figure 1
Intertemporal choice task. The sequence of events within a trial is shown. On each trial, subjects chose between an immediate and a delayed reward. The immediate reward was the same ($20) on every trial and was never presented visually. A red dot signaled the beginning of a trial, after which the delayed reward for that trial was presented for 2 s and then replaced again by the red dot. Subjects then had 6 s to consider their choice. Throughout the trial, subjects were required to hold down a button, and they indicated their decision by releasing or continuing to hold the button when the dot turned green. For half of the session, a button release indicated a choice of the delayed reward; for the other half, a button release indicated a choice of the immediate reward. If subjects released the button before the green light appeared, that trial was considered abandoned and removed from the analysis. The inter-trial interval was 2 s for behavioral sessions and 12 s for scanning sessions.
Figure 2
Figure 2
Subject-specific discount functions. (ac) Choice data from three subjects during a single scanning session. Points are shaded according to the imposed delay to the delayed reward, and denote the fraction of times the subject chose the delayed reward over an immediate reward of $20 as a function of the objective amount of the delayed reward. The smooth curves are logistic functions fit to these data. Data from different delays are slightly offset so that all data are visible. (df) Indifference points, plotted as a function of the imposed delay to the delayed reward. Indifference points were estimated from the logistic fit in ac as the amount for each delay at which the subject would choose the immediate and delayed rewards with equal frequency. The increase in indifference amounts with delay was fit by a line with a fixed intercept at $20. Delays are shaded as in ac. (gi) Indifference points from (df), divided into $20 to obtain a discount function. The decrease in subjective value with delay was fit with a single-parameter hyperbolic function. Delays are shaded as in ac. Data from three subjects are shown (YH in a,d,g; JH in b,e,h; CH in c,f,i) to illustrate the observed heterogeneity in discount functions across subjects. CH was our most impulsive subject (k = 0.1189), YH was our most patient subject (k = 0.0005), and JH was near the median discount rate (k = 0.0097).
Figure 3
Figure 3
Group analysis showing areas in which activity is correlated with subjective value. (a) Areas in which neural activity was correlated with subjective value (during the 6–10-s window) in a random-effects group analysis, overlayed on the mean normalized anatomical image. Areas of correlation can be seen in the medial prefrontal cortex and posterior cingulate cortex (sagittal and axial images) and in the ventral striatum (coronal image). The color scale represents the t-value of the contrast testing for a significant effect of subjective value at time points 4–6 in the trial. (be) Activity in the ventral striatum, medial prefrontal cortex and posterior cingulate cortex was better correlated with subjective value than with (b) the objective amount of the delayed reward, (c) the inverse delay of the delayed reward, (d) the choice of the subject (chose delayed > chose immediate), or (e) the value of the delayed reward calculated using a single fixed discount rate for all subjects (k = 0.01, near the median for our subjects). Areas in which activity correlated with subjective value are shown in yellow, areas in which activity correlated with the other variables are shown in red, and areas of overlap are shown in orange. All maps are thresholded at P < 0.005 (uncorrected), spatial extent > 100 mm3. Data are shown in radiological convention, with the right hemisphere on the left.
Figure 4
Figure 4
Single-subject analyses showing areas in which activity correlated with subjective value. Data from five subjects are shown to illustrate that subjective value effects were evident for subjects spanning the entire range of behavioral discount rates. (ae) Discount functions for five subjects, as measured behaviorally during a scanning session. These subjects include one of our more patient subjects (HM) and our most impulsive subject (CH). (fj) Maps showing areas in which neural activity was correlated with subjective value in each subject. The sagittal overlay shows areas of activity in the medial prefrontal cortex, the coronal overlay areas of activity in the ventral striatum, and the axial overlay areas of activity in the posterior cingulate cortex. The color scale represents the t-value of the contrast testing for a significant effect of subjective value at time points 4–6 in the trial. These maps are thresholded at P < 0.01 (uncorrected), and the color scale ranges from this value to P < 0.05 (corrected for false discovery rate). Data are shown in radiological convention, with the right hemisphere on the left.
Figure 5
Figure 5
Single-subject time courses and neural discount functions. (ac) Data from three subjects (HM, see Fig. 4a,f; RA, see Fig. 4c,h; and CH, see Fig. 4e,j) are shown. Data were averaged over all voxels that showed a correlation between activity and subjective value in the ventral striatum, medial prefrontal cortex and posterior cingulate cortex (from the individual-level analyses shown in Fig. 4), and then re-plotted as trial averages. Trial averages are color-coded by the imposed delay to the delayed reward. The 6–10-s window in which we observed significant effects is shown in gray. The largest standard error is shown on the right. The arrows indicate the point in the trial at which the delayed option was presented. (df) Data from ac, summed over the 6–10-s window and re-plotted as a function of delay. The solid black line represents average predicted activity at each delay, from the fit of the subjective value regression using a subject-specific discount rate. Predicted activity is simply a scaled and shifted version of each subject’s behavioral discount function. This regression is also used to scale the y-axis across subjects (see Supplementary Methods).
Figure 6
Figure 6
Psychometric-neurometric comparisons. (a,d) A measure of the neural effect of delay is plotted against the subject’s behavioral discount rate for both (a) ROIs defined on the basis of the subjective value regression and (d) value ROIs defined in an unbiased manner. The neural effect of delay used is the ratio of the regression slope of neural activity against delay compared to that against amount. (b,e) The neural discount rate is plotted against the subject’s behavioral discount rate for both (b) subjective value ROIs and (e) unbiased value ROIs. The neural discount rate is estimated using a nonlinear version of the subjective value regression where k can vary. Colored lines show the robust linear fit for each ROI; black line shows the fit collapsed across all ROIs. Because of their skewed distribution, the ratio and neural and behavioral discount rates are log-transformed. All three data points from the most patient subject (bottom left) overlap in panels a,b,d and e. (c,f) Difference between neural discount rate and behavioral discount rate for both (c) subjective value ROIs and (f) unbiased value ROIs. Colored triangles indicate the median differences for each ROI. Panels ac exclude two and df exclude one subject-ROI pair where an ROI could not be defined. Panel d also excludes two subject-ROI pairs where the correlation with amount was negative and ef exclude five subject-ROI pairs where no discount rate accounted for a significant amount of variance in neural activity (see Methods).
Figure 7
Figure 7
Single-subject example of unbiased value ROIs and resulting neural discount functions. (a) ROIs in the ventral striatum, medial prefrontal cortex and posterior cingulate cortex are shown for one subject (HM, same subject shown in Fig. 4a,f and Fig. 5a,d), who demonstrated a relatively close psychometric-neurometric match in each region. Voxels were selected within these anatomically defined regions which showed either greater activity for trials involving the largest objective amount of the delayed reward than for trials involving the smallest amount, or greater activity for trials involving the shortest delay to the delayed reward than for those involving the longest delay. (bd) A psychometric-neurometric comparison is shown for each ROI for this subject. Mean neural activity and standard error, summed over the 6–10-s window, are plotted as a function of the imposed delay to the delayed reward. The red line represents average predicted activity at each delay, from the fit of the subjective value regression using the subject’s behavioral discount rate (k = 0.0042). This regression is also used to scale the y-axis across ROIs (see Methods). The black line represents average predicted activity at each delay, from the fit of a nonlinear version of the subjective value regression where the discount rate is allowed to vary. The discount rates estimated from the neural data are 0.0052 in the ventral striatum, 0.0010 in the medial prefrontal cortex and 0.0099 in the posterior cingulate cortex.
Figure 8
Figure 8
Neural activity tracks subjective value, and not a more impulsive (β) or more patient (δ) estimate of value. (a,b) Difference between the neural (single exponential) discount rate and the behavioral (single exponential) discount rate, calculated separately for each ROI in each subject. The difference between neural and behavioral discount rates is centered on zero for both (a) ROIs defined on the basis of the subjective value regression and (b) value ROIs defined in an unbiased manner. (c,d) Difference between the neural (single exponential) discount rate and β, the steeper exponential from the sum of exponentials discount function estimated behaviorally. On average, β is larger than the neural discount rate for both (c) subjective value ROIs and (d) unbiased value ROIs. (e,f) Difference between the neural (single exponential) discount rate and δ, the shallower exponential from the sum of exponentials discount function estimated behaviorally. On average, the neural discount rate is larger than δ for both (e) subjective value ROIs and (f) unbiased value ROIs. Colored triangles in each panel indicate the medians for each ROI separately. These data exclude two subjects (six subject-ROI pairs) for which the fit of the β-δ model collapsed to a single exponential function (that is, β = δ). Panels b,d and f also exclude two subject-ROI pairs for which no discount rate accounted for a significant amount of variance in neural activity (see Methods).

References

    1. Weber EH. De Pulsu, Resorptione, Audita et Tactu. Annotationes Anatomicae et Physiologicae. CF Koehler; Lipsiae: 1834.
    1. Fechner GT. Elemente der Psychophysik. Breitkopf & Hartel; Leipzig: 1860.
    1. Stevens S. Neural events and the psychophysical law. Science. 1970;170:1043–1050.
    1. Bernoulli D. The Works. Birkhauser; Boston: 1982.
    1. Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature. 1999;400:233–238.
    1. Abler B, Walter H, Erk S, Kammerer H, Spitzer M. Prediction error as a linear function of reward probability is coded in human nucleus accumbens. Neuroimage. 2006;31:790–795.
    1. Cromwell HC, Schultz W. Effects of expectations for different reward magnitudes on neuronal activity in primate striatum. J Neurophysiol. 2003;89:2823–2838.
    1. Delgado MR, Nystrom LE, Fissell C, Noll DC, Fiez JA. Tracking the hemodynamic responses to reward and punishment in the striatum. J Neurophysiol. 2000;84:3072–3077.
    1. McCoy AN, Crowley JC, Haghighian G, Dean HL, Platt ML. Saccade reward signals in posterior cingulate cortex. Neuron. 2003;40:1031–1040.
    1. O’Doherty J, Critchley H, Deichmann R, Dolan RJ. Dissociating valence of outcome from behavioral control in human orbital and ventral prefrontal cortices. J Neurosci. 2003;23:7931–7939.
    1. Breiter HC, Aharon I, Kahneman D, Dale A, Shizgal P. Functional imaging of neural responses to expectancy and experience of monetary gains and losses. Neuron. 2001;30:619–639.
    1. Nieuwenhuis S, et al. Activity in human reward-sensitive brain areas is strongly context dependent. Neuroimage. 2005;25:1302–1309.
    1. Rogers RD, et al. Distinct portions of anterior cingulate cortex and medial prefrontal cortex are activated by reward processing in separable phases of decision-making cognition. Biol Psychiatry. 2004;55:594–602.
    1. Kuhnen CM, Knutson B. The neural basis of financial risk taking. Neuron. 2005;47:763–770.
    1. Knutson B, Taylor J, Kaufman M, Peterson R, Glover G. Distributed neural representation of expected value. J Neurosci. 2005;25:4806–4812.
    1. Bowman EM, Aigner TG, Richmond BJ. Neural signals in the monkey ventral striatum related to motivation for juice and cocaine rewards. J Neurophysiol. 1996;75:1061–1073.
    1. Cromwell HC, Hassani OK, Schultz W. Relative reward processing in primate striatum. Exp Brain Res. 2005;162:520–525.
    1. McCoy AN, Platt ML. Risk-sensitive neurons in macaque posterior cingulate cortex. Nat Neurosci. 2005;8:1220–1227.
    1. Dorris MC, Glimcher PW. Activity in posterior parietal cortex is correlated with the relative subjective desirability of action. Neuron. 2004;44:365–378.
    1. McClure SM, et al. Neural correlates of behavioral preference for culturally familiar drinks. Neuron. 2004;44:379–387.
    1. Gottfried JA, O’Doherty J, Dolan RJ. Encoding predictive reward value in human amygdala and orbitofrontal cortex. Science. 2003;301:1104–1107.
    1. Tanaka SC, et al. Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nat Neurosci. 2004;7:887–893.
    1. Padoa-Schioppa C, Assad JA. Neurons in the orbitofrontal cortex encode economic value. Nature. 2006;441:223–226.
    1. Daw ND, O’Doherty JP, Dayan P, Seymour B, Dolan RJ. Cortical substrates for exploratory decisions in humans. Nature. 2006;441:876–879.
    1. O’Doherty JP, Buchanan TW, Seymour B, Dolan RJ. Predictive neural coding of reward preference involves dissociable responses in human ventral midbrain and ventral striatum. Neuron. 2006;49:157–166.
    1. Johnson KO, Hsiao SS, Yoshioka T. Neural coding and the basic law of psychophysics. Neuroscientist. 2002;8:111–121.
    1. Parker AJ, Newsome WT. Sense and the single neuron: probing the physiology of perception. Annu Rev Neurosci. 1998;21:227–277.
    1. Werner G, Mountcastle VB. Quantitative relations between mechanical stimuli to the skin and neural responses evoked by them. In: Kenshalo DR, editor. The Skin Senses. Charles C Thomas Publisher; Springfield, Illinois: 1968. pp. 112–138.
    1. Newsome WT, Britten KH, Movshon JA. Neuronal correlates of a perceptual decision. Nature. 1989;341:52–54.
    1. Frederick S, Loewenstein G, O’Donoghue T. Time discounting and time preference: a critical review. J Econ Lit. 2002;40:351–401.
    1. Mazur JE. An adjusting procedure for studying delayed reinforcement. In: Commons ML, Mazur JE, Nevin JA, Rachlin H, editors. Quantitative Analysis of Behavior: The Effects of Delay and Intervening Events on Reinforcement Value. Lawrence Erlbaum Associates, Publishers; Hillsdale, New Jersey: 1987. pp. 55–73.
    1. Laibson D. Golden eggs and hyperbolic discounting. Q J Econ. 1997;12:443–477.
    1. Green L, Myerson J. A discounting framework for choice with delayed and probabilistic rewards. Psychol Bull. 2004;130:769–792.
    1. Benhabib J, Bisin A, Schotter A. Working Paper. New York University Department of Economics; New York: 2006. Present-bias, quasi-hyperbolic discounting and fixed costs. < >.
    1. McClure SM, Laibson DI, Loewenstein G, Cohen JD. Separate neural systems value immediate and delayed monetary rewards. Science. 2004;306:503–507.
    1. Teller DY, Pugh EN. Linking propositions in color vision. In: Mollon JD, Sharpe LT, editors. Colour Vision: Physiology and Psychophysics. Academic Press; New York: 1983. pp. 577–589.
    1. von Neumann J, Morgenstern O. The Theory of Games and Economic Behavior. Princeton University Press; Princeton, New Jersey: 1944.
    1. Samuelson PA. Foundations of Economic Analysis. Harvard University Press; Cambridge, Massachusetts: 1947.
    1. Gul F, Pesendorfer W. Working Paper. Princeton University Department of Economics; Princeton, New Jersey: 2005. The case for mindless economics. < >.
    1. Gusnard DA, Raichle ME. Searching for a baseline: functional imaging and the resting human brain. Nat Rev Neurosci. 2001;2:685–694.
    1. Zysset S, Huber O, Ferstl E, von Cramon DY. The anterior frontomedian cortex and evaluative judgment: an fMRI study. Neuroimage. 2002;15:983–991.
    1. Hsu M, Bhatt M, Adolphs R, Tranel D, Camerer CF. Neural systems responding to degrees of uncertainty in human decision-making. Science. 2005;310:1680–1683.
    1. Paulus MP, Frank LR. Anterior cingulate activity modulates nonlinear decision weight function of uncertain prospects. Neuroimage. 2006;30:668–677.
    1. Tom SM, Fox CR, Trepel C, Poldrack RA. The neural basis of loss aversion in decision-making under risk. Science. 2007;315:515–518.
    1. Sugrue LP, Corrado GS, Newsome WT. Matching behavior and the representation of value in the parietal cortex. Science. 2004;304:1782–1787.
    1. Shadlen MN, Newsome WT. Motion perception: seeing and deciding. Proc Natl Acad Sci USA. 1996;93:628–633.
    1. Collier M, Williams MB. Eliciting individual discount rates. Exp Econ. 1999;2:107–127.
    1. Camerer CF, Hogarth RM. The effects of financial incentives in experiments: a review and capital-labor-production framework. J Risk Uncertain. 1999;19:7–42.
    1. Smith VL, Walker JM. Monetary rewards and decision cost in experimental economics. Econ Inq. 1993;31:245–261.

Source: PubMed

3
Abonneren