Neural Mechanisms Behind Identification of Leptokurtic Noise and Adaptive Behavioral Response

Mathieu d'Acremont, Peter Bossaerts, Mathieu d'Acremont, Peter Bossaerts

Abstract

Large-scale human interaction through, for example, financial markets causes ceaseless random changes in outcome variability, producing frequent and salient outliers that render the outcome distribution more peaked than the Gaussian distribution, and with longer tails. Here, we study how humans cope with this evolutionary novel leptokurtic noise, focusing on the neurobiological mechanisms that allow the brain, 1) to recognize the outliers as noise and 2) to regulate the control necessary for adaptive response. We used functional magnetic resonance imaging, while participants tracked a target whose movements were affected by leptokurtic noise. After initial overreaction and insufficient subsequent correction, participants improved performance significantly. Yet, persistently long reaction times pointed to continued need for vigilance and control. We ran a contrasting treatment where outliers reflected permanent moves of the target, as in traditional mean-shift paradigms. Importantly, outliers were equally frequent and salient. There, control was superior and reaction time was faster. We present a novel reinforcement learning model that fits observed choices better than the Bayes-optimal model. Only anterior insula discriminated between the 2 types of outliers. In both treatments, outliers initially activated an extensive bottom-up attention and belief network, followed by sustained engagement of the fronto-parietal control network.

Keywords: anterior insula; fronto-parietal control network; leptokurtic noise; outliers; reinforcement learning.

© The Author 2016. Published by Oxford University Press.

Figures

Figure 1.
Figure 1.
(a) Random shifting in (“mixing” of) the variance of a Gaussian distribution (left) generates an outcome distribution that is leptokurtic (right); outliers are frequent and salient. (b) Histogram and fitted Gaussian curve, daily rates of return (rates of movement of value from market close in 1 day to the next) of Standard and Poor's (S&P) index of 500 major US common stock, 1 June 1988 to 28 June 2013. (c) Cover of Time magazine for the week after the October 1987 stock market crash.
Figure 2.
Figure 2.
(a) Sequence of events in one trial. Left: Following a movement of the target (old position: open red circle; new position: filled red circle; movement direction: red arrow), the arc between the position of the robot (filled blue circle) and the target is indicated in yellow. This is also the “prediction error,” that is, the size of the mistake of the participant's guess of the new position of the target. Middle: Participant adjusts position of robot by moving a slider (old position: open white rectangle; new position: filled white rectangle) somewhere between 0 (robot stays at current location) to 1 (robot moves all the way to target position). Here, participant instructs the robot to move about 80% toward the target. The goal is to move the robot as close as possible to the position of the target in the next trial. Right: Robot executes the instruction and moves about 80% toward the target (blue arrow; old position indicated by open blue circle; new position indicated by closed blue circle). (b) Sample paths of locations of target in one Run, Transitory Treatment (top), and Fundamental Treatment (bottom). Outliers are salient, showing as sudden large movements. In the Transitory Treatment, outliers revert in the subsequent trial. They are permanent in the Fundamental Treatment. (c) Empirical distribution of target movements in the Transitory Treatment (top) and in the Fundamental Treatment (bottom). Movements are expressed in radians; one full circle corresponds to 2π radians, or approximately 6.28 radians.
Figure 3.
Figure 3.
(a) Average (±1 standard error [SE]) learning rates (fraction of length of arc spanning pretrial position of robot and new position of target that participant instructs robot to cover; a learning rate of 1 instructs robot to fully catch up with the target). Trials are divided into 3 categories: Nonoutlier (regular) trials; outlier trials; postoutlier trial (Fundamental Treatment; this refers to the trial following an outlier) or reversal trial (Transitory Treatment; this refers to the trial following an outlier, but only if the movement back covers at least the size of an outlier). First Run of 200 trials in yellow; second Run in orange. Green line segments indicate Bayes-optimal learning rates. (b) Average (±1 SE) prediction errors (length of arc spanning pretrial position of robot and new position of target), stratified by Treatment, Run, and Trial types. Green line segments show prediction errors from implementing the Bayes-optimal strategy. (c) Average (±1 SE) reaction times (in seconds), stratified by Treatment, Run, and Trial types. (d) Average learning rates in event time (outlier trial = “0”), stratified by treatment (Transitory: learning rates decrease in outlier trial; Fundamental: learning rates increase upon an outlier). Bayes-optimal model averages: green; Contrarian RL model averages: magenta; participant averages: black.
Figure 4.
Figure 4.
(a) Statistical parametric brain maps of significant BOLD activation correlating with outlier trials (early onset), whole-brain corrected, all Runs of both Treatments. Significant activation emerges in a wide network engaged in bottom-up attention to sensory stimuli. (b) Statistical parametric brain maps of significant BOLD activation correlating with outlier trials (late onset), whole-brain corrected, all Runs of both Treatments. Significant delayed activation emerges in the fronto-parietal control network. (c) Statistical parametric brain maps (left) and time courses (right) of activation in the attentional (red) and fronto-parietal (green) networks correlating with outlier trials, all Runs of both Treatments. Time courses aligned with initiation of target movement (time = 0).
Figure 5.
Figure 5.
Localization of significant BOLD activation correlating with outlier trials (early onset), separated by treatment. BOLD signals in anterior insula and medial wall of the anterior cingulate cortex were significant (whole-brain corrected) in the transitory treatment (bottom left) while they were not in the Fundamental Treatment (top left). Right: Only anterior insula survived (at P < 0.001 uncorrected) direct testing of differential activation across treatments.

References

    1. Allman JM, Tetreault NA, Hakeem AY, Manaye KF, Semendeferi K, Erwin JM, Park S, Goubert V, Hof PR. 2011. The von Economo neurons in the frontoinsular and anterior cingulate cortex. Ann N Y Acad Sci. 1225:59–71.
    1. Ayton P, Fischer I. 2004. The hot hand fallacy and the gambler's fallacy: two faces of subjective randomness? Mem Cognit. 32:1369–1378.
    1. Bai X, Russell JR, Tiao GC. 2003. Kurtosis of GARCH and stochastic volatility models with non-normal innovations. J Econom. 114:349–360.
    1. Behrens TE, Woolrich MW, Walton ME, Rushworth MF. 2007. Learning the value of information in an uncertain world. Nat Neurosci. 10:1214–1221.
    1. Berkman ET, Kahn LE, Merchant JS. 2014. Training-induced changes in inhibitory control network activity. J Neurosci. 34:149–157.
    1. Blakemore SJ, Sarfati Y, Bazin N, Decety J. 2003. The detection of intentional contingencies in simple animations in patients with delusions of persecution. Psychol Med. 33:1433–1441.
    1. Brass M, Haggard P. 2007. To do or not to do: the neural signature of self-control. J Neurosci. 27:9141–9145.
    1. Brett M, Anton JL, Valabregue R, Poline JB. 2002. Region of interest analysis using an SPM toolbox. Presented at the 8th International Conference on Functional Mapping of the Human Brain, June 2–6, Sendai, Japan.
    1. Brown SD, Steyvers M. 2009. Insensitivity to future consequences following damage to human prefrontal cortex. Cogn Psych. 58:49–67.
    1. Brugger P. 2001. From haunted brain to haunted science: a cognitive neuroscience view of paranormal and pseudoscientific thought. In: Houran J, Lange R, editors. Hauntings and poltergeists: multidisciplinary perspectives. London: McFarland; p. 195–213.
    1. Carter CS, Braver TS, Barch DM, Botvinick MM, Noll D, Cohen JD. 1998. Anterior cingulate cortex, error detection, and the online monitoring of performance. Science. 280:747–749.
    1. Chumbley J, Friston KJ. 2009. False discovery rate revisited: FDR and topological inference using Gaussian random fields. Neuroimage. 44:62–70.
    1. Chumbley J, Worsley K, Flandin G, Friston K. 2010. Topological FDR for neuroimaging. Neuroimage. 49:3057–3064.
    1. Cohen JD, McClure SM, Yu AJ. 2007. Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philos Trans R Soc, B. 362:933–942.
    1. Cole MW, Reynolds JR, Power JD, Repovs G, Anticevic A, Braver TS. 2013. Multi-task connectivity reveals flexible hubs for adaptive task control. Nat Neurosci. 16:1348–1355.
    1. Craig AD. 2009. How do you feel—now? The anterior insula and human awareness. Nat Rev Neurosci. 10:59–70.
    1. Craig AD. 2011. Significance of the insula for the evolution of human awareness of feelings from the body. Ann N Y Acad Sci. 1225:72–82.
    1. d'Acremont M, Fornari E, Bossaerts P. 2013. Activity in inferior parietal and medial prefrontal cortex signals the accumulation of evidence in a probability learning task. PLoS Comput Biol. 9: e1002895.
    1. d'Acremont M, Lu Z, Li X, Van der Linden M, Bechara A. 2009. Neural correlates of risk prediction error during reinforcement learning in humans. Neuroimage. 47:1929–1939.
    1. d'Acremont M, Schultz W, Bossaerts P. 2013. The human brain encodes event frequencies while forming subjective beliefs. J Neurosci. 33:10887–10897.
    1. Damsa C, Kosel M, Moussally J. 2009. Current status of brain imaging in anxiety disorders. Curr Opin Psych. 2:96–110.
    1. Daw ND, O'Doherty JP, Dayan P, Seymour B, Dolan RJ. 2006. Cortical substrates for exploratory decisions in humans. Nature. 441:876–879.
    1. De Bondt WF, Thaler RH. 1990. Do security analysts overreact? Am Econ Rev. 90:52–57.
    1. Eichele T, Debener S, Calhoun VD, Specht K, Engel AK, Hugdahl K, Cramon DYV, Ullsperger M. 2008. Prediction of human errors by maladaptive changes in event-related brain networks. Proc Natl Acad Sci USA. 105:6173–6178.
    1. Embrechts P, Klueppelber C, Mikosch T. 1997. Modelling extremal events for insurance and finance. Berlin and Heidelberg (Germany): Springer.
    1. Etkin A, Wager TD. 2007. Functional neuroimaging of anxiety: a meta-analysis of emotional processing in PTSD, social anxiety disorder, and specific phobia. Am J Psychiatry. 164:1476–1488.
    1. Fischer M. 2010. Generalized Tukey-type distributions with application to financial and teletraffic data. Stat Pap. 51:41–56.
    1. Friston K. 2010. The free-energy principle: a unified brain theory? Nat Rev Neurosci. 11:127–138.
    1. Gilovich T, Vallone R, Tversky A. 1985. The hot hand in basketball: on the misperception of random sequences. Cog Psych. 17:295–314.
    1. Glaescher J, Adolphs R, Damasio H, Bechara A, Rudrauf D, Calamia M, Paul LK, Tranel D. 2012. Lesion mapping of cognitive control and value-based decision making in the prefrontal cortex. Proc Natl Acad Sci USA. 109:14681–14686.
    1. Hampton A, Bossaerts P, O'Doherty J. 2006. The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J Neurosci. 26:8360–8367.
    1. Hsu DA. 1979. Detecting shifts of parameter in gamma sequences with applications to stock price and air traffic flow analysis. J Am Stat Assoc. 74:31–40.
    1. Kim H. 2014. Involvement of the dorsal and ventral attention networks in oddball stimulus processing: a meta-analysis. Hum Brain Mapp. 35:2265–2284.
    1. Kitagawa G. 1987. Non-Gaussian state-space modeling of nonstationary time series. J Am Stat Assoc. 82:1032–1041.
    1. Klein TA, Endrass T, Kathmann N, Neumann J, Von Cramon DY, Ullsperger M. 2007. Neural correlates of error awareness. Neuroimage. 34:1774–1781.
    1. Kriegeskorte N, Simmons WK, Bellgowan PSF, Baker CI. 2009. Circular analysis in systems neuroscience—the dangers of double dipping. Nat Neurosci. 12:535–540.
    1. Mandelbrot B. 1963. The variation of certain speculative prices. J Bus. 36:394–419.
    1. Mandelbrot B, Van Ness J. 1968. Fractional Brownian motions, fractional noises and applications. SIAM Rev. 10:422–437.
    1. McGuire JT, Nassar MR, Gold JI, Kable JW. 2014. Functionally dissociable influences on learning rate in a dynamic environment. Neuron. 84:870–881.
    1. Menon V, Uddin L. 2010. Saliency, switching, attention and control: a network model of insula function. Brain Struct Funct. 214:655–667.
    1. Nassar MR, Rumsey KM, Wilson RC, Parikh K, Heasly B, Gold JI. 2012. Rational regulation of learning dynamics by pupil-linked arousal systems. Nat Neurosci. 15:1040–1046.
    1. O'Reilly JX, Schüffelgen U, Cuell SF, Behrens TE, Mars RB, Rushworth MF. 2013. Dissociable effects of surprise and model update in parietal and anterior cingulate cortex. Proc Natl Acad Sci USA. 110:3660–3669.
    1. Payzan-LeNestour E, Bossaerts P. 2010. Risk, estimation uncertainty, and unexpected uncertainty: Bayesian learning in unstable settings. PLoS Comput Biol. 6:29–30.
    1. Payzan-LeNestour E, Dunne S, Bossaerts P, O'Doherty JP. 2013. The neural representation of unexpected uncertainty during value-based decision making. Neuron. 79:191–201.
    1. Preuschoff K, Marius't Hart B, Einhäuser W. 2011. Pupil dilation signals surprise: evidence for noradrenaline's role in decision making. Front Neurosci. 5:115 .
    1. Preuschoff K, Quartz S, Bossaerts P. 2008. Human insula activation reflects risk prediction errors as well as risk. J Neurosci. 28:2745–2752.
    1. R Core Team. 2015. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.
    1. Shenhav A, Botvinick MM, Cohen JD. 2013. The expected value of control: an integrative theory of anterior cingulate cortex function. Neuron. 79:217–240.
    1. Sutton RS. 1992. Adapting bias by gradient descent: an incremental version of delta-bar-delta. In: Proceedings of the 10th National Conference on Artificial Intelligence. Cambridge: (MA: ): MIT Press; p. 171–176.
    1. Tervo DG, Proskurin M, Manakov M, Kabra M, Vollmer A, Branson K, Karpova AY. 2014. Behavioral variability through stochastic choice and its gating by anterior cingulate cortex. Cell. 159(1):21–32.
    1. Wolpert RL, Taqqu MS. 2005. Fractional Ornstein–Uhlenbeck Lévy processes and the Telecom process: upstairs and downstairs. Signal Process. 85:1523–1545.
    1. Yu A, Dayan P. 2003. Expected and unexpected uncertainty: ACh and NE in the neocortex. Adv Neur Inf Process Syst. 173–180.
    1. Yu A, Dayan P. 2005. Uncertainty, neuromodulation, and attention. Neuron. 46:681–692.

Source: PubMed

3
Sottoscrivi