Beyond linearity in neuroimaging: Capturing nonlinear relationships with application to longitudinal studies

Gang Chen, Tiffany A Nash, Katherine M Cole, Philip D Kohn, Shau-Ming Wei, Michael D Gregory, Daniel P Eisenberg, Robert W Cox, Karen F Berman, J Shane Kippenhan, Gang Chen, Tiffany A Nash, Katherine M Cole, Philip D Kohn, Shau-Ming Wei, Michael D Gregory, Daniel P Eisenberg, Robert W Cox, Karen F Berman, J Shane Kippenhan

Abstract

The ubiquitous adoption of linearity for quantitative predictors in statistical modeling is likely attributable to its advantages of straightforward interpretation and computational feasibility. The linearity assumption may be a reasonable approximation especially when the variable is confined within a narrow range, but it can be problematic when the variable's effect is non-monotonic or complex. Furthermore, visualization and model assessment of a linear fit are usually omitted because of challenges at the whole brain level in neuroimaging. By adopting a principle of learning from the data in the presence of uncertainty to resolve the problematic aspects of conventional polynomial fitting, we introduce a flexible and adaptive approach of multilevel smoothing splines (MSS) to capture any nonlinearity of a quantitative predictor for population-level neuroimaging data analysis. With no prior knowledge regarding the underlying relationship other than a parsimonious assumption about the extent of smoothness (e.g., no sharp corners), we express the unknown relationship with a sufficient number of smoothing splines and use the data to adaptively determine the specifics of the nonlinearity. In addition to introducing the theoretical framework of MSS as an efficient approach with a counterbalance between flexibility and stability, we strive to (a) lay out the specific schemes for population-level nonlinear analyses that may involve task (e.g., contrasting conditions) and subject-grouping (e.g., patients vs controls) factors; (b) provide modeling accommodations to adaptively reveal, estimate and compare any nonlinear effects of a predictor across the brain, or to more accurately account for the effects (including nonlinear effects) of a quantitative confound; (c) offer the associated program 3dMSS to the neuroimaging community for whole-brain voxel-wise analysis as part of the AFNI suite; and (d) demonstrate the modeling approach and visualization processes with a longitudinal dataset of structural MRI scans.

Trial registration: ClinicalTrials.gov NCT01132885 NCT01434368.

Published by Elsevier Inc.

Figures

Fig. 1.
Fig. 1.
Comparisons of data fitting among three polynomial models and a smoothing spline approach. Linear fitting (black line) between BOLD response and age in a region, as typically adopted in neuroimaging analysis, sometimes may render a roughly acceptable performance with a general sign (positive or negative) for the association between x and y such as (a) here, but other times it could fail badly as shown in (b). Quadratic fitting (green) performs better, and the cubic model (orange) is a further improvement. Furthermore, the curve estimated through smoothing splines (blue) provides the best fit without any prior knowledge of parameters such as the order of polynomials. More crucially, smoothing spline modeling is largely self-adaptive without the daunting undertaking of order selection in polynomial fitting.
Fig. 2.
Fig. 2.
Basis functions b0(x), b1(x), …, bK−1(x) with K = 6. Two spline types are discussed here: cardinal cubic splines (a, c) and thin plate splines (b, d). Cubic splines are knot-based; with 6 knots at 0, 0.2, 0.4, 0.6, 0.8, and 1.0 within [0, 1], there are 6 cardinal basis functions (a). Each cardinal basis function (e.g., b4(x) in blue) peaks at the associated knot (x = 0.6) with a value of 1, and takes the value 0 at all other knots. Therefore, the fitted value at a knot corresponds to the weight for the associated basis function. In contrast, as shown in (b, d), each of the thin plate basis functions within [0, 1] is not knot-specific. As the baseline (intercept) is usually modeled separately in real practice, we only utilize K − 1 basis functions to maintain model identifiability (c, d).
Fig. 3.
Fig. 3.
Statistical evidence of asymmetry in age trends of cortical gray matter volume (GMV) shown at voxelwise p < .05 (top) and three selected regions (highlighted in cyan) filtered at voxelwise p < 10−5 (below), based on smoothing spline modeling of trends with age. Trajectories are averaged across voxels within the three selected regions, and their uncertainty bands extend one standard error above and below the trajectories. Adjusted R2 values (Radj2) indicating goodness-of-fit were higher for the smoothing spline model than for a linear model (ranges for voxels within regions are shown below). (a) Heschl’s gyrus, demonstrating left-lateralized asymmetry with trajectories for left and right hemispheres that have roughly similar shape across the age range 7–20 (Radj2 in (0.786, 0.997) for MSS vs. (−0.020, 0.353) for linearity); (b) Inferior temporal gyrus, also showing left-lateralized asymmetry, but with trajectories that reveal decreasing asymmetry over the same age range (Radj2 in (0.978, 0.998) for MSS vs. (0.065, 0.116) for linearity); (c) Anterior insula, where trajectories of right and left hemispheres can be seen to “cross over” (Radj2 in (0.951, 0.983) for MSS vs. (−0.049, −0.022) for linearity).
Fig. 4.
Fig. 4.
Statistical evidence of regions for which asymmetries were captured by a nonlinear trajectory (top, voxelwise threshold of p < 0.05) of cortical gray matter volume (GMV) and three selected regions (highlighted in cyan) (below, voxelwise p < 0.005). These regions were a subset of the general asymmetries shown in Fig. 3. The nonlinear trends are averaged within the three selected regions, and their uncertainty bands extend one standard error above and below the trajectories. (a) shows a right-lateralized region within the intraparietal sulcus where both hemispheres exhibit an increased rate of gray matter reduction in the 10–15 year range (Radj2 in (0.993, 0.996) for MSS vs. (0.001, 0.044) for linearity). (b) and (c) illustrate similarly right-lateralized regions within the temporo-parietal junction (Radj2 in (0.993, 0.998) for MSS vs. (−0.086, 0.018) for linearity) and frontal pole (Radj2 in (0.984, 0.991) for MSS vs. (0.051, 0.102) for linearity), respectively, with an increased rate of reduction over a similar age range.

References

    1. Ashburner J, Ridgway GR, 2012. Symmetric diffeomorphic modeling of longitudinal structural MRI. Front Neurosci 6, 197.
    1. Bay X, Grammont L, Maatouk H, 2016. Generalization of the kimeldorf-wahba correspondence for constrained interpolation. Electron. J. Statist 10, 1580–1595.
    1. Benzécri JP, 1973. L’analyse des données. Tome II: L’analyse des correspondances. Dunod; 1973.
    1. Chen G, Adleman NE, Saad ZS, Leibenluft E, Cox RW, 2014. Applications of multivariate modeling to neuroimaging group analysis: a comprehensive alternative to univariate general linear model. Neuroimage 99, 571–588.
    1. Chen G, Bürkner P-C, Taylor PA, Li Z, Yin L, Glen DR, Kinnison J, Cox RW, Pessoa L, 2019b. An integrative approach to matrix-based analyses in neuroimaging. Hum. Brain Mapp 40 (14), 14–4072.
    1. Chen G, Padmala S, Chen Y, Taylor PA, Cox RW, Pessoa L, 2020b. To pool or not to pool: can we ignore cross-trial variability in FMRI? under review. NeuroImage. In press. doi:10.1016/j.neuroimage.2020.117496.
    1. Chen G, Saad ZS, Adleman NE, Leibenluft E, Cox RW, 2015. Detecting the subtle shape differences in hemodynamic responses at the group level. Front. Neurosci 9, 375. doi:10.3389/fnins.2015.00375.
    1. Chen G, Saad ZS, Britton JC, Pine DS, Cox RW, 2013. Linear mixed-effects modeling approach to FMRI group analysis. Neuroimage 73, 176–190.
    1. Chen G, Taylor PA, Cox RW, Pessoa L, 2020a. Fighting or embracing multiplicity in neuroimaging? neighborhood leverage versus global calibration. Neuroimage 206. doi:10.1016/j.neuroimage.2019.116320.
    1. Chen G, Xiao Y, Taylor PA, Riggins T, Geng F, Redcay E, 2019a. Handling multiplicity in neuroimaging through bayesian lenses with multilevel modeling. Neuroinformatics 17 (4), 515–545.
    1. Cohen MS, 1997. Parametric analysis of fMRI data using linear systems methods. Neuroimage 6 (2), 2–93.
    1. Cole KM, Wei SM, Martinez PE, Nguyuen TV, Gregory M, Kippenhan JS, Kohn P, Soldin S, Nieman LK, Yanovski JA, Schmidt PJ, Berman KF, 2021. The NIMH intramural longitudinal study of the endocrine and neurobiological events accompanying puberty: methods, measures, and maps. Neuroimage (this issue).
    1. Cox RW, 1996. AFNI: Software for analysis and visualization of functional magnetic resonance neuroimages. Comput. Biomed. Res. 29, 162–173.
    1. Dorsaint-Pierre R, Penhune VB, Watkins KE, Neelin P, Lerch JP, Bouffard M, Zatorre RJ, 2006. Asymmetries of the planum temporale and heschl’s gyrus: relationship to language lateralization. Brain 129, 1164–1176.
    1. Duchon J, 1977. Splines Minimizing Rotation Invariant Semi-norms in Sobolev Spaces. In: Schempp W, Zeller K (Eds.), In: Constructive Theory of Functions of Several Variables.. Lecture Notes in Math 571:85–100, Springer, Berlin.
    1. Faghiri A, Stephen JM, Wang Y-P, Wilson TW, Calhoun VD, 2019. Brain development includes linear and multiple nonlinear trajectories: a cross-sectional resting-state functional magnetic resonance imaging study. Brain Connect. 9, 777–788. doi:10.1089/brain.2018.0641.
    1. Fjell AM, Westlye LT, Grydeland H, Amlien I, Espeseth T, Reinvang I, Raz N, Holland D, Dale AM, Walhovd KB, 2013. Critical ages in the life-course of the adult brain: nonlinear subcortical aging. Neurobiol Aging 34, 2239–2247. doi:10.1016/j.neurobiolaging.2013.04.006.
    1. Gelman A, Imbens G, 2019. Why high-order polynomials should not be used in regression discontinuity designs. Journal of Business & Economic Statistics 37 (3), 3–447.
    1. Gregory MD, Mervis CB, Elliott ML, Kippenhan JS, Nash T, Czarapata JB, Prabhakaran R, Roe K, Eisenbert D, Kohn PD, Berman KF, 2019. Williams syndrome hemideletion and LIMK1 variation both affect dorsal stream functional connectivity. Brain. 142 (12), 12–3963.
    1. Hastie TJ, Tibshirani RJ, 1990. Generalized additive models. Chapman & HallCRC.
    1. Kitada R, Kito T, Saito DN, Kochiyama T, Matsumura M, Sadato N, Lederman SJ, 2006. Multisensory activation of the intraparietal area when classifying grating orientation: a functional magnetic resonance imaging study. J Neurosci. 26 (28), 28–7491.
    1. Kong XZ, Mathias SR, Guadalupe T, Glahn DC, Franke B, Crivello F, Tzourio-Mazoyer N, Fisher SE, Thompson PM, Francks C, 2018. Enigma laterality working group,mapping cortical brain asymmetry in 17,141 healthy individuals worldwide via the ENIGMA consortium. Proc Natl Acad Sci USA 115 (2), E5154–E5163.
    1. McFarquhar M, 2019. Modeling group-level repeated measurements of neuroimaging data using the univariate general linear model. Front Neurosci 13, 352.
    1. Pomponio R, Erus G, Habes M, Doshi J, Srinivasan D, Mamourian E, Bashyam V, Nasrallah IM, Satterthwaite TD, Fan Y, Launer LJ, Masters CL, Maruff P, Zhuo C, Völzke H, Johnson SC, Fripp J, Koutsouleris N, Wolf DH, Gur R, Gur R, Morris J, Albert MS, Grabe HJ, Resnick SM, Bryan RN, Wolk DA, Shinohara RT, Shou H, Davatzikos C, 2020. Harmonization of large MRI datasets for the analysis of brain imaging patterns throughout the lifespan. Neuroimage 208, 116450. 10.1016/j.neuroimage.2019.116450
    1. Rasmussen CE, Williams CKI, 2006. Gaussian processes for machine learning. Cambridge, MA: MIT Press.
    1. Runge C, 1901. Uber empirische funktionen und die interpolation zwischen quidistanten ordinaten. Zeitschrift fr Mathematik und Physik 46, 224–243.
    1. de Schotten M. T, Dell’Acqua F, Forkel SJ, Simmons A, Vergani F, Murphy DG, Catani M, 2011. A lateralized brain network for visuospatial attention. Nat Neurosci 14 (10), 10–1245.
    1. Sørensena O, Brandmaier AM, Bros DM, Ebmeier K, Ghisletta P, Kievit RA, M. MA, Walhovd KB, Westerhausen R, Fjell A, 2020. Meta-analysis of generalized additive models in neuroimaging studies. ArXiv: 2002.02627.
    1. Team RC, 2020. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL .
    1. Telzer EH, McCormick EM, Peters S, Cosme D, Pfeifer JH, van Duijvenvoorde ACK, 2018. Methodological considerations for developmental longitudinal fMRI research. Dev. Cogn. Neurosci 33, 149–160.
    1. Wang J-L, Chiou J-M, Müller HG, 2016. Functional data analysis. Annu. Rev. Stat. Appl 3, 257–295.
    1. Wood SN, 2003. Thin plate regression splines. J. R. Statist. Soc. B 65 (1), 1–95.
    1. Wood SN, 2017. Generalized additive models: An introduction with r (2nd ed). Chapman & HallCRC.
    1. Wood SN, Scheipl F, 2020. Gamm4: generalized additive mixed models using ‘mgcv’ and ‘lme4’. R package. Version 0.2–6. .

Source: PubMed

3
Sottoscrivi