Multicollinear physical activity accelerometry data and associations to cardiometabolic health: challenges, pitfalls, and potential solutions

Eivind Aadland, Olav Martin Kvalheim, Sigmund Alfred Anderssen, Geir Kåre Resaland, Lars Bo Andersen, Eivind Aadland, Olav Martin Kvalheim, Sigmund Alfred Anderssen, Geir Kåre Resaland, Lars Bo Andersen

Abstract

Background: The analysis of associations between accelerometer-derived physical activity (PA) intensities and cardiometabolic health is a major challenge due to multicollinearity between the explanatory variables. This challenge has facilitated the application of different analytic approaches within the field. The aim of the present study was to compare association patterns of PA intensities with cardiometabolic health in children obtained from multiple linear regression, compositional data analysis, and multivariate pattern analysis.

Methods: A sample of 841 children (age 10.2 ± 0.3 years; BMI 18.0 ± 3.0; 50% boys) provided valid accelerometry and cardiometabolic health data. Accelerometry (ActiGraph GT3X+) data were characterized into traditional (four PA intensity variables) and more detailed categories (23 PA intensity variables covering the intensity spectrum; 0-99 to ≥10,000 counts per minute). Several indices of cardiometabolic health were used to create a composite cardiometabolic health score. Multiple linear regression and multivariate pattern analyses were used to analyze both raw and compositional data.

Results: Besides a consistent negative (favorable) association between vigorous PA and the cardiometabolic health measure using the traditional description of PA data, associations between PA intensities and cardiometabolic health differed substantially depending on the analytic approaches used. Multiple linear regression lead to instable and spurious associations, while compositional data analysis showed distorted association patterns. Multivariate pattern analysis appeared to handle the raw PA data correctly, leading to more plausible interpretations of the associations between PA intensities and cardiometabolic health.

Conclusions: Future studies should consider multivariate pattern analysis without any transformation of PA data when examining relationships between PA intensity patterns and health outcomes.

Trial registration: The study was registered in Clinicaltrials.gov 7th of April 2014 with identification number NCT02132494 .

Keywords: Accelerometer; Children; Compositional data analysis; Intensity; Multicollinearity; Multiple linear regression; Multivariate pattern analysis; Statistics.

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Bivariate correlations (95% confidence intervals) between physical activity intensities and a composite cardiometabolic health score using the traditional description of four physical activity variables. Raw data (upper panel), compositional data (lower panel)
Fig. 2
Fig. 2
Association patterns between physical activity intensities and a composite cardiometabolic health score using the traditional description of four physical activity variables using different analytic approaches. Multiple linear regression with raw data (upper left panel), multiple linear regression with compositional data using the ilr-transformation (lower left panel), multivariate pattern analysis with raw data (upper right panel), and multivariate pattern analysis with compositional data using the clr-transformation (lower right panel). Selectivity ratio is calculated as the ratio of explained to total variance on the predictive (target projected) component. R2 = explained variance of the model
Fig. 3
Fig. 3
Bivariate correlations (95% confidence intervals) between physical activity intensities and a composite cardiometabolic health score using the spectrum description of 23 physical activity variables. Raw data (upper panel), compositional data (lower panel)
Fig. 4
Fig. 4
Association patterns between physical activity intensities and a composite cardiometabolic health score using the spectrum description of 23 physical activity variables using different analytic approaches. Multiple linear regression with raw data (upper left panel), multiple linear regression with compositional data using the ilr-transformation (lower left panel), multivariate pattern analysis with raw data (upper right panel), and multivariate pattern analysis with compositional data using the clr-transformation (lower right panel). Selectivity ratio is calculated as the ratio of explained to total variance on the predictive (target projected) component. R2 = explained variance of the model

References

    1. Ekelund U, Luan JA, Sherar LB, Esliger DW, Griew P, Cooper A, et al. Moderate to vigorous physical activity and sedentary time and Cardiometabolic risk factors in children and adolescents. JAMA. 2012;307(7):704–712. doi: 10.1001/jama.2012.156.
    1. Andersen LB, Harro M, Sardinha LB, Froberg K, Ekelund U, Brage S, et al. Physical activity and clustered cardiovascular risk in children: a cross-sectional study (the European youth heart study) Lancet. 2006;368(9532):299–304. doi: 10.1016/S0140-6736(06)69075-2.
    1. Janssen Ian, LeBlanc Allana G. Systematic review of the health benefits of physical activity and fitness in school-aged children and youth. International Journal of Behavioral Nutrition and Physical Activity. 2010;7(1):40. doi: 10.1186/1479-5868-7-40.
    1. Poitras VJ, Gray CE, Borghese MM, Carson V, Chaput JP, Janssen I, et al. Systematic review of the relationships between objectively measured physical activity and health indicators in school-aged children and youth. Appl Physiol Nutr Metab. 2016;41(6):S197–S239. doi: 10.1139/apnm-2015-0663.
    1. van der Ploeg HP, Hillsdon M. Is sedentary behaviour just physical inactivity by another name? Int J Behav Nutr Phys Act. 2017;14:8. doi: 10.1186/s12966-017-0601-0.
    1. Aadland E, Kvalheim OM, Anderssen SA, Resaland GK, Andersen LB. The multivariate physical activity signature associated with metabolic health in children. Int J Behav Nutr Phys Act. 2018;15:77. doi: 10.1186/s12966-018-0707-z.
    1. Pedisic Z. Measurement issues and poor adjustments for physical activity and sleep undermine sedentary behaviour research - the focus should shift to the balance between sleep, sedentary behaviour, standing and activity. Kinesiology. 2014;46(1):135–146.
    1. Cohen J, Cohen P, West SG, Aiken LS. Applied multiple regression/correlation analysis for the bahavioral sciences. 3. New York: Routledge; 2003.
    1. Saunders TJ, Gray CE, Poitras VJ, Chaput JP, Janssen I, Katzmarzyk PT, et al. Combinations of physical activity, sedentary behaviour and sleep: relationships with health indicators in school-aged children and youth. Appl Physiol Nutr Metab. 2016;41(6):S283–SS93. doi: 10.1139/apnm-2015-0626.
    1. Mekary RA, Willett WC, Hu FB, Ding EL. Isotemporal substitution paradigm for physical activity epidemiology and weight change. Am J Epidemiol. 2009;170(4):519–527. doi: 10.1093/aje/kwp163.
    1. Hansen BH, Anderssen SA, Andersen LB, Hildebrand M, Kolle E, Steene-Johannessen J, et al. Cross-sectional associations of reallocating time between sedentary and active Behaviours on Cardiometabolic risk factors in young people: an international Children’s Accelerometry database (ICAD) analysis. Sports Med. 2018;48(10):2401–2412. doi: 10.1007/s40279-018-0909-1.
    1. Chastin Sebastien F. M., Palarea-Albaladejo Javier, Dontje Manon L., Skelton Dawn A. Combined Effects of Time Spent in Physical Activity, Sedentary Behaviors and Sleep on Obesity and Cardio-Metabolic Health Markers: A Novel Compositional Data Analysis Approach. PLOS ONE. 2015;10(10):e0139984. doi: 10.1371/journal.pone.0139984.
    1. Dumuid D, Stanford TE, Martin-Fernandez JA, Pedisic Z, Maher CA, Lewis LK, et al. Compositional data analysis for physical activity, sedentary time and sleep research. Stat Methods Med Res. 2018;27(12):3726–3738. doi: 10.1177/0962280217710835.
    1. Aadland E, Andersen LB, Anderssen SA, Resaland GK, Kvalheim OM. Associations of volumes and patterns of physical activity with metabolic health in children: a multivariate pattern analysis approach. Prev Med. 2018;115:12–18. doi: 10.1016/j.ypmed.2018.08.001.
    1. Aitchison J. The statistial analysis of compositional data. J Royal Stat Soc. 1982;44(2):139–177.
    1. Hron K, Filzmoser P, Thompson K. Linear regression with compositional explanatory variables. J Appl Stat. 2012;39(5):1115–1128. doi: 10.1080/02664763.2011.644268.
    1. Skala W. A mathematical model to investigate distortions of correlation coefficients in closed arrays. Math Geol. 1977;9(5):519–528. doi: 10.1007/BF02100963.
    1. Skala W. Some effects of the constant-sum problem in geochemistry. Chem Geol. 1979;27:1–9. doi: 10.1016/0009-2541(79)90099-8.
    1. Kvalheim OM, Brakstad F, Liang Y. Preprocessing of analytical profiles in the presence of homoscedastic or heteroscedastic noise. Anal Chem. 1994;66:43–51. doi: 10.1021/ac00073a010.
    1. Rajalahti T, Kvalheim OM. Multivariate data analysis in pharmaceutics: a tutorial review. Int J Pharm. 2011;417(1–2):280–290. doi: 10.1016/j.ijpharm.2011.02.019.
    1. Madsen R, Lundstedt T, Trygg J. Chemometrics in metabolomics - a review in human disease diagnosis. Anal Chim Acta. 2010;659(1–2):23–33. doi: 10.1016/j.aca.2009.11.042.
    1. Rajalahti T, Kroksveen AC, Arneberg R, Berven FS, Vedeler CA, Myhr K-M, et al. A multivariate approach to reveal biomarker signatures for disease classification: application to mass spectral profiles of cerebrospinal fluid from patients with multiple sclerosis. J Proteome Res. 2010;9(7):3608–3620. doi: 10.1021/pr100142m.
    1. Wold S, Ruhe A, Wold H, Dunn WJ. The collinearity problem in linear-regression - the partial least-squares (PLS) approach to generalized inverses. SIAM J Sci Comput. 1984;5(3):735–743. doi: 10.1137/0905052.
    1. Resaland GK, Moe VF, Aadland E, Steene-Johannessen J, Glosvik Ø, Andersen JR, et al. Active smarter kids (ASK): rationale and design of a cluster-randomized controlled trial investigating the effects of daily physical activity on children's academic performance and risk factors for non-communicable diseases. BMC Public Health. 2015;15:709. doi: 10.1186/s12889-015-2049-y.
    1. Resaland GK, Aadland E, Moe VF, Aadland KN, Skrede T, Stavnsbo M, et al. Effects of physical activity on schoolchildren's academic performance: the active smarter kids (ASK) cluster-randomized controlled trial. Prev Med. 2016;91:322–328. doi: 10.1016/j.ypmed.2016.09.005.
    1. John D, Freedson P. ActiGraph and Actical physical activity monitors: a peek under the hood. Med Sci Sports Exerc. 2012;44(1 Suppl 1):S86–SS9. doi: 10.1249/MSS.0b013e3182399f5e.
    1. Froberg A, Berg C, Larsson C, Boldemann C, Raustorp A. Combinations of epoch durations and cut-points to estimate sedentary time and physical activity among adolescents. Meas Phys Educ Exerc Sci. 2017;21(3):154–160. doi: 10.1080/1091367x.2017.1309657.
    1. Aadland E, Andersen LB, Anderssen SA, Resaland GK. A comparison of 10 accelerometer non-wear time criteria and logbooks in children. BMC Public Health. 2018;18:9. doi: 10.1186/s12889-018-5212-4.
    1. Aadland Eivind, Andersen Lars Bo, Skrede Turid, Ekelund Ulf, Anderssen Sigmund Alfred, Resaland Geir Kåre. Reproducibility of objectively measured physical activity and sedentary time over two seasons in children; Comparing a day-by-day and a week-by-week approach. PLOS ONE. 2017;12(12):e0189304. doi: 10.1371/journal.pone.0189304.
    1. Evenson KR, Catellier DJ, Gill K, Ondrak KS, McMurray RG. Calibration of two objective measures of physical activity for children. J Sports Sci. 2008;26(14):1557–1565. doi: 10.1080/02640410802334196.
    1. Trost SG, Loprinzi PD, Moore R, Pfeiffer KA. Comparison of accelerometer cut points for predicting activity intensity in youth. Med Sci Sports Exerc. 2011;43(7):1360–1368. doi: 10.1249/MSS.0b013e318206476e.
    1. Aadland E, Terum T, Mamen A, Andersen LB, Resaland GK. The Andersen aerobic fitness test: reliability and validity in 10-year-old children. PLoS One. 2014;9(10):e110492. doi: 10.1371/journal.pone.0110492.
    1. Friedewald WT, Levy RI, Fredrickson DS. Estimation of the concentration of low-density lipoprotein cholesterol in plasma, without use of the preparative ultracentrifuge. Clin Chem. 1972;18:499–502.
    1. Matthews DR, Hosker JP, Rudenski AS, Naylor BA, Treacher DF, Turner RC. Homeostasis model assessment: insulin resistance and β-cell function from fasting plasma glucose and insulin concentrations in man. Diabetologia. 1985;28(7):412–419. doi: 10.1007/bf00280883.
    1. Kvalheim Olav Martin, Arneberg Reidar, Grung Bjørn, Rajalahti Tarja. Determination of optimum number of components in partial least squares regression from distributions of the root-mean-squared error obtained by Monte Carlo resampling. Journal of Chemometrics. 2018;32(4):e2993. doi: 10.1002/cem.2993.
    1. Kvalheim OM, Karstang TV. Interpretation of latent-variable regression-models. Chemometr Intell Lab Syst. 1989;7(1–2):39–51. doi: 10.1016/0169-7439(89)80110-8.
    1. Rajalahti T, Arneberg R, Berven FS, Myhr KM, Ulvik RJ, Kvalheim OM. Biomarker discovery in mass spectral profiles by means of selectivity ratio plot. Chemometr Intell Lab Syst. 2009;95(1):35–48. doi: 10.1016/j.chemolab.2008.08.004.
    1. Rajalahti T, Arneberg R, Kroksveen AC, Berle M, Myhr KM, Kvalheim OM. Discriminating variable test and selectivity ratio plot: quantitative tools for interpretation and variable (biomarker) selection in complex spectral or chromatographic profiles. Anal Chem. 2009;81(7):2581–2590. doi: 10.1021/ac802514y.
    1. Howard B, Winkler EAH, Sethi P, Carson V, Ridgers ND, Salmon J, et al. Associations of low- and high-intensity light activity with Cardiometabolic biomarkers. Med Sci Sports Exerc. 2015;47(10):2093–2101. doi: 10.1249/mss.0000000000000631.
    1. Cliff DP, Hesketh KD, Vella SA, Hinkley T, Tsiros MD, Ridgers ND, et al. Objectively measured sedentary behaviour and health and development in children and adolescents: systematic review and meta-analysis. Obes Rev. 2016;17(4):330–344. doi: 10.1111/obr.12371.

Source: PubMed

Подписаться