Power and sample size analysis for longitudinal mixed models of health in populations exposed to environmental contaminants: a tutorial

Kylie K Harrall, Keith E Muller, Anne P Starling, Dana Dabelea, Kelsey E Barton, John L Adgate, Deborah H Glueck, Kylie K Harrall, Keith E Muller, Anne P Starling, Dana Dabelea, Kelsey E Barton, John L Adgate, Deborah H Glueck

Abstract

Background: When evaluating the impact of environmental exposures on human health, study designs often include a series of repeated measurements. The goal is to determine whether populations have different trajectories of the environmental exposure over time. Power analyses for longitudinal mixed models require multiple inputs, including clinically significant differences, standard deviations, and correlations of measurements. Further, methods for power analyses of longitudinal mixed models are complex and often challenging for the non-statistician. We discuss methods for extracting clinically relevant inputs from literature, and explain how to conduct a power analysis that appropriately accounts for longitudinal repeated measures. Finally, we provide careful recommendations for describing complex power analyses in a concise and clear manner.

Methods: For longitudinal studies of health outcomes from environmental exposures, we show how to [1] conduct a power analysis that aligns with the planned mixed model data analysis, [2] gather the inputs required for the power analysis, and [3] conduct repeated measures power analysis with a highly-cited, validated, free, point-and-click, web-based, open source software platform which was developed specifically for scientists.

Results: As an example, we describe the power analysis for a proposed study of repeated measures of per- and polyfluoroalkyl substances (PFAS) in human blood. We show how to align data analysis and power analysis plan to account for within-participant correlation across repeated measures. We illustrate how to perform a literature review to find inputs for the power analysis. We emphasize the need to examine the sensitivity of the power values by considering standard deviations and differences in means that are smaller and larger than the speculated, literature-based values. Finally, we provide an example power calculation and a summary checklist for describing power and sample size analysis.

Conclusions: This paper provides a detailed roadmap for conducting and describing power analyses for longitudinal studies of environmental exposures. It provides a template and checklist for those seeking to write power analyses for grant applications.

Keywords: Free software; General linear mixed model; Longitudinal study design; Persistent chemicals; Power analysis; Repeated measurements; Sample size.

Conflict of interest statement

The other authors declare they have no actual or potential competing financial interests.

© 2023. The Author(s).

Figures

Fig. 1
Fig. 1
A power analysis check-list
Fig. 2
Fig. 2
Power for time-by-group interaction, as a function of mean difference [(μ T1,A - μ T3,A) -(μ T1,C- μ T3,C)] and standard deviation of PFHxS. Data are back-transformed for interpretability. The standard deviation shown is 3.02 ng/mL. This mean difference is scaled by factors of 0.5, 1, 1.5, and 2 along the x-axis. The solid line shows how differences in the mean difference may affect power. Additionally, the two dashed lines show how the relationship between the mean difference and power changes when the standard deviation is smaller by half (1.51 ng/mL) or doubled in size (6.04 ng/mL)

References

    1. Bartell SM, Calafat AM, Lyu C, Kato K, Ryan PB, Steenland K. Rate of decline in serum PFOA concentrations after granular activated carbon filtration at two public water systems in Ohio and West Virginia. Environ Health Perspect. 2010;118(2):222–228. doi: 10.1289/ehp.0901252.
    1. Brede E, Wilhelm M, Göen T, Müller J, Rauchfuss K, Kraft M, et al. Two-year follow-up biomonitoring pilot study of residents’ and controls’ PFC plasma levels after PFOA reduction in public water system in Arnsberg. Germany Int J Hyg Environ Health. 2010;213(3):217–223. doi: 10.1016/j.ijheh.2010.03.007.
    1. Worley RR, Moore SM, Tierney BC, Ye X, Calafat AM, Campbell S, et al. Per- and polyfluoroalkyl substances in human serum and urine samples from a residentially exposed community. Environ Int. 2017;106:135–143. doi: 10.1016/j.envint.2017.06.007.
    1. Barton KE, Starling AP, Higgins CP, McDonough CA, Calafat AM, Adgate JL. Sociodemographic and behavioral determinants of serum concentrations of per- and polyfluoroalkyl substances in a community highly exposed to aqueous film-forming foam contaminants in drinking water. Int J Hyg Environ Health 2019 Aug 20 [cited 2019 Nov 1]; Available from: .
    1. Laird NM, Ware JH. Random-effects models for longitudinal data. Biometrics. 1982;38(4):963–974. doi: 10.2307/2529876.
    1. Muller KE, Lavange LM, Ramey SL, Ramey CT. Power calculations for general linear multivariate models including repeated measures applications. J Am Stat Assoc. 1992;87(420):1209–1226. doi: 10.1080/01621459.1992.10476281.
    1. Guo Y, Logan HL, Glueck DH, Muller KE. Selecting a sample size for studies with repeated measures. BMC Med Res Methodol. 2013;31(13):100. doi: 10.1186/1471-2288-13-100.
    1. Guo Y, Pandis N. Sample-size calculation for repeated-measures and longitudinal studies. Am J Orthod Dentofac Orthop. 2015;147(1):146–149. doi: 10.1016/j.ajodo.2014.10.009.
    1. Nordgren R. Calculating a sample size for a study with repeated measures. Journal of Molecular and Cellular Cardiology [Internet]. 2019 Mar [cited 2019 Apr 23]; Available from: .
    1. Cheng J, Edwards LJ, Maldonado-Molina MM, Komro KA, Muller KE. Real longitudinal data analysis for real people: building a good enough mixed model. Stat Med. 2010;29(4):504–20. doi: 10.1002/sim.3775.
    1. Muller KE, Benignus VA. Increasing scientific power with statistical power. Neurotoxicol Teratol. 1992;14(3):211–219. doi: 10.1016/0892-0362(92)90019-7.
    1. Chi YY, Glueck DH, Muller KE. Power and sample size for fixed-effects inference in reversible linear mixed models. Am Stat. 2018;15:1–10.
    1. Hedeker D, Gibbons RD, Waternaux C. Sample size estimation for longitudinal designs with attrition: comparing time-related contrasts between two groups. J Educ Behav Stat. 1999;24(1):70–93. doi: 10.3102/10769986024001070.
    1. Tu X, Kowalski J, Zhang J, Lynch K, Crits-Christoph P. Power analyses for longitudinal trials and other clustered designs. - PubMed - NCBI. Stat Med. 2007;23(18):2799–2815. doi: 10.1002/sim.1869.
    1. Murray DM, Blitstein JL, Hannan PJ, Baker WL, Lytle LA. Sizing a trial to alter the trajectory of health behaviours: methods, parameter estimates, and their application. Stat Med. 2007;26(11):2297–2316. doi: 10.1002/sim.2714.
    1. Basagana X, Spiegelman D. Power and sample size calculations for longitudinal studies comparing rates of change with a time-varying exposure. Stat Med. 2010;29(2):181–192. doi: 10.1002/sim.3772.
    1. Basagana X, Liao X, Spiegelman D. Power and sample size calculations for longitudinal studies estimating a main effect of a time-varying exposure. Stat Methods Med Res. 2011;20(5):471–487. doi: 10.1177/0962280210371563.
    1. Wang C, Hall CB, Kim M. A comparison of power analysis methods for evaluating effects of a predictor on slopes in longitudinal designs with missing data. Stat Methods Med Res. 2015;24(6):1009–1029. doi: 10.1177/0962280212437452.
    1. Stroup WW. Power analysis based on spatial effects mixed models: a tool for comparing design and analysis strategies in the presence of spatial variability. JABES. 2002;7(4):491–511. doi: 10.1198/108571102780.
    1. Stroup WW. Generalized linear mixed models: modern concepts, methods and applications. 2013.
    1. Kleinman K, Huang SS. Calculating power by bootstrap, with an application to cluster-randomized trials. EGEMS (Wash DC) [Internet]. 2017 Feb 9 [cited 2019 Apr 24];4(1). Available from: .
    1. Landau S, Stahl D. Sample size and power calculations for medical studies by simulation when closed form expressions are not available. Stat Methods Med Res. 2013;22(3):324–345. doi: 10.1177/0962280212439578.
    1. Lu K. Sample size calculations with multiplicity adjustment for longitudinal clinical trials with missing data. Stat Med. 2012;31(1):19–28. doi: 10.1002/sim.4415.
    1. Siemer M, Joormann J. Power and measures of effect size in analysis of variance with fixed versus random nested factors. Psychol Methods. 2003;8(4):497–517. doi: 10.1037/1082-989X.8.4.497.
    1. Power analysis and sample size software [internet]. Kaysville, Utah: NCSS, LLC; 2019. Available from: .
    1. Johnson JL, Muller KE, Slaughter JC, Gurka MJ, Gribbin MJ, Simpson SL. POWERLIB: SAS/IML software for computing power in multivariate linear models. J Stat Softw 2009 Apr 1 [cited 2019 Jan 25];30(5). Available from: .
    1. Kreidler SM, Muller KE, Grunwald GK, Ringham BM, Coker-Dukowitz ZT, Sakhadeo UR, et al. GLIMMPSE: online power computation for linear models with and without a baseline covariate. J Stat Softw. 2013;54(10):1–26.
    1. Kreidler SM. GLIMMPSE software 3.0: Free sample size tools for multilevel and longitudinal data. [Internet]. [cited 2022 Aug 12]. Available from: .
    1. Kairalla JA, Coffey CS, Thomann MA, Muller KE. Adaptive trial designs: a review of barriers and opportunities. Trials. 2012;23(13):145. doi: 10.1186/1745-6215-13-145.
    1. Taylor DJ, Muller KE. Computing confidence bounds for power and sample size of the general linear univariate model. Am Stat. 1995;49(1):43–47.
    1. Taylor DJ, Muller KE. Bias in linear model power and sample size calculation due to estimating noncentrality. Commun Stat Theory Methods [Internet]. 1996 [cited 2019 Sep 16];25(7). Available from: .
    1. Anderson SF, Kelley K, Maxwell SE. Sample-size planning for more accurate statistical power: a method adjusting sample effect sizes for publication bias and uncertainty. Psychol Sci. 2017;28:1547–1562. doi: 10.1177/0956797617723724.
    1. Gurka MJ, Edwards L. J, Muller KE. Avoiding bias in mixed model inference for fixed effects. Stat Med 2011 Sep ;30(22):2696–707.
    1. Simpson SL, Edwards LJ, Muller KE, Sen PK, Styner MA. A linear exponent AR(1) family of correlation structures. Stat Med. 2010;29(17):1825–1838. doi: 10.1002/sim.3928.
    1. Littell RC, Pendergast J, Natarajan R. Modelling covariance structure in the analysis of repeated measures data. Stat Med. 2000;19(13):1793–1819. doi: 10.1002/1097-0258(20000715)19:13<1793::AID-SIM482>;2-Q.
    1. Hemming K, Kasza J, Hooper R, Forbes A, Taljaard M. A tutorial on sample size calculation for multiple-period cluster randomized parallel, cross-over and stepped-wedge trials using the shiny CRT calculator. Int J Epidemiol. 2020;49(3):979–995. doi: 10.1093/ije/dyz237.
    1. Ouyang Y, Li F, Preisser JS, Taljaard M. Sample size calculators for planning stepped-wedge cluster randomized trials: a review and comparison. Int J Epidemiol. 2022;51(6):2000–2013. 10.1093/ije/dyac123.
    1. Gustavson K, von Soest T, Karevold E, Røysamb E. Attrition and generalizability in longitudinal studies: findings from a 15-year population-based study and a Monte Carlo simulation study. BMC Public Health. 2012;29(12):918. doi: 10.1186/1471-2458-12-918.
    1. Little RJA, Rubin DB. Statistical analysis with missing data. 2nd ed: Wiley-Interscience; 2002. p. 408.
    1. Gawande A. Checklist Manifesto. 1. New York, NY: Picador Paper; 2011. p. 240.
    1. Collins FS, Tabak LA. Policy: NIH plans to enhance reproducibility. Nature. 2014;505(7485):612–613. doi: 10.1038/505612a.
    1. Mueller R, Yingling V. History and use of per-and polyfluoroalkyl substances (PFAS). Interstate Technology & Regulatory Council [Internet]. 2017 [cited 2020 Apr 29]; Available from: .
    1. Persistent organic pollutants review committee. The 16 new persistent organic chemicals under the Stockholm Convention. [Internet]. Secretariat of the Stockholm Convention; 2017 Jun. Available from: .
    1. Prevedouros K, Cousins IT, Buck RC, Korzeniowski SH. Sources, fate and transport of perfluorocarboxylates. Environ Sci Technol. 2006;40(1):32–44. doi: 10.1021/es0512475.
    1. Domingo JL, Nadal M. Per- and polyfluoroalkyl substances (PFASs) in food and human dietary intake: a review of the recent scientific literature. J Agric Food Chem. 2017;65(3):533–543. doi: 10.1021/acs.jafc.6b04683.
    1. Gyllenhammar I, Berger U, Sundström M, McCleaf P, Eurén K, Eriksson S, et al. Influence of contaminated drinking water on perfluoroalkyl acid levels in human serum – a case study from Uppsala. Sweden Environmental Research. 2015;1(140):673–683. doi: 10.1016/j.envres.2015.05.019.
    1. Haug LS, Thomsen C, Brantsæter AL, Kvalem HE, Haugen M, Becher G, et al. Diet and particularly seafood are major sources of perfluorinated compounds in humans. Environ Int. 2010;36(7):772–778. doi: 10.1016/j.envint.2010.05.016.
    1. Hu XC, Dassuncao C, Zhang X, Grandjean P, Weihe P, Webster GM, et al. Can profiles of poly- and Perfluoroalkyl substances (PFASs) in human serum provide information on major exposure sources? Environ Health [Internet]. 2018 Feb 1 [cited 2019 Sep 18];17. Available from: .
    1. Kaboré HA, Vo Duy S, Munoz G, Méité L, Desrosiers M, Liu J, et al. Worldwide drinking water occurrence and levels of newly-identified perfluoroalkyl and polyfluoroalkyl substances. Sci Total Environ. 2018;1(616–617):1089–1100. doi: 10.1016/j.scitotenv.2017.10.210.
    1. USEPA (United States Environmental Protection Agency). Drinking water health advisories for PFOA and PFOS. [Internet]. 2016. Available from: .
    1. EPA (Environmental Protection, Agency). Lifetime Drinking Water Health Advisories for Four Perfluoroalkyl Substances [Internet]. Federal Register. Sect. Vol. 87 Jun 17, 2022 p. 36848–9. Available from: .
    1. ATSDR . Agency for Toxic Substances and Disease Registry. Toxicological profile for Perfluoroalkyls. (draft for public comment) P.H.S. U.S. Department of Health and Human Services, Editor: Atlanta, GA; 2018.
    1. CDPHE (Colorado Department of Public Health and Environment). Chemicals from firefighting foam and other sources [Internet]. Department of Public Health and Environment. 2016 [cited 2019 Sep 18]. Available from: .
    1. El Paso County Health Department. Air Force PFOS/PFOA snapshot Peterson AFB [Internet]. 2017. Available from: (IST).pdf.
    1. Finley, B., 2016 September 23. Drinking water in three Colorado cities contaminated with toxic chemicals above EPA limits. Denver Post Retrieved December 26, 2018, Available from: .
    1. Olsen GW, Burris JM, Ehresman DJ, Froehlich JW, Seacat AM, Butenhoff JL, et al. Half-life of serum elimination of perfluorooctanesulfonate, perfluorohexanesulfonate, and perfluorooctanoate in retired fluorochemical production workers. Environ Health Perspect. 2007;115(9):1298–1305. doi: 10.1289/ehp.10009.
    1. Li Y, Fletcher T, Mucs D, Scott K, Lindh CH, Tallving P, et al. Half-lives of PFOS, PFHxS and PFOA after end of exposure to contaminated drinking water. Occup Environ Med. 2018;75(1):46–51. doi: 10.1136/oemed-2017-104651.
    1. Kato K, Basden BJ, Needham LL, Calafat AM. Improved selectivity for the analysis of maternal serum and cord serum for polyfluoroalkyl chemicals. J Chromatogr A. 2011;1218(15):2133–2137. doi: 10.1016/j.chroma.2010.10.051.
    1. Thompson J, Lorber M, Toms LML, Kato K, Calafat AM, Mueller JF. Use of simple pharmacokinetic modeling to characterize exposure of Australians to perfluorooctanoic acid and perfluorooctane sulfonic acid. Environ Int. 2010;36(4):390–397. doi: 10.1016/j.envint.2010.02.008.
    1. Kenward MG, Roger JH. Small sample inference for fixed effects from restricted maximum likelihood. Biometrics. 1997;53(3):983–997. doi: 10.2307/2533558.
    1. Kenward MG, Roger JH. An improved approximation to the precision of fixed effects from restricted maximum likelihood. Computational Statistics & Data Analysis. 2009;53(7):2583–2595. doi: 10.1016/j.csda.2008.12.013.
    1. Marshall WA, Tanner JM. Growth and physiological development during adolescence. Annu Rev Med. 1968;19(1):283–300. doi: 10.1146/annurev.me.19.020168.001435.
    1. Hockett CW, Bedrick EJ, Zeitler P, Crume TL, Daniels S, Dabelea D. Exposure to diabetes in utero is associated with earlier pubertal timing and faster pubertal growth in the offspring: the EPOCH study. J Pediatr. 2019;1(206):105–112. doi: 10.1016/j.jpeds.2018.10.053.
    1. Fu J, Gao Y, Cui L, Wang T, Liang Y, Qu G, et al. Occurrence, temporal trends, and half-lives of perfluoroalkyl acids (PFAAs) in occupational workers in China. Sci Rep [Internet]. 2016 Dec 1 [cited 2019 Mar 12];6. Available from: .
    1. Seals R, Bartell SM, Steenland K. Accumulation and clearance of perfluorooctanoic acid (PFOA) in current and former residents of an exposed community. Environ Health Perspect. 2011;119(1):119–124. doi: 10.1289/ehp.1002346.
    1. United States Environmental Protection Agency Office of Water. Health Effects Support Document for Perfluorooctane Sulfonate (PFOS). Vol EPA 822-R-16-002. 2016:245. .
    1. United States Environmental Protection Agency Office of Water. Health Effects Support Document for Perfluorooctanoic Acid (PFOA). Vol EPA 822-R-16-003.; 2016:322. .
    1. CDC (Centers for Disease Control and Prevention). 2 to 20 years: Boys Stature Weight-for-age percentiles [Internet]. CDC (Centers for Diseae Control and Prevention); 2000 Nov. Available from: .
    1. CDC (Centers for Disease Control and Prevention). 2 to 20 years: girls stature weight-for-age percentiles [internet]. CDC; 2000 Nov. Available from: .

Source: PubMed

3
Subscribe