Development and Replication of Objective Measurements of Social Visual Engagement to Aid in Early Diagnosis and Assessment of Autism

Warren Jones, Cheryl Klaiman, Shana Richardson, Meena Lambha, Morganne Reid, Taralee Hamner, Chloe Beacham, Peter Lewis, Jose Paredes, Laura Edwards, Natasha Marrus, John N Constantino, Sarah Shultz, Ami Klin, Warren Jones, Cheryl Klaiman, Shana Richardson, Meena Lambha, Morganne Reid, Taralee Hamner, Chloe Beacham, Peter Lewis, Jose Paredes, Laura Edwards, Natasha Marrus, John N Constantino, Sarah Shultz, Ami Klin

Abstract

Importance: Autism spectrum disorder is a common and early-emerging neurodevelopmental condition. While 80% of parents report having had concerns for their child's development before age 2 years, many children are not diagnosed until ages 4 to 5 years or later.

Objective: To develop an objective performance-based tool to aid in early diagnosis and assessment of autism in children younger than 3 years.

Design, setting, and participants: In 2 prospective, consecutively enrolled, broad-spectrum, double-blind studies, we developed an objective eye-tracking-based index test for children aged 16 to 30 months, compared its performance with best-practice reference standard diagnosis of autism (discovery study), and then replicated findings in an independent sample (replication study). Discovery and replication studies were conducted in specialty centers for autism diagnosis and treatment. Reference standard diagnoses were made using best-practice standardized protocols by specialists blind to eye-tracking results. Eye-tracking tests were administered by staff blind to clinical results. Children were enrolled from April 27, 2013, until September 26, 2017. Data were analyzed from March 28, 2018, to January 3, 2019.

Main outcomes and measures: Prespecified primary end points were the sensitivity and specificity of the eye-tracking-based index test compared with the reference standard. Prespecified secondary end points measured convergent validity between eye-tracking-based indices and reference standard assessments of social disability, verbal ability, and nonverbal ability.

Results: Data were collected from 1089 children: 719 children (mean [SD] age, 22.4 [3.6] months) in the discovery study, and 370 children (mean [SD] age, 25.4 [6.0] months) in the replication study. In discovery, 224 (31.2%) were female and 495 (68.8%) male; in replication, 120 (32.4%) were female and 250 (67.6%) male. Based on reference standard expert clinical diagnosis, there were 386 participants (53.7%) with nonautism diagnoses and 333 (46.3%) with autism diagnoses in discovery, and 184 participants (49.7%) with nonautism diagnoses and 186 (50.3%) with autism diagnoses in replication. In the discovery study, the area under the receiver operating characteristic curve was 0.90 (95% CI, 0.88-0.92), sensitivity was 81.9% (95% CI, 77.3%-85.7%), and specificity was 89.9% (95% CI, 86.4%-92.5%). In the replication study, the area under the receiver operating characteristic curve was 0.89 (95% CI, 0.86-0.93), sensitivity was 80.6% (95% CI, 74.1%-85.7%), and specificity was 82.3% (95% CI, 76.1%-87.2%). Eye-tracking test results correlated with expert clinical assessments of children's individual levels of ability, explaining 68.6% (95% CI, 58.3%-78.6%), 63.4% (95% CI, 47.9%-79.2%), and 49.0% (95% CI, 33.8%-65.4%) of variance in reference standard assessments of social disability, verbal ability, and nonverbal cognitive ability, respectively.

Conclusions and relevance: In two diagnostic studies of children younger than 3 years, objective eye-tracking-based measurements of social visual engagement quantified diagnostic status as well as individual levels of social disability, verbal ability, and nonverbal ability in autism. These findings suggest that objective measurements of social visual engagement can be used to aid in autism diagnosis and assessment.

Conflict of interest statement

Conflict of Interest Disclosures: Dr Jones reported receiving grants from the Georgia Research Alliance, the Joseph B. Whitehead Foundation, the Marcus Foundation, and the National Institute of Mental Health (NIMH) during the conduct of the study (all via his institution); being a scientific cofounder of and owning equity in EarliTec Diagnostics; holding patents licensed to EarliTec Diagnostics; and receiving personal fees for consulting from EarliTec Diagnostics and a lecture honorarium from Washington University in St Louis School of Medicine outside the submitted work. Dr Klaiman reported receiving grants from the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD), the Georgia Research Alliance, the Joseph B. Whitehead Foundation, the Marcus Foundation, and the NIMH (all via her institution) during the conduct of the study and personal fees from ABA Centers of America, Dekalb County School District, Cherokee County School District, Fulton County School District, Beaming Health, and EarliTec Diagnostics outside the submitted work. Dr Richardson reported receiving funding and/or equipment from the Georgia Research Alliance, the John B. Whitehead Foundation, and the Marcus Foundation (all via her institution) during the conduct of the study. Dr Lambha reported receiving funding and/or equipment from the Georgia Research Alliance, the Joseph B. Whitehead Foundation, the Marcus Foundation, the NICHD, and the NIMH (all via her institution) during the conduct of the study. Dr Beacham reported receiving funding and/or equipment from the John B. Whitehead Foundation and the Marcus Foundation during the conduct of the study. Mr Lewis reported receiving grants from the Georgia Research Alliance, the Joseph B. Whitehead Foundation, and the Marcus Foundation (all via his institution) during the conduct of the study; being a scientific cofounder of and owning equity in EarliTec Diagnostics; and holding patents licensed to EarliTec Diagnostics outside the submitted work. Mr Paredes reported receiving funding and/or equipment from the Joseph B. Whitehead Foundation and the Marcus Foundation (both via his institution) during the conduct of the study. Dr Marrus reported receiving grants from the NIMH and from Washington University in St Louis School of Medicine during the conduct of the study. Dr Constantino reported receiving grants from the NICHD during the conduct of the study and royalties from Western Psychological Services outside the submitted work. Dr Klin reported receiving grants from the Georgia Research Alliance, the Joseph B. Whitehead Foundation, the Marcus Foundation, the NICHD, and the NIMH (all via his institution) during the conduct of the study; being a scientific cofounder of and owning equity in EarliTec Diagnostics; holding patents licensed to EarliTec Diagnostics; and receiving personal fees from the Alliance for Early Success, the Brazilian Society of Speech Therapy, EarliTec Diagnostics, the McKnight Endowment Fund for Neuroscience, the National Autism Conference, and Washington University in St Louis School of Medicine outside the submitted work. No other disclosures were reported.

Figures

Figure 1.. Participant Enrollment and Outcomes for…
Figure 1.. Participant Enrollment and Outcomes for Comparing Objective Measurements of Social Visual Engagement With Expert Clinical Diagnosis of Autism in Discovery and Replication Studies
A, Participant flow for the discovery study. B, Participant flow for the replication study. In both studies, during a single visit at the testing site, enrolled participants received expert clinical diagnosis using standardized assessments (reference standard diagnosis) as well as eye-tracking–based measurement of social visual engagement (index test). Index test quality control indicator (QCIs) failures occurred when participants’ data failed to meet automated preset data QCIs (additional details are available in eTables 2 and 3 in Supplement 1).
Figure 2.. Diagnostic Performance Comparing Eye-Tracking–Based Measurement…
Figure 2.. Diagnostic Performance Comparing Eye-Tracking–Based Measurement of Social Visual Engagement (Index Test) With Expert Clinical Diagnosis of Autism (Reference Standard)
Performance among 711 children in the discovery study and 361 children in the replication study. A, The diamond represents the optimal test positivity threshold for the discovery study (Youden index). B, The test positivity threshold determined in the discovery study was fixed and applied independently in the replication study. The diamond represents the achieved sensitivity and specificity in the replication study using the test positivity threshold from the discovery study. The solid blue circle represents the post hoc theoretical optimal threshold. C, Tabulation corresponds to the diamond in panel A. D, Tabulation corresponds to the diamond in panel B. AUC indicates area under the curve; ROC, receiver operating characteristic. Negative predictive value (NPV) and positive predictive value (PPV) estimates reported here are calculated based on study sample prevalence.
Figure 3.. Convergent Validity Between Eye-Tracking–Based Measurement…
Figure 3.. Convergent Validity Between Eye-Tracking–Based Measurement of Social Visual Engagement (Index Test) and Expert Clinician–Administered Standardized Assessments of Social Disability, Verbal Ability, and Nonverbal Cognitive Ability
A, Discovery study correlation between eye-tracking–based indices of social disability versus children’s total scores on the Autism Diagnostic Observation Schedule, second edition (ADOS-2). B, Discovery study correlation between eye-tracking–based indices of verbal ability versus children’s verbal age equivalent scores as measured by the Mullen Scales of Early Learning (Mullen). C, Discovery study correlation between eye-tracking–based indices of nonverbal cognitive ability versus children’s nonverbal age equivalent scores as measured by the Mullen. D, E, F, Replication study correlations between eye-tracking–based indices and reference standard assessments. In all scatterplots, circles represent individual data and diamonds represent regression outliers (bivariate outliers identified using Cook distance and difference-in-fits regression diagnostic assessment). The adjusted R2 values were adjusted for measurement error variance of the reference standard (yielding percentage of reference standard nonerror variance explained by the index test). Additional information is provided in the Secondary End Point Analyses subsection of the eMethods in Supplement 1.
Figure 4.. Performance-Based Measures of Children’s Individual…
Figure 4.. Performance-Based Measures of Children’s Individual Strengths, Vulnerabilities, and Opportunities for Skill Development
Measurement of social visual engagement quantifies how a child engages with social and nonsocial cues occurring continuously within naturalistic environmental contexts (left column, shown as still frames from testing videos). In relation to those contexts, normative reference measures provide objective quantification of nonautism age-expected visual engagement (middle columns, shown as density distributions in both pseudocolor format and as color to grayscale fades overlaid on corresponding still frames). The age-expected reference measures can be used to measure and visualize patient comparisons, revealing individual strengths, vulnerabilities, and opportunities for skill building (right columns, sample patient data shown as overlaid circular apertures that encompass the portion of video foveated by each patient [each aperture spans the central 5.2 degrees of a patient’s visual field]). Children with autism present as engaging with toys of interest (1, 3, 5, and 7), color and contrast cues (2, 6, 8, and 9), and objects and background elements not directly relevant to social context (4 and 10-14). Elapsed times at the bottom right of still frames highlight the rapidly changing nature of social interaction in which many hundreds of verbal and nonverbal communicative cues are presented, each eliciting age-expected patterns of engagement and offering corresponding opportunities for objective quantitative comparisons of patient behavior.

References

    1. Maenner MJ, Warren Z, Williams AR, et al. . Prevalence and characteristics of autism spectrum disorder among children aged 8 years—Autism and Developmental Disabilities Monitoring Network, 11 sites, United States, 2020. MMWR Surveill Summ. 2023;72(2):1-14. doi:10.15585/mmwr.ss7202a1
    1. Chawarska K, Paul R, Klin A, Hannigen S, Dichtel LE, Volkmar F. Parental recognition of developmental problems in toddlers with autism spectrum disorders. J Autism Dev Disord. 2007;37(1):62-72. doi:10.1007/s10803-006-0330-8
    1. Young RL, Brewer N, Pattison C. Parental identification of early behavioural abnormalities in children with autistic disorder. Autism. 2003;7(2):125-143. doi:10.1177/1362361303007002002
    1. Wetherby AM, Brosnan-Maddox S, Peace V, Newton L. Validation of the Infant-Toddler Checklist as a broadband screener for autism spectrum disorders from 9 to 24 months of age. Autism. 2008;12(5):487-511. doi:10.1177/1362361308094501
    1. Christensen DL, Baio J, Van Naarden Braun K, et al. ; Centers for Disease Control and Prevention . Prevalence and characteristics of autism spectrum disorder among children aged 8 years—Autism and Developmental Disabilities Monitoring Network, 11 sites, United States, 2012. MMWR Surveill Summ. 2016;65(3):1-23. doi:10.15585/mmwr.ss6503a1
    1. Shaw KA, McArthur D, Hughes MM, et al. . Progress and disparities in early identification of autism spectrum disorder: Autism and Developmental Disabilities Monitoring Network, 2002-2016. J Am Acad Child Adolesc Psychiatry. 2022;61(7):905-914. doi:10.1016/j.jaac.2021.11.019
    1. Mandell DS, Novak MM, Zubritsky CD. Factors associated with age of diagnosis among children with autism spectrum disorders. Pediatrics. 2005;116(6):1480-1486. doi:10.1542/peds.2005-0185
    1. Mandell DS, Wiggins LD, Carpenter LA, et al. . Racial/ethnic disparities in the identification of children with autism spectrum disorders. Am J Public Health. 2009;99(3):493-498. doi:10.2105/AJPH.2007.131243
    1. Constantino JN, Abbacchi AM, Saulnier C, et al. . Timing of the diagnosis of autism in African American children. Pediatrics. 2020;146(3):e20193629. doi:10.1542/peds.2019-3629
    1. Zwaigenbaum L, Warren Z. Commentary: embracing innovation is necessary to improve assessment and care for individuals with ASD: a reflection on Kanne and Bishop (2020). J Child Psychol Psychiatry. 2021;62(2):143-145. doi:10.1111/jcpp.13271
    1. MacLachlan M. Commentary: challenges and opportunities in autism assessment—a commentary on Kanne and Bishop (2020). J Child Psychol Psychiatry. 2021;62(2):146-148. doi:10.1111/jcpp.13360
    1. American Psychiatric Association . Diagnostic and Statistical Manual of Mental Disorders. 5th ed. American Psychiatric Association; 2013.
    1. Falkmer T, Anderson K, Falkmer M, Horlin C. Diagnostic procedures in autism spectrum disorders: a systematic literature review. Eur Child Adolesc Psychiatry. 2013;22(6):329-340. doi:10.1007/s00787-013-0375-0
    1. Lord C, Rutter M, DiLavore PC, Risi S, Gotham K, Bishop S. Autism Diagnostic Observation Schedule. 2nd ed. Western Psychological Services; 2012.
    1. Rutter M, Le Couteur A, Lord C. ADI-R: Autism Diagnostic Interview—Revised (ADI-R). Western Psychological Services; 2003.
    1. Hyman SL, Levy SE, Myers SM; Council on Children With Disabilities, Section on Developmental and Behavioral Pediatrics . Identification, evaluation, and management of children with autism spectrum disorder. Pediatrics. 2020;145(1):e20193447. doi:10.1542/peds.2019-3447
    1. Kanne SM, Bishop SL. Editorial perspective: the autism waitlist crisis and remembering what families need. J Child Psychol Psychiatry. 2021;62(2):140-142. doi:10.1111/jcpp.13254
    1. Wiggins LD, Baio J, Rice C. Examination of the time between first evaluation and first autism spectrum diagnosis in a population-based sample. J Dev Behav Pediatr. 2006;27(2)(suppl):S79-S87. doi:10.1097/00004703-200604002-00005
    1. Rice CE, Baio J, Morrier MJ, et al. . Changes in diagnostic testing practice for the autism spectrum disorders (ASDs) in four US populations. Paper presented at: International Meeting for Autism Research; May 9, 2009; Chicago, IL. Accessed December 6, 2022.
    1. Kim SH, Lord C. Combining information from multiple sources for the diagnosis of autism spectrum disorders for toddlers and young preschoolers from 12 to 47 months of age. J Child Psychol Psychiatry. 2012;53(2):143-151. doi:10.1111/j.1469-7610.2011.02458.x
    1. Mullen EM. Mullen Scales of Early Learning. American Guidance Service; 1995.
    1. Constantino JN, Kennon-McGill S, Weichselbaum C, et al. . Infant viewing of social scenes is under genetic control and is atypical in autism. Nature. 2017;547(7663):340-344. doi:10.1038/nature22999
    1. Wang A, Payne C, Moss S, Jones WR, Bachevalier J. Early developmental changes in visual social engagement in infant rhesus monkeys. Dev Cogn Neurosci. 2020;43:100778. doi:10.1016/j.dcn.2020.100778
    1. Klin A, Jones W, Schultz R, Volkmar F, Cohen D. Visual fixation patterns during viewing of naturalistic social situations as predictors of social competence in individuals with autism. Arch Gen Psychiatry. 2002;59(9):809-816. doi:10.1001/archpsyc.59.9.809
    1. Klin A, Lin DJ, Gorrindo P, Ramsay G, Jones W. Two-year-olds with autism orient to non-social contingencies rather than biological motion. Nature. 2009;459(7244):257-261. doi:10.1038/nature07868
    1. Jones W, Klin A. Attention to eyes is present but in decline in 2-6–month-old infants later diagnosed with autism. Nature. 2013;504(7480):427-431. doi:10.1038/nature12715
    1. Bossuyt PM, Reitsma JB, Bruns DE, et al. ; Standards for Reporting of Diagnostic Accuracy . Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. BMJ. 2003;326(7379):41-44. doi:10.1136/bmj.326.7379.41
    1. Korevaar DA, Cohen JF, Reitsma JB, et al. . Updating standards for reporting diagnostic accuracy: the development of STARD 2015. Res Integr Peer Rev. 2016;1:7. doi:10.1186/s41073-016-0014-7
    1. Cohen JF, Korevaar DA, Altman DG, et al. . STARD 2015 guidelines for reporting diagnostic accuracy studies: explanation and elaboration. BMJ Open. 2016;6(11):e012799. doi:10.1136/bmjopen-2016-012799
    1. Lijmer JG, Mol BW, Heisterkamp S, et al. . Empirical evidence of design-related bias in studies of diagnostic tests. JAMA. 1999;282(11):1061-1066. doi:10.1001/jama.282.11.1061
    1. Leckman JF, Sholomskas D, Thompson WD, Belanger A, Weissman MM. Best estimate of lifetime psychiatric diagnosis: a methodological study. Arch Gen Psychiatry. 1982;39(8):879-883. doi:10.1001/archpsyc.1982.04290080001001
    1. Chawarska K, Klin A, Paul R, Macari S, Volkmar F. A prospective study of toddlers with ASD: short-term diagnostic and cognitive outcomes. J Child Psychol Psychiatry. 2009;50(10):1235-1245. doi:10.1111/j.1469-7610.2009.02101.x
    1. Macari SL, Campbell D, Gengoux GW, Saulnier CA, Klin AJ, Chawarska K. Predicting developmental status from 12 to 24 months in infants at risk for autism spectrum disorder: a preliminary report. J Autism Dev Disord. 2012;42(12):2636-2647. doi:10.1007/s10803-012-1521-0
    1. Zwaigenbaum L, Bryson S, Lord C, et al. . Clinical assessment and management of toddlers with suspected autism spectrum disorder: insights from studies of high-risk infants. Pediatrics. 2009;123(5):1383-1391. doi:10.1542/peds.2008-1606
    1. Risi S, Lord C, Gotham K, et al. . Combining information from multiple sources in the diagnosis of autism spectrum disorders. J Am Acad Child Adolesc Psychiatry. 2006;45(9):1094-1103. doi:10.1097/01.chi.0000227880.42780.0e
    1. Chawarska K, Klin A, Paul R, Volkmar F. Autism spectrum disorder in the second year: stability and change in syndrome expression. J Child Psychol Psychiatry. 2007;48(2):128-138. doi:10.1111/j.1469-7610.2006.01685.x
    1. Silverman BW. Density Estimation for Statistics and Data Analysis. Chapman and Hall; 1986.
    1. Good P. Permutation, Parametric, and Bootstrap Tests of Hypotheses. 3rd ed. Springer; 2000.
    1. Stone M. Cross-validatory choice and assessment of statistical predictions. J R Stat Soc Ser B Methodol. 1974;36(2):111-133. doi:10.1111/j.2517-6161.1974.tb00994.x
    1. Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32-35. doi:10.1002/1097-0142(1950)3:1<32::AID-CNCR2820030106>;2-3
    1. Agresti A, Coull BA. Approximate is better than “exact” for interval estimation of binomial proportions. Am Stat. 1998;52(2):119-126. doi:10.1080/00031305.1998.10480550
    1. Linnet K. Performance of Deming regression analysis in case of misspecified analytical error ratio in method comparison studies. Clin Chem. 1998;44(5):1024-1031. doi:10.1093/clinchem/44.5.1024
    1. Martin RF. General Deming regression for estimating systematic bias and its confidence interval in method-comparison studies. Clin Chem. 2000;46(1):100-104. doi:10.1093/clinchem/46.1.100
    1. Bollen KA, Jackman RW. Regression diagnostics: an expository treatment of outliers and influential cases. Sociol Methods Res. 1985;13(4):510-542. doi:10.1177/0049124185013004004
    1. Kutner MH, Nachtsheim CJ, Neter J, Li W. Applied Linear Statistical Models. 5th ed. McGraw-Hill/Irwin; 2005.
    1. Spearman C. The proof and measurement of association between two things. by C. Spearman, 1904. Am J Psychol. 1987;100(3-4):441-471. doi:10.2307/1422689
    1. Law MR, Wald NJ, Wu T, Hackshaw A, Bailey A. Systematic underestimation of association between serum cholesterol concentration and ischaemic heart disease in observational studies: data from the BUPA study. BMJ. 1994;308(6925):363-366. doi:10.1136/bmj.308.6925.363
    1. Charles EP. The correction for attenuation due to measurement error: clarifying concepts and creating confidence sets. Psychol Methods. 2005;10(2):206-226. doi:10.1037/1082-989X.10.2.206
    1. Loomes R, Hull L, Mandy WPL. What is the male-to-female ratio in autism spectrum disorder? a systematic review and meta-analysis. J Am Acad Child Adolesc Psychiatry. 2017;56(6):466-474. doi:10.1016/j.jaac.2017.03.013
    1. Shultz S, Klin A, Jones W. Inhibition of eye blinking reveals subjective perceptions of stimulus salience. Proc Natl Acad Sci U S A. 2011;108(52):21270-21275. doi:10.1073/pnas.1109304108
    1. Zwaigenbaum L, Bishop S, Stone WL, et al. . Rethinking autism spectrum disorder assessment for children during COVID-19 and beyond. Autism Res. 2021;14(11):2251-2259. doi:10.1002/aur.2615
    1. LeDoux J. Synaptic Self: How Brains Become Who We Are. Viking Adult; 2002.
    1. Scarr S, McCartney K. How people make their own environments: a theory of genotype → environment effects. Child Dev. 1983;54(2):424-435. doi:10.1111/j.1467-8624.1983.tb03884.x
    1. Ellis BJ, Boyce WT, Belsky J, Bakermans-Kranenburg MJ, van Ijzendoorn MH. Differential susceptibility to the environment: an evolutionary–neurodevelopmental theory. Dev Psychopathol. 2011;23(1):7-28. doi:10.1017/S0954579410000611
    1. Jones W, Klaiman C, Richardson S, et al. . Eye-tracking–based measurement of social visual engagement compared with expert clinical diagnosis of autism. JAMA. Published online September 5, 2023. doi:10.1001/jama.2023.13295
    1. Burdick RK, Larsen GA. Confidence intervals on measures of variability in R&R studies. J Qual Technol. 1997;29(3):261-273. doi:10.1080/00224065.1997.11979768
    1. McDonnell CG, Bradley CC, Kanne SM, Lajonchere C, Warren Z, Carpenter LA. When are we sure? predictors of clinician certainty in the diagnosis of autism spectrum disorder. J Autism Dev Disord. 2019;49(4):1391-1401. doi:10.1007/s10803-018-3831-3
    1. Klaiman C, White S, Richardson S, et al. . Expert clinician certainty in diagnosing autism spectrum disorder in 16-30–month-olds: a multi-site trial secondary analysis. J Autism Dev Disord. 2022:1-16. doi:10.1007/s10803-022-05812-8

Source: PubMed

3
Abonnieren