Variational autoencoder provides proof of concept that compressing CDT to extremely low-dimensional space retains its ability of distinguishing dementia

Sabyasachi Bandyopadhyay, Catherine Dion, David J Libon, Catherine Price, Patrick Tighe, Parisa Rashidi, Sabyasachi Bandyopadhyay, Catherine Dion, David J Libon, Catherine Price, Patrick Tighe, Parisa Rashidi

Abstract

The clock drawing test (CDT) is an inexpensive tool to screen for dementia. In this study, we examined if a variational autoencoder (VAE) with only two latent variables can capture and encode clock drawing anomalies from a large dataset of unannotated CDTs (n = 13,580) using self-supervised pre-training and use them to classify dementia CDTs (n = 18) from non-dementia CDTs (n = 20). The model was independently validated using a larger cohort consisting of 41 dementia and 50 non-dementia clocks. The classification model built with the parsimonious VAE latent space adequately classified dementia from non-dementia (0.78 area under receiver operating characteristics (AUROC) in the original test dataset and 0.77 AUROC in the secondary validation dataset). The VAE-identified atypical clock features were then reviewed by domain experts and compared with existing literature on clock drawing errors. This study shows that a very small number of latent variables are sufficient to encode important clock drawing anomalies that are predictive of dementia.

Trial registration: ClinicalTrials.gov NCT01986577 NCT03175302.

Conflict of interest statement

The authors declare no competing interests.

© 2022. The Author(s).

Figures

Figure 1
Figure 1
Distribution of reconstructed clocks in the VAE latent space. (A) Clock drawing reconstructions are represented as a function of the two VAE latent dimensions. This shows the variety of reconstructions generated by the VAE to capture the distribution of the training dataset. (B) Scatterplot showing the distribution of the latent vectors belonging to clocks in the fine-tuning dataset divided into dementia (= 1) and control (= 0) groups. The red curve represents a possible decision boundary between the two groups. Using our neural network classifier, we aim to learn such a decision boundary to classify the two groups.
Figure 2
Figure 2
Generative factors of clock drawing are discovered using traversals over VAE latent space. Different regions of the two-dimensional manifold constructed by the VAE map to specific artistic features/anomalies of clock drawings. Latent Dimension 1 = Z0, Latent Dimension 2 = Z1. Left side of the VAE latent space: direction of eccentricity of reconstructed clocks reverses from left to right as Z1 increases from − 4 to + 4 given Z0 =  − 4. This change is correlated with a decrease in clock size shown by darkening of the reconstructed clock. Right side of the VAE latent space: distance of intersection point of clock hands from the geometrical clock center increases as Z1 changes from − 4 to + 4 given Z0 = 4. This change is correlated with a loss of the circular clock boundary. Top half of the VAE latent space: size of the clock hands increases as Z0 changes from − 4 to + 4 given Z1 = 4. This change is correlated with the loss of clock face boundary. Bottom half of the VAE latent space: angle of eccentricity of the clock decreases as Z0 changes from − 4 to + 4 with Z1 =  − 4. This change is correlated with an increase in clock size. X-axis of the VAE latent space: size of the clock increases as Z0 changes from − 4 to + 4 with Z1 = 0.
Figure 3
Figure 3
Performance of the classifier on test dataset. (A) The test dataset consists of 18 dementia and 20 control subjects. This image shows the confusion matrix. (B) Area under receiver operating characteristic is 0.77. (C) Area under precision-recall curve is 0.76.
Figure 4
Figure 4
Increasing the interpretability and clinical utility of the VAE latent space. (A) Diagnosis of misclassified clock drawings in test dataset. The latent projections of misclassified clocks from the test dataset were compared to their original clock drawings (salmon represents dementia clocks misclassified as controls; green represents control clocks misclassified as dementia). Dementia and control clocks formed two distinct clusters in the latent space with the control clocks being projected towards the anomalous side of the latent space, thus justifying their classification. Mapping these projections to the original clock drawings revealed that our model correlated eccentricity to dementia and circularity to non-dementia. It failed to encode missingness/shortening of hands as a predictive feature variable in dementia, and our preprocessing pipeline mis-cropped certain clocks, thereby introducing eccentricity into them. (B) Operationalization of the VAE latent space using k-nearest neighbor classifier with k = 13. K-nearest neighbors of each training datapoint were labeled uniformly while simultaneously varying k to find the best decision boundary between dementia and control samples. This decision boundary physically divided the VAE latent space into two regions: Red—Dementia and Blue—Control. The control region is smaller, and it consists of normal sized clocks with circular clockfaces whose hands intersect at the geometric center of the clockface. The dementia region is larger in size and encodes the various anomalies detected by the VAE.
Figure 5
Figure 5
Conceptual workflow for training VAE with unlabeled dataset and constructing classifiers with latent space of VAE. (A) Clock images are passed into the VAE encoder in the form of a 1X10,000 vector. The top part of the schema represents the VAE decoder-encoder couplet which is trained to minimize the reconstruction loss of the clock drawing using an information bottleneck of a two-dimensional latent space. The lower part of the figure shows how the latent dimensions which capture a compressed representation of a clock drawing are passed into a classifier that mimics the architecture of the VAE decoder. This classifier is fine-tuned to reduce the loss in predicting dementia. (B) This image differentiates between the self-supervised pre-training of the VAE versus the usage of the pre-trained VAE encoder for downstream classification.

References

    1. Libon DJ, Malamut BL, Swenson R, Sands LP, Cloud BS. Further analyses of clock drawings among demented and nondemented older subjects. Arch. Clin. Neuropsychol. 1996;11:193–205. doi: 10.1093/arclin/11.3.193.
    1. Dion C, et al. Cognitive correlates of digital clock drawing metrics in older adults with and without mild cognitive impairment. J. Alzheimers Dis. 2020;75:73–83. doi: 10.3233/JAD-191089.
    1. Freedman M, Leach L, Kaplan E, Shulman K, Delis DC. Clock Drawing: A Neuropsychological Analysis. Oxford University Press; 1994.
    1. Cosentino S, Jefferson A, Chute DL, Kaplan E, Libon DJ. Clock drawing errors in dementia: Neuropsychological and neuroanatomical considerations. Cogn. Behav. Neurol. 2004;17:74–84. doi: 10.1097/01.wnn.0000119564.08162.46.
    1. Penney, D. et al. in Annual Meeting of The International Neuropsychological Society.
    1. Piers RJ, et al. Age and graphomotor decision making assessed with the digital clock drawing test: The Framingham Heart Study. J. Alzheimers Dis. 2017;60:1611–1620. doi: 10.3233/jad-170444.
    1. Royall DR, Cordes JA, Polk M. CLOX: An executive clock drawing task. J. Neurol. Neurosurg. Psychiatry. 1998;64:588–594. doi: 10.1136/jnnp.64.5.588.
    1. Shulman KI, Shedletsky R, Silver IL. The challenge of time: Clock-drawing and cognitive function in the elderly. Int. J. Geriatr. Psychiatry. 1986;1:135–140. doi: 10.1002/gps.930010209.
    1. Rouleau I, Salmon DP, Butters N, Kennedy C, McGuire K. Quantitative and qualitative analyses of clock drawings in Alzheimer's and Huntington's disease. Brain Cogn. 1992;18:70–87. doi: 10.1016/0278-2626(92)90112-y.
    1. Sunderland T, et al. Clock drawing in Alzheimer's disease. A novel measure of dementia severity. J. Am. Geriatr. Soc. 1989;37:725–729. doi: 10.1111/j.1532-5415.1989.tb02233.x.
    1. Agrell B, Dehlin O. The clock-drawing test. Age Ageing. 1998;27:399–404. doi: 10.1093/ageing/27.3.399.
    1. Shulman KI. Clock-drawing: Is it the ideal cognitive screening test? Int. J. Geriatr. Psychiatry. 2000;15:548–561. doi: 10.1002/1099-1166(200006)15:6<548::aid-gps242>;2-u.
    1. Spenciere B, Alves H, Charchat-Fichman H. Scoring systems for the clock drawing test: A historical review. Dement. Neuropsychol. 2017;11:6–14. doi: 10.1590/1980-57642016dn11-010003.
    1. Price CC, et al. Clock drawing in the Montreal Cognitive Assessment: Recommendations for dementia assessment. Dement. Geriatr. Cogn. Disord. 2011;31:179–187. doi: 10.1159/000324639.
    1. Frei BW, et al. Considerations for clock drawing scoring systems in perioperative anesthesia settings. Anesth. Analg. 2019;128:e61–e64. doi: 10.1213/ANE.0000000000004105.
    1. Davis R, Libon DJ, Au R, Pitman D, Penney DL. THink: Inferring cognitive status from subtle behaviors. Proc. Conf. AAAI Artif. Intell. 2014;2014:2898–2905.
    1. Souillard-Mandar W, et al. Learning classification models of cognitive conditions from subtle behaviors in the digital clock drawing test. Mach. Learn. 2016;102:393–441. doi: 10.1007/s10994-015-5529-5.
    1. Davoudi A, et al. Classifying non-dementia and Alzheimer's disease/vascular dementia patients using kinematic, time-based, and visuospatial parameters: The digital clock drawing test. J. Alzheimers Dis. 2021;82:47–57. doi: 10.3233/JAD-201129.
    1. Binaco R, et al. Machine learning analysis of digital clock drawing test performance for differential classification of mild cognitive impairment subtypes versus Alzheimer's disease. J. Int. Neuropsychol. Soc. 2020;26:690–700. doi: 10.1017/S1355617720000144.
    1. Souillard-Mandar W, et al. DCTclock: Clinically-interpretable and automated artificial intelligence analysis of drawing behavior for capturing cognition. Front. Digit. Health. 2021;3:750661. doi: 10.3389/fdgth.2021.750661.
    1. Gomes-Osman J, et al. Aging in the digital age: Using technology to increase the reach of the clinician expert and close the gap between health span and life span. Front. Digit. Health. 2021;3:755008. doi: 10.3389/fdgth.2021.755008.
    1. Kingma, D. P. & Welling, M. Auto-encoding variational bayes. arXiv preprint (2013).
    1. Kingma, D. & Welling, M. (2019).
    1. Kingma, D. P., Mohamed, S., Rezende, D. J. & Welling, M. in Advances in Neural Information Processing Systems 3581–3589.
    1. Sønderby, C. K., Raiko, T., Maaløe, L., Sønderby, S. K. & Winther, O. How to train deep variational autoencoders and probabilistic ladder networks. arXiv preprint 3 (2016).
    1. Kim, H. & Mnih, A. in International Conference on Machine Learning 2649–2658 (PMLR).
    1. Kim, M., Wang, Y., Sahu, P. & Pavlovic, V. Relevance factor VAE: Learning and identifying disentangled factors. arXiv preprint (2019).
    1. Kim, M., Wang, Y., Sahu, P. & Pavlovic, V. in Proceedings of the IEEE/CVF International Conference on Computer Vision 2979–2987.
    1. Dosovitskiy, A. et al. An image is worth 16 × 16 words: Transformers for image recognition at scale. arXiv preprint (2020).
    1. Moons KG, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): Explanation and elaboration. Ann. Intern. Med. 2015;162:W1–73. doi: 10.7326/M14-0698.
    1. Amini S, et al. Feasibility and rationale for incorporating frailty and cognitive screening protocols in a preoperative anesthesia clinic. Anesth. Analg. 2019;129:830–838. doi: 10.1213/ANE.0000000000004190.
    1. Emrani S, et al. Alzheimer’s/vascular spectrum dementia: Classification in addition to diagnosis. J. Alzheimers Dis. 2020;73:63–71. doi: 10.3233/JAD-190654.
    1. Price CC, Jefferson AL, Merino JG, Heilman KM, Libon DJ. Subcortical vascular dementia: Integrating neuropsychological and neuroradiologic data. Neurology. 2005;65:376–382. doi: 10.1212/01.wnl.0000168877.06011.15.
    1. Price CC, et al. Leukoaraiosis severity and list-learning in dementia. Clin. Neuropsychol. 2009;23:944–961. doi: 10.1080/13854040802681664.
    1. Lawton MP, Brody EM. Assessment of older people: Self-maintaining and instrumental activities of daily living. Gerontologist. 1969;9:179–186. doi: 10.1093/geront/9.3_Part_1.179.
    1. American Psychiatric Association, D. & Association, A. P. Diagnostic and Statistical Manual of Mental Disorders, 5th edn. (American Psychiatric Association, 2013).
    1. Welsh KA, Breitner JC, Magruder-Habib KM. Detection of dementia in the elderly using telephone screening of cognitive status. Neuropsychiatry Neuropsychol. Behav. Neurol. 1993;6:103–110.
    1. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: Development and validation. J. Chronic Dis. 1987;40:373–383. doi: 10.1016/0021-9681(87)90171-8.
    1. Davis, R. et al. The digital clock drawing test (dCDT) I: Development of a new computerized quantitative system. in The International Neuropsychological Society (2011).

Source: PubMed

3
Tilaa