Predicting clinical scores in Huntington's disease: a lightweight speech test

Rachid Riad, Marine Lunven, Hadrien Titeux, Xuan-Nga Cao, Jennifer Hamet Bagnou, Laurie Lemoine, Justine Montillot, Agnes Sliwinski, Katia Youssov, Laurent Cleret de Langavant, Emmanuel Dupoux, Anne-Catherine Bachoud-Lévi, Rachid Riad, Marine Lunven, Hadrien Titeux, Xuan-Nga Cao, Jennifer Hamet Bagnou, Laurie Lemoine, Justine Montillot, Agnes Sliwinski, Katia Youssov, Laurent Cleret de Langavant, Emmanuel Dupoux, Anne-Catherine Bachoud-Lévi

Abstract

Objectives: Using brief samples of speech recordings, we aimed at predicting, through machine learning, the clinical performance in Huntington's Disease (HD), an inherited Neurodegenerative disease (NDD).

Methods: We collected and analyzed 126 samples of audio recordings of both forward and backward counting from 103 Huntington's disease gene carriers [87 manifest and 16 premanifest; mean age 50.6 (SD 11.2), range (27-88) years] from three multicenter prospective studies in France and Belgium (MIG-HD (ClinicalTrials.gov NCT00190450); BIO-HD (ClinicalTrials.gov NCT00190450) and Repair-HD (ClinicalTrials.gov NCT00190450). We pre-registered all of our methods before running any analyses, in order to avoid inflated results. We automatically extracted 60 speech features from blindly annotated samples. We used machine learning models to combine multiple speech features in order to make predictions at individual levels of the clinical markers. We trained machine learning models on 86% of the samples, the remaining 14% constituted the independent test set. We combined speech features with demographics variables (age, sex, CAG repeats, and burden score) to predict cognitive, motor, and functional scores of the Unified Huntington's disease rating scale. We provided correlation between speech variables and striatal volumes.

Results: Speech features combined with demographics allowed the prediction of the individual cognitive, motor, and functional scores with a relative error from 12.7 to 20.0% which is better than predictions using demographics and genetic information. Both mean and standard deviation of pause durations during backward recitation and clinical scores correlated with striatal atrophy (Spearman 0.6 and 0.5-0.6, respectively).

Interpretation: Brief and examiner-free speech recording and analysis may become in the future an efficient method for remote evaluation of the individual condition in HD and likely in other NDD.

Keywords: Huntington’s disease; Machine learning; Speech.

Conflict of interest statement

Nothing to report.

© 2022. The Author(s).

Figures

Fig. 1
Fig. 1
Extraction of individual clinical scores from the speech samples. (Top panel) Examples of portions of the speech signal and various types of vocalizations and segmentation are provided. Similar speech features were extracted separately from the forward and backward counting tasks yielding to 60 features (30 × 2). (Bottom panel) Illustration of the methods developments, Machine learning training and evaluation of the predictions of the clinical scores. N CAG number of CAG repeats on the Huntingtin gene, DBS Disease Burden Score. TFC Total Functional capacity, TMS Total motor score, SDMT Symbol digit modality, UHDRS IS UHDRS Independence Scale, MAE Mean absolute error, ICC Intraclass correlation coefficient, cUHDRS composite UHDRS
Fig. 2
Fig. 2
Illustration of individual predictions of the cUHDRS (Left) and the TMS (Right) based on the speech features. Each individual blue dot is the difference between the predicted and the observed score for a particular assessment of an individual of the test set. The red dashed line is the line ‘y = x’. The black line is the individual contribution of a point (individual absolute error) to obtain the Mean Absolute Error (MAE)
Fig. 3
Fig. 3
Boxplots of mean-absolute-error (MAE) on the test set for the repeated-learning testing experiment. A MAE at zero means that the predicted value equals the observed one. Horizontal lines are the medians, boxes are upper and lower quartiles, and whiskers are 1.5 × IQR (Interquartile Range). First row displays the cUHDRS, functional, and motor predicted scores; whereas the second row displays the predicted Cognitive Scores. Statistical Significance was assessed with Wilcoxon-test and was Bonferroni-corrected
Fig. 4
Fig. 4
Boxplots of intraclass correlation coefficients (ICC) on the test set for the repeated-learning testing experiment. An ICC at 1 means that the predicted value equals the observed one. Horizontal lines are the medians, boxes are upper and lower quartiles, and whiskers are 1.5 × IQR (Interquartile Range). First row displays the cUHDRS, functional, and motor predicted scores; whereas the second row displays the predicted Cognitive Scores. Statistical Significance was assessed with Wilcoxon-test and was Bonferroni-corrected. The dashed lines figure the ICCs obtained between Neurologists for the clinical scores namely: (1) ICC for cUHDRS ICC = 0.92 [49], (2) for TMS ICC = 0.847 [3], (3) for TFC ICC = 0.938, and for UHDRS IS ICC = 0.842 [4]. The ICC cannot be computed for the Mean Cohort Performance as its standard deviation is zero
Fig. 5
Fig. 5
Coefficient importance of the different speech features for the predictions of the clinical scores. Each line represents a feature of Table 2 and the rank is the order introduced in Table 2. These mean weights are obtained with a linear Elastic Net model for interpretability. The weights are z-scored per clinical score to be one the same scale. The weights for the clinical scores are reversed, so that a higher feature weight can be interpreted as a higher clinical impairment

References

    1. Ross CA, Tabrizi SJ. Huntington’s disease: from molecular pathogenesis to clinical treatment. Lancet Neurol. 2011;10(1):83–98. doi: 10.1016/S1474-4422(10)70245-3.
    1. (1996) Unified Huntington’s disease rating scale: reliability and consistency. Huntington study group. Mov Disord Off J Mov Disord. Soc 11(2): 136–142. 10.1002/mds.870110204.
    1. Winder JY, Roos RAC, Burgunder J, Marinus J, Reilmann R. Interrater reliability of the unified huntington’s disease rating scale-total motor score certification. Mov Disord Clin Pract. 2018;5(3):290–295. doi: 10.1002/mdc3.12618.
    1. Winder JY, Achterberg WP, Marinus J, Gardiner SL, Roos RAC. Assessment scales for patients with advanced Huntington’s disease: comparison of the UHDRS and UHDRS-FAP. Mov Disord Clin Pract. 2018;5(5):527–533. doi: 10.1002/mdc3.12646.
    1. Schobel SA, et al. Motor, cognitive, and functional declines contribute to a single progressive factor in early HD. Neurology. 2017;89(24):2495–2502. doi: 10.1212/WNL.0000000000004743.
    1. Stout JC, et al. HD-CAB: a cognitive assessment battery for clinical trials in Huntington’s disease 1,2,3. Mov Disord Off J Mov Disord Soc. 2014;29(10):1281–1288. doi: 10.1002/mds.25964.
    1. Mason SL, et al. Predicting clinical diagnosis in Huntington’s disease: an imaging polymarker. Ann Neurol. 2018;83(3):532–543. doi: 10.1002/ana.25171.
    1. Scahill RI, et al. Biological and clinical characteristics of gene carriers far from predicted onset in the Huntington’s disease young adult study (HD-YAS): a cross-sectional analysis. Lancet Neurol. 2020;19(6):502–512. doi: 10.1016/S1474-4422(20)30143-5.
    1. Zhan A, et al. Using smartphones and machine learning to quantify Parkinson disease severity: the mobile parkinson disease score. JAMA Neurol. 2018;75(7):876–880. doi: 10.1001/jamaneurol.2018.0809.
    1. Bechtel N, et al. Tapping linked to function and structure in premanifest and symptomatic Huntington disease. Neurology. 2010;75(24):2150–2160. doi: 10.1212/WNL.0b013e3182020123.
    1. Gajos KZ, et al. Computer mouse use captures ataxia and parkinsonism, enabling accurate measurement and detection. Mov Disord. 2020;35(2):354–358. doi: 10.1002/mds.27915.
    1. Wilkinson J, et al. Time to reality check the promises of machine learning-powered precision medicine. Lancet Digit Health. 2020;2(12):e677–e680. doi: 10.1016/S2589-7500(20)30200-4.
    1. Fernandes BS, Williams LM, Steiner J, Leboyer M, Carvalho AF, Berk M. The new field of ‘precision psychiatry’. BMC Med. 2017;15(1):80. doi: 10.1186/s12916-017-0849-x.
    1. Guenther FH. Neural control of speech. Cambridge: MIT Press; 2016.
    1. Levelt WJM. Speaking: from intention to articulation. Cambridge: MIT Press; 1993.
    1. Rusz J, et al. Objective acoustic quantification of phonatory dysfunction in Huntington’s Disease. PLoS ONE. 2013;8(6):e65881. doi: 10.1371/journal.pone.0065881.
    1. Rusz J, Saft C, Schlegel U, Hoffman R, Skodda S. Phonatory dysfunction as a preclinical symptom of Huntington Disease. PLoS ONE. 2014;9(11):e113412. doi: 10.1371/journal.pone.0113412.
    1. Rusz J, et al. Characteristics and occurrence of speech impairment in Huntington’s disease: possible influence of antipsychotic medication. J Neural Transm. 2014;121(12):1529–1539. doi: 10.1007/s00702-014-1229-8.
    1. Skodda S, et al. Two different phenomena in basic motor speech performance in premanifest Huntington disease. Neurology. 2016;86(14):1329–1335. doi: 10.1212/WNL.0000000000002550.
    1. Skodda S, Schlegel U, Hoffmann R, Saft C. Impaired motor speech performance in Huntington’s disease. J Neural Transm. 1996;121(4):399–407. doi: 10.1007/s00702-013-1115-9.
    1. Ramig LA. Acoustic analyses of phonation in patients with Huntington’s disease. Preliminary report. Ann Otol Rhinol Laryngol. 1986;95(3 Pt 1):288–293. doi: 10.1177/000348948609500315.
    1. Velasco García MJ, Cobeta I, Martín G, Alonso-Navarro H, Jimenez-Jimenez FJ. Acoustic analysis of voice in Huntington’s disease patients. J Voice Found. 2011;25(2):208–217. doi: 10.1016/j.jvoice.2009.08.007.
    1. Németh D, et al. Language deficits in Pre-Symptomatic Huntington’s Disease: Evidence from Hungarian. Brain Lang. 2012;121(3):248–253. doi: 10.1016/j.bandl.2012.04.001.
    1. Wallesch C-W, Fehrenbach RA. On the neurolinguistic nature of language abnormalities in Huntington’s disease. J Neurol Neurosurg Psychiatry. 1988;51(3):367–373. doi: 10.1136/jnnp.51.3.367.
    1. Chenery HJ, Copland DA, Murdoch BE. Complex language functions and subcortical mechanisms: evidence from Huntington’s disease and patients with non-thalamic subcortical lesions. Int J Lang Commun Disord. 2002;37(4):459–474. doi: 10.1080/1368282021000007730.
    1. Hinzen W, et al. “A systematic linguistic profile of spontaneous narrative speech in pre-symptomatic and early stage Huntington’s disease”, Cortex. J Devoted Study Nerv Syst Behav. 2018;100:71–83. doi: 10.1016/j.cortex.2017.07.022.
    1. Vogel AP, Shirbin C, Churchyard AJ, Stout JC. Speech acoustic markers of early stage and prodromal Huntington’s disease: a marker of disease onset? Neuropsychologia. 2012;50(14):3273–3278. doi: 10.1016/j.neuropsychologia.2012.09.011.
    1. Hertrich I, Ackermann H. Acoustic analysis of speech timing in Huntington’s disease. Brain Lang. 1994;47(2):182–196. doi: 10.1006/brln.1994.1048.
    1. Perez et al M (2018) Classification of huntington disease using acoustic and lexical features. In: Interspeech, ISCA, Hyderabad India, pp.1898–1902.
    1. Romana A, Bandon J, Carlozzi N, Roberts A, Provost EM. Classification of manifest Huntington disease using vowel distortion measures. Interspeech. 2020;2020:4966–4970. doi: 10.21437/interspeech.2020-2724.
    1. Chan JCS, Stout JC, Vogel AP. Speech in prodromal and symptomatic Huntington’s disease as a model of measuring onset and progression in dominantly inherited neurodegenerative diseases. Neurosci Biobehav Rev. 2019;107:450–460. doi: 10.1016/j.neubiorev.2019.08.009.
    1. Shoulson I. Huntington disease: functional capacities in patients treated with neuroleptic and antidepressant drugs. Neurology. 1981;31(10):1333–1335. doi: 10.1212/WNL.31.10.1333.
    1. Tabrizi SJ, et al. Biological and clinical manifestations of Huntington’s disease in the longitudinal TRACK-HD study: cross-sectional analysis of baseline data. Lancet Neurol. 2009;8(9):791–801. doi: 10.1016/S1474-4422(09)70170-X.
    1. Boersma P (2006) Praat: doing phonetics by computer,” .
    1. Titeux et al H (2020) Seshat: a tool for managing and verifying annotation campaigns of audio data. In: Proceedings of the 12th Language Resources and Evaluation Conference, Marseille, France, May 2020, pp. 6976–6982. Accessed: Nov. 09, 2020. [Online]. Available:
    1. Clark HH. Using language. Cambridge: Cambridge University Press; 1996.
    1. Povey et al D (2014) The Kaldi speech recognition toolkit. In: Proc. ASRU, 2011, pp. 1–4. Accessed: Nov. 19, 2014. [Online]. Available:
    1. Riad et al R (2020) Vocal markers from sustained phonation in Huntington’s disease. Proc. Interspeech, 1893–1897, 10.21437/Interspeech.2020-1057
    1. Ludlow CL, Connor NP, Bassich CJ. Speech timing in Parkinson’s and Huntington’s disease. Brain Lang. 1987;32(2):195–214. doi: 10.1016/0093-934x(87)90124-6.
    1. Santos JF, Falk TH. Updating the SRMR-CI metric for improved intelligibility prediction for cochlear implant users. IEEEACM Trans Audio Speech Lang Process. 2014;22(12):2197–2206. doi: 10.1109/TASLP.2014.2363788.
    1. Feurer M, Klein A, Eggensperger K, Springenberg J, Blum M, Hutter F. Efficient and robust automated machine learning. Adv Neural Inf Process Syst. 2015;28:2962–2970.
    1. Rodrigues FB, et al. Mutant huntingtin and neurofilament light have distinct longitudinal dynamics in Huntington’s disease. Sci Transl Med. 2020 doi: 10.1126/scitranslmed.abc2888.
    1. Poldrack RA, Huckins G, Varoquaux G. Establishment of best practices for evidence for prediction: a review. JAMA Psychiat. 2019 doi: 10.1001/jamapsychiatry.2019.3671.
    1. Varoquaux G. Cross-validation failure: small sample sizes lead to large error bars. Neuroimage. 2017 doi: 10.1016/j.neuroimage.2017.06.061.
    1. Fischl B, et al. Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain. Neuron. 2002;33(3):341–355. doi: 10.1016/S0896-6273(02)00569-X.
    1. Reshef YA, Reshef DN, Finucane HK, Sabeti PC, Mitzenmacher M. Measuring dependence powerfully and equitably. J Mach Learn Res. 2016;17(211):1–63.
    1. Albanese D, Riccadonna S, Donati C, Franceschi P. A practical tool for maximal information coefficient analysis. GigaScience. 2018 doi: 10.1093/gigascience/giy032.
    1. Nichols TE, Holmes AP. Nonparametric permutation tests for functional neuroimaging: A primer with examples. Hum Brain Mapp. 2002;15(1):1–25. doi: 10.1002/hbm.1058.
    1. Trundell D, Palermo G, Schobel S, Long JD, Leavitt BR, Tabrizi SJ. F23 Validity, reliability, ability to detect change and meaningful within-patient change of the CUHDRS. London: BMJ Publishing Group Ltd; 2018.
    1. Yi Q, Panzarella T. Estimating sample size for tests on trends across repeated measurements with missing data based on the interaction term in a mixed model. Control Clin Trials. 2002;23(5):481–496. doi: 10.1016/S0197-2456(02)00223-4.
    1. Arias-Vergara T, Klumpp P, Vasquez J, Orozco JR, Noeth E. Parkinson’s disease progression assessment from speech using a mobile device-based application. Cham: Springer; 2017. pp. 371–379.
    1. Robbins TW, James M, Owen AM, Sahakian BJ, McInnes L, Rabbitt P. Cambridge neuropsychological test automated battery (CANTAB): a factor analytic study of a large sample of normal elderly volunteers. Dement Geriatr Cogn Disord. 1994;5(5):266–281. doi: 10.1159/000106735.
    1. Lo J, et al. Dual tasking impairments are associated with striatal pathology in Huntington’s disease. Ann Clin Transl Neurol. 2020;7(9):1608–1619. doi: 10.1002/acn3.51142.
    1. Mayr U, Keele SW. Changing internal constraints on action: the role of backward inhibition. J Exp Psychol Gen. 2000;129(1):4. doi: 10.1037/0096-3445.129.1.4.
    1. Rofes A, et al. Language in individuals with left hemisphere tumors: is spontaneous speech analysis comparable to formal testing? J Clin Exp Neuropsychol. 2018;40(7):722–732. doi: 10.1080/13803395.2018.1426734.
    1. Ryant N, Church K, Cieri C, Cristia A, Du J, Ganapathy S, Liberman M(2019) The Second DIHARD Diarization Challenge: Dataset, Task, and Baselines. Interspeech 978–982. 10.21437/Interspeech.2019-1268
    1. Reilmann R, Schubert R. Motor outcome measures in Huntington disease clinical trials. Handb Clin Neurol. 2017;144:209–225. doi: 10.1016/B978-0-12-801893-4.00018-3.
    1. Rusz J, et al. Speech biomarkers in rapid eye movement sleep behavior disorder and parkinson disease. Ann Neurol. 2021 doi: 10.1002/ana.26085.

Source: PubMed

3
S'abonner