- ICH GCP
- US Clinical Trials Registry
- Clinical Trial NCT05754606
Artificial Intelligence and Benign Lesions of Vocal Folds Recognition
Artificial Intelligence for the Recognition of Benign Lesions of Vocal Folds From Audio Recordings
Study Overview
Detailed Description
The investigators will collect the audio recordings of dysphonic participants affected by BLVF. All voice samples will be divided into the following groups based on the endoscopic diagnosis: vocal fold cysts, Reinke's edema, nodules and polyps. The audio tracks will be obtained by asking to pronounce with usual voice intensity, pitch and quality the word /aiuole/ three times in a row. Voices will be acquired using a Shure model SM48 microphone (Evanston IL) positioned at an angle of 45° at a distance of 20 cm from the patient's mouth. The microphone saturation input will be fixed at 6/9 of CH1 and the environmental noise was <30 dB sound pressure level (SPL). The signals will be recorded in ".nvi" format with a high-definition audio-recorder Computerized Speech Lab, model 4300B, from Kay Elemetrics (Lincoln Park, NJ, USA) with a sampling rate of 50 kHz frequency and converted to ".wav" format. Each audio file will be anonymously labelled with gender and type of BLVF.
Analysis pipeline All the following analyses will be performed using MatLab R2019b, the MathWorks, Natick MA, USA. The analysis pipeline included signal pre-processing, features extraction, screening of the features, and model implementation.
Features extraction On the segmented signal, 66 different features in the time, frequency, and cepstral domain will be extracted. Then, seven statistical measures will be computed on the extracted features, namely: mean, standard deviation, skewness, kurtosis, 25th, 50th, and 75th percentiles. In addition, jitter, shimmer, and tilt of the power spectrum will be obtained from the whole unsegmented signal.
Features screening Features screening will be applied using biostatistical analyses on the whole dataset, to reduce the extended number of features to give as input to the classifier. Two statistical tests will be used to screen relevant features for the classification task: the one-way analysis of variance (ANOVA), when all the groups were normally distributed, and the Kruskal-Wallis test, otherwise. The groups' normality will be verified through the Kolmogorov-Smirnov test. For all the tests, a p-value <0.05 will be considered statistically significant.
A. Model implementation A non-linear Support Vector Machine (SVM) with a Gaussian kernel is the algorithm chosen for this research. The classification performance will be measured through the accuracy and the average F1-score. Both metrics will be provided for the description of the overall classification performances and those obtained on gender sub-groups.
Study Type
Enrollment (Anticipated)
Contacts and Locations
Study Contact
- Name: Maria Raffaella Marchese
- Phone Number: 3391144556
- Email: raffaellamarchese@gmail.com
Study Locations
-
-
-
Roma, Italy, 00198
- Recruiting
- Maria Raffaella Marchese
-
Contact:
- Maria Raffaella Marchese
- Phone Number: 3391144556
- Email: raffaellamarchese@gmail.com
-
-
Participation Criteria
Eligibility Criteria
Ages Eligible for Study
Accepts Healthy Volunteers
Genders Eligible for Study
Sampling Method
Study Population
Description
Inclusion criteria:
- Reinke's edema
- cyst of the vocal fold
- nodule of the vocal fold
- polyp of the vocal fold
Exclusion criteria:
- previous laryngeal or thyroid surgery
- previous speech therapy
- current pulmonary diseases
- current gastroesophageal reflux
- laryngeal movement disorder or recurrent laryngeal nerve paralysis
- Non-native Italian speakers
Study Plan
How is the study designed?
Design Details
What is the study measuring?
Primary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
validation of ML algorithms to recognize the different BVFL
Time Frame: five years
|
The statistical measures computed on the extracted features are the following: mean, standard deviation, skewness, kurtosis, 25th, 50th, and 75th percentiles.
In addition, jitter, shimmer, and tilt of the power spectrum will be obtained from the whole unsegmented signal.
|
five years
|
Collaborators and Investigators
Study record dates
Study Major Dates
Study Start (Actual)
Primary Completion (Anticipated)
Study Completion (Anticipated)
Study Registration Dates
First Submitted
First Submitted That Met QC Criteria
First Posted (Estimate)
Study Record Updates
Last Update Posted (Estimate)
Last Update Submitted That Met QC Criteria
Last Verified
More Information
Terms related to this study
Additional Relevant MeSH Terms
Other Study ID Numbers
- 4519
Plan for Individual participant data (IPD)
Plan to Share Individual Participant Data (IPD)?
Drug and device information, study documents
Studies a U.S. FDA-regulated drug product
Studies a U.S. FDA-regulated device product
This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.
Clinical Trials on Dysphonia
-
National Institute of Neurological Disorders and...Completed
-
University of California, San FranciscoCompletedSpasmodic Dysphonia | Adductor Spasmodic Dysphonia | Voice DisordersUnited States
-
Kumamoto UniversityKyoto University; Hokkaido University Hospital; Yokohama City University HospitalCompletedAdductor Spasmodic DysphoniaJapan
-
Eastern Virginia Medical SchoolRecruitingDysphonia | Laryngeal Dystonia | Dysphonia, SpasticUnited States
-
University of Wisconsin, MadisonCompleted
-
University of MinnesotaWithdrawnAdductor Spasmodic Dysphonia | Laryngeal Dystonia | Abductor Spastic DysphoniaUnited States
-
Texas Christian UniversityNational Institute on Deafness and Other Communication Disorders (NIDCD)RecruitingDysphonia | Muscle Tension DysphoniaUnited States
-
Pusan National University Yangsan HospitalCompleted
-
University of California, San FranciscoNational Spasmodic Dysphonia AssociationCompletedSpasmodic Dysphonia | Adductor Spasmodic Dysphonia | Voice DisordersUnited States
-
Lawson Health Research InstituteUnknownAdductor Spasmodic DysphoniaCanada
Clinical Trials on Audio recordings
-
University of WashingtonNational Center for Complementary and Integrative Health (NCCIH)CompletedChronic Pain | Chronic Low-back PainUnited States
-
Memorial Sloan Kettering Cancer CenterNational Cancer Institute (NCI); National Institutes of Health (NIH)Completed
-
Mayo ClinicCompletedPostmenopausalUnited States
-
Assistance Publique - Hôpitaux de ParisNot yet recruitingAtopic DermatitisFrance
-
St. Olavs HospitalOslo University Hospital; University Hospital of North Norway; Norwegian University... and other collaboratorsCompleted
-
St. Antonius HospitalLeiden UniversityRecruiting
-
Dartmouth-Hitchcock Medical CenterNational Institutes of Health (NIH); National Institute on Aging (NIA); Vanderbilt... and other collaboratorsRecruitingDiabetesUnited States
-
Centre Hospitalier Universitaire de Pointe-a-PitreGroupe Hospitalier Pitie-Salpetriere; University Hospital Center of MartiniqueCompletedProgressive Supranuclear Palsy
-
Siriraj HospitalNot yet recruitingHypnosis, Mindfulness Meditation
-
Dartmouth-Hitchcock Medical CenterNational Library of Medicine (NLM)CompletedMultimorbidityUnited States