Artificial Intelligence and Benign Lesions of Vocal Folds Recognition

February 21, 2023 updated by: Marchese Maria Raffaella, Fondazione Policlinico Universitario Agostino Gemelli IRCCS

Artificial Intelligence for the Recognition of Benign Lesions of Vocal Folds From Audio Recordings

The development of Artificial Intelligence (AI), the evolution of voice technology, progresses in audio signal analysis, and natural language processing/understanding methods have opened the way to numerous potential applications of voice, such as the identification of vocal biomarkers for diagnosis, classification or to enhance clinical practice. More recently, researches focused on the role of the audio signal of the voice as a signature of the pathogenic process. Dysphonia indicates that some negative changes have occurred in the voice production. The overall prevalence of dysphonia is approximately 1% even if the actual rates may be higher depending on the population studied and the definition of the specific voice disorder. Voice health may be assessed by several acoustic parameters. The relationship between voice pathology and acoustic voice features has been clinically established and confirmed both quantitatively and subjectively by speech experts. The automatic systems are designed to determine whether the sample belongs to a healthy subject or a non-healthy subject. The exactness of acoustic parameters is linked to the features used to estimate them for speech noise identification. Current voice searches are mostly restricted to basic questions even if with broad perspectives. The literature on vocal biomarkers of specific vocal fold diseases is anecdotal and related to functional vocal fold disorders or rare movement disorders of the larynx . The most common causes of dysphonia are the Benign Lesions of the Vocal Fold (BLVF). Currently, videolaryngostroboscopy, although invasive, is the gold standard for the diagnosis of BLVF. However, it is invasive and expensive procedure. The novel ML algorithms have recently improved the classification accuracy of selected features in target variables when compared to more conventional procedures thanks to the ability to combine and analyze large data-sets of voice features. Even if the majority of studies focus on the diagnosis of a disorder where they differentiate between healthy and non-healthy subjects, the investigators believe that the more important task is frequently differential diagnosis between two or more diseases. Even though this is a challenging task, it is of crucial importance to move decision support to this level. The main aim of this research would be the study, development, and validation of ML algorithms to recognize the different BVLVFL from digital voice recordings.

Study Overview

Status

Recruiting

Conditions

Dysphonia

Intervention / Treatment

Diagnostic test: Audio recordings

Detailed Description

The investigators will collect the audio recordings of dysphonic participants affected by BLVF. All voice samples will be divided into the following groups based on the endoscopic diagnosis: vocal fold cysts, Reinke's edema, nodules and polyps. The audio tracks will be obtained by asking to pronounce with usual voice intensity, pitch and quality the word /aiuole/ three times in a row. Voices will be acquired using a Shure model SM48 microphone (Evanston IL) positioned at an angle of 45° at a distance of 20 cm from the patient's mouth. The microphone saturation input will be fixed at 6/9 of CH1 and the environmental noise was <30 dB sound pressure level (SPL). The signals will be recorded in ".nvi" format with a high-definition audio-recorder Computerized Speech Lab, model 4300B, from Kay Elemetrics (Lincoln Park, NJ, USA) with a sampling rate of 50 kHz frequency and converted to ".wav" format. Each audio file will be anonymously labelled with gender and type of BLVF.

Analysis pipeline All the following analyses will be performed using MatLab R2019b, the MathWorks, Natick MA, USA. The analysis pipeline included signal pre-processing, features extraction, screening of the features, and model implementation.

Features extraction On the segmented signal, 66 different features in the time, frequency, and cepstral domain will be extracted. Then, seven statistical measures will be computed on the extracted features, namely: mean, standard deviation, skewness, kurtosis, 25th, 50th, and 75th percentiles. In addition, jitter, shimmer, and tilt of the power spectrum will be obtained from the whole unsegmented signal.

Features screening Features screening will be applied using biostatistical analyses on the whole dataset, to reduce the extended number of features to give as input to the classifier. Two statistical tests will be used to screen relevant features for the classification task: the one-way analysis of variance (ANOVA), when all the groups were normally distributed, and the Kruskal-Wallis test, otherwise. The groups' normality will be verified through the Kolmogorov-Smirnov test. For all the tests, a p-value <0.05 will be considered statistically significant.

A. Model implementation A non-linear Support Vector Machine (SVM) with a Gaussian kernel is the algorithm chosen for this research. The classification performance will be measured through the accuracy and the average F1-score. Both metrics will be provided for the description of the overall classification performances and those obtained on gender sub-groups.

Study Type

Observational

Enrollment (Anticipated)

300

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Contact

Name: Maria Raffaella Marchese
Phone Number: 3391144556
Email: raffaellamarchese@gmail.com

Study Locations

Italy
- - Roma, Italy, 00198
    - Recruiting
    - Maria Raffaella Marchese
    - Contact:
      
      Maria Raffaella Marchese
      
      Phone Number: 3391144556
      
      Email: raffaellamarchese@gmail.com

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

18 years to 65 years (Adult, Older Adult)

Accepts Healthy Volunteers

N/A

Genders Eligible for Study

All

Sampling Method

Probability Sample

Study Population

Patients affected by benign lesions of vocal fold

Description

Inclusion criteria:

Reinke's edema
cyst of the vocal fold
nodule of the vocal fold
polyp of the vocal fold

Exclusion criteria:

previous laryngeal or thyroid surgery
previous speech therapy
current pulmonary diseases
current gastroesophageal reflux
laryngeal movement disorder or recurrent laryngeal nerve paralysis
Non-native Italian speakers

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
validation of ML algorithms to recognize the different BVFL Time Frame: five years	The statistical measures computed on the extracted features are the following: mean, standard deviation, skewness, kurtosis, 25th, 50th, and 75th percentiles. In addition, jitter, shimmer, and tilt of the power spectrum will be obtained from the whole unsegmented signal.	five years

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

Fondazione Policlinico Universitario Agostino Gemelli IRCCS

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

November 1, 2021

Primary Completion (Anticipated)

November 1, 2025

Study Completion (Anticipated)

November 1, 2025

Study Registration Dates

First Submitted

February 1, 2023

First Submitted That Met QC Criteria

February 21, 2023

First Posted (Estimate)

March 6, 2023

Study Record Updates

Last Update Posted (Estimate)

March 6, 2023

Last Update Submitted That Met QC Criteria

February 21, 2023

Last Verified

February 1, 2023

More Information

Terms related to this study

Additional Relevant MeSH Terms

Other Study ID Numbers

4519

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Dysphonia

Liaquat University of Medical & Health Sciences
University of Urbino "Carlo Bo"

Not yet recruiting

Assessment of the Efficacy of a Highly Standardized Ginger and Perilla Nutraceutical (Dispepril®) in Improving Gastric and Intestinal Symptoms in Patients With Functional Dyspepsia

Functional Dysphonia

Italy
National Institute of Neurological Disorders and...

Completed

Study of Air Stream in Voice Production

Healthy | Spastic Dysphonia

United States
University of California, San Francisco

Completed

DaxibotulinumtoxinA Injection for Treatment of Adductor Spasmodic Dysphonia

Spasmodic Dysphonia | Adductor Spasmodic Dysphonia | Voice Disorders

United States
Kumamoto University
Kyoto University; Hokkaido University Hospital; Yokohama City University Hospital

Completed

Treatment for Adductor Spasmodic Dysphonia by Type 2 Thyroplasty Using Titanium Bridges

Adductor Spasmodic Dysphonia

Japan
Eastern Virginia Medical School

Recruiting

Reversal of Botulinum Neurotoxin Injection Related Dysphonia With Pyridostigmine

Dysphonia | Laryngeal Dystonia | Dysphonia, Spastic

United States
University of Wisconsin, Madison

Completed

Effects of Vocal Exercises for Spasmodic Dysphonia

Adductor Spasmodic Dysphonia

United States
University of Minnesota

Withdrawn

Laryngeal Vibro-tactile Stimulation as a Non-invasive Symptomatic Treatment for Spasmodic Dysphonia

Adductor Spasmodic Dysphonia | Laryngeal Dystonia | Abductor Spastic Dysphonia

United States
Texas Christian University
National Institute on Deafness and Other Communication Disorders (NIDCD)

Recruiting

Translaryngeal Vibration for Hyperfunctional Voice Disorders

Dysphonia | Muscle Tension Dysphonia

United States
Universidade Federal de Pernambuco

Not yet recruiting

Transcranial Magnetic Stimulation in Laryngeal Dystonia

Spasmodic Dysphonia | Laryngeal Dystonia
Pusan National University Yangsan Hospital

Completed

Artemisia and Green Tea Extracts for Gastric Mucosal Health

Functional Dysphonia

South Korea

Clinical Trials on Audio recordings

University of Washington
National Center for Complementary and Integrative Health (NCCIH)

Completed

Digital Clinical Hypnosis for Chronic Pain Management

Chronic Pain | Chronic Low-back Pain

United States
Memorial Sloan Kettering Cancer Center
National Cancer Institute (NCI); National Institutes of Health (NIH)

Completed

Physician-Patient Communication About Breast Cancer-Related Internet Information and Its Effect on Patient Satisfaction and Anxiety

Breast Cancer

United States
Mayo Clinic

Completed

Remote Mindfulness-Based Physical Activity Intervention for Postmenopausal Women

Postmenopausal

United States
Assistance Publique - Hôpitaux de Paris

Not yet recruiting

Creation of a Shared Medical Decision-making Tool for Atopic Dermatitis (SHADOW)

Atopic Dermatitis

France
St. Olavs Hospital
Oslo University Hospital; University Hospital of North Norway; Norwegian University... and other collaborators

Completed

Acceptance and Commitment Therapy With and Without Enhanced Mindfulness Training for Chronic Pain (MUST)

Chronic Pain

Norway
St. Antonius Hospital
Leiden University

Recruiting

Online Medical Hypnosis Exercises to Improve Mental Health in Primary School Children and Prevent Future Mental Health Issues (Breinbaas)

Healty Volunteers

Netherlands
Dartmouth-Hitchcock Medical Center
National Institutes of Health (NIH); National Institute on Aging (NIA); Vanderbilt... and other collaborators

Recruiting

The Effect of Clinic Visit Audio Recordings for Self-management in Older Adults (REPLAY)

Diabetes

United States
University Health Network, Toronto
University of Toronto; The Salvation Army Toronto Grace Health Centre

Recruiting

CHARM-COPD Program of Care

Chronic Obstructive Pulmonary Disease

Canada
Siriraj Hospital

Not yet recruiting

A Comparison of Hypnosis and Mindfulness on Reducing Pain in Chronic Pain Patients

Hypnosis, Mindfulness Meditation
Centre Hospitalier Universitaire de Pointe-a-Pitre
Groupe Hospitalier Pitie-Salpetriere; University Hospital Center of Martinique

Completed

Pathophysiology of Gait and Posture in Progressive Supranuclear Palsy (Gait-PSP)

Progressive Supranuclear Palsy

Artificial Intelligence and Benign Lesions of Vocal Folds Recognition

Artificial Intelligence for the Recognition of Benign Lesions of Vocal Folds From Audio Recordings

Study Overview

Status

Conditions

Intervention / Treatment

Detailed Description

Study Type

Enrollment (Anticipated)

Contacts and Locations

Study Contact

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Genders Eligible for Study

Sampling Method

Study Population

Description

Study Plan

How is the study designed?

Design Details

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Study record dates

Study Major Dates

Study Start (Actual)

Primary Completion (Anticipated)

Study Completion (Anticipated)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Estimate)

Study Record Updates

Last Update Posted (Estimate)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Additional Relevant MeSH Terms

Other Study ID Numbers

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Dysphonia

Clinical Trials on Audio recordings

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Burkina Faso

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations