Human Algorithm Interactions for Acute Respiratory Failure Diagnosis

October 17, 2023 updated by: Michael Sjoding, University of Michigan

Measuring the Impact of AI in the Diagnosis of Hospitalized Patients: A Randomized Survey Vignette Multicenter Study

Artificial intelligence (AI) shows promising in identifying abnormalities in clinical images. However, systematically biased AI models, where a model makes inaccurate predictions for entire subpopulations, can lead to errors and potential harms. When shown incorrect predictions from an AI model, clinician diagnostic accuracy can be harmed. This study aims to study the effectiveness of providing clinicians with image-based AI model explanations when provided AI model predictions to help clinicians better understand the logic of an AI model's prediction. It will evaluate whether providing clinicians with AI model explanations can improve diagnostic accuracy and help clinicians catch when models are making incorrect decisions. As a test case, the study will focus on the diagnosis of acute respiratory failure because determining the underlying causes of acute respiratory failure is critically important for guiding treatment decisions but can be clinically challenging.

To determine if providing AI explanations can improve clinician diagnostic accuracy and alleviate the potential impact of showing clinicians a systematically biased AI model, a randomized clinical vignette survey study will be conducted. During the survey, study participants will be shown clinical vignettes of patients hospitalized with acute respiratory failure, including the patient's presenting symptoms, physical exam, laboratory results, and chest X-ray. Study participants will then be asked to assess the likelihood that heart failure, pneumonia and/or Chronic Obstructive Pulmonary Disease (COPD) is the underlying diagnosis. During specific vignettes in the survey, participants will also be shown standard or systematically biased AI models that provide an estimate the likelihood that heart failure, pneumonia and/or COPD is the underlying diagnosis. Clinicians will be randomized see AI predictions alone or AI predictions with explanations when shown AI models. This survey design will allow for testing the hypothesis that systematically biased models would harm clinician diagnostic accuracy, but commonly used image-based explanations would help clinicians partially recover their performance.

Study Overview

Status

Completed

Conditions

Acute Respiratory Failure

Intervention / Treatment

Study Type

Interventional

Enrollment (Actual)

457

Phase

Not Applicable

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

United States
- Michigan
  - Ann Arbor, Michigan, United States, 48103
    - University of Michigan

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

Adult
Older Adult

Accepts Healthy Volunteers

Description

Inclusion Criteria:

Physicians, nurse practitioners, and physician assistants that care for patients with acute respiratory failure as part of their clinical practice

Exclusion Criteria:

Physicians, nurse practitioners, and physician assistants that only provide patient care in outpatient settings

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

Primary Purpose: Other
Allocation: Randomized
Interventional Model: Parallel Assignment
Masking: Single

Number of Arms

Arms and Interventions

Participant Group / Arm	Intervention / Treatment
Experimental: AI model biased for heart failure, no AI explanation Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against heart failure, always predicting that heart failure is present with high likelihood in patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.	Other: Artificial Intelligence model predictions without explanation During 6 clinical vignettes, participants will see AI model predictions without a corresponding AI explanation. The AI model will provide a score for each diagnosis (heart failure, pneumonia, COPD) on a scale of 0-100 estimating how likely the patient's presentation was due to each of these diagnoses. In 3 of the clinical vignettes, participants will be shown standard AI model predictions and 3 vignettes they will be shown systematically biased AI model predictions, with the model specifically biased against one of the three diagnoses. Other: AI model biased against heart failure In 3 clinical vignettes, participants will be shown systematically biased AI model predictions with the model specifically biased against heart failure, always predicting that heart failure is present with high likelihood in survey vignette patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses (pneumonia, COPD).
Experimental: AI model biased for pneumonia, no AI explanation Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against pneumonia, always predicting that pneumonia is present with high likelihood in patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.	Other: Artificial Intelligence model predictions without explanation During 6 clinical vignettes, participants will see AI model predictions without a corresponding AI explanation. The AI model will provide a score for each diagnosis (heart failure, pneumonia, COPD) on a scale of 0-100 estimating how likely the patient's presentation was due to each of these diagnoses. In 3 of the clinical vignettes, participants will be shown standard AI model predictions and 3 vignettes they will be shown systematically biased AI model predictions, with the model specifically biased against one of the three diagnoses. Other: AI model biased against pneumonia In 3 clinical vignettes, participants will be shown systematically biased AI model predictions with the model specifically biased against pneumonia, always predicting that pneumonia is present with high likelihood in survey vignette patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses (heart failure, COPD).
Experimental: AI model biased for COPD, no AI explanation Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against COPD, always predicting that COPD is present with high likelihood when a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.	Other: Artificial Intelligence model predictions without explanation During 6 clinical vignettes, participants will see AI model predictions without a corresponding AI explanation. The AI model will provide a score for each diagnosis (heart failure, pneumonia, COPD) on a scale of 0-100 estimating how likely the patient's presentation was due to each of these diagnoses. In 3 of the clinical vignettes, participants will be shown standard AI model predictions and 3 vignettes they will be shown systematically biased AI model predictions, with the model specifically biased against one of the three diagnoses. Other: AI model biased against COPD In 3 clinical vignettes, participants will be shown systematically biased AI model predictions with the model specifically biased against COPD, always predicting that COPD is present with high likelihood in survey vignette patients where a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses (heart failure, pneumonia).
Experimental: AI model biased for heart failure, Image-based AI explanation presented Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against heart failure, always predicting that heart failure is present with high likelihood in patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.	Other: AI model biased against heart failure In 3 clinical vignettes, participants will be shown systematically biased AI model predictions with the model specifically biased against heart failure, always predicting that heart failure is present with high likelihood in survey vignette patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses (pneumonia, COPD). Other: Artificial intelligence model predictions with explanation During 6 clinical vignettes, participants will see AI model predictions with explanation. The AI model will provide a score for each diagnosis on a scale of 0-100. In 3 clinical vignettes, participants will be shown standard AI model predictions and 3 vignettes they will be shown systematically biased AI model predictions with the model specifically biased against one of the three diagnoses. If the AI model provides a score above 50 an AI model explanation will be shown as gradient-weighted class activation mapping (Grad-CAM) heatmaps overlaid on the chest X-ray that highlighted which regions of the image most affecting the AI model's prediction.
Experimental: AI model biased for pneumonia, Image-based AI explanation presented Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against pneumonia, always predicting that pneumonia is present with high likelihood in patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.	Other: AI model biased against pneumonia In 3 clinical vignettes, participants will be shown systematically biased AI model predictions with the model specifically biased against pneumonia, always predicting that pneumonia is present with high likelihood in survey vignette patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses (heart failure, COPD). Other: Artificial intelligence model predictions with explanation During 6 clinical vignettes, participants will see AI model predictions with explanation. The AI model will provide a score for each diagnosis on a scale of 0-100. In 3 clinical vignettes, participants will be shown standard AI model predictions and 3 vignettes they will be shown systematically biased AI model predictions with the model specifically biased against one of the three diagnoses. If the AI model provides a score above 50 an AI model explanation will be shown as gradient-weighted class activation mapping (Grad-CAM) heatmaps overlaid on the chest X-ray that highlighted which regions of the image most affecting the AI model's prediction.
Experimental: AI model biased for COPD, Image-based AI explanation presented Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against COPD, always predicting that COPD is present with high likelihood when a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.	Other: AI model biased against COPD In 3 clinical vignettes, participants will be shown systematically biased AI model predictions with the model specifically biased against COPD, always predicting that COPD is present with high likelihood in survey vignette patients where a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses (heart failure, pneumonia). Other: Artificial intelligence model predictions with explanation During 6 clinical vignettes, participants will see AI model predictions with explanation. The AI model will provide a score for each diagnosis on a scale of 0-100. In 3 clinical vignettes, participants will be shown standard AI model predictions and 3 vignettes they will be shown systematically biased AI model predictions with the model specifically biased against one of the three diagnoses. If the AI model provides a score above 50 an AI model explanation will be shown as gradient-weighted class activation mapping (Grad-CAM) heatmaps overlaid on the chest X-ray that highlighted which regions of the image most affecting the AI model's prediction.

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Participant diagnostic accuracy across clinical vignette settings Time Frame: Day 0	Diagnostic accuracy is defined as the number of correct diagnostic assessments over the total number of diagnostic assessments. After reviewing each individual patient clinical vignette within the survey, participants will be asked to make three separate diagnostic assessments for each clinical vignette, one for heart failure, pneumonia, and COPD. If the participant's assessment agrees with the reference label for each vignette, the diagnostic assessment is considered correct. Diagnostic assessments will be performed while participants are completing the survey (day 0), immediately after the participant reviews the clinical vignette. Participant diagnostic accuracy will be compared across vignette settings (no AI model, standard AI model, standard AI model with explanation, biased AI model, biased AI model with explanation).	Day 0

Secondary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Treatment Selection Accuracy across clinical vignette settings Time Frame: Day 0	Treatment selection accuracy is defined as whether the participant choose the correct treatment for the patient in the clinical vignette, and could choose any combination of steroids, antibiotics, Intravenous (IV) diuretics, or none of these treatments for the patient. Treatment selection assessments will be performed while participants are completing the survey (day 0), immediately after the participant reviews the clinical vignette. Participant treatment selection accuracy will be compared across vignette settings (no AI model, standard AI model, standard AI model with explanation, biased AI model, biased AI model with explanation).	Day 0
Diagnosis specific diagnostic accuracy across clinical vignette settings Time Frame: Day 0	Diagnostic accuracy specific to heart failure, pneumonia, and COPD across vignette settings	Day 0

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

University of Michigan

Collaborators

National Heart, Lung, and Blood Institute (NHLBI)

Investigators

Principal Investigator: Michael Sjoding, MD, University of Michigan

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

April 1, 2022

Primary Completion (Actual)

January 31, 2023

Study Completion (Actual)

January 31, 2023

Study Registration Dates

First Submitted

October 17, 2023

First Submitted That Met QC Criteria

October 17, 2023

First Posted (Actual)

October 25, 2023

Study Record Updates

Last Update Posted (Actual)

October 25, 2023

Last Update Submitted That Met QC Criteria

October 17, 2023

Last Verified

October 1, 2023

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

HUM00180745
R01HL158626 (U.S. NIH Grant/Contract)

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

YES

IPD Plan Description

Data could be made available to other researchers from accredited research institutions after entering into a data use agreement with the University of Michigan

IPD Sharing Time Frame

Data will be shared indefinitely once the study is published

IPD Sharing Access Criteria

This information will be published as supplements with the study manuscript.

IPD Sharing Supporting Information Type

STUDY_PROTOCOL
SAP

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Acute Respiratory Failure

Efficacy Care R&D Ltd
Memorial Hermann Hospital; CRG Medical, Inc.

Unknown

Postoperative Respiratory Abnormalities (AI-ARF)

Shock | Shock, Septic | Respiratory Failure | Respiratory Distress Syndrome | Shock, Cardiogenic | Acute Cardiac Failure | Acute Respiratory Failure | Acute Kidney Failure | Multi Organ Failure | Respiratory Arrest | Acute Respiratory Failure With Hypoxia | Acute Respiratory Failure Requiring Reintubation | Acute... and other conditions

United States
Hospices Civils de Lyon

Recruiting

Evaluation of Perfusion and Ventilation Distribution in Children: a Pilot Study. (PELUCHE 2)

Acute Respiratory Failure

France
Assistance Publique - Hôpitaux de Paris

Recruiting

Brain-lung Interaction During Acute Respiratory Failure (BrainFlow)

Acute Respiratory Failure

France
Laval University

Recruiting

Comparison of Two SpO2 Targets With Two Different Oximeters - Impact on FiO2 During Nasal High Flow Oxygen Therapy

Acute Respiratory Failure | Hypoxemic Acute Respiratory Failure | High Flow Oxygen Therapy | Oxygen Delivery

Canada
University of Milan

Recruiting

Effect of High Flow Nasal Therapy (HFNC) Weaning Protocols vs Standard of Care on Respiratory Outcomes in Patients With Acute Respiratory Failure (WHIP)

Acute Respiratory Failure (ARF)

Italy
Peking Union Medical College Hospital

Recruiting

Lung Recruitment Assessed by 2D/3D-EIT and R/I Ratio in Acute Respiratory Failure Patients With Mechanical Ventilation

Acute Respiratory Failure (ARF)

China
Fondazione IRCCS Policlinico San Matteo di Pavia

Recruiting

Bed Side Assessment in Patients With Acute Respiratory Failure Under Invasive Mechanical Ventilation (BAM)

Acute Respiratory Failure (ARF)

Italy
Southeast University, China

Not yet recruiting

The Effects of Different Non-invasive Respiratory Support

Acute Respiratory Failure (ARF)
Dr. Behcet Uz Children's Hospital

Completed

Comparing Closed-loop FiO2 Controller With Conventional Control of FiO2

Acute Respiratory Failure | Acute Hypoxemic Respiratory Failure | Acute Hypoxemic and Hypercapnic Respiratory Failure

Turkey
Muhammed Ata Nur GEÇER
Bakirkoy Dr. Sadi Konuk Research and Training Hospital

Not yet recruiting

The Effect of Using Distractor Cards and Thermochromic Cards on Pain, Fear, Anxiety and Procedure Time in Children Receiving Inhaler Therapy (CALM-INH)

Pediatric | Pediatric Acute Upper Respiratory Tract Infection | Pediatric Acute Respiratory Failure

Clinical Trials on Artificial Intelligence model predictions without explanation

Cairo University
Future University in Egypt

Recruiting

Evaluation of Artificial Intelligence Models for Diagnosis of Anterior Open Bite Malocclusion

Artificial Intelligence | Open Bite

Egypt
Zhejiang Provincial People's Hospital
The Affiliated Hospital of Qingdao University; Women's Hospital School Of Medicine... and other collaborators

Not yet recruiting

Research and Application of Ultrasonic Intelligent Diagnosis System for Ovarian Mass

Ovarian Neoplasms | Adnexal Mass
Huashan Hospital

Recruiting

Radiomic and Pathomic Study of Pituitary Adenoma Using Machine Learning

Pituitary Neoplasms

China
Fujian Maternity and Child Health Hospital

Completed

Model Study on Cervical Cancer Screening Strategies and Risk Prediction

Artificial Intelligence | Cervical Cancer Screening | Machine Learning | Risk Assessment

China
Fondazione IRCCS Istituto Nazionale dei Tumori,...
University of Milano Bicocca

Active, not recruiting

AI for Lung Cancer Risk Definition in Computed Tomography Screening Programs

Lung Cancer | Blood Biomarkers

Italy
Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University

Recruiting

Development and Prospective Validation of a Digital Pathology-based Artificial Intelligence Diagnostic Model for Pan-cancer Lymphatic Metastasis

Cancer | Lymphatic Metastasis

China
Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University

Completed

Prospective Validation of Pathology-based Artificial Intelligence Diagnostic Model for Lymph Node Metastasis in Prostate Cancer

Prostatic Neoplasms | Lymphatic Metastasis

China
Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University

Completed

Development and Validation of a Multimodal Fusion Artificial Intelligence Model for Predicting the Efficacy of Neoadjuvant Treatment of Bladder Cancer

Cancer | Neoadjuvant Therapy

China
Changhai Hospital
The First Affiliated Hospital with Nanjing Medical University; Shengjing Hospital and other collaborators

Recruiting

AI-Powered Precision Decision-Making for Pancreatic Diseases

Pancreatic Cancer | Chronic Pancreatitis | Pancreatic Neuroendocrine Tumor | Pancreatic Cystic Lesions | Diagnose Disease | IPMN, Pancreatic | Acute Pancreatitis (AP)

China
Sun Yat-Sen Memorial Hospital of Sun Yat-Sen University

Not yet recruiting

Development and Prospective Validation of a Pathology-Based Artificial Intelligence Model for Predicting the Time to Castration Resistance of Prostate Cancer

Prostatic Neoplasms, Castration-Resistant

China

Human Algorithm Interactions for Acute Respiratory Failure Diagnosis

Measuring the Impact of AI in the Diagnosis of Hospitalized Patients: A Randomized Survey Vignette Multicenter Study

Study Overview

Status

Conditions

Intervention / Treatment

Study Type

Enrollment (Actual)

Phase

Contacts and Locations

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Description

Study Plan

How is the study designed?

Design Details

Number of Arms

Arms and Interventions

Participant Group / Arm

Intervention / Treatment

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Secondary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Collaborators

Investigators

Study record dates

Study Major Dates

Study Start (Actual)

Primary Completion (Actual)

Study Completion (Actual)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Actual)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

IPD Plan Description

IPD Sharing Time Frame

IPD Sharing Access Criteria

IPD Sharing Supporting Information Type

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Acute Respiratory Failure

Clinical Trials on Artificial Intelligence model predictions without explanation

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Uzbekistan

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations