Identification of Patients Admitted With COPD Exacerbations and Predicting Readmission Risk Using Machine Learning

May 12, 2023 updated by: Robert Wu, University Health Network, Toronto

Identification of Patients Admitted With COPD Exacerbations and Stratification of Those at High Risk of Readmission Using Natural Language Processing and Machine Learning

Patients with Chronic Obstructive Pulmonary Disease (COPD) who are admitted to hospital are at high risk of readmission. While therapies have improved and there are evidence-based guidelines to reduce readmissions, there are significant challenges to implementation including 1) identifying all patients with COPD early in admission to ensure evidence-based, high value care is provided and 2) identifying those who are at high risk of readmission in order to effectively target resources.

Using machine learning and natural language processing, we want to develop models to 1) identify all patients with COPD exacerbations admitted to hospital and 2) stratify them to distinguish those who are at high risk of readmission b) How will you undertake your work? From Toronto hospitals, we will develop a very large dataset of patient admissions for all medical conditions including exacerbations of COPD from the electronic health record. This data will include both structured data such as age, gender, medications, laboratory values, co-morbidities as well as unstructured data such as discharge summaries and physician notes.

Using the dataset, we will train a model through natural language processing and machine learning to be able to identify people admitted with COPD exacerbation and identify those patients who will be at high risk of readmission within 30 days. We will test the ability of these models to determine our predictive accuracies. We will then test these models at other institutions.

Study Overview

Status

Active, not recruiting

Conditions

Detailed Description

One fifth of patients discharged from hospital for COPD exacerbations are readmitted within 30 days.(1, 3, 4) While therapies and care guidelines have improved, guideline implementation remains poor.(5) Implementing appropriate standards through usual hospital workflow presents significant challenges. One of the top challenges is ensuring all eligible patients with COPD exacerbations are identified in a timely manner.(6) Another top challenge is that staff are often too busy and do not have time to execute evidence-based practices that reduce readmissions.(6) Furthermore, intensive case management can not be offered to everyone because of limited resources. Therefore, it is important that we are able to identify both people who are admitted with COPD early as well as those who are at high risk for readmission.

COPD exacerbations may, at times, not be easily recognized at first and take days to become apparent. Symptoms of exacerbations such as shortness of breath are not specific and signs such as chest radiograph infiltrates can be due to one or more diagnoses. Furthermore, COPD exacerbations can trigger or be triggered by other diseases. As a result, it is not uncommon for admitting physicians to admit patients with multiple provisional diagnoses of heart failure, pneumonia, COPD exacerbation and more. Distinguishing people with COPD exacerbations is further confounded by Electronic Health Records (EHRs) that do not have diagnoses listed as coded elements. The end result is that it is difficult for the rest of the interprofessional team to find COPD patients early in admission. This has been addressed in some U.S. hospitals by having non-health care providers review charts to identify patients admitted with for COPD.(7) An alternate approach has been machine learning and natural language processing. This has been implemented with some success for patients with heart failure but little has been done for people with COPD.(13) In one pilot program, natural language processing helped identify patients admitted with COPD.(7)

To target scarce resources for those who need it most, it would be helpful to further identify patients at high risk of readmission. This would be the first step in determining how to implement effective strategies to reduce readmission rates. There are readmission prediction models developed for medical and surgical patients including the LACE score and the HOSPITAL score.(8, 9) Unfortunately, those that have been studied do not appear to perform well in the COPD population.(10) While factors have been identified that help predict COPD readmission, the models have not been fully validated.(11, 12) The performance could be improved through the use of unstructured data such as clinician progress notes and discharge summaries.

Early identification of people with COPD and knowledge of those who are at risk of readmission can improve health outcomes. Zafar et al. demonstrated that a comprehensive COPD care bundle that consisted of 1. inhaler assessment, 2. appropriate inhaler regimen, 3. early discharge follow up and 4. patient-centered discharge instructions reduced readmissions.(14) Identification of those at high risk of readmission could facilitate enrollment into intensive case management. Therefore, we will conduct the current study to identify patients admitted with acute exacerbations of COPD and stratify patients according to risk of readmission

Methods:

Using retrospective data from the University Health Network (UHN), we will create a data set of admissions to General Internal Medicine for the past 5 years. We estimate this will include approximately 40,000 admissions of which 2,000 will have a most responsible diagnoses of a COPD exacerbation. The data set will contain both structured coded data as well as unstructured text data. Coded data will include age, gender, medications ordered, co-morbidities, laboratory values, and pulmonary function tests. Unstructured text data will include notes in EHR: physician clinic notes, discharge summaries, admission diagnoses, progress notes, and notes from our signover system.

Analysis: We will use several different methods to develop the model including logistic regression, deep neural networks, and convolutional neural networks. Specifically, we will also use statistical machine learning algorithms for event detection using bi-directional long-short term memory neural networks across a variety of input types (e.g., Fourier filter banks, Mel-frequency cepstral coefficients, wavelets, and raw audio). We will also use traditional methods such as dynamic Bayes networks and conditional random fields. On the text analytics side, we will identify key phrases that predict readmission. One approach will be to use discourse analysis to single out "nucleus" phrases from background text. We will also build "joint" predictive models that combine features from the unstructured text and features from the structured coded data. We will use the standard Area under the ROC Curve to assess model performance and use cross validation to minimize the impact of overfitting. Finally, we will then validate our models using a dataset from different centres to determine whether these results are valid and generalizable.

Anticipated results: The development of two validated models based on EHR data: one to accurately identify patients with AECOPD and the second to accurately identify patients at high risk of readmission within 30 days.

Study Type

Observational

Enrollment (Actual)

65000

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

Canada
- Ontario
  - Toronto, Ontario, Canada, M5G 2C4
    - University Health Network

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

18 years and older (Adult, Older Adult)

Accepts Healthy Volunteers

Sampling Method

Non-Probability Sample

Study Population

Using retrospective data from the University Health Network (UHN), we will create a data set of admissions to General Internal Medicine for the past 7 years.

Description

Inclusion Criteria:

All admissions to General Internal Medicine between 2012-2018

Exclusion Criteria:

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Identification of COPD exacerbation Time Frame: Within admission	To identify COPD exacerbations, we will use the most responsible diagnosis code for that visit.	Within admission
Readmission risk Time Frame: 30 days	To identify readmissions, we will include all cause readmissions within 30 days after an index admission for a COPD exacerbation, similar to previous studies.(3)	30 days

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

University Health Network, Toronto

Collaborators

Canadian Institutes of Health Research (CIHR)

Canadian Lung Association

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

June 1, 2019

Primary Completion (Actual)

August 31, 2021

Study Completion (Anticipated)

December 31, 2023

Study Registration Dates

First Submitted

December 6, 2019

First Submitted That Met QC Criteria

December 6, 2019

First Posted (Actual)

December 10, 2019

Study Record Updates

Last Update Posted (Actual)

May 15, 2023

Last Update Submitted That Met QC Criteria

May 12, 2023

Last Verified

May 1, 2023

More Information

Terms related to this study

Additional Relevant MeSH Terms

Other Study ID Numbers

19-5124

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Machine Learning

Tang-Du Hospital

Completed

Prediction of Risk Factors for Adverse Events After Head and Neck Vascular Recanalization Surgery Based on Machine Learning Models

Machine Learning

China
Singapore General Hospital

Not yet recruiting

Impact of Machine Learning-based Clinician Decision Support Algorithms in Perioperative Care (IMAGINATIVE)

Machine Learning

Singapore
Chang Gung Memorial Hospital

Completed

Prediction of Endotracheal Tube Depth by Using Deep Convolutional Neural Networks

Intubation | Machine Learning

Taiwan
University of North Carolina, Chapel Hill
Bill and Melinda Gates Foundation

Completed

Diagnostic Accuracy of a Novel Machine Learning Algorithm to Estimate Gestational Age

Pregnancy Related | Machine Learning | Gestational Age

United States, Zambia
University of Pennsylvania

Enrolling by invitation

Checklist for AI in Medical Imaging (CLAIM) Consensus Panel (CLAIM)

Artificial Intelligence | Machine Learning | Diagnostic Imaging

United States
Academisch Medisch Centrum - Universiteit van Amsterdam...

Completed

Prediction of Hemodynamic Instability in Patients Undergoing Surgery

Blood Pressure | Machine Learning | Hemodynamic Instability | Prediction Models
Shanghai Zhongshan Hospital

Not yet recruiting

Near-infrared Vision for Microcirculatory Status (NVIM)

Machine Learning | Near-infrared Vision | Microcirculatory Status
Southeast University, China

Recruiting

Machine Learning-based Early Clinical Warning of High-risk Patients

High-risk Patients | Risk Reduction | Machine Learning

China
AHEPA University Hospital
George Papanicolaou Hospital; University General Hospital of Heraklion; University... and other collaborators

Recruiting

Artificial Intelligence for Automated Clinical Data Exploration From Electronic Medical Records (CardioMining-AI)

Artificial Intelligence | Machine Learning | Electronic Medical Records

Greece
University of California, Berkeley

Completed

A Mobile App to Increase Physical Activity in Students

Physical Activity | Exercise | Mood | Machine Learning | Mobile Health

United States

Identification of Patients Admitted With COPD Exacerbations and Predicting Readmission Risk Using Machine Learning

Identification of Patients Admitted With COPD Exacerbations and Stratification of Those at High Risk of Readmission Using Natural Language Processing and Machine Learning

Study Overview

Status

Conditions

Detailed Description

Study Type

Enrollment (Actual)

Contacts and Locations

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Sampling Method

Study Population

Description

Study Plan

How is the study designed?

Design Details

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Collaborators

Study record dates

Study Major Dates

Study Start (Actual)

Primary Completion (Actual)

Study Completion (Anticipated)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Actual)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Additional Relevant MeSH Terms

Other Study ID Numbers

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Machine Learning

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Togo

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations