- ICH GCP
- US Clinical Trials Registry
- Clinical Trial NCT04432961
Natural Language Processing (NLP) Analysis of Free Text Notes to Investigate Coronavirus (COVID-19)
A Database and Analytics Study of Free Text Clinical Notes and Structured Data to Investigate Phenotype Associations With Outcomes in Patients With COVID-19
Study Overview
Status
Conditions
Detailed Description
Patients admitted to Cambridge University Hospitals (CUH)with COVID-19 have undergone routine clinical documentation and specific investigation and testing for COVID-19. The pathway for these patients ranges from supportive measures on the ward to deterioration requiring Intensive therapy Unit (ITU) admission and ventilatory support. Patients are also at risk of developing complications such as Acute Kidney Injury and thromboembolism. Identification of the risk factors for these and other outcomes such as the requirement for ventilation remain a challenge and reviewing the clinical data for these patients is critical in the understanding of the relationship between patient characteristics and outcomes.
There is data available in structured fields in the EHR, however, this is sometimes incomplete and inaccurate. An assessment of the free text clinical notes provides an opportunity to fill in the gaps and provide a much richer dataset for evaluation. We plan to use Natural Language Processing (NLP) (a field of machine learning that allows computers to analyse human language) to review Discharge Summaries of patients admitted to hospital with COVID-19 and convert free text data into structured data for analysis.
The NLP techniques developed by Dr Collier's team include methods for coding of free texts to SNOMED CT and other biomedical ontologies. These methods, based on statistical machine learning from human annotated texts, have been benchmarked for scientific texts and social media. In this project we intend to adapt these techniques for patient records. The techniques will require a number of human annotated patient records in order to adapt. The NLP output will be combined with structured data from the EHR and undergo statistical analysis to identify the rates of complications in patients with COVID-19 and risk factors associated with these. This may help to guide management decisions by earlier intervention to prevent poor outcomes in these patients.
Study Type
Enrollment (Actual)
Contacts and Locations
Study Locations
-
-
-
Cambridge, United Kingdom
- Cambridge University NHS Foundation Trust
-
-
Participation Criteria
Eligibility Criteria
Ages Eligible for Study
Accepts Healthy Volunteers
Genders Eligible for Study
Sampling Method
Study Population
Description
Inclusion Criteria:
- Male and female
- Age range: 18 to 100 years
- Patients admitted to Cambridge University Hospitals with confirmed COVID-19 on lab testing
Exclusion Criteria:
Children and patients with a negative COVID test.
Study Plan
How is the study designed?
Design Details
What is the study measuring?
Primary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
---|---|---|
research database of EHR records from COVID-19 patients processed using NLP tools for named entity recognition and linking adapted to CUH EMR data to identify variables of interest
Time Frame: 1 year
|
Our overarching hypothesis is that the NLP-extracted data from the free-text discharge summary can be combined with structured data from the EMR to yield insights into the development of complications.
Patient with severe disease requiring ITU admission and non severe disease managed on an inpatient ward will be included.
The variables of interest will include patient characteristics and specific encounter related information including length of stay and baseline investigations (e.g., blood tests) and interventions received
|
1 year
|
Secondary Outcome Measures
Outcome Measure |
Time Frame |
---|---|
A set of annotation guidelines to produce human-expert (gold) labelled data for a subset of the EHR
Time Frame: 6 months
|
6 months
|
A comparison of the NLP output to terms in the structured problem list to identify missing terms in the structured problem list
Time Frame: 1 year
|
1 year
|
Collaborators and Investigators
Collaborators
Study record dates
Study Major Dates
Study Start (Actual)
Primary Completion (Actual)
Study Completion (Actual)
Study Registration Dates
First Submitted
First Submitted That Met QC Criteria
First Posted (Actual)
Study Record Updates
Last Update Posted (Actual)
Last Update Submitted That Met QC Criteria
Last Verified
More Information
Terms related to this study
Keywords
Additional Relevant MeSH Terms
Other Study ID Numbers
- A095625
Drug and device information, study documents
Studies a U.S. FDA-regulated drug product
Studies a U.S. FDA-regulated device product
This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.
Clinical Trials on COVID-19
-
University of Roma La SapienzaQueen Mary University of London; Università degli studi di Roma Foro Italico; Bios Prevention SrlCompletedPost Acute Sequelae of COVID-19 | Post COVID-19 Condition | Long-COVID | Chronic COVID-19 SyndromeItaly
-
Yang I. PachankisActive, not recruitingCOVID-19 Respiratory Infection | COVID-19 Stress Syndrome | COVID-19 Vaccine Adverse Reaction | COVID-19-Associated Thromboembolism | COVID-19 Post-Intensive Care Syndrome | COVID-19-Associated StrokeChina
-
Massachusetts General HospitalRecruitingPost Acute COVID-19 Syndrome | Long COVID | Post Acute Sequelae of COVID-19 | Long COVID-19United States
-
Indonesia UniversityRecruitingPost-COVID-19 Syndrome | Long COVID | Post COVID-19 Condition | Post-COVID Syndrome | Long COVID-19Indonesia
-
Erasmus Medical CenterDa Vinci Clinic; HGC RijswijkNot yet recruitingPost-COVID-19 Syndrome | Long COVID | Long Covid19 | Post COVID-19 Condition | Post-COVID Syndrome | Post COVID-19 Condition, Unspecified | Post-COVID ConditionNetherlands
-
Dr. Soetomo General HospitalIndonesia-MoH; Universitas Airlangga; Biotis Pharmaceuticals, IndonesiaRecruitingCOVID-19 Pandemic | COVID-19 Vaccines | COVID-19 Virus DiseaseIndonesia
-
University of Witten/HerdeckeInstitut für Rehabilitationsforschung NorderneyCompletedPost-COVID-19 Syndrome | Long-COVID-19 SyndromeGermany
-
Jonathann Kuo, MDActive, not recruitingSARS-CoV2 Infection | Post-COVID-19 Syndrome | Dysautonomia | Post Acute COVID-19 Syndrome | Long COVID | Long Covid19 | COVID-19 Recurrent | Post-Acute COVID-19 | Post-Acute COVID-19 Infection | Post Acute Sequelae of COVID-19 | Dysautonomia Like Disorder | Dysautonomia Orthostatic Hypotension Syndrome | Post... and other conditionsUnited States
-
University Hospital, Ioannina1st Division of Internal Medicine, University Hospital of IoanninaRecruitingCOVID-19 Pneumonia | COVID-19 Respiratory Infection | COVID-19 Pandemic | COVID-19 Acute Respiratory Distress Syndrome | COVID-19-Associated Pneumonia | COVID 19 Associated Coagulopathy | COVID-19 (Coronavirus Disease 2019) | COVID-19-Associated ThromboembolismGreece