Performance of an OCR-Prompt-LLM Integrated Workflow for Extracting Multi-dimensional Clinical Data in Ischemic Heart Disease (OPAL-CAD)
Study Overview
Status
Status
Conditions
Conditions
Intervention / Treatment
Intervention / Treatment
Study Type
Study Type
Enrollment (Actual)
Enrollment
Contacts and Locations
Study Locations
-
-
-
Beijing, China, 100037
- Fuwai Hospital
-
-
Participation Criteria
Eligibility Criteria
Eligibility Criteria
Ages Eligible for Study
- Adult
- Older Adult
Accepts Healthy Volunteers
Sampling Method
Study Population
Description
Inclusion Criteria:
- Patients aged 18 years and older.
- Clinical records of patients who were previously enrolled in the AIM-CHD (for the pilot/prompt optimization set) or SMART-CHD (for the internal validation cohort) studies.
- Patients diagnosed with, or suspected of having, coronary artery disease (CAD), including subtypes: Unstable Angina (UA), STEMI, NSTEMI, and Chronic Coronary Syndrome (CCS).
Exclusion Criteria:
- Clinical records with severe data fragmentation or missing more than 50% of the key clinical indicators.
- Handwritten medical records or low-quality scans that are illegible for Optical Character Recognition (OCR) processing.
- Duplicate records or records with conflicting "Gold Standard" labels that cannot be reconciled by the expert committee.
Study Plan
How is the study designed?
Design Details
Number of groups / cohorts
Cohorts and Interventions
Group / CohortGroup / Cohort |
Intervention / TreatmentIntervention / Treatment |
|---|---|
|
Test Cohort
This group consists of 50 patient records from the AIM-CHD Study at Fuwai Hospital.
These data are specifically utilized for refining OCR processing and optimizing Prompt Engineering for the LLM-based workflow.
|
The intervention is an automated clinical data management system integrating Optical Character Recognition (OCR), optimized Prompt Engineering, and Large Language Models (LLMs).
The workflow processes unstructured inpatient records to extract 10 key clinical indicators (e.g., LVEF, CAD subtypes, medications) and classifies the patient into specific coronary artery disease categories (UA, STEMI, NSTEMI, CCS)
Standard manual process where experienced clinical physicians collect and interpret patient information from medical records.
This serves as the human benchmark for comparing diagnostic accuracy and operational efficiency.
|
|
Internal Validation Cohort
This cohort includes 188 clinical cases sourced from the SMART-CHD Study at Fuwai Hospital.
These records serve as the primary internal benchmark to evaluate the diagnostic and extraction accuracy of the LLM workflow against the established ground truth.
|
The intervention is an automated clinical data management system integrating Optical Character Recognition (OCR), optimized Prompt Engineering, and Large Language Models (LLMs).
The workflow processes unstructured inpatient records to extract 10 key clinical indicators (e.g., LVEF, CAD subtypes, medications) and classifies the patient into specific coronary artery disease categories (UA, STEMI, NSTEMI, CCS)
Standard manual process where experienced clinical physicians collect and interpret patient information from medical records.
This serves as the human benchmark for comparing diagnostic accuracy and operational efficiency.
|
|
External Validation Cohort
This cohort comprises 70 patient records collected from 8 independent sub-centers (excluding Fuwai Hospital) to assess the generalizability and robustness of the model across diverse clinical environments and different medical record formats.
|
The intervention is an automated clinical data management system integrating Optical Character Recognition (OCR), optimized Prompt Engineering, and Large Language Models (LLMs).
The workflow processes unstructured inpatient records to extract 10 key clinical indicators (e.g., LVEF, CAD subtypes, medications) and classifies the patient into specific coronary artery disease categories (UA, STEMI, NSTEMI, CCS)
Standard manual process where experienced clinical physicians collect and interpret patient information from medical records.
This serves as the human benchmark for comparing diagnostic accuracy and operational efficiency.
|
What is the study measuring?
Primary Outcome Measures
Primary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Overall Diagnostic and Extraction Accuracy Rate
Time Frame: Through study completion, an average of 3 months.
|
To calculate the overall accuracy rate of the LLM-based workflow across 308 cases (including the pilot set, internal validation cohort, and external validation cohort) for 10 clinical indicators (e.g., LVEF, blood glucose, etc.) and 4 diagnostic subtypes of coronary artery disease.
Accuracy is defined as the proportion of cases where the LLM's extraction or diagnostic results are perfectly consistent with the 'Gold Standard' established by human clinical experts.
|
Through study completion, an average of 3 months.
|
Collaborators and Investigators
Sponsor
Sponsor
Study record dates
Study Major Dates
Study Start (Actual)
Study Start
Primary Completion (Actual)
Primary Completion
Study Completion (Actual)
Study Completion
Study Registration Dates
First Submitted
First Submitted
First Submitted That Met QC Criteria
First Submitted That Met QC Criteria
First Posted (Actual)
First Posted
Study Record Updates
Last Update Posted (Actual)
Last Update Posted
Last Update Submitted That Met QC Criteria
Last Update Submitted That Met QC Criteria
Last Verified
Last Verified
More Information
Terms related to this study
Additional Relevant MeSH Terms
Other Study ID Numbers
Other Study ID Numbers
- CAD-LLM-2025-01
Plan for Individual participant data (IPD)
Plan to Share Individual Participant Data (IPD)?
IPD Plan Description
Drug and device information, study documents
Studies a U.S. FDA-regulated drug product
Studies a U.S. FDA-regulated device product
This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.
Clinical Trials on Coronary Artery Disease
-
NCT07163858RecruitingCoronary Artery Bypass | Coronary Artery Disease(CAD) | Off Pump Coronary Artery Bypass Surgery | Hemodynamic Optimization | Hemodynamic Management | Off Pump Coronary Artery Bypass Graft | Coronary Artery Disease With Need for Bypass Surgery | Noradrenaline
-
NCT07388030Not yet recruitingCoronary Artery Disease | Coronary Artery Calcification | Severe Coronary Artery Disease
-
NCT07172308CompletedCoronary Artery Disease (CAD) | Atherosclerosis of Coronary Artery
-
NCT07491107Not yet recruitingCoronary Artery Disease (CAD) | Multivessel Coronary Artery Disease | Complex Coronary Lesions | Calcific Coronary Arteriosclerosis | Small Vessel Ischemic Disease | Stenosis Coronary
-
NCT07596706Active, not recruitingCoronary Artery Disease (CAD) | Coronary Bifurcation Lesion | Left Main Coronary Artery Stenosis
-
NCT07354399RecruitingCoronary Artery Disease With Myocardial Infarction
-
NCT07357675Not yet recruitingCoronary Artery Disease (CAD) | Coronary Artery Bypass Graft Surgery(CABG)
-
NCT03767621Active, not recruitingCoronary Artery Disease | Left Main Coronary Artery Disease | Left Main Coronary Artery Stenosis | Restenosis, Coronary
-
NCT05464147Active, not recruitingCoronary Artery Disease | Chronic Total Occlusion of Coronary Artery | Multi Vessel Coronary Artery Disease | Bifurcation of Coronary Artery | Long Lesions Coronary Artery Disease
-
NCT07407738Not yet recruitingCalcified Coronary Artery Disease | Coronary Arterial Disease
Clinical Trials on OCR-Prompt-LLM Information Extraction Workflow
-
NCT07449429Not yet recruitingAcute Coronary Syndromes | ST-segment Elevation Myocardial Infarction (STEMI) | Coronary Artery Disease (CAD) (E.G., Angina, Myocardial Infarction, and Atherosclerotic Heart Disease (ASHD)) | Non-ST-Segment Elevation Myocardial Infarction (NSTEMI)