- ICH GCP
- US Clinical Trials Registry
- Clinical Trial NCT07414966
Scalable Clinical Oversight of Large Language Models Via Uncertainty Triangulation (SCOUT)
Prospective Evaluation of a Model-Agnostic Meta-Verification Framework (SCOUT) for Scalable Clinical Oversight of Large Language Model Outputs in Coronary Heart Disease Diagnosis: A Multi-Reader, Randomized, Crossover Trial
Study Overview
Status
Conditions
Intervention / Treatment
Detailed Description
Background: Large language models are increasingly deployed in clinical workflows, yet requiring clinician review of every AI output negates the efficiency gains that motivate their adoption. SCOUT addresses this efficiency-safety paradox through algorithmic meta-verification.
The SCOUT framework triangulates three orthogonal external signals to determine case-level uncertainty: (1) Model Heterogeneity - whether a structurally different auxiliary LLM agrees with the primary model; (2) Stochastic Inconsistency - whether repeated sampling from the same model yields divergent outputs; (3) Reasoning Critique - whether an external checker model identifies logical flaws in the chain-of-thought reasoning.
In this crossover trial, 7 clinicians of varying seniority (2 junior residents, 3 senior residents, 2 attending physicians) each review all 110 cases under both standard manual review and SCOUT-assisted review workflows. The study evaluates workflow efficiency (primary endpoint) and diagnostic accuracy (secondary endpoint).
Study Type
Enrollment (Estimated)
Phase
- Not Applicable
Contacts and Locations
Study Contact
- Name: Xiaojin Gao, Dr.
- Phone Number: +86 010 88322415
- Email: sophie_gao@sina.com
Participation Criteria
Eligibility Criteria
Ages Eligible for Study
- Adult
- Older Adult
Accepts Healthy Volunteers
Description
Inclusion Criteria:
- Board-certified or in-training cardiologists at Fuwai Hospital
- Spanning three experience strata: junior residents, senior residents, attending physicians
Exclusion Criteria:
- Clinicians involved in the development or optimization of the SCOUT framework
- Clinicians involved in the gold-standard adjudication process
Study Plan
How is the study designed?
Design Details
- Primary Purpose: Diagnostic
- Allocation: Randomized
- Interventional Model: Crossover Assignment
- Masking: None (Open Label)
Arms and Interventions
Participant Group / Arm |
Intervention / Treatment |
|---|---|
|
Active Comparator: Control (Standard Manual Review)
Physicians manually review all cases in the control set (n=54) with access to AI predictions and reasoning.
No selective deferral.
|
Physicians perform a full manual review of 54 cases using raw medical records with access to the AI model's predictions and reasoning, but without SCOUT uncertainty stratification or selective deferral.
|
|
Experimental: Experimental (SCOUT-Assisted Review)
Physicians process the intervention set (n=56) through the SCOUT framework.
Low-uncertainty cases are auto-accepted; high-uncertainty cases undergo physician review with full audit trail.
|
SCOUT-Assisted Review (Intervention Arm): Physicians review 56 cases processed through the SCOUT framework.
For cases classified as low-uncertainty (D(x)=0), the AI prediction is auto-accepted without physician review.
For high-uncertainty cases (D(x)=1), the physician reviews the case with access to the main model's chain-of-thought reasoning and the meta-verification audit results.
The main model is DeepSeek-V3.1 with chain-of-thought prompting.
|
What is the study measuring?
Primary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Mean physician review time per case (minutes)
Time Frame: Through study completion, an average of 2 hours.
|
Mean time spent by each clinician reviewing and rendering a diagnostic decision per case under each arm.
Measured in minutes.
|
Through study completion, an average of 2 hours.
|
Secondary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Diagnostic accuracy (%)
Time Frame: Through study completion, an average of 2 hours.
|
Proportion of correct CHD subtype classifications (STEMI, NSTEMI, unstable angina, chronic coronary syndromes) under each arm.
|
Through study completion, an average of 2 hours.
|
|
Computational Return on Investment (ROI)
Time Frame: Through study completion, an average of 2 hours.
|
Ratio of physician time savings (valued at standardized minute-wages from Sanming healthcare reform benchmarks) to computational cost of SCOUT inference, stratified by clinician seniority level.
|
Through study completion, an average of 2 hours.
|
Collaborators and Investigators
Study record dates
Study Major Dates
Study Start (Estimated)
Primary Completion (Estimated)
Study Completion (Estimated)
Study Registration Dates
First Submitted
First Submitted That Met QC Criteria
First Posted (Actual)
Study Record Updates
Last Update Posted (Actual)
Last Update Submitted That Met QC Criteria
Last Verified
More Information
Terms related to this study
Keywords
Additional Relevant MeSH Terms
Other Study ID Numbers
- 2025-2702-1
Plan for Individual participant data (IPD)
Plan to Share Individual Participant Data (IPD)?
IPD Plan Description
IPD Sharing Time Frame
IPD Sharing Access Criteria
IPD Sharing Supporting Information Type
- STUDY_PROTOCOL
- SAP
- ICF
- ANALYTIC_CODE
- CSR
Drug and device information, study documents
Studies a U.S. FDA-regulated drug product
Studies a U.S. FDA-regulated device product
This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.
Clinical Trials on Coronary Heart Disease (CHD)
-
Xinjiang Medical UniversityNot yet recruiting
-
Chinese Academy of Medical Sciences, Fuwai HospitalNot yet recruitingProteomic and Inflammatory Omics Changes With Colchicine Therapy in Coronary Heart Disease (PIC-CHD)Coronary Heart Disease (CHD)
-
Gan LijunRecruiting
-
Chinese University of Hong KongNot yet recruiting
-
Merck Sharp & Dohme LLCCompletedCoronary Heart Disease (CHD) | CHD Risk-Equivalent Disease
-
China-Japan Friendship HospitalPeking Union Medical CollegeRecruitingCoronary Heart Disease (CHD)China
-
900th Hospital of PLA Joint Logistic Support ForceCompletedPCI | Coronary Heart Disease (CHD)China
-
China National Center for Cardiovascular DiseasesNot yet recruitingIschemic Mitral Regurgitation | Coronary Heart Disease (CHD)
-
Shenyang Medical CollegeThe Second Hospital of Shenyang Medical CollegeRecruitingAnxiety Disorders | Coronary Heart Disease (CHD)China
-
Shenyang Medical CollegeThe Second Hospital of Shenyang Medical CollegeRecruitingAnxiety Disorders | Coronary Heart Disease (CHD)China
Clinical Trials on Standard Manual Review Workflow
-
China National Center for Cardiovascular DiseasesCompletedCoronary Artery Disease | Data Collection | Artificial Intelligence (AI)China
-
China National Center for Cardiovascular DiseasesNot yet recruitingAcute Coronary Syndromes | ST-segment Elevation Myocardial Infarction (STEMI) | Coronary Artery Disease (CAD) (E.G., Angina, Myocardial Infarction, and Atherosclerotic Heart Disease (ASHD)) | Non-ST-Segment Elevation Myocardial Infarction (NSTEMI)
-
Hospital El SalvadorNot yet recruitingArtificial Intelligence (AI) in DiagnosisEl Salvador
-
Can Tho Stroke International Services HospitalSiemens Healthineers AGEnrolling by invitationAcute Ischemic StrokeVietnam
-
M.D. Anderson Cancer CenterNational Cancer Institute (NCI)CompletedHematopoietic and Lymphoid Cell Neoplasm | Malignant Solid NeoplasmUnited States
-
Bispebjerg HospitalRecruiting
-
Chinese University of Hong KongRecruitingProstate CancerHong Kong
-
Sebastian KochWithdrawn
-
Conceivable Life SciencesCompleted
-
Göran PeterssonThe Kamprad Family Foundation for Entrepreneurship, Research & CharityCompletedCoronary Heart DiseaseSweden