Head-to-Head Evaluation of the Cancer Ontology Supervised Multimodal Orchestration (COSMO) AI System Versus Pathologist-Only Review (COSMO)
This study evaluates the diagnostic performance of the Cancer Ontology Supervised Multimodal Orchestration (COSMO) AI system for cancer subtype classification and compares it head-to-head with pathologist-only review. Pathologists will independently review de-identified whole-slide images derived from up to 300 patients across three anatomical sites (brain, lung, kidney) and provide diagnostic assessments. In parallel, COSMO will process the same cases offline to generate independent predictions, enabling direct comparison of diagnostic accuracy between human experts and the AI system.
The study will characterize the diagnostic accuracy of COSMO and pathologists, inter-observer agreement, and variations in performance across anatomical sites and cancer types with different incidence rates. Results will establish how COSMO compares to pathologists on identical cases and will inform the development of AI-assisted diagnostic systems in clinical practice.
Study Overview
Status
Status
Conditions
Conditions
Intervention / Treatment
Intervention / Treatment
Detailed Description
Study Rationale and Background Diagnostic accuracy in cancer subtype classification varies significantly among pathologists due to differences in expertise, experience, and access to diagnostic resources. The emergence of AI systems in pathology offers the potential to enhance diagnostic performance and consistency in cancer classification. However, direct empirical comparisons of AI-based predictions and pathologists' diagnostic performance on identical cases remain limited in the literature.
Study Aims This head-to-head comparative study aims to: (1) evaluate the diagnostic performance of the COSMO AI system in cancer subtype classification across multiple anatomical sites; (2) characterize the diagnostic accuracy of experienced pathologists on the same cases; (3) directly compare diagnostic performance metrics between COSMO and pathologists; and (4) examine concordance patterns and performance variation by anatomical site, cancer incidence category, pathologist experience, and case complexity.
Study Setting and Participants The study will involve up to 25 board-certified pathologists with 3 to 10+ years of diagnostic experience, recruited from institutions across North America, Europe, and the Asia-Pacific region. Participating pathologists will have domain expertise in neuropathology, pulmonary pathology, urologic pathology, or general anatomical pathology.
Cases and Stratification The study will employ de-identified archival whole-slide images representing up to 300 patients with confirmed reference diagnoses, including 100 brain cancers, 100 lung cancers, and 100 kidney cancers. Cases will be stratified by cancer type and incidence category (common vs. rare or uncommon), consistent with World Health Organization (WHO) guidelines.
Data Collection Pathologists will independently review each case and provide diagnostic classifications along with confidence assessments using a 5-point scale. The digital pathology interface will automatically record time-to-diagnosis metrics. COSMO will process the same cases offline to generate independent diagnostic predictions and confidence scores. Both pathologist and AI predictions will be evaluated against established reference standard diagnoses.
Analysis Framework The primary analysis will characterize diagnostic performance metrics (including accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under the receiver operating characteristic curve (AUROC)) for both pathologists (at the individual and aggregated levels) and the COSMO system. Secondary analyses will assess performance stratified by anatomical site, cancer incidence category, and pathologist experience level.
Study Type
Study Type
Enrollment (Estimated)
Enrollment
Contacts and Locations
Study Locations
-
-
Massachusetts
-
Boston, Massachusetts, United States, 02115
- Harvard Medical School
-
-
Participation Criteria
Eligibility Criteria
Eligibility Criteria
Ages Eligible for Study
- Child
- Adult
- Older Adult
Accepts Healthy Volunteers
Sampling Method
Study Population
Description
Inclusion Criteria:
- Board-certified pathologist with expertise in neuropathology, pulmonary pathology, urologic pathology, or general anatomical pathology
- Minimum of 3 years of clinical diagnostic experience
- Active clinical practice involving diagnostic pathology slide review
- Willingness to independently review and diagnose up to 300 de-identified whole-slide images
- Ability to access the study platform and complete case reviews within the specified study timeline
- Provision of informed consent for study participation
Exclusion Criteria:
- Prior involvement in the design or validation of the COSMO AI system
- Inability to commit sufficient time to complete assigned case reviews
- Presence of significant financial conflicts of interest related to the study outcomes
Study Plan
How is the study designed?
Design Details
Number of groups / cohorts
Cohorts and Interventions
Group / CohortGroup / Cohort |
Intervention / TreatmentIntervention / Treatment |
|---|---|
|
AI-Based Evaluation using COSMO
|
|
|
Pathologist-Based Evaluation
|
Digital Pathology Evaluation
|
What is the study measuring?
Primary Outcome Measures
Primary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Diagnostic performance
Time Frame: Periprocedural (at the time of slide review)
|
Diagnostic performance of the COSMO AI system and pathologists in identifying cancer subtypes across brain, lung, and kidney tumors, as assessed by accuracy, balanced accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under the receiver operating characteristic curve (AUROC).
We will include both overall comparisons and stratified evaluations by anatomical site and cancer incidence category (common vs. rare or uncommon).
|
Periprocedural (at the time of slide review)
|
Secondary Outcome Measures
Secondary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Inter-Observer Agreement Among Pathologists
Time Frame: Periprocedural (at the time of slide review)
|
Diagnostic concordance among participating pathologists, measured by Fleiss' kappa, intraclass correlation coefficient (ICC), and pairwise concordance rates.
|
Periprocedural (at the time of slide review)
|
|
Pathologist-COSMO AI Concordance
Time Frame: Periprocedural (at the time of slide review)
|
Agreement patterns between pathologist diagnoses and COSMO AI predictions, including proportion of concordant cases overall and stratified by anatomical site, cancer incidence category, and pathologist experience level.
|
Periprocedural (at the time of slide review)
|
|
Diagnostic Confidence
Time Frame: Periprocedural (at the time of slide review)
|
Mean confidence scores (5-point scale) reported by pathologists during diagnostic assessment, stratified by anatomical site, cancer incidence category, and diagnostic correctness (correct vs. incorrect).
|
Periprocedural (at the time of slide review)
|
|
Time-to-Diagnosis
Time Frame: Periprocedural (at the time of slide review)
|
Mean diagnostic time (in seconds) required by pathologists to provide cancer subtype classification, stratified by anatomical site, cancer incidence category, and pathologist experience level.
|
Periprocedural (at the time of slide review)
|
|
Diagnostic Performance Stratified by Pathologist Experience
Time Frame: Periprocedural (at the time of slide review)
|
Diagnostic accuracy of pathologists stratified by years of clinical experience (3-5 years, 6-10 years, >10 years) to assess the relationship between experience level and diagnostic performance in cancer subtype classification.
|
Periprocedural (at the time of slide review)
|
Collaborators and Investigators
Sponsor
Sponsor
Investigators
Investigators
- Principal Investigator: Kun-Hsing Yu, MD, PhD, Harvard Medical School (HMS and HSDM)
Study record dates
Study Major Dates
Study Start (Actual)
Study Start
Primary Completion (Estimated)
Primary Completion
Study Completion (Estimated)
Study Completion
Study Registration Dates
First Submitted
First Submitted
First Submitted That Met QC Criteria
First Submitted That Met QC Criteria
First Posted (Actual)
First Posted
Study Record Updates
Last Update Posted (Actual)
Last Update Posted
Last Update Submitted That Met QC Criteria
Last Update Submitted That Met QC Criteria
Last Verified
Last Verified
More Information
Terms related to this study
Additional Relevant MeSH Terms
- Urogenital Diseases
- Brain Diseases
- Central Nervous System Diseases
- Nervous System Diseases
- Pathologic Processes
- Urogenital Neoplasms
- Neoplasms by Site
- Neoplasms
- Male Urogenital Diseases
- Kidney Diseases
- Urologic Diseases
- Female Urogenital Diseases
- Female Urogenital Diseases and Pregnancy Complications
- Respiratory Tract Diseases
- Lung Diseases
- Respiratory Tract Neoplasms
- Thoracic Neoplasms
- Urologic Neoplasms
- Nervous System Neoplasms
- Central Nervous System Neoplasms
- Pathological Conditions, Signs and Symptoms
- Lung Neoplasms
- Disease
- Brain Neoplasms
- Kidney Neoplasms
Other Study ID Numbers
Other Study ID Numbers
- Yu Lab COSMO Study
Plan for Individual participant data (IPD)
Plan to Share Individual Participant Data (IPD)?
IPD Plan Description
Drug and device information, study documents
Studies a U.S. FDA-regulated drug product
Studies a U.S. FDA-regulated device product
This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.
Clinical Trials on Brain Cancer
-
NCT02045381CompletedCancer Liver | Cancer Brain | Cancer Head &Neck | Cancer Pelvis
-
NCT03721055UnknownCancer | Metastatic Cancer | Metastatic Brain Cancer
-
NCT02334722CompletedBrain Neoplasms | Brain Cancer | Brain Tumors | Seizure | Cancer of the Brain | Cancer of Brain
-
NCT00525590Completed
-
NCT07152782Not yet recruitingHER2-positive Breast Cancer | Breast Cancer With Brain Metastasis
-
NCT05170087Active, not recruitingBreast Cancer | Head and Neck Cancer | Gynecologic Cancer | Brain Cancer
-
NCT01473485Active, not recruitingGlioma | Metastatic Brain Cancer
-
NCT00610285Completed
-
NCT05087095CompletedBrain Metastases, Adult | Cancer Metastatic to Brain
Clinical Trials on Digital Pathology Evaluation
-
NCT06055530Not yet recruitingSchistosomiasis Mansoni | Soil Transmitted Helminths
-
NCT05889481Recruiting
-
NCT06344364CompletedMetabolic Dysfunction-associated Steatotic Liver Disease
-
NCT06493253CompletedMetabolic Dysfunction-associated Steatotic Liver Disease
-
NCT01991665UnknownVacuum Extraction; Failure, Affecting Fetus or Newborn | Persistent Occiput Posterior Position During Labor | Complication of Delivery
-
NCT03636737Unknown
-
NCT05835947RecruitingAnal Cancer | Human Papilloma Virus | Anal Intraepithelial Neoplasia | Genital Neoplasm | Genital Cancer
-
NCT04383249Completed