Head-to-Head Evaluation of the Cancer Ontology Supervised Multimodal Orchestration (COSMO) AI System Versus Pathologist-Only Review (COSMO)

December 13, 2025 updated by: Kun-Hsing Yu, Harvard Medical School (HMS and HSDM)

This study evaluates the diagnostic performance of the Cancer Ontology Supervised Multimodal Orchestration (COSMO) AI system for cancer subtype classification and compares it head-to-head with pathologist-only review. Pathologists will independently review de-identified whole-slide images derived from up to 300 patients across three anatomical sites (brain, lung, kidney) and provide diagnostic assessments. In parallel, COSMO will process the same cases offline to generate independent predictions, enabling direct comparison of diagnostic accuracy between human experts and the AI system.

The study will characterize the diagnostic accuracy of COSMO and pathologists, inter-observer agreement, and variations in performance across anatomical sites and cancer types with different incidence rates. Results will establish how COSMO compares to pathologists on identical cases and will inform the development of AI-assisted diagnostic systems in clinical practice.

Study Overview

Status

Enrolling by invitation

Conditions

Intervention / Treatment

Diagnostic test: Digital Pathology Evaluation

Detailed Description

Study Rationale and Background Diagnostic accuracy in cancer subtype classification varies significantly among pathologists due to differences in expertise, experience, and access to diagnostic resources. The emergence of AI systems in pathology offers the potential to enhance diagnostic performance and consistency in cancer classification. However, direct empirical comparisons of AI-based predictions and pathologists' diagnostic performance on identical cases remain limited in the literature.

Study Aims This head-to-head comparative study aims to: (1) evaluate the diagnostic performance of the COSMO AI system in cancer subtype classification across multiple anatomical sites; (2) characterize the diagnostic accuracy of experienced pathologists on the same cases; (3) directly compare diagnostic performance metrics between COSMO and pathologists; and (4) examine concordance patterns and performance variation by anatomical site, cancer incidence category, pathologist experience, and case complexity.

Study Setting and Participants The study will involve up to 25 board-certified pathologists with 3 to 10+ years of diagnostic experience, recruited from institutions across North America, Europe, and the Asia-Pacific region. Participating pathologists will have domain expertise in neuropathology, pulmonary pathology, urologic pathology, or general anatomical pathology.

Cases and Stratification The study will employ de-identified archival whole-slide images representing up to 300 patients with confirmed reference diagnoses, including 100 brain cancers, 100 lung cancers, and 100 kidney cancers. Cases will be stratified by cancer type and incidence category (common vs. rare or uncommon), consistent with World Health Organization (WHO) guidelines.

Data Collection Pathologists will independently review each case and provide diagnostic classifications along with confidence assessments using a 5-point scale. The digital pathology interface will automatically record time-to-diagnosis metrics. COSMO will process the same cases offline to generate independent diagnostic predictions and confidence scores. Both pathologist and AI predictions will be evaluated against established reference standard diagnoses.

Analysis Framework The primary analysis will characterize diagnostic performance metrics (including accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under the receiver operating characteristic curve (AUROC)) for both pathologists (at the individual and aggregated levels) and the COSMO system. Secondary analyses will assess performance stratified by anatomical site, cancer incidence category, and pathologist experience level.

Study Type

Observational

Enrollment (Estimated)

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

United States
- Massachusetts
  - Boston, Massachusetts, United States, 02115
    - Harvard Medical School

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

Child
Adult
Older Adult

Accepts Healthy Volunteers

Sampling Method

Non-Probability Sample

Study Population

We will recruit pathologists from international academic medical centers, hospital systems, and diagnostic pathology practices across North America (United States), Europe (Austria, Hungary), and the Asia-Pacific (Taiwan, Hong Kong, South Korea, China, India) region. Participating sites will include major academic institutions with established pathology departments, with recruitment targeting expertise in neuropathology, pulmonary pathology, and urologic pathology.

Description

Inclusion Criteria:

Board-certified pathologist with expertise in neuropathology, pulmonary pathology, urologic pathology, or general anatomical pathology
Minimum of 3 years of clinical diagnostic experience
Active clinical practice involving diagnostic pathology slide review
Willingness to independently review and diagnose up to 300 de-identified whole-slide images
Ability to access the study platform and complete case reviews within the specified study timeline
Provision of informed consent for study participation

Exclusion Criteria:

Prior involvement in the design or validation of the COSMO AI system
Inability to commit sufficient time to complete assigned case reviews
Presence of significant financial conflicts of interest related to the study outcomes

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

Number of groups / cohorts

Cohorts and Interventions

Group / Cohort	Intervention / Treatment
AI-Based Evaluation using COSMO
Pathologist-Based Evaluation	Diagnostic test: Digital Pathology Evaluation Digital Pathology Evaluation

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Diagnostic performance Time Frame: Periprocedural (at the time of slide review)	Diagnostic performance of the COSMO AI system and pathologists in identifying cancer subtypes across brain, lung, and kidney tumors, as assessed by accuracy, balanced accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under the receiver operating characteristic curve (AUROC). We will include both overall comparisons and stratified evaluations by anatomical site and cancer incidence category (common vs. rare or uncommon).	Periprocedural (at the time of slide review)

Secondary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Inter-Observer Agreement Among Pathologists Time Frame: Periprocedural (at the time of slide review)	Diagnostic concordance among participating pathologists, measured by Fleiss' kappa, intraclass correlation coefficient (ICC), and pairwise concordance rates.	Periprocedural (at the time of slide review)
Pathologist-COSMO AI Concordance Time Frame: Periprocedural (at the time of slide review)	Agreement patterns between pathologist diagnoses and COSMO AI predictions, including proportion of concordant cases overall and stratified by anatomical site, cancer incidence category, and pathologist experience level.	Periprocedural (at the time of slide review)
Diagnostic Confidence Time Frame: Periprocedural (at the time of slide review)	Mean confidence scores (5-point scale) reported by pathologists during diagnostic assessment, stratified by anatomical site, cancer incidence category, and diagnostic correctness (correct vs. incorrect).	Periprocedural (at the time of slide review)
Time-to-Diagnosis Time Frame: Periprocedural (at the time of slide review)	Mean diagnostic time (in seconds) required by pathologists to provide cancer subtype classification, stratified by anatomical site, cancer incidence category, and pathologist experience level.	Periprocedural (at the time of slide review)
Diagnostic Performance Stratified by Pathologist Experience Time Frame: Periprocedural (at the time of slide review)	Diagnostic accuracy of pathologists stratified by years of clinical experience (3-5 years, 6-10 years, >10 years) to assess the relationship between experience level and diagnostic performance in cancer subtype classification.	Periprocedural (at the time of slide review)

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

Harvard Medical School (HMS and HSDM)

Investigators

Principal Investigator: Kun-Hsing Yu, MD, PhD, Harvard Medical School (HMS and HSDM)

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

June 12, 2025

Primary Completion (Estimated)

January 31, 2026

Study Completion (Estimated)

January 31, 2026

Study Registration Dates

First Submitted

December 13, 2025

First Submitted That Met QC Criteria

December 13, 2025

First Posted (Actual)

December 29, 2025

Study Record Updates

Last Update Posted (Actual)

December 29, 2025

Last Update Submitted That Met QC Criteria

December 13, 2025

Last Verified

December 1, 2025

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

Yu Lab COSMO Study

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

IPD Plan Description

Individual pathologist diagnostic assessments will not be shared to protect evaluator anonymity and privacy. De-identified case data and aggregated performance metrics will be made available through published results and supplementary materials. The protocol document will be uploaded to enable full methodological transparency.

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Brain Cancer

University of Michigan Rogel Cancer Center

Completed

Optimization of MRI for Radiation Therapy

Cancer Liver | Cancer Brain | Cancer Head &Neck | Cancer Pelvis

United States
Xinhua Hospital, Shanghai Jiao Tong University...

Unknown

Fluoroglutamine PET/CT in Imaging Patients With Malignant Tumor

Cancer | Metastatic Cancer | Metastatic Brain Cancer

China
University of Florida

Completed

1 Week Versus 6 Weeks of Levetiracetam in Surgical Brain Tumor Patients

Brain Neoplasms | Brain Cancer | Brain Tumors | Seizure | Cancer of the Brain | Cancer of Brain

United States
University Hospital, Grenoble

Unknown

MEDIR Medulloblastome (MEDIR)

Child Cancer Brain

France
Eisai Inc.

Completed

Exploratory Study, Evaluating the Treatment Effect of Surgery Plus GLIADEL® Wafer in Patients With Metastatic Brain Cancer

Metastatic Brain Cancer

United States
Fudan University

Not yet recruiting

T-DXd With or Without Neratinib for HER2 Positive Breast Cancer With Brain Metastasis (THUNDER)

HER2-positive Breast Cancer | Breast Cancer With Brain Metastasis
Sunnybrook Health Sciences Centre

Active, not recruiting

Radiomic Analysis for Predicting Treatment Response and Clinical Outcomes in Malignancies

Breast Cancer | Head and Neck Cancer | Gynecologic Cancer | Brain Cancer

Canada
InSightec

Active, not recruiting

ExAblate (Magnetic Resonance-guided Focused Ultrasound Surgery) Treatment of Brain Tumors

Glioma | Metastatic Brain Cancer

Canada
Memorial Sloan Kettering Cancer Center

Completed

A Feasibility Study of Image Guided Noninvasive Single Fraction Stereotactic Radiosurgery for the Treatment of Brain Metastases

Metastatic Brain Cancer

United States
Virginia Commonwealth University
United States Department of Defense

Completed

Managing Distress in Malignant Brain Cancer

Brain Metastases, Adult | Cancer Metastatic to Brain

United States

Clinical Trials on Digital Pathology Evaluation

Enaiblers AB
Ministry of Health, Uganda; Jimma University; Ghent University, Belgium

Not yet recruiting

Evaluation of an AI-DP for STH Deworming Programs: a Study Protocol (KAKADU)

Schistosomiasis Mansoni | Soil Transmitted Helminths
Fondazione Policlinico Universitario Agostino Gemelli...

Recruiting

Morphological, Genetic and Tumour Microenvironment Characterisation in Uveal Melanoma (MicroGenUM)

Uveal Melanoma

Italy
PharmaNest, Inc
Chinese University of Hong Kong; University of Seville; Sorbonne University; Fundacio...

Completed

Digital Pathology and AI for Liver Outcomes in MASLD (DPAILO-1)

Metabolic Dysfunction-associated Steatotic Liver Disease

Hong Kong, Spain
PharmaNest, Inc
Virginia Commonwealth University; Nonalcoholic Steatohepatitis Clinical Research...

Completed

Digital Pathology and AI for Liver Outcomes in MASLD (DPAILO-2) (DPAILO-2)

Metabolic Dysfunction-associated Steatotic Liver Disease

United States
University of Bologna

Unknown

R. I. S. POS. T. A (RISPOSTA)

Vacuum Extraction; Failure, Affecting Fetus or Newborn | Persistent Occiput Posterior Position During Labor | Complication of Delivery

Italy
Assistance Publique Hopitaux De Marseille

Unknown

Study on the Clinical Features, Comorbidities and Pathologies Associated With Pyoderma Gangrenosum (PYODERMA)

Pyoderma Gangrenosum

France
Imperial College London

Recruiting

Anal Cancer Risk In Women

Anal Cancer | Human Papilloma Virus | Anal Intraepithelial Neoplasia | Genital Neoplasm | Genital Cancer

United Kingdom
Al-Azhar University

Completed

Enterobius Vermicularis Infestation of the Appendix

Parasitic Disease

Saudi Arabia
Dr. Ersin Arslan Education and Training Hospital

Completed

Routine Pathological Examination of Hernia Sac; Is it a Workload or Necessary?

Pathology

Turkey
Mansoura University Hospital

Completed

From Presentation to Diagnosis: Patterns of Pleural Effusion

Pleural Mesothelioma

Egypt

Head-to-Head Evaluation of the Cancer Ontology Supervised Multimodal Orchestration (COSMO) AI System Versus Pathologist-Only Review (COSMO)

Study Overview

Status

Conditions

Intervention / Treatment

Detailed Description

Study Type

Enrollment (Estimated)

Contacts and Locations

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Sampling Method

Study Population

Description

Study Plan

How is the study designed?

Design Details

Number of groups / cohorts

Cohorts and Interventions

Group / Cohort

Intervention / Treatment

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Secondary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Investigators

Study record dates

Study Major Dates

Study Start (Actual)

Primary Completion (Estimated)

Study Completion (Estimated)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Actual)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

IPD Plan Description

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Brain Cancer

Clinical Trials on Digital Pathology Evaluation

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Jamaica

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations