Human-AI Uncertainty Callibration for Improved Skin Lesion Segmentation

March 11, 2026 updated by: Julie Renata Bjerremand, Copenhagen Academy for Medical Education and Simulation

The Effect of Human-AI Uncertainty Calibration vs. AI Uncertainty Alone on the Diagnostic Accuracy of Human Experts for Skin Lesions - a Randomized Controlled Trial.

The goal of this randomized controlled study is to compare the effect of a new, personalized uncertainty-aware decision model (FDM) to a standard image recognition model in improving the diagnostic accuracy while reducing diagnostic uncertainty in experienced dermatologists tasked with differentiating between melanomas, moles and other benign skin lesions. The main question it aims to answer: Is the FDM a feasible method for an improved human AI partnership in which trust is build, misdiagnoses are avoided, and uncertainty is duly introduced or reduced.

The investigators expect to see only a slight increase in collective diagnostic accuracy for both interventions as the the human participants are skilled dermatologist and thus have high accuracies pre-intervention.

The investigators expect to see a higher increase in diagnostic certainty for the FDM intervention compared to the diagnostic certainty in the Base Model intervention.

The investigators expect to see a higher amount of diagnosis changes from incorrect to correct in the FDM group compared to the Base Model group.

The investigators do not expect any learning effect during the study.

Participants will start by answering a series of training cases consisting of images of skin lesions. These are used to train their individual FDM (only for the FDM-intervention group). From here, the participants will be randomized into two arms determining which of the two interventions they are exposed to. The participants will solve each case withouth any intervention first, and this reply will act as a control.

Study Overview

Status

Not yet recruiting

Conditions

Skin Lesions

Intervention / Treatment

Detailed Description

A detailed description of the FDM is presented in the references.

Study Type

Interventional

Enrollment (Estimated)

Phase

Not Applicable

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Contact

Name: Julie Renata Bjerremand
Phone Number: +45 53593700
Email: julierenata@outlook.com

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

Child
Adult
Older Adult

Accepts Healthy Volunteers

Yes

Description

Inclusion Criteria:

Board certified dermatologists with clinical experience in dermoscopic diagnosis.

Exclusion Criteria:

Doctors who have not yet finished their specialization and dermatologists.
Dermatologists without clinical experience in dermoscopic diagnosis.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

Primary Purpose: Diagnostic
Allocation: Randomized
Interventional Model: Parallel Assignment
Masking: None (Open Label)

Number of Arms

Arms and Interventions

Participant Group / Arm	Intervention / Treatment
Active Comparator: Base Model The study participant is presented with a patient case including patient demographics (gender, age, placement of lesion) and two lesion images: 1 overview image, and 1 dermoscopic image. They are asked first to indicate an initial diagnosis along with their self-perceived uncertainty for this specific case before they receive Intervention 1. This initial diagnosis will act as the control. Intervention 1 is AI-generated multi-class probabilities (from a model trained on a large dataset of dermoscopic and overview images similar to the ones used for testing) and only the most likely diagnosis is presented accompanied by uncertainty estimates in percent. After the AI input, the study participant is given the chance to change their diagnosis and indicate any potential shift in uncertainty.	Other: Base Model See arm description. Other Names: Intervention 1
Experimental: FDM The initial diagnosis and indication of self-perceived uncertainty follows the same procedure as for Intervention 1. Intervention 2 is the most likely diagnosis accompanied by a calibrated uncertainty generated by the FDM model (i.e. trained on the study participants previous answers + the crowd annotations on the training data + the base model prediction). After the AI input, the study participant is given the chance to change their diagnosis and indicate any potential shift in uncertainty.	Other: FDM See arm description Other Names: Intervention 2 Final Decision Model

Participant Group / Arm

Intervention / Treatment

Active Comparator: Base Model

The study participant is presented with a patient case including patient demographics (gender, age, placement of lesion) and two lesion images: 1 overview image, and 1 dermoscopic image. They are asked first to indicate an initial diagnosis along with their self-perceived uncertainty for this specific case before they receive Intervention 1. This initial diagnosis will act as the control. Intervention 1 is AI-generated multi-class probabilities (from a model trained on a large dataset of dermoscopic and overview images similar to the ones used for testing) and only the most likely diagnosis is presented accompanied by uncertainty estimates in percent.

After the AI input, the study participant is given the chance to change their diagnosis and indicate any potential shift in uncertainty.

Other: Base Model

See arm description.

Other Names:

Intervention 1

Experimental: FDM

The initial diagnosis and indication of self-perceived uncertainty follows the same procedure as for Intervention 1. Intervention 2 is the most likely diagnosis accompanied by a calibrated uncertainty generated by the FDM model (i.e. trained on the study participants previous answers + the crowd annotations on the training data + the base model prediction).

After the AI input, the study participant is given the chance to change their diagnosis and indicate any potential shift in uncertainty.

Other: FDM

See arm description

Other Names:

Intervention 2
Final Decision Model

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Accuracy Time Frame: Immediately after the intervention.	Diagnostic accuracy in differentiating between melanoma, nevus, and benign keratosis. Defined as the percentage of correct diagnoses. Ground truth is based on histopathologically verified diagnoses.	Immediately after the intervention.

Secondary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Uncertainty Time Frame: Immediately after the intervention.	Changes in self-assesed uncertainty ranging from 0 (very uncertain) to 10 (very certain) from pre- to post-intervention.	Immediately after the intervention.
Cut-off uncertainty Time Frame: Immediately after the intervention.	The self-assessed uncertainty of cases where the participant has clicked a "would you like to discuss this case with a collegue"-button.	Immediately after the intervention.

Other Outcome Measures

Outcome Measure	Measure Description	Time Frame
Time Time Frame: Immediately after the intervention.	Time from the start to finish of each case with a split time corresponding to the end of the control phase (the time "Show AI input"-button is clicked).	Immediately after the intervention.

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

Copenhagen Academy for Medical Education and Simulation

Collaborators

Technical University of Denmark

Investigators

Study Chair: Martin Tolsgaard, Professor, Copenhagen Academy for Medical Education and Simulation

Publications and helpful links

The person responsible for entering information about the study voluntarily provides these publications. These may be about anything related to the study.

General Publications

Kampen, P.J.T. et al. (2026). Uncertainty-Aware Classification: A Human-Guided Bayesian Deep Learning Framework. In: Sudre, C.H., et al. Uncertainty for Safe Utilization of Machine Learning in Medical Imaging. UNSURE 2025. Lecture Notes in Computer Science, vol 16166. Springer, Cham. https://doi.org/10.1007/978-3-032-06593-3_19

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Estimated)

March 1, 2026

Primary Completion (Estimated)

July 1, 2026

Study Completion (Estimated)

November 1, 2026

Study Registration Dates

First Submitted

January 29, 2026

First Submitted That Met QC Criteria

March 11, 2026

First Posted (Actual)

March 12, 2026

Study Record Updates

Last Update Posted (Actual)

March 12, 2026

Last Update Submitted That Met QC Criteria

March 11, 2026

Last Verified

March 1, 2026

More Information

Terms related to this study

Keywords

Other Study ID Numbers

F-25076782

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Skin Lesions

Laboratoires Innothera

Terminated

Tolerance Study of the Silicone Bands on Medical Compression Stockings

Skin Lesions

France
Wright State University

Completed

The Validation of New Photographic Technology for Assessing Skin Lesions

Skin Lesions

United States
Eurofarma Laboratorios S.A.

Not yet recruiting

Evaluation of the EF192B Protection Versus No Treatment (Spray B)

Skin Care | Skin Lesions

Brazil
Eurofarma Laboratorios S.A.

Completed

Assessment of EF192A Potential Sensitization and Accumulated and Primary Irritability in Controlled/Maximized Conditions (Spray A)

Skin Lesion | Skin Care | Skin Lesions

Brazil
Candela Corporation

Completed

Nonablative Fractional Diode Laser for Treatment of Skin Resurfacing and Pigmented Lesions

Pigmented Lesions | Skin Texture

United States
Assiut University

Unknown

Dermoscopy in Diagnosis of Pigmentary Skin Lesions

Pigmentary Skin Lesions | Dermoscopy
Padagis LLC

Completed

To Compare the Safety and Efficacy of Perrigo's Product to an FDA Approved Product for the Treatment of Secondarily Infected Traumatic Skin Lesions

Secondarily Infected Traumatic Skin Lesions

United States
Cynosure, Inc.

Completed

Evaluation of the 755nm Alexandrite Laser for Skin Toning and Epidermal Pigmented Lesions in Asian Skin Types

Epidermal Pigmented Lesions | Skin Toning

United States
Taro Pharmaceuticals USA

Completed

A Study to Evaluate the Safety and Bioequivalence of Mupirocin Calcium Cream, 2% and Bactroban® Cream and Compare Both to a Vehicle in Treatment of Secondarily Infected Traumatic Skin Lesions.

Secondarily Infected Traumatic Skin Lesions
Orlucent, Inc
Clalit Health Services

Completed

Pilot Study to Determine Feasibility of Benign and Malignant Skin Lesion Detection.

Skin Lesions

Clinical Trials on Base Model

Cairo University

Unknown

Chairside Time and Bond Failure of Non Custom Versus Custom Base Orthodontic Attachments During Indirect Bonding

Orthodontic Appliance Complication
USDA Beltsville Human Nutrition Research Center

Completed

Impact of Cashew Nuts in the Human Diet: Measured Energy Value and Effects on Cardiovascular Disease Risk Factors

Healthy Volunteers

United States
Jiangsu HengRui Medicine Co., Ltd.

Completed

A Randomized, Double-blind, Placebo-controlled Phase I Clinical Trial to Evaluate the Safety and Pharmacokinetic Profile of Single and Multiple Dose Escalation Topical Dermal Administration of SHR0302 Alkali Gel in Healthy Subjects

Preoperative Sedation of Adults

China
University Health Network, Toronto

Unknown

Assessment of a Web-Based Simulation in Transesophageal Echocardiography (TEE) Views (Web-SimTEE)

Training | Education

Canada
Riphah International University

Completed

Effects of Acapella VS CHEST Physiotherapy in Post-Operative CABG Patients

CABG

Pakistan
USDA Beltsville Human Nutrition Research Center

Completed

Daily Consumption of Well-Cooked Broccoli May Affect Glucosinolate Metabolites and Inflammatory Biomarkers

Healthy Volunteers

United States
Coloplast A/S

Terminated

Multi-national, Safety and Performance Study of New Ostomy Product Compared to Standard Care

Skin Condition | Leakage

Denmark, France, Germany, Iceland
University of Illinois at Urbana-Champaign

Completed

Multiphase Activity Promotion Study (MAPS)

Sedentary Lifestyle | Health Behavior

United States
Mansoura University

Completed

A 2-year Clinical Impact of Bulk-fill Low-viscosity Resin Composite Liners in Class II Restorations.

Dental Caries Class II

Egypt
University of Toronto

Not yet recruiting

Development of a Novel Anti-caries Chewing Gum

Dental Caries | Dental Plaque | Periodontal Disease

Human-AI Uncertainty Callibration for Improved Skin Lesion Segmentation

The Effect of Human-AI Uncertainty Calibration vs. AI Uncertainty Alone on the Diagnostic Accuracy of Human Experts for Skin Lesions - a Randomized Controlled Trial.

Study Overview

Status

Conditions

Intervention / Treatment

Detailed Description

Study Type

Enrollment (Estimated)

Phase

Contacts and Locations

Study Contact

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Description

Study Plan

How is the study designed?

Design Details

Number of Arms

Arms and Interventions

Participant Group / Arm

Intervention / Treatment

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Secondary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Other Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Collaborators

Investigators

Publications and helpful links

General Publications

Study record dates

Study Major Dates

Study Start (Estimated)

Primary Completion (Estimated)

Study Completion (Estimated)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Actual)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Keywords

Other Study ID Numbers

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Skin Lesions

Clinical Trials on Base Model

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Suriname

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations