Evaluation of AI Large Models for Diagnosis and Treatment in Real-World Cases: Multicenter Retrospective Study

January 26, 2026 updated by: First Affiliated Hospital of Fujian Medical University

This multicenter retrospective study aims to evaluate the diagnostic and therapeutic performance of three large language models-ChatGPT, Gemini and Deepseek-using 800 archived inpatient medical records from urology departments across four tertiary hospitals. The study will focus on the accuracy and applicability of these models in disease recognition, preliminary diagnosis and treatment recommendation generation, in order to explore their potential value and limitations in supporting clinical decision-making in real-world settings.

Study Overview

Status

Recruiting

Conditions

Urologic Diseases

Intervention / Treatment

Other: Large Language Model Assessment (ChatGPT, Gemini, DeepSeek)

Study Type

Observational

Enrollment (Estimated)

800

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Contact

Name: Ning Xu
Phone Number: +86-13235907575
Email: drxun@fjmu.edu.cn

Study Locations

China
- - Fuzhou, China
    - Recruiting
    - The First Affiliated Hospital of Fujian Medical University

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

Adult
Older Adult

Accepts Healthy Volunteers

Sampling Method

Non-Probability Sample

Study Population

The study population was drawn from the following institutions: The First Affiliated Hospital of Fujian Medical University, The Second Affiliated Hospital of Fujian Medical University,Shishi City Hospital and Shaowu City Hospital

Description

Inclusion Criteria:

The case data is sourced from the four hospitals involved in the study, with complete and authentic diagnosis and treatment records.
Patients must be 18 years or older, with no gender restrictions.
Complete medical records, including the following core information: patient' s basic information, present illness history, past medical history, physical examination, and auxiliary examinations (including laboratory and imaging tests).
A clear discharge diagnosis and treatment plan (including therapeutic measures and follow-up arrangements).
Medical records have been archived, with objective and accurate information that has not been altered.
The patient or their legal representative has provided informed consent, agreeing to the use of their anonymized medical data for research analysis.

Exclusion Criteria:

Medical records with significant missing information, such as key clinical details (present illness history, diagnostic or treatment records, etc.).
Cases where the diagnosis or treatment plan is unclear, or where treatment has not been fully completed for an initial diagnosis.
Cases where the primary diagnosis is not urological.
Cases with major errors or inconsistencies in the records that could affect further assessment.
Medical records in special formats or images that are not readable (e.g., handwritten notes, non-standard documentation).
Patients who have not signed the informed consent form or who refuse to allow their medical data to be used for research.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Diagnostic Accuracy: Assessed by Top-1 accuracy Time Frame: Through study completion, an average of 3 months	Top-1: Proportion of cases where the model's first diagnosis matches the true primary diagnosis.	Through study completion, an average of 3 months
Diagnostic Accuracy: Assessed by Top-3 accuracy Time Frame: Through study completion, an average of 3 months	Top-3: Proportion of cases where the true diagnosis appears in the model's top 3.	Through study completion, an average of 3 months
Diagnostic Completeness Time Frame: Through study completion, an average of 3 months	Proportion of the model's diagnoses that overlap with all diagnoses (primary and secondary) in the case.	Through study completion, an average of 3 months
Differential Diagnosis Quality Time Frame: Through study completion, an average of 3 months	Evaluated by experts using a Likert 5-point scale, considering factors like common disease coverage, logical clarity, and specificity	Through study completion, an average of 3 months
Treatment Plan Quality Time Frame: Through study completion, an average of 3 months	Assesses whether the model's treatment suggestions align with clinical guidelines, scored by experts on completeness, appropriateness, and safety.	Through study completion, an average of 3 months
Analysis Time Time Frame: Through study completion, an average of 3 months	5.Time taken by the AI model to provide diagnoses and treatment suggestions (in seconds), reflecting real-time capability.	Through study completion, an average of 3 months

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

First Affiliated Hospital of Fujian Medical University

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Estimated)

January 1, 2026

Primary Completion (Estimated)

April 1, 2026

Study Completion (Estimated)

June 1, 2026

Study Registration Dates

First Submitted

December 9, 2025

First Submitted That Met QC Criteria

January 26, 2026

First Posted (Actual)

January 30, 2026

Study Record Updates

Last Update Posted (Actual)

January 30, 2026

Last Update Submitted That Met QC Criteria

January 26, 2026

Last Verified

January 1, 2026

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

MRCTA,ECFAH of FMU[2025]902

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Urologic Diseases

University Hospital, Toulouse

Not yet recruiting

Interest of Magnetic Resonance Imaging in the Diagnosis of Upper Urinary Tract Invasive Tumours (UUTICaD)

Urologic Cancer | Magnetic Resonance Imaging (MRI)

France
The First Affiliated Hospital of Zhengzhou University

Completed

Comparison of the Integrated Posterior-Anterior-Lateral Approach and the Posterior Approach in Robotic Radical Prostatectomy

Urologic Cancer | Prostate | Robot Assisted Laparoscopic Radical Prostatectomy

China
University Hospital, Ghent

Completed

Unmet Supportive Care Needs in Bladder Cancer Patients Undergoing Radical Cystectomy

Surgery | Bladder Cancer | Urologic Cancer

Belgium
Chang Gung Memorial Hospital

Completed

Long-Term Lead Chelation Therapy and Progressive Renal Insufficiency

Urologic Disease

China
Wuhan Union Hospital, China

Recruiting

Exploring the Association of Imaging and Tumor Microenvironment in Urologic Cancer Using Radiogenomic Approach（Radiogenomics-Urinary）

Urologic Cancer

China
National Health Research Institutes, Taiwan
National Taiwan University Hospital; Chang Gung Memorial Hospital; Kaohsiung... and other collaborators

Active, not recruiting

The Registry of Genetic Expression of Taiwan Urologic Cancer

Urologic Cancer

Taiwan
Medical University of Graz

Recruiting

PROM Project Urology

Urologic Diseases | Surgery | Urologic Cancer

Austria
Korea University Anam Hospital
The Catholic University of Korea; Keimyung University Dongsan Medical Center; Medical AI Co., Ltd

Recruiting

Prediction of Myocardial Injury After Non-Cardiac Surgery in Urologic Cancer Patients (URO-MINS)

Urologic Cancer | Myocardial Injury After Non-cardiac Surgery | Major Adverse Cardiovascular Events (MACE)

South Korea
Fundación Pública Andaluza para la gestión de la...

Recruiting

Preoperative Carbohydrate Loading for Enhancing Recovery After Radical Cystectomy

Surgery | Anesthesia | Urologic Cancer

Spain
IRCCS San Raffaele

Recruiting

Bank of Biological Material From Patients and Healthy Donors for the Study of Urological and Uro-oncological Pathologies

Urologic Diseases | Infertility | Urologic Cancer

Italy

Clinical Trials on Large Language Model Assessment (ChatGPT, Gemini, DeepSeek)

First Affiliated Hospital, Sun Yat-Sen University
Vantive Health LLC

Enrolling by invitation

Peritoneal Dialysis (PD) Specialized LLM for PD Management (PIONEER-PD)

Peritoneal Dialysis (PD) | Large Language Models

China
MetroWest Artificial Intelligence Research Workgroup

Not yet recruiting

Point-of-Care AI Assistance and Critical Care Outcomes: A Randomized Trial (POC-AI-ICU)

Sepsis | Shock | Critical Illness | Acute Kidney Injury | Delirium Confusional State | Multi-organ Failure | Acute Respiratory Failure (ARF)

United States
Bursa City Hospital

Active, not recruiting

The Predictability of the Necessity for Cardiology Consultation in Patients Scheduled for Non-Cardiac Surgery Using Artificial Intelligence Models in Preoperative Anesthesia Assessment

USE OF ARTIFICIAL INTELLIGENCE IN ANESTHESIA | PREOPERATIVE CARDIOLOGY CONSULTATION REQUIREMENT

Turkey (Türkiye)
University of Manitoba

Recruiting

Utility of ChatGPT in Pre-vasectomy Counselling in an Office-based Setting

Contraception | Vasectomy

Canada
Tsinghua University

Not yet recruiting

A Large Language Model in Outpatient Care

Outpatient Care
Capital Medical University

Completed

Application of Large Language Models in Emergency Neurology

Emergency | Neurology

China
Kirsehir Ahi Evran Universitesi

Completed

LLM-Guided Rehabilitation in Degenerative Knee Disease (LLM-RehabKnee)

Degenerative Knee Disease

Turkey (Türkiye)
Shandong Cancer Hospital and Institute

Not yet recruiting

Concordance Between Large Language Model and Multidisciplinary Team Recommendations in Rectal Cancer

Rectal Cancer

China
First Affiliated Hospital of Wenzhou Medical University

Recruiting

Large Language Model-Assisted cTNM Annotation From Chinese PSMA PET/CT Reports (PSMA-LLM-cTNM)

Prostate Cancer

China
Zhongshan Ophthalmic Center, Sun Yat-sen University

Completed

Evaluate the Performance of Large Language Models in Ophthalmologic Patient Consultation

Non-emergency Ocular Diseases

China

Evaluation of AI Large Models for Diagnosis and Treatment in Real-World Cases: Multicenter Retrospective Study

Study Overview

Status

Conditions

Intervention / Treatment

Study Type

Enrollment (Estimated)

Contacts and Locations

Study Contact

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Sampling Method

Study Population

Description

Study Plan

How is the study designed?

Design Details

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Study record dates

Study Major Dates

Study Start (Estimated)

Primary Completion (Estimated)

Study Completion (Estimated)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Actual)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Urologic Diseases

Clinical Trials on Large Language Model Assessment (ChatGPT, Gemini, DeepSeek)

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Netherlands

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations