- ICH GCP
- US Clinical Trials Registry
- Clinical Trial NCT07281066
LLM Performance in Endodontic Diagnostics
Evaluating ChatGPT-4o, Gemini and Claude 3.7 in Endodontic Diagnostics: A Prospective Clinical Study
The goal of this prospective observational study is to evaluate the ability of three large language models (ChatGPT-4o, Gemini Advanced, and Claude 3.7) to support diagnosis and treatment decision-making in adult patients presenting with common endodontic conditions.
The main questions the study aims to answer are:
Can LLMs accurately determine the endodontic diagnosis when provided with structured clinical information and periapical radiographs?
Can LLMs propose appropriate treatment plans comparable to decisions made by endodontic specialists?
To answer these questions, researchers will compare the diagnostic and treatment accuracy of three AI models using a consensus diagnosis from endodontic specialists as the reference standard.
Participants will:
Receive routine endodontic examination and periapical radiographs as part of standard clinical care.
Have their anonymized clinical histories and radiographs entered into the three AI models.
Not interact directly with any AI system; all evaluations will be performed by the research team.
This study aims to understand how large language models perform under real-world clinical conditions and whether these systems may play a supportive role in endodontic diagnostics in the future.
Study Overview
Status
Conditions
Intervention / Treatment
Detailed Description
This prospective observational study aims to evaluate the real-time diagnostic and treatment decision-making performance of three large language models-ChatGPT-4o, Gemini Advanced, and Claude 3.7-in an endodontic clinical setting. A total of 120 patients presenting to the endodontic clinic were examined, and detailed medical/dental histories, clinical findings, and periapical radiographs were collected. Each anonymized case was then presented to the three LLMs using a standardized prompt asking for the diagnosis and the appropriate treatment plan.
All models were used in their default multimodal configurations without enabling web-search functions, plug-ins, or external data retrieval. Each question was submitted only once in isolated chat sessions to prevent memory carry-over. Responses were saved verbatim and compared with the reference diagnoses and treatment plans established by a panel of endodontic specialists.
This study was designed to mimic real-world clinical conditions as closely as possible, providing a realistic assessment of how these systems might perform when used by clinicians in everyday practice. Understanding their capabilities and limitations in authentic clinical scenarios is essential, as LLMs are expected to play an increasingly vital role in future dental care particularly in decision support, triage, and patient education. By identifying where these models perform well and where they fall short, this research aims to inform safe and effective clinical integration as LLM technologies continue to advance.
Study Type
Enrollment (Actual)
Contacts and Locations
Study Locations
-
-
Istanbul
-
Maltepe, Istanbul, Turkey (Türkiye), 34856
- Faculty of Dentistry, Marmara University
-
-
Participation Criteria
Eligibility Criteria
Ages Eligible for Study
- Adult
- Older Adult
Accepts Healthy Volunteers
Sampling Method
Study Population
Description
Inclusion Criteria:
- Adult patients (≥18 years old) presenting to or referred to the Endodontic Clinic.
Patients with a clinically verified endodontic condition requiring diagnosis and treatment planning.
Patients who agreed to participate and provided informed consent.
Patients for whom a complete paper-based medical/dental history and periapical radiograph were obtained during the clinical visit.
Exclusion Criteria:
- Exclusion Criteria
Patients who declined participation or did not provide informed consent.
Pediatric patients (<18 years old) referred to the Pediatric Dentistry Clinic.
Patients attending the clinic with non-endodontic complaints (e.g., post-extraction alveolitis, third-molar extraction problems).
Cases with incomplete clinical information or missing radiographs.
Patients unable to undergo standard endodontic examination procedures.
Study Plan
How is the study designed?
Design Details
Cohorts and Interventions
Group / Cohort |
Intervention / Treatment |
|---|---|
|
Endodontic Patients Cohort
This cohort includes 120 consecutive patients presenting to the endodontic clinic with clinically verified endodontic conditions.
Clinical history and periapical radiographs were collected, and diagnostic/treatment recommendations generated by AI models were compared with expert consensus.
|
Participants' anonymized clinical information, including structured patient history and periapical radiographs, was used as input for three large language models (ChatGPT-4o, Gemini Advanced, Claude 3.7).
The models were asked to determine the endodontic diagnosis and propose an appropriate treatment plan.
No treatment, device, or drug was administered to participants.
The intervention consists solely of AI-based interpretation of pre-existing clinical data.
|
What is the study measuring?
Primary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Clinician Diagnosis Accuracy Based on Paper-Based History and Periapical Radiograph
Time Frame: 7 july-5 august
|
Assessment of the diagnostic decision made by endodontic clinicians after reviewing a paper-based patient history form and a standardized periapical radiograph.
Accuracy is determined by comparing the clinician's diagnosis with the consensus diagnosis established by three independent endodontic specialists.
Data will be collected for all 120 patients at the time of initial clinical evaluation.
|
7 july-5 august
|
Secondary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
LLM-Generated Diagnosis and Treatment Planning Performance
Time Frame: august-september
|
Evaluation of diagnostic and treatment recommendations generated by large language models (LLMs)-ChatGPT-4o, Gemini Advanced, and Claude 3.7-after receiving the same paper-based patient history and periapical radiograph provided to clinicians.
LLM responses will be compared to the gold-standard specialist consensus for both diagnosis and treatment decisions.
|
august-september
|
Collaborators and Investigators
Sponsor
Investigators
- Study Director: ayşe karadayı, asst. prof., Marmara University Faculty of Dentistry
Publications and helpful links
General Publications
- Abd-Alrazaq A, AlSaad R, Alhuwail D, Ahmed A, Healy PM, Latifi S, Aziz S, Damseh R, Alabed Alrazak S, Sheikh J. Large Language Models in Medical Education: Opportunities, Challenges, and Future Directions. JMIR Med Educ. 2023 Jun 1;9:e48291. doi: 10.2196/48291.
- Schwendicke F, Samek W, Krois J. Artificial Intelligence in Dentistry: Chances and Challenges. J Dent Res. 2020 Jul;99(7):769-774. doi: 10.1177/0022034520915714. Epub 2020 Apr 21.
Study record dates
Study Major Dates
Study Start (Actual)
Primary Completion (Actual)
Study Completion (Actual)
Study Registration Dates
First Submitted
First Submitted That Met QC Criteria
First Posted (Actual)
Study Record Updates
Last Update Posted (Actual)
Last Update Submitted That Met QC Criteria
Last Verified
More Information
Terms related to this study
Other Study ID Numbers
- 2025-38
Plan for Individual participant data (IPD)
Plan to Share Individual Participant Data (IPD)?
Drug and device information, study documents
Studies a U.S. FDA-regulated drug product
Studies a U.S. FDA-regulated device product
This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.
Clinical Trials on Endodontic Diagnosis, Endodontic Diseases, Endodontic Treatment, Endodontic Decision-making
-
Lumendo AGAvaniaRecruitingEndodontic TreatmentTurkey
-
Kutahya Health Sciences UniversityCompletedRoot Canal Treatment | Effect of Endodontic Case Difficulty on Endodontic MishapsTurkey
-
Dow University of Health SciencesActive, not recruiting
-
Aristotle University Of ThessalonikiActive, not recruiting
-
Suez Canal UniversityCompletedEndodontic Re-treatment FailureEgypt
-
University of Sao PauloCompletedEndodontic Treatment of Primary TeethBrazil
-
PAULO VINICIUS SOARESUnknownMTA | Calcium Hydroxide | Endodontic TreatmentBrazil
-
Damascus UniversityCompletedEndodontic Disease | Endodontic Re-treatment FailureSyrian Arab Republic
-
Cairo UniversityUnknownEndodontic Disease | Endodontic Re-treatment Failure
Clinical Trials on AI-Based Diagnostic Assessment
-
Valentina CerroneFederico II University; University of Salerno, ItalyRecruitingChronic Pain | Neuropathic Pain | Cancer Pain | Pain AssessmentItaly
-
Bukret Plastic SurgeryCompletedRisk Factors | Risk AssessmentArgentina
-
Chang Gung Memorial HospitalRecruitingMild Cognitive Impairment | Subjective Cognitive Decline | Cognitive Function AssessmentTaiwan
-
Sun Yat-Sen Memorial Hospital of Sun Yat-Sen UniversityRecruiting
-
The Eye Hospital of Wenzhou Medical UniversityRecruiting
-
The Eye Hospital of Wenzhou Medical UniversityRecruitingPregnancy-Related and Neonatal DisordersChina
-
The Eye Hospital of Wenzhou Medical UniversityRecruiting
-
Second Affiliated Hospital, School of Medicine,...Recruiting
-
The Eye Hospital of Wenzhou Medical UniversityRecruiting
-
Children's Hospital of Fudan UniversityChengdu Women's and Children's Central Hospital; Xiamen Children's Hospital; Kunming... and other collaboratorsRecruiting