The Diagnostic and Triage Capacity of Laypeople-large Language Model Collaboration in China

November 25, 2025 updated by: Zhang Min, Huazhong University of Science and Technology

The Diagnostic and Triage Capacity of Laypeople-large Language Model Collaboration: a National Pretest-posttest Randomized Controlled Experiment in China

The goal of this randomized controlled trial is to evaluate the role of large language models in enhancing laypeople's ability to self-diagnose and triage common diseases. The main questions it aims to answer are:

Does using an LLM help participants make more accurate self-diagnoses and care decisions for common illnesses, compared to their first guess without any help?
How much better is it when people work together with an LLM, compared to using a regular search engine, using the LLM alone, or how doctors would decide? Researchers will compare participants who were randomly assigned to either the LLM group (using DeepSeek) or the search engine group to see if the LLM-assisted approach leads to better clinical judgments.

Participants will:

Read one of 48 short, realistic health vignettes;
Make an initial guess about what might be wrong by listing up to three possible causes, ranked from most to least likely, and choose a care level: seek immediate care, see a doctor within one day, see a doctor within one week, or manage at home without medical care.
Use their assigned tool (either DeepSeek or a standard search engine) to look up information and update their guess and care decision;
Submit their final diagnosis and care choice after using the tool. In addition, the study team evaluated the performance of four other AI models (GPT-4o, GPT-o1, DeepSeek-v3, and DeepSeek-r1) and 33 experienced general physicians on the same vignettes.

Study Overview

Status

Completed

Conditions

Intervention / Treatment

Study Type

Interventional

Enrollment (Actual)

6360

Phase

Not Applicable

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

China
- Hubei
  - Wuhan, Hubei, China
    - Tongji Medical College of Huazhong University of Science & Technology School of Medicine and Health Management

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

Adult
Older Adult

Accepts Healthy Volunteers

Description

Inclusion Criteria:

Age 18 years or older
Current resident of mainland China
History of high-quality participation in online surveys on Credamo platform (historical survey acceptance rate ≥ 80% and personal credit score ≥ 70)

Exclusion Criteria:

Incomplete survey responses
Failure on embedded quality-check items
Implausibly short completion time (<180 seconds for search engine group; <360 seconds for LLM group)
Provision of non-diagnostic or irrelevant responses (e.g., "unknown", "don't know")
Consistent pattern of identical responses across all items

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

Primary Purpose: Health Services Research
Allocation: Randomized
Interventional Model: Parallel Assignment
Masking: Single

Number of Arms

Arms and Interventions

Participant Group / Arm	Intervention / Treatment
Experimental: layperson-LLM integrated group After initially answering a clinical diagnosis and triage question without the aid of tools, the participants were asked to use a large language model (Deepseek v3 or r1) to retrieve health information and then answer the same question again	Behavioral: AI-assisted health information seeking Participants in this group used a large language model (DeepSeek) to search for medical information related to a clinical vignette after providing initial diagnostic and triage decisions. They were instructed to interact freely with the model to gather insights and then update their diagnoses and triage recommendations. The intervention simulates real-world use of AI tools for personal health decision-making
Active Comparator: layperson-search engine group After initially answering a clinical diagnosis and triage question without the use of tools, the participants were required to use a search engine to retrieve health information and then answer the same question again	Behavioral: Conventional internet search for health information Participants in this group used mainstream internet search engines (e.g., Baidu, Google, Bing) to look up information about the clinical vignette after making initial diagnostic and triage decisions. They were allowed to search freely but were not permitted to use any named AI chatbot or large language model platform. This group represents typical self-directed online health information seeking behavior.

Participant Group / Arm

Intervention / Treatment

Experimental: layperson-LLM integrated group

After initially answering a clinical diagnosis and triage question without the aid of tools, the participants were asked to use a large language model (Deepseek v3 or r1) to retrieve health information and then answer the same question again

Behavioral: AI-assisted health information seeking

Participants in this group used a large language model (DeepSeek) to search for medical information related to a clinical vignette after providing initial diagnostic and triage decisions. They were instructed to interact freely with the model to gather insights and then update their diagnoses and triage recommendations. The intervention simulates real-world use of AI tools for personal health decision-making

Active Comparator: layperson-search engine group

After initially answering a clinical diagnosis and triage question without the use of tools, the participants were required to use a search engine to retrieve health information and then answer the same question again

Behavioral: Conventional internet search for health information

Participants in this group used mainstream internet search engines (e.g., Baidu, Google, Bing) to look up information about the clinical vignette after making initial diagnostic and triage decisions. They were allowed to search freely but were not permitted to use any named AI chatbot or large language model platform. This group represents typical self-directed online health information seeking behavior.

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Top-3 Diagnostic Accuracy Time Frame: Immediately after intervention (within the same survey session)	The primary diagnostic outcome was defined as the proportion of participants who included the correct diagnosis in their top three differential diagnoses after using the assigned tool (LLM or search engine). Accuracy was assessed for each of the 48 clinical vignettes and aggregated across all participants in each group.	Immediately after intervention (within the same survey session)
Triage Accuracy (4-class exact match) Time Frame: Immediately after intervention (within the same survey session)	Triage accuracy was defined as the proportion of participants who selected the correct triage level (emergent care, within one day, within one week, or self-care) that matched the reference standard. There were 12 vignettes per triage category.	Immediately after intervention (within the same survey session)

Secondary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Top-1 Diagnostic Accuracy Time Frame: Immediately after intervention (within the same survey session)	The proportion of participants who selected the correct diagnosis as their top (first) diagnosis after using the assigned tool. This measures the precision of laypeople's final diagnostic judgment.	Immediately after intervention (within the same survey session)
Triage Accuracy (2-class binary match) Time Frame: Immediately after intervention (within the same survey session)		Immediately after intervention (within the same survey session)

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

Huazhong University of Science and Technology

Investigators

Principal Investigator: Chenxi Liu, Huazhong University of Science and Technology

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

April 27, 2025

Primary Completion (Actual)

July 1, 2025

Study Completion (Actual)

July 1, 2025

Study Registration Dates

First Submitted

November 17, 2025

First Submitted That Met QC Criteria

November 25, 2025

First Posted (Actual)

November 26, 2025

Study Record Updates

Last Update Posted (Actual)

November 26, 2025

Last Update Submitted That Met QC Criteria

November 25, 2025

Last Verified

October 1, 2025

More Information

Terms related to this study

Other Study ID Numbers

JCYJ20240813115806009

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Vignette Based Intervention

Research Foundation for Mental Hygiene, Inc.

Completed

Addressing COVID-19 Mental Health Problems Among US Veterans

Brief Video-based Intervention | Vignette Based Intervention | Non Intervention Control Arm

United States
Sir Mortimer B. Davis - Jewish General Hospital

Recruiting

Cardiovascular Effects of Music Versus Guided Mindfulness (CMM)

Music Listening Intervention | Mindfulness-based Intervention

Canada
Centre Hospitalier Universitaire Vaudois
Paro College of Education; Royal Thimphu College

Not yet recruiting

School-based Physical Education in Bhutan for Physical Fitness and Socio-emotional Competencies in Adolescents (ActiveClass-BH)

School-based Intervention

Bhutan
Fatima Jinnah Women University

Unknown

Impact of School Based Intervention on Motivation Towards Physical Activity and Psychosocial Outcomes Among Adolescents

School Based Intervention

Pakistan
IRCCS Fondazione Stella Maris
University of Pisa; University of Florence; John Cabot University

Recruiting

PACE Study: School-Based Mindfulness and Compassion Interventions for Children (PACE)

Compassion | Emotion Regulation | School-based Intervention | Mindfulness-based Intervention

Italy
University of Calgary
International Society of Psychiatric-Mental Health Nurses

Completed

Evaluation of the First Pathways Game on Parent-child Interactions and Development for Vulnerable Children

Internet-Based Intervention

Canada
Istituto Auxologico Italiano

Completed

SOSteniamoci: Usability Study

Internet-based Intervention

Italy
Universidade do Porto
Fundação para a Ciência e a Tecnologia

Completed

Psychoeducational Simulation Game for Adults in Stepfamilies (GSteps)

Internet-Based Intervention | Remarriage

Portugal
Maimónides Biomedical Research Institute of Córdoba

Completed

Educational Intervention for Sleep Hygiene Nursing in Adolescents (ENISHA-ST)

Sleep | School-based Intervention

Spain
University of Haifa

Completed

Can Mindfulness and Self-monitoring Improve Control Over Maladaptive Daydreaming?

Psychological Intervention | Internet-Based Intervention | Intervention Study

Israel

Clinical Trials on AI-assisted health information seeking

University of Connecticut
National Institute of Mental Health (NIMH)

Completed

Stigma and Online Counseling to Increase HIV/STI Testing

HIV/STI Testing

United States
Beijing Tongren Hospital

Not yet recruiting

Retinal Clinical Assessment With AI-derived Quantitative Information

Epiretinal Membrane | AMD | Macular Hole | Pathological Myopia | Diabetic Retinopathy (DR) | Retinal Vein Occlusion (RVO) | no Obvious Abnormalities | Cup-to-disc Ratio Bigger Than 0.5
Sanliurfa Education and Research Hospital

Not yet recruiting

AI-Assisted Written Information in Chronic Anal Fissure (AF-AI)

Chronic Anal Fissure
Sanliurfa Education and Research Hospital

Not yet recruiting

AI-Assisted Written Information in Symptomatic Hemorrhoidal Disease (HD-AI)

Hemorrhoids Grade I and II | Symptomatic Hemorrhoidal Disease
National Taiwan University Hospital

Completed

Transforming ED Throughput With AI-Driven Clinical Decision Support System (TEDAI)

Critical Care | Emergency Treatment | Triage | Readmission

Taiwan
Seoul National University Hospital

Completed

Generative AI-Assisted Clinical Decision Support for Medical Intensive Care Unit Physicians

Clinical Decision-making | Clinical Decision Support

South Korea
Shanghai East Hospital

Not yet recruiting

Multi-Modal Image Fusion for Precision Prostate Biopsy Navigation

Prostate Cancer

China
Rigshospitalet, Denmark
Slagelse Hospital; Copenhagen Academy for Medical Education and Simulation

Not yet recruiting

Clinicians' Trust in AI-Based Fetal Growth Estimates

Pregnancy | Clinical Decision-making | Fetal Growth | Obstetric Ultrasonography

Denmark
Assiut University

Not yet recruiting

Animal Bite Victims Seeking Medical Services at Concerned Hospitals

Vaccine-Preventable Diseases
Amelia Gulliver
Australian Institute of Sport

Completed

The Elite Athlete Mental Health Strategy Trial (TEAMS)

Mental Health Help-Seeking

Australia

The Diagnostic and Triage Capacity of Laypeople-large Language Model Collaboration in China

The Diagnostic and Triage Capacity of Laypeople-large Language Model Collaboration: a National Pretest-posttest Randomized Controlled Experiment in China

Study Overview

Status

Conditions

Intervention / Treatment

Study Type

Enrollment (Actual)

Phase

Contacts and Locations

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Description

Study Plan

How is the study designed?

Design Details

Number of Arms

Arms and Interventions

Participant Group / Arm

Intervention / Treatment

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Secondary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Investigators

Study record dates

Study Major Dates

Study Start (Actual)

Primary Completion (Actual)

Study Completion (Actual)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Actual)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Other Study ID Numbers

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Vignette Based Intervention

Clinical Trials on AI-assisted health information seeking

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Kosovo

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations