Deze pagina is automatisch vertaald en de nauwkeurigheid van de vertaling kan niet worden gegarandeerd. Raadpleeg de Engelse versie voor een brontekst.

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

11 juni 2026 bijgewerkt door: Ji Xunming,MD,PhD, Capital Medical University

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

This study will evaluate whether three-minute six-dimensions education(3M-6D education) can improve the reliability of large language models as medical assistants for the general public. Participants will be randomly assigned to receive or not receive 3M-6D education and then use ChatGPT, Gemini, or non-AI information resources. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Studie Overzicht

Toestand

Nog niet aan het werven

Conditie

Relevant Conditions Identification

Interventie / Behandeling

Gedetailleerde beschrijving

This randomized, controlled, proof-of-concept simulation trial will evaluate whether three-minute six-dimensions education (3M-6D education) can improve the reliability of large language models as medical assistants for the general public.

Eligible participants will be randomly assigned in a 1:1:1:1:1 ratio to one of five study groups: the 3M-6D education GPT group, the GPT group, the 3M-6D education Gemini group, the Gemini group, or the control group. Participants in the 3M-6D education GPT and 3M-6D education Gemini groups will receive approximately three minutes of education before using ChatGPT or Gemini.Each participant will be randomly assigned one of 10 standardized clinical scenarios and complete a simulated counseling task in unrestricted natural language within approximately 10 minutes. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Studietype

Ingrijpend

Inschrijving (Geschat)

525

Fase

Niet toepasbaar

Contacten en locaties

In dit gedeelte vindt u de contactgegevens van degenen die het onderzoek uitvoeren en informatie over waar dit onderzoek wordt uitgevoerd.

Studiecontact

Naam: Xunming Ji
Telefoonnummer: 01083198962
E-mail: jixm@ccmu.edu.cn

Studie Contact Back-up

Naam: Chuanjie Wu
Telefoonnummer: 01083199439
E-mail: wuchuanjie@ccmu.edu.cn

Studie Locaties

China
- Beijing Municipality
  - Beijing, Beijing Municipality, China, 100053
    - Xuanwu Hospital, Capital Medical University
    - Contact:
      
      Chuanjie Wu
      
      Telefoonnummer: 010-83199439
      
      E-mail: wuchuanjie@ccmu.edu.cn

Deelname Criteria

Onderzoekers zoeken naar mensen die aan een bepaalde beschrijving voldoen, de zogenaamde geschiktheidscriteria. Enkele voorbeelden van deze criteria zijn iemands algemene gezondheidstoestand of eerdere behandelingen.

Geschiktheidscriteria

Leeftijden die in aanmerking komen voor studie

Volwassen
Oudere volwassene

Accepteert gezonde vrijwilligers

Beschrijving

Inclusion Criteria:

Age 18 years or greater, male or female;
Completed primary school or higher education;
Able to use a smartphone or computer to complete online interaction;
No history of acute ischemic stroke, systemic lupus erythematosus, gastric ulcer, pneumonia, acute cardiac infarction, urinary tract infection, uterine fibroids, diabetes, osteoarthritis, or migraine.
Able to understand and comply with study procedures and to provide written informed consent.

Exclusion Criteria:

Currently or previously employed as a healthcare worker;
Previously received systematic medical training;
Currently involved in concurrent research that may interfere with the results of the present trial;
The investigator considered that the participant had other conditions that might affect compliance or preclude participation.

Studie plan

Dit gedeelte bevat details van het studieplan, inclusief hoe de studie is opgezet en wat de studie meet.

Hoe is de studie opgezet?

Ontwerpdetails

Primair doel: Onderzoek naar gezondheidsdiensten
Toewijzing: Gerandomiseerd
Interventioneel model: Parallelle opdracht
Masker: Enkel

Aantal wapens

Wapens en interventies

Deelnemersgroep / Arm	Interventie / Behandeling
Experimenteel: 3M-6D education GPT Group Participants will first be trained in 3M-6D education, then use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Gedragsmatig: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, we identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so we call this approach three minutes six dimensions education (3M-6D education). Ander: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language.
Experimenteel: 3M-6D education Gemini Group Participants will first be trained in 3M-6D education, then use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Gedragsmatig: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, we identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so we call this approach three minutes six dimensions education (3M-6D education). Ander: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language.
Actieve vergelijker: GPT Group Participants will use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Ander: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language.
Actieve vergelijker: Gemini Group Participants will use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Ander: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language.
Geen tussenkomst: Control group Participants will use non-AI tools such as internet searches and medical websites to complete a consultation task in unrestricted natural language in approximately 10 minutes.

Wat meet het onderzoek?

Primaire uitkomstmaten

Uitkomstmaat	Maatregel Beschrijving	Tijdsspanne
Relevant conditions identification of the 3M-6D education GPT group compared with the GPT group Tijdsspanne: Usually within 1 hour.	Relevant conditions identification is defined as the proportion of participants whose final response includes the expert-defined final diagnosis or a relevant differential diagnosis.	Usually within 1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the GPT group Tijdsspanne: Usually within 1 hour.	Disposition concordance is defined as the proportion of participants whose final care recommendation matches the expert-defined level. The five levels are self-care, routine outpatient care, urgent outpatient care, emergency department visit, and emergency medical services.	Usually within 1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.

Secundaire uitkomstmaten

Uitkomstmaat	Maatregel Beschrijving	Tijdsspanne
Relevant conditions identification of the 3M-6D education GPT group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the GPT group Tijdsspanne: Usually within 1 hour.	Red-flag identification is defined as the proportion of participants whose final response includes the key warning signs that experts defined for the assigned scenario.	Usually within 1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the GPT group Tijdsspanne: Usually within 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Relevant conditions identification of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.

Andere uitkomstmaten

Uitkomstmaat	Maatregel Beschrijving	Tijdsspanne
Failure to identify red flags in the 3M-6D education GPT group compared with the GPT group Tijdsspanne: Usually within 1 hour.	Failure to identify red flags is defined as the proportion of participants whose final response does not include the expert-defined red-flag symptoms or warning signs for the assigned standardized simulated clinical scenario.	Usually within 1 hour.
Failure to identify red flags in the 3M-6D education GPT group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the GPT group Tijdsspanne: Usually within 1 hour.	Underestimation of disposition is defined as the proportion of participants whose final care recommendation is lower than the expert-defined disposition level for the assigned standardized simulated clinical scenario.	Usually within 1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the Gemini group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the control group Tijdsspanne: Usually within 1 hour.		Usually within 1 hour.

Medewerkers en onderzoekers

Hier vindt u mensen en organisaties die betrokken zijn bij dit onderzoek.

Sponsor

Capital Medical University

Medewerkers

Xuanwu Hospital, Beijing

Studie record data

Deze datums volgen de voortgang van het onderzoeksdossier en de samenvatting van de ingediende resultaten bij ClinicalTrials.gov. Studieverslagen en gerapporteerde resultaten worden beoordeeld door de National Library of Medicine (NLM) om er zeker van te zijn dat ze voldoen aan specifieke kwaliteitscontrolenormen voordat ze op de openbare website worden geplaatst.

Bestudeer belangrijke data

Studie start (Geschat)

20 juni 2026

Primaire voltooiing (Geschat)

20 juli 2026

Studie voltooiing (Geschat)

20 juli 2026

Studieregistratiedata

Eerst ingediend

11 juni 2026

Eerst ingediend dat voldeed aan de QC-criteria

11 juni 2026

Eerst geplaatst (Werkelijk)

16 juni 2026

Updates van studierecords

Laatste update geplaatst (Werkelijk)

16 juni 2026

Laatste update ingediend die voldeed aan QC-criteria

11 juni 2026

Laatst geverifieerd

1 juni 2026

Meer informatie

Termen gerelateerd aan deze studie

Trefwoorden

Andere studie-ID-nummers

LAMP-1

Plan Individuele Deelnemersgegevens (IPD)

Bent u van plan om gegevens van individuele deelnemers (IPD) te delen?

ONBESLIST

Informatie over medicijnen en apparaten, studiedocumenten

Bestudeert een door de Amerikaanse FDA gereguleerd geneesmiddel

Nee

Bestudeert een door de Amerikaanse FDA gereguleerd apparaatproduct

Nee

Deze informatie is zonder wijzigingen rechtstreeks van de website clinicaltrials.gov gehaald. Als u verzoeken heeft om uw onderzoeksgegevens te wijzigen, te verwijderen of bij te werken, neem dan contact op met register@clinicaltrials.gov. Zodra er een wijziging wordt doorgevoerd op clinicaltrials.gov, wordt deze ook automatisch bijgewerkt op onze website .

Klinische onderzoeken op Relevant Conditions Identification

Keele University

Voltooid

Zou u graag gecontacteerd worden over onderzoek? (The Who)

Aanvaardbaarheid van de patiënt om uitgenodigd te worden om deel te nemen aan onderzoek dat voor hen relevant is

Verenigd Koninkrijk

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

Studie Overzicht

Toestand

Conditie

Interventie / Behandeling

Gedetailleerde beschrijving

Studietype

Inschrijving (Geschat)

Fase

Contacten en locaties

Studiecontact

Studie Contact Back-up

Studie Locaties

Deelname Criteria

Geschiktheidscriteria

Leeftijden die in aanmerking komen voor studie

Accepteert gezonde vrijwilligers

Beschrijving

Studie plan

Hoe is de studie opgezet?

Ontwerpdetails

Aantal wapens

Wapens en interventies

Deelnemersgroep / Arm

Interventie / Behandeling

Wat meet het onderzoek?

Primaire uitkomstmaten

Uitkomstmaat

Maatregel Beschrijving

Tijdsspanne

Secundaire uitkomstmaten

Uitkomstmaat

Maatregel Beschrijving

Tijdsspanne

Andere uitkomstmaten

Uitkomstmaat

Maatregel Beschrijving

Tijdsspanne

Medewerkers en onderzoekers

Sponsor

Medewerkers

Studie record data

Bestudeer belangrijke data

Studie start (Geschat)

Primaire voltooiing (Geschat)

Studie voltooiing (Geschat)

Studieregistratiedata

Eerst ingediend

Eerst ingediend dat voldeed aan de QC-criteria

Eerst geplaatst (Werkelijk)

Updates van studierecords

Laatste update geplaatst (Werkelijk)

Laatste update ingediend die voldeed aan QC-criteria

Laatst geverifieerd

Meer informatie

Termen gerelateerd aan deze studie

Trefwoorden

Andere studie-ID-nummers

Plan Individuele Deelnemersgegevens (IPD)

Bent u van plan om gegevens van individuele deelnemers (IPD) te delen?

Informatie over medicijnen en apparaten, studiedocumenten

Bestudeert een door de Amerikaanse FDA gereguleerd geneesmiddel

Bestudeert een door de Amerikaanse FDA gereguleerd apparaatproduct

Klinische onderzoeken op Relevant Conditions Identification

Zoek naar vergelijkbare onderzoeken

Sponsors en medewerkers

Medische omstandigheden

Geneesmiddelinterventies

CROs by country

CROs in Tanzania

Voorwaarden

Zeldzame ziekten

Geneesmiddelinterventies

Voedingssupplementen

Sponsor / medewerkers

Locaties