Tato stránka byla automaticky přeložena a přesnost překladu není zaručena. Podívejte se prosím na anglická verze pro zdrojový text.

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

3. července 2026 aktualizováno: Ji Xunming,MD,PhD, Capital Medical University

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

This study will evaluate whether three-minute six-dimensions education(3M-6D education) can improve the reliability of large language models as medical assistants for the general public. Participants will be randomly assigned to receive or not receive 3M-6D education and then use ChatGPT, Gemini, or non-AI information resources. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Přehled studie

Postavení

Nábor

Podmínky

Relevant Conditions Identification

Intervence / Léčba

Detailní popis

This randomized, controlled, proof-of-concept simulation trial will evaluate whether three-minute six-dimensions education (3M-6D education) can improve the reliability of large language models as medical assistants for the general public.

Eligible participants will be randomly assigned in a 1:1:1:1:1 ratio to one of five study groups: the 3M-6D education GPT group, the GPT group, the 3M-6D education Gemini group, the Gemini group, or the control group. Participants in the 3M-6D education GPT and 3M-6D education Gemini groups will receive approximately three minutes of education before using ChatGPT or Gemini.Each participant will be randomly assigned one of 10 standardized clinical scenarios and complete a simulated counseling task in unrestricted natural language within approximately 10 minutes. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Typ studie

Intervenční

Zápis (Odhadovaný)

525

Fáze

Nelze použít

Kontakty a umístění

Tato část poskytuje kontaktní údaje pro ty, kteří studii provádějí, a informace o tom, kde se tato studie provádí.

Studijní kontakt

Jméno: Xunming Ji
Telefonní číslo: 01083198962
E-mail: jixm@ccmu.edu.cn

Studijní záloha kontaktů

Jméno: Chuanjie Wu
Telefonní číslo: 01083199439
E-mail: wuchuanjie@ccmu.edu.cn

Studijní místa

Čína
- Beijing Municipality
  - Beijing, Beijing Municipality, Čína
    - Nábor
    - Beijing Ctiy
    - Kontakt:
      
      Chuanjie Wu
      
      Telefonní číslo: 010-83199439
      
      E-mail: wuchuanjie@ccmu.edu.cn

Kritéria účasti

Výzkumníci hledají lidi, kteří odpovídají určitému popisu, kterému se říká kritéria způsobilosti. Některé příklady těchto kritérií jsou celkový zdravotní stav osoby nebo předchozí léčba.

Kritéria způsobilosti

Věk způsobilý ke studiu

Dospělý
Starší dospělý

Přijímá zdravé dobrovolníky

Ano

Popis

Inclusion Criteria:

Age 18 years or greater, male or female;
Completed primary school or higher education;
Able to use a smartphone or computer to complete online interaction;
No history of acute ischemic stroke, systemic lupus erythematosus, gastric ulcer, pneumonia, acute cardiac infarction, urinary tract infection, uterine fibroids, diabetes, osteoarthritis, or migraine.
Able to understand and comply with study procedures and to provide written informed consent.

Exclusion Criteria:

Currently or previously employed as a healthcare worker;
Previously received systematic medical training;
Currently involved in concurrent research that may interfere with the results of the present trial;
The investigator considered that the participant had other conditions that might affect compliance or preclude participation.

Studijní plán

Tato část poskytuje podrobnosti o studijním plánu, včetně toho, jak je studie navržena a co studie měří.

Jak je studie koncipována?

Detaily designu

Primární účel: Výzkum zdravotnických služeb
Přidělení: Randomizované
Intervenční model: Paralelní přiřazení
Maskování: Singl

Počet zbraní

Zbraně a zásahy

Skupina účastníků / Arm	Intervence / Léčba
Experimentální: 3M-6D education GPT Group Participants will first be trained in 3M-6D education, then use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Jiný: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language. Behaviorální: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, the investigators identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so the investigators call this approach three minutes six dimensions education (3M-6D education).
Experimentální: 3M-6D education Gemini Group Participants will first be trained in 3M-6D education, then use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Jiný: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language. Behaviorální: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, the investigators identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so the investigators call this approach three minutes six dimensions education (3M-6D education).
Aktivní komparátor: GPT Group Participants will use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Jiný: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language.
Aktivní komparátor: Gemini Group Participants will use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Jiný: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language.
Žádný zásah: Control group Participants will use non-AI tools such as internet searches and medical websites to complete a consultation task in unrestricted natural language in approximately 10 minutes.

Co je měření studie?

Primární výstupní opatření

Měření výsledku	Popis opatření	Časové okno
Relevant conditions identification of the 3M-6D education GPT group compared with the GPT group Časové okno: 1 hour.	Relevant conditions identification is defined as the proportion of participants whose final response includes the expert-defined final diagnosis or a relevant differential diagnosis.	1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the GPT group Časové okno: 1 hour.	Disposition concordance is defined as the proportion of participants whose final care recommendation matches the expert-defined level. The five levels are self-care, routine outpatient care, urgent outpatient care, emergency department visit, and emergency medical services.	1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the Gemini group Časové okno: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the Gemini group Časové okno: 1 hour.		1 hour.

Sekundární výstupní opatření

Měření výsledku	Popis opatření	Časové okno
Relevant conditions identification of the 3M-6D education GPT group compared with the control group Časové okno: 1 hour.		1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the control group Časové okno: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the control group Časové okno: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the control group Časové okno: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the GPT group Časové okno: 1 hour.	Red-flag identification is defined as the proportion of participants whose final response includes the key warning signs that experts defined for the assigned scenario.	1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the control group Časové okno: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the Gemini group Časové okno: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the control group Časové okno: 1 hour.		1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the GPT group Časové okno: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the control group Časové okno: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the Gemini group Časové okno: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the control group Časové okno: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
Relevant conditions identification of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Časové okno: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Časové okno: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the 3M-6D education Gemini group Časové okno: 1 hour.		1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Časové okno: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.

Další výstupní opatření

Měření výsledku	Popis opatření	Časové okno
Failure to identify red flags in the 3M-6D education GPT group compared with the GPT group Časové okno: 1 hour.	Failure to identify red flags is defined as the proportion of participants whose final response does not include the expert-defined red-flag symptoms or warning signs for the assigned standardized simulated clinical scenario.	1 hour.
Failure to identify red flags in the 3M-6D education GPT group compared with the control group Časové okno: 1 hour.		1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the Gemini group Časové okno: 1 hour.		1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the control group Časové okno: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the GPT group Časové okno: 1 hour.	Underestimation of disposition is defined as the proportion of participants whose final care recommendation is lower than the expert-defined disposition level for the assigned standardized simulated clinical scenario.	1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the control group Časové okno: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the Gemini group Časové okno: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the control group Časové okno: 1 hour.		1 hour.

Spolupracovníci a vyšetřovatelé

Zde najdete lidi a organizace zapojené do této studie.

Sponzor

Capital Medical University

Spolupracovníci

Xuanwu Hospital, Beijing

Termíny studijních záznamů

Tato data sledují průběh záznamů studie a předkládání souhrnných výsledků na ClinicalTrials.gov. Záznamy ze studií a hlášené výsledky jsou před zveřejněním na veřejné webové stránce přezkoumány Národní lékařskou knihovnou (NLM), aby se ujistily, že splňují specifické standardy kontroly kvality.

Hlavní termíny studia

Začátek studia (Aktuální)

3. července 2026

Primární dokončení (Odhadovaný)

20. července 2026

Dokončení studie (Odhadovaný)

20. července 2026

Termíny zápisu do studia

První předloženo

11. června 2026

První předloženo, které splnilo kritéria kontroly kvality

11. června 2026

První zveřejněno (Aktuální)

16. června 2026

Aktualizace studijních záznamů

Poslední zveřejněná aktualizace (Aktuální)

7. července 2026

Odeslaná poslední aktualizace, která splnila kritéria kontroly kvality

3. července 2026

Naposledy ověřeno

1. července 2026

Více informací

Termíny související s touto studií

Klíčová slova

Další identifikační čísla studie

LAMP-1

Plán pro data jednotlivých účastníků (IPD)

Plánujete sdílet data jednotlivých účastníků (IPD)?

NEROZHODNÝ

Informace o lécích a zařízeních, studijní dokumenty

Studuje lékový produkt regulovaný americkým FDA

Studuje produkt zařízení regulovaný americkým úřadem FDA

Tyto informace byly beze změn načteny přímo z webu clinicaltrials.gov. Máte-li jakékoli požadavky na změnu, odstranění nebo aktualizaci podrobností studie, kontaktujte prosím register@clinicaltrials.gov. Jakmile bude změna implementována na clinicaltrials.gov, bude automaticky aktualizována i na našem webu .

Klinické studie na ChatGPT

Istituto Clinico Humanitas
Fondazione I.R.C.C.S. Istituto Neurologico Carlo Besta

Dokončeno

ChatGPT v diagnostice a léčbě komplexních polyneuropatií: Srovnávací analýza s neurology využívající reálné případy (REASON)

Polyneuropatie

Itálie
Philipps University Marburg

Dokončeno

Al ke zlepšení diagnostiky vzácných revmatických onemocnění (AIDRARER)

Revmatická onemocnění

Německo
Chang Gung University of Science and Technology
National Science and Technology Council, Taiwan

Zatím nenabíráme

Chatgpt -zásah založené na sociální slabosti u starších žen s CHF: Genderové rozdíly

Sociální komunikace | CHF – městnavé srdeční selhání | 65 let starší
Boston Intelligent Medical Research Center, Shenzhen...
Tsinghua University

Zatím nenabíráme

ChatGPT v.s. Člověk při psaní listu předoperační návštěvy

Předoperační péče
Charite University, Berlin, Germany
German Research Foundation; Max Planck Institute for Human Development

Zatím nenabíráme

Screening rakoviny vaječníků a umělá inteligence (AI-OCS-Gyn)

Doporučení gynekologů pro screening rakoviny vaječníků

Německo
Lahore University of Management Sciences
King Edward Medical University

Dokončeno

Vliv velkých jazykových modelů na diagnostické uvažování mezi lékaři

Diagnóza

Pákistán
Hartford Hospital
Boston Scientific Corporation

Aktivní, ne nábor

Využití umělé inteligence u urogynekologických pacientů

Příznaky dolních močových cest | Únik moči | Uterovaginální prolaps

Spojené státy
Chang Gung University of Science and Technology

Zatím nenabíráme

Porovnání sarkopenie, fyzické, psychologické a sociální křehkosti u hospitalizovaných starších žen CHF v metropolitním a venkovském prostředí

Sociální komunikace | CHF – městnavé srdeční selhání | 65 let starší | Sarkopenie u seniorů
Centre Hospitalier Universitaire de Nice

Zápis na pozvánku

Vyhodnocení diagnózy provedené umělou inteligencí v reakci na žádost praktického lékaře na OMNIDOC dermatologovi CHU Nice. Srovnání této odpovědi s odpovědí dermatologa.

Kožní choroby | Umělá inteligence

Francie
Saglik Bilimleri Universitesi

Dokončeno

Vliv školení ošetřovatelského procesu založeného na ChatGPT

Vztahy sestra-pacient

Krocan

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

Přehled studie

Postavení

Podmínky

Intervence / Léčba

Detailní popis

Typ studie

Zápis (Odhadovaný)

Fáze

Kontakty a umístění

Studijní kontakt

Studijní záloha kontaktů

Studijní místa

Kritéria účasti

Kritéria způsobilosti

Věk způsobilý ke studiu

Přijímá zdravé dobrovolníky

Popis

Studijní plán

Jak je studie koncipována?

Detaily designu

Počet zbraní

Zbraně a zásahy

Skupina účastníků / Arm

Intervence / Léčba

Co je měření studie?

Primární výstupní opatření

Měření výsledku

Popis opatření

Časové okno

Sekundární výstupní opatření

Měření výsledku

Popis opatření

Časové okno

Další výstupní opatření

Měření výsledku

Popis opatření

Časové okno

Spolupracovníci a vyšetřovatelé

Sponzor

Spolupracovníci

Termíny studijních záznamů

Hlavní termíny studia

Začátek studia (Aktuální)

Primární dokončení (Odhadovaný)

Dokončení studie (Odhadovaný)

Termíny zápisu do studia

První předloženo

První předloženo, které splnilo kritéria kontroly kvality

První zveřejněno (Aktuální)

Aktualizace studijních záznamů

Poslední zveřejněná aktualizace (Aktuální)

Odeslaná poslední aktualizace, která splnila kritéria kontroly kvality

Naposledy ověřeno

Více informací

Termíny související s touto studií

Klíčová slova

Další identifikační čísla studie

Plán pro data jednotlivých účastníků (IPD)

Plánujete sdílet data jednotlivých účastníků (IPD)?

Informace o lécích a zařízeních, studijní dokumenty

Studuje lékový produkt regulovaný americkým FDA

Studuje produkt zařízení regulovaný americkým úřadem FDA

Klinické studie na ChatGPT

Prohledejte podobné pokusy

Sponzoři a spolupracovníci

Zdravotní podmínky

Drogové intervence

CROs by country

CROs in Switzerland

Podmínky

Vzácné nemoci

Drogové intervence

Doplňky stravy

Sponzor / Spolupracovníci

Místa