Denne siden ble automatisk oversatt og nøyaktigheten av oversettelsen er ikke garantert. Vennligst referer til engelsk versjon for en kildetekst.

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

11. juni 2026 oppdatert av: Ji Xunming,MD,PhD, Capital Medical University

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

This study will evaluate whether three-minute six-dimensions education(3M-6D education) can improve the reliability of large language models as medical assistants for the general public. Participants will be randomly assigned to receive or not receive 3M-6D education and then use ChatGPT, Gemini, or non-AI information resources. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Studieoversikt

Status

Har ikke rekruttert ennå

Forhold

Relevant Conditions Identification

Intervensjon / Behandling

Detaljert beskrivelse

This randomized, controlled, proof-of-concept simulation trial will evaluate whether three-minute six-dimensions education (3M-6D education) can improve the reliability of large language models as medical assistants for the general public.

Eligible participants will be randomly assigned in a 1:1:1:1:1 ratio to one of five study groups: the 3M-6D education GPT group, the GPT group, the 3M-6D education Gemini group, the Gemini group, or the control group. Participants in the 3M-6D education GPT and 3M-6D education Gemini groups will receive approximately three minutes of education before using ChatGPT or Gemini.Each participant will be randomly assigned one of 10 standardized clinical scenarios and complete a simulated counseling task in unrestricted natural language within approximately 10 minutes. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Studietype

Intervensjonell

Registrering (Antatt)

525

Fase

Ikke aktuelt

Kontakter og plasseringer

Denne delen inneholder kontaktinformasjon for de som utfører studien, og informasjon om hvor denne studien blir utført.

Studiekontakt

Navn: Xunming Ji
Telefonnummer: 01083198962
E-post: jixm@ccmu.edu.cn

Studer Kontakt Backup

Navn: Chuanjie Wu
Telefonnummer: 01083199439
E-post: wuchuanjie@ccmu.edu.cn

Studiesteder

Kina
- Beijing Municipality
  - Beijing, Beijing Municipality, Kina, 100053
    - Xuanwu Hospital, Capital Medical University
    - Ta kontakt med:
      
      Chuanjie Wu
      
      Telefonnummer: 010-83199439
      
      E-post: wuchuanjie@ccmu.edu.cn

Deltakelseskriterier

Forskere ser etter personer som passer til en bestemt beskrivelse, kalt kvalifikasjonskriterier. Noen eksempler på disse kriteriene er en persons generelle helsetilstand eller tidligere behandlinger.

Kvalifikasjonskriterier

Alder som er kvalifisert for studier

Voksen
Eldre voksen

Tar imot friske frivillige

Beskrivelse

Inclusion Criteria:

Age 18 years or greater, male or female;
Completed primary school or higher education;
Able to use a smartphone or computer to complete online interaction;
No history of acute ischemic stroke, systemic lupus erythematosus, gastric ulcer, pneumonia, acute cardiac infarction, urinary tract infection, uterine fibroids, diabetes, osteoarthritis, or migraine.
Able to understand and comply with study procedures and to provide written informed consent.

Exclusion Criteria:

Currently or previously employed as a healthcare worker;
Previously received systematic medical training;
Currently involved in concurrent research that may interfere with the results of the present trial;
The investigator considered that the participant had other conditions that might affect compliance or preclude participation.

Studieplan

Denne delen gir detaljer om studieplanen, inkludert hvordan studien er utformet og hva studien måler.

Hvordan er studiet utformet?

Designdetaljer

Primært formål: Helsetjenesteforskning
Tildeling: Randomisert
Intervensjonsmodell: Parallell tildeling
Masking: Enkelt

Antall våpen

Våpen og intervensjoner

Deltakergruppe / Arm	Intervensjon / Behandling
Eksperimentell: 3M-6D education GPT Group Participants will first be trained in 3M-6D education, then use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Atferdsmessig: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, we identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so we call this approach three minutes six dimensions education (3M-6D education). Annen: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language.
Eksperimentell: 3M-6D education Gemini Group Participants will first be trained in 3M-6D education, then use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Atferdsmessig: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, we identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so we call this approach three minutes six dimensions education (3M-6D education). Annen: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language.
Aktiv komparator: GPT Group Participants will use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Annen: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language.
Aktiv komparator: Gemini Group Participants will use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Annen: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language.
Ingen inngripen: Control group Participants will use non-AI tools such as internet searches and medical websites to complete a consultation task in unrestricted natural language in approximately 10 minutes.

Hva måler studien?

Primære resultatmål

Resultatmål	Tiltaksbeskrivelse	Tidsramme
Relevant conditions identification of the 3M-6D education GPT group compared with the GPT group Tidsramme: Usually within 1 hour.	Relevant conditions identification is defined as the proportion of participants whose final response includes the expert-defined final diagnosis or a relevant differential diagnosis.	Usually within 1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the GPT group Tidsramme: Usually within 1 hour.	Disposition concordance is defined as the proportion of participants whose final care recommendation matches the expert-defined level. The five levels are self-care, routine outpatient care, urgent outpatient care, emergency department visit, and emergency medical services.	Usually within 1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.

Sekundære resultatmål

Resultatmål	Tiltaksbeskrivelse	Tidsramme
Relevant conditions identification of the 3M-6D education GPT group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the GPT group Tidsramme: Usually within 1 hour.	Red-flag identification is defined as the proportion of participants whose final response includes the key warning signs that experts defined for the assigned scenario.	Usually within 1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the GPT group Tidsramme: Usually within 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Relevant conditions identification of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.

Andre resultatmål

Resultatmål	Tiltaksbeskrivelse	Tidsramme
Failure to identify red flags in the 3M-6D education GPT group compared with the GPT group Tidsramme: Usually within 1 hour.	Failure to identify red flags is defined as the proportion of participants whose final response does not include the expert-defined red-flag symptoms or warning signs for the assigned standardized simulated clinical scenario.	Usually within 1 hour.
Failure to identify red flags in the 3M-6D education GPT group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the GPT group Tidsramme: Usually within 1 hour.	Underestimation of disposition is defined as the proportion of participants whose final care recommendation is lower than the expert-defined disposition level for the assigned standardized simulated clinical scenario.	Usually within 1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the Gemini group Tidsramme: Usually within 1 hour.		Usually within 1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the control group Tidsramme: Usually within 1 hour.		Usually within 1 hour.

Samarbeidspartnere og etterforskere

Det er her du vil finne personer og organisasjoner som er involvert i denne studien.

Sponsor

Capital Medical University

Samarbeidspartnere

Xuanwu Hospital, Beijing

Studierekorddatoer

Disse datoene sporer fremdriften for innsending av studieposter og sammendragsresultater til ClinicalTrials.gov. Studieposter og rapporterte resultater gjennomgås av National Library of Medicine (NLM) for å sikre at de oppfyller spesifikke kvalitetskontrollstandarder før de legges ut på det offentlige nettstedet.

Studer hoveddatoer

Studiestart (Antatt)

20. juni 2026

Primær fullføring (Antatt)

20. juli 2026

Studiet fullført (Antatt)

20. juli 2026

Datoer for studieregistrering

Først innsendt

11. juni 2026

Først innsendt som oppfylte QC-kriteriene

11. juni 2026

Først lagt ut (Faktiske)

16. juni 2026

Oppdateringer av studieposter

Sist oppdatering lagt ut (Faktiske)

16. juni 2026

Siste oppdatering sendt inn som oppfylte QC-kriteriene

11. juni 2026

Sist bekreftet

1. juni 2026

Mer informasjon

Begreper knyttet til denne studien

Nøkkelord

Andre studie-ID-numre

LAMP-1

Plan for individuelle deltakerdata (IPD)

Planlegger du å dele individuelle deltakerdata (IPD)?

UBESLUTTE

Legemiddel- og utstyrsinformasjon, studiedokumenter

Studerer et amerikansk FDA-regulert medikamentprodukt

Nei

Studerer et amerikansk FDA-regulert enhetsprodukt

Nei

Denne informasjonen ble hentet direkte fra nettstedet clinicaltrials.gov uten noen endringer. Hvis du har noen forespørsler om å endre, fjerne eller oppdatere studiedetaljene dine, vennligst kontakt register@clinicaltrials.gov. Så snart en endring er implementert på clinicaltrials.gov, vil denne også bli oppdatert automatisk på nettstedet vårt. .

Kliniske studier på Relevant Conditions Identification

Massachusetts General Hospital

Fullført

Effektiviteten av lys for forbedring av kognisjon ved transkraniell gjentatt bruk (ELECTRA) (ELECTRA)

Sykdomstype og/eller -kategori Ikke relevant

Forente stater
Keele University

Fullført

Vil du gjerne bli kontaktet om forskning? (The Who)

Pasientens aksept av å bli invitert til å delta i forskning som er relevant for dem

Storbritannia
Ospedale San Raffaele
amg International

Fullført

Intern biologisk nedbrytbar stent versus ikke-stent hos pasienter med høy risiko for å utvikle fistel etter pankreatoduodenektomi (BioSteP)

Pankreatoduodenektomi | Klinisk relevant postoperativ bukspyttkjertelfistel | Intern biologisk nedbrytbar bukspyttkjertelstent

Italia

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

Studieoversikt

Status

Forhold

Intervensjon / Behandling

Detaljert beskrivelse

Studietype

Registrering (Antatt)

Fase

Kontakter og plasseringer

Studiekontakt

Studer Kontakt Backup

Studiesteder

Deltakelseskriterier

Kvalifikasjonskriterier

Alder som er kvalifisert for studier

Tar imot friske frivillige

Beskrivelse

Studieplan

Hvordan er studiet utformet?

Designdetaljer

Antall våpen

Våpen og intervensjoner

Deltakergruppe / Arm

Intervensjon / Behandling

Hva måler studien?

Primære resultatmål

Resultatmål

Tiltaksbeskrivelse

Tidsramme

Sekundære resultatmål

Resultatmål

Tiltaksbeskrivelse

Tidsramme

Andre resultatmål

Resultatmål

Tiltaksbeskrivelse

Tidsramme

Samarbeidspartnere og etterforskere

Sponsor

Samarbeidspartnere

Studierekorddatoer

Studer hoveddatoer

Studiestart (Antatt)

Primær fullføring (Antatt)

Studiet fullført (Antatt)

Datoer for studieregistrering

Først innsendt

Først innsendt som oppfylte QC-kriteriene

Først lagt ut (Faktiske)

Oppdateringer av studieposter

Sist oppdatering lagt ut (Faktiske)

Siste oppdatering sendt inn som oppfylte QC-kriteriene

Sist bekreftet

Mer informasjon

Begreper knyttet til denne studien

Nøkkelord

Andre studie-ID-numre

Plan for individuelle deltakerdata (IPD)

Planlegger du å dele individuelle deltakerdata (IPD)?

Legemiddel- og utstyrsinformasjon, studiedokumenter

Studerer et amerikansk FDA-regulert medikamentprodukt

Studerer et amerikansk FDA-regulert enhetsprodukt

Kliniske studier på Relevant Conditions Identification

Søk i lignende forsøk

Sponsorer og samarbeidspartnere

Medisinsk tilstand

Legemiddelintervensjoner

CROs by country

CROs in Guatemala

Forhold

Sjeldne sykdommer

Legemiddelintervensjoner

Kosttilskudd

Sponsor / samarbeidspartnere

Steder