Ta strona została przetłumaczona automatycznie i dokładność tłumaczenia nie jest gwarantowana. Proszę odnieść się do angielska wersja za tekst źródłowy.

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

3 lipca 2026 zaktualizowane przez: Ji Xunming,MD,PhD, Capital Medical University

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

This study will evaluate whether three-minute six-dimensions education(3M-6D education) can improve the reliability of large language models as medical assistants for the general public. Participants will be randomly assigned to receive or not receive 3M-6D education and then use ChatGPT, Gemini, or non-AI information resources. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Przegląd badań

Status

Rekrutacyjny

Warunki

Relevant Conditions Identification

Interwencja / Leczenie

Szczegółowy opis

This randomized, controlled, proof-of-concept simulation trial will evaluate whether three-minute six-dimensions education (3M-6D education) can improve the reliability of large language models as medical assistants for the general public.

Eligible participants will be randomly assigned in a 1:1:1:1:1 ratio to one of five study groups: the 3M-6D education GPT group, the GPT group, the 3M-6D education Gemini group, the Gemini group, or the control group. Participants in the 3M-6D education GPT and 3M-6D education Gemini groups will receive approximately three minutes of education before using ChatGPT or Gemini.Each participant will be randomly assigned one of 10 standardized clinical scenarios and complete a simulated counseling task in unrestricted natural language within approximately 10 minutes. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

Typ studiów

Interwencyjne

Zapisy (Szacowany)

525

Faza

Nie dotyczy

Kontakty i lokalizacje

Ta sekcja zawiera dane kontaktowe osób prowadzących badanie oraz informacje o tym, gdzie badanie jest przeprowadzane.

Kontakt w sprawie studiów

Nazwa: Xunming Ji
Numer telefonu: 01083198962
E-mail: jixm@ccmu.edu.cn

Kopia zapasowa kontaktu do badania

Nazwa: Chuanjie Wu
Numer telefonu: 01083199439
E-mail: wuchuanjie@ccmu.edu.cn

Lokalizacje studiów

Chiny
- Beijing Municipality
  - Beijing, Beijing Municipality, Chiny
    - Rekrutacyjny
    - Beijing Ctiy
    - Kontakt:
      
      Chuanjie Wu
      
      Numer telefonu: 010-83199439
      
      E-mail: wuchuanjie@ccmu.edu.cn

Kryteria uczestnictwa

Badacze szukają osób, które pasują do określonego opisu, zwanego kryteriami kwalifikacyjnymi. Niektóre przykłady tych kryteriów to ogólny stan zdrowia danej osoby lub wcześniejsze leczenie.

Kryteria kwalifikacji

Wiek uprawniający do nauki

Dorosły
Starszy dorosły

Akceptuje zdrowych ochotników

Tak

Opis

Inclusion Criteria:

Age 18 years or greater, male or female;
Completed primary school or higher education;
Able to use a smartphone or computer to complete online interaction;
No history of acute ischemic stroke, systemic lupus erythematosus, gastric ulcer, pneumonia, acute cardiac infarction, urinary tract infection, uterine fibroids, diabetes, osteoarthritis, or migraine.
Able to understand and comply with study procedures and to provide written informed consent.

Exclusion Criteria:

Currently or previously employed as a healthcare worker;
Previously received systematic medical training;
Currently involved in concurrent research that may interfere with the results of the present trial;
The investigator considered that the participant had other conditions that might affect compliance or preclude participation.

Plan studiów

Ta sekcja zawiera szczegółowe informacje na temat planu badania, w tym sposób zaprojektowania badania i jego pomiary.

Jak projektuje się badanie?

Szczegóły projektu

Główny cel: Badania usług zdrowotnych
Przydział: Randomizowane
Model interwencyjny: Przydział równoległy
Maskowanie: Pojedynczy

Liczba ramion

Broń i interwencje

Grupa uczestników / Arm	Interwencja / Leczenie
Eksperymentalny: 3M-6D education GPT Group Participants will first be trained in 3M-6D education, then use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Inny: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language. Behawioralne: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, the investigators identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so the investigators call this approach three minutes six dimensions education (3M-6D education).
Eksperymentalny: 3M-6D education Gemini Group Participants will first be trained in 3M-6D education, then use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Inny: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language. Behawioralne: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, the investigators identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so the investigators call this approach three minutes six dimensions education (3M-6D education).
Aktywny komparator: GPT Group Participants will use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Inny: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language.
Aktywny komparator: Gemini Group Participants will use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	Inny: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language.
Brak interwencji: Control group Participants will use non-AI tools such as internet searches and medical websites to complete a consultation task in unrestricted natural language in approximately 10 minutes.

Co mierzy badanie?

Podstawowe miary wyniku

Miara wyniku	Opis środka	Ramy czasowe
Relevant conditions identification of the 3M-6D education GPT group compared with the GPT group Ramy czasowe: 1 hour.	Relevant conditions identification is defined as the proportion of participants whose final response includes the expert-defined final diagnosis or a relevant differential diagnosis.	1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the GPT group Ramy czasowe: 1 hour.	Disposition concordance is defined as the proportion of participants whose final care recommendation matches the expert-defined level. The five levels are self-care, routine outpatient care, urgent outpatient care, emergency department visit, and emergency medical services.	1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the Gemini group Ramy czasowe: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the Gemini group Ramy czasowe: 1 hour.		1 hour.

Miary wyników drugorzędnych

Miara wyniku	Opis środka	Ramy czasowe
Relevant conditions identification of the 3M-6D education GPT group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the GPT group Ramy czasowe: 1 hour.	Red-flag identification is defined as the proportion of participants whose final response includes the key warning signs that experts defined for the assigned scenario.	1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the Gemini group Ramy czasowe: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the control group Ramy czasowe: 1 hour.		1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the GPT group Ramy czasowe: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the control group Ramy czasowe: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the Gemini group Ramy czasowe: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the control group Ramy czasowe: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
Relevant conditions identification of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Ramy czasowe: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Ramy czasowe: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the 3M-6D education Gemini group Ramy czasowe: 1 hour.		1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the 3M-6D education Gemini group Ramy czasowe: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.

Inne miary wyników

Miara wyniku	Opis środka	Ramy czasowe
Failure to identify red flags in the 3M-6D education GPT group compared with the GPT group Ramy czasowe: 1 hour.	Failure to identify red flags is defined as the proportion of participants whose final response does not include the expert-defined red-flag symptoms or warning signs for the assigned standardized simulated clinical scenario.	1 hour.
Failure to identify red flags in the 3M-6D education GPT group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the Gemini group Ramy czasowe: 1 hour.		1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the GPT group Ramy czasowe: 1 hour.	Underestimation of disposition is defined as the proportion of participants whose final care recommendation is lower than the expert-defined disposition level for the assigned standardized simulated clinical scenario.	1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the control group Ramy czasowe: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the Gemini group Ramy czasowe: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the control group Ramy czasowe: 1 hour.		1 hour.

Współpracownicy i badacze

Tutaj znajdziesz osoby i organizacje zaangażowane w to badanie.

Sponsor

Capital Medical University

Współpracownicy

Xuanwu Hospital, Beijing

Daty zapisu na studia

Daty te śledzą postęp w przesyłaniu rekordów badań i podsumowań wyników do ClinicalTrials.gov. Zapisy badań i zgłoszone wyniki są przeglądane przez National Library of Medicine (NLM), aby upewnić się, że spełniają określone standardy kontroli jakości, zanim zostaną opublikowane na publicznej stronie internetowej.

Główne daty studiów

Rozpoczęcie studiów (Rzeczywisty)

3 lipca 2026

Zakończenie podstawowe (Szacowany)

20 lipca 2026

Ukończenie studiów (Szacowany)

20 lipca 2026

Daty rejestracji na studia

Pierwszy przesłany

11 czerwca 2026

Pierwszy przesłany, który spełnia kryteria kontroli jakości

11 czerwca 2026

Pierwszy wysłany (Rzeczywisty)

16 czerwca 2026

Aktualizacje rekordów badań

Ostatnia wysłana aktualizacja (Rzeczywisty)

7 lipca 2026

Ostatnia przesłana aktualizacja, która spełniała kryteria kontroli jakości

3 lipca 2026

Ostatnia weryfikacja

1 lipca 2026

Więcej informacji

Terminy związane z tym badaniem

Słowa kluczowe

Inne numery identyfikacyjne badania

LAMP-1

Plan dla danych uczestnika indywidualnego (IPD)

Planujesz udostępniać dane poszczególnych uczestników (IPD)?

NIEZDECYDOWANY

Informacje o lekach i urządzeniach, dokumenty badawcze

Bada produkt leczniczy regulowany przez amerykańską FDA

Nie

Bada produkt urządzenia regulowany przez amerykańską FDA

Nie

Te informacje zostały pobrane bezpośrednio ze strony internetowej clinicaltrials.gov bez żadnych zmian. Jeśli chcesz zmienić, usunąć lub zaktualizować dane swojego badania, skontaktuj się z register@clinicaltrials.gov. Gdy tylko zmiana zostanie wprowadzona na stronie clinicaltrials.gov, zostanie ona automatycznie zaktualizowana również na naszej stronie internetowej .

Badania kliniczne na Relevant Conditions Identification

Psychedelic Data Society
Maastricht University; Quantified Citizen Technologies Inc.

Jeszcze nie rekrutacja

Osobowość i używanie narkotyków (PDU)

Badanie No Conditions koncentruje się na używaniu substancji i osobowości

Badania kliniczne na ChatGPT

Istituto Clinico Humanitas
Fondazione I.R.C.C.S. Istituto Neurologico Carlo Besta

Zakończony

ChatGPT w diagnozowaniu i leczeniu złożonych polineuropatii: analiza porównawcza z neurologami na podstawie rzeczywistych przypadków klinicznych (REASON)

Polineuropatie

Włochy
Chang Gung University of Science and Technology
National Science and Technology Council, Taiwan

Jeszcze nie rekrutacja

Interwencja oparta na chatgpt na słabość społeczną u starszych kobiet z CHF: różnice płci

Komunikacja społeczna | CHF - zastoinowa niewydolność serca | 65 lat starszy
National Taiwan University Hospital

Rejestracja na zaproszenie

Korzystanie z Chatgpt jako trenera pracy dla dorosłych z zaburzeniem ze spektrum autyzmu

Autyzm | Wydajność rozmowy kwalifikacyjnej

Tajwan
Chang Gung University of Science and Technology

Jeszcze nie rekrutacja

Porównanie sarkopenii, kruchości fizycznej, psychologicznej i społecznej u hospitalizowanych starszych kobiet CHF w środowisku metropolitalnym i wiejskim

Komunikacja społeczna | CHF - zastoinowa niewydolność serca | 65 lat starszy | Sarkopenia u osób starszych
Charite University, Berlin, Germany
German Research Foundation; Max Planck Institute for Human Development

Jeszcze nie rekrutacja

Badania Przesiewowe Raka Jajnika i Sztuczna Inteligencja (AI-OCS-Gyn)

Rekomendacje dotyczące badań przesiewowych raka jajnika według ginekologów

Niemcy
Montefiore Medical Center

Zakończony

Ocena reakcji lekarza z kontekstowymi spostrzeżeniami vs. standardowe silniki - sztuczna inteligencja Rag vs LLM Wsparcie decyzji klinicznych (PRECISE)

Duże modele językowe

Stany Zjednoczone
Carleton University
Université de Montréal

Zakończony

Ocena skuteczności i użyteczności sztucznej inteligencji (ChatGPT) dla studentów nauk o zdrowiu (AIHSS)

Działania edukacyjne | AI (sztuczna inteligencja)

Kanada
King Faisal Specialist Hospital & Research Center

Rekrutacyjny

Ocena skóry wspomagana sztuczną inteligencją w zapobieganiu odleżynom u pielęgniarek intensywnej terapii (IT-PIP)

Badanie koncentruje się na ocenie skóry i stopniowaniu odleżyn u pacjentów na oddziale intensywnej terapii

Arabia Saudyjska
Ankara University

Jeszcze nie rekrutacja

Chatgpt wspomagana nadciśnieniem

Nadciśnienie | Sztuczna inteligencja | Studenci pielęgniarstwa
University of Manitoba

Rekrutacyjny

Przydatność ChatGPT w poradnictwie przed wazektomią w biurze

Zapobieganie ciąży | Wazektomia

Kanada

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

Przegląd badań

Status

Warunki

Interwencja / Leczenie

Szczegółowy opis

Typ studiów

Zapisy (Szacowany)

Faza

Kontakty i lokalizacje

Kontakt w sprawie studiów

Kopia zapasowa kontaktu do badania

Lokalizacje studiów

Kryteria uczestnictwa

Kryteria kwalifikacji

Wiek uprawniający do nauki

Akceptuje zdrowych ochotników

Opis

Plan studiów

Jak projektuje się badanie?

Szczegóły projektu

Liczba ramion

Broń i interwencje

Grupa uczestników / Arm

Interwencja / Leczenie

Co mierzy badanie?

Podstawowe miary wyniku

Miara wyniku

Opis środka

Ramy czasowe

Miary wyników drugorzędnych

Miara wyniku

Opis środka

Ramy czasowe

Inne miary wyników

Miara wyniku

Opis środka

Ramy czasowe

Współpracownicy i badacze

Sponsor

Współpracownicy

Daty zapisu na studia

Główne daty studiów

Rozpoczęcie studiów (Rzeczywisty)

Zakończenie podstawowe (Szacowany)

Ukończenie studiów (Szacowany)

Daty rejestracji na studia

Pierwszy przesłany

Pierwszy przesłany, który spełnia kryteria kontroli jakości

Pierwszy wysłany (Rzeczywisty)

Aktualizacje rekordów badań

Ostatnia wysłana aktualizacja (Rzeczywisty)

Ostatnia przesłana aktualizacja, która spełniała kryteria kontroli jakości

Ostatnia weryfikacja

Więcej informacji

Terminy związane z tym badaniem

Słowa kluczowe

Inne numery identyfikacyjne badania

Plan dla danych uczestnika indywidualnego (IPD)

Planujesz udostępniać dane poszczególnych uczestników (IPD)?

Informacje o lekach i urządzeniach, dokumenty badawcze

Bada produkt leczniczy regulowany przez amerykańską FDA

Bada produkt urządzenia regulowany przez amerykańską FDA

Badania kliniczne na Relevant Conditions Identification

Badania kliniczne na ChatGPT

Wyszukaj podobne próby

Sponsorzy i współpracownicy

Warunki medyczne

Interwencje lekowe

CROs by country

CROs in Türkiye

Warunki

Rzadkie choroby

Interwencje lekowe

Suplementy diety

Sponsor / Współpracownicy

Lokalizacje