이 페이지는 자동 번역되었으며 번역의 정확성을 보장하지 않습니다. 참조하십시오 영문판 원본 텍스트의 경우.

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

2026년 7월 3일 업데이트: Ji Xunming,MD,PhD, Capital Medical University

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

This study will evaluate whether three-minute six-dimensions education(3M-6D education) can improve the reliability of large language models as medical assistants for the general public. Participants will be randomly assigned to receive or not receive 3M-6D education and then use ChatGPT, Gemini, or non-AI information resources. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

연구 개요

상태

모병

정황

Relevant Conditions Identification

개입 / 치료

상세 설명

This randomized, controlled, proof-of-concept simulation trial will evaluate whether three-minute six-dimensions education (3M-6D education) can improve the reliability of large language models as medical assistants for the general public.

Eligible participants will be randomly assigned in a 1:1:1:1:1 ratio to one of five study groups: the 3M-6D education GPT group, the GPT group, the 3M-6D education Gemini group, the Gemini group, or the control group. Participants in the 3M-6D education GPT and 3M-6D education Gemini groups will receive approximately three minutes of education before using ChatGPT or Gemini.Each participant will be randomly assigned one of 10 standardized clinical scenarios and complete a simulated counseling task in unrestricted natural language within approximately 10 minutes. The study will assess relevant condition identification, disposition concordance, red-flag identification, and NASA-TLX score.

연구 유형

중재적

등록 (추정된)

525

단계

해당 없음

연락처 및 위치

이 섹션에서는 연구를 수행하는 사람들의 연락처 정보와 이 연구가 수행되는 장소에 대한 정보를 제공합니다.

연구 연락처

이름: Xunming Ji
전화번호: 01083198962
이메일: jixm@ccmu.edu.cn

연구 연락처 백업

이름: Chuanjie Wu
전화번호: 01083199439
이메일: wuchuanjie@ccmu.edu.cn

연구 장소

중국
- Beijing Municipality
  - Beijing, Beijing Municipality, 중국
    - 모병
    - Beijing Ctiy
    - 연락하다:
      
      Chuanjie Wu
      
      전화번호: 010-83199439
      
      이메일: wuchuanjie@ccmu.edu.cn

참여기준

연구원은 적격성 기준이라는 특정 설명에 맞는 사람을 찾습니다. 이러한 기준의 몇 가지 예는 개인의 일반적인 건강 상태 또는 이전 치료입니다.

자격 기준

공부할 수 있는 나이

성인
고령자

건강한 자원 봉사자를 받아들입니다

예

설명

Inclusion Criteria:

Age 18 years or greater, male or female;
Completed primary school or higher education;
Able to use a smartphone or computer to complete online interaction;
No history of acute ischemic stroke, systemic lupus erythematosus, gastric ulcer, pneumonia, acute cardiac infarction, urinary tract infection, uterine fibroids, diabetes, osteoarthritis, or migraine.
Able to understand and comply with study procedures and to provide written informed consent.

Exclusion Criteria:

Currently or previously employed as a healthcare worker;
Previously received systematic medical training;
Currently involved in concurrent research that may interfere with the results of the present trial;
The investigator considered that the participant had other conditions that might affect compliance or preclude participation.

공부 계획

이 섹션에서는 연구 설계 방법과 연구가 측정하는 내용을 포함하여 연구 계획에 대한 세부 정보를 제공합니다.

연구는 어떻게 설계됩니까?

디자인 세부사항

주 목적: 건강 서비스 연구
할당: 무작위
중재 모델: 병렬 할당
마스킹: 하나의

팔의 수

무기와 개입

참가자 그룹 / 팔	개입 / 치료
실험적: 3M-6D education GPT Group Participants will first be trained in 3M-6D education, then use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	다른: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language. 행동: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, the investigators identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so the investigators call this approach three minutes six dimensions education (3M-6D education).
실험적: 3M-6D education Gemini Group Participants will first be trained in 3M-6D education, then use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	다른: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language. 행동: three minutes six dimensions education 3M-6D education is designed based on Cognitive Load Theory to reduce the cognitive burden on patients during medical interactions with AI and to improve the clarity and completeness of symptom reporting. Guided by cognitive load theory and the natural process physicians use to take medical histories, the investigators identified candidate information dimensions and developed a structured expression framework with six dimensions for public health queries through a Delphi expert consensus process. Participants were instructed to use the framework to describe their symptoms across these six dimensions; this process can typically be completed within three minutes, so the investigators call this approach three minutes six dimensions education (3M-6D education).
활성 비교기: GPT Group Participants will use ChatGPT to complete a consultation task in unrestricted natural language in approximately 10 minutes.	다른: ChatGPT Participants use ChatGPT to complete a standardized simulated clinical scenarios in unrestricted natural language.
활성 비교기: Gemini Group Participants will use Gemini to complete a consultation task in unrestricted natural language in approximately 10 minutes.	다른: Gemini Participants use Gemini to complete a standardized simulated clinical scenarios in unrestricted natural language.
간섭 없음: Control group Participants will use non-AI tools such as internet searches and medical websites to complete a consultation task in unrestricted natural language in approximately 10 minutes.

연구는 무엇을 측정합니까?

주요 결과 측정

결과 측정	측정값 설명	기간
Relevant conditions identification of the 3M-6D education GPT group compared with the GPT group 기간: 1 hour.	Relevant conditions identification is defined as the proportion of participants whose final response includes the expert-defined final diagnosis or a relevant differential diagnosis.	1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the GPT group 기간: 1 hour.	Disposition concordance is defined as the proportion of participants whose final care recommendation matches the expert-defined level. The five levels are self-care, routine outpatient care, urgent outpatient care, emergency department visit, and emergency medical services.	1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the Gemini group 기간: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the Gemini group 기간: 1 hour.		1 hour.

2차 결과 측정

결과 측정	측정값 설명	기간
Relevant conditions identification of the 3M-6D education GPT group compared with the control group 기간: 1 hour.		1 hour.
Relevant conditions identification of the 3M-6D education Gemini group compared with the control group 기간: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the control group 기간: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education Gemini group compared with the control group 기간: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the GPT group 기간: 1 hour.	Red-flag identification is defined as the proportion of participants whose final response includes the key warning signs that experts defined for the assigned scenario.	1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the control group 기간: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the Gemini group 기간: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education Gemini group compared with the control group 기간: 1 hour.		1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the GPT group 기간: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the control group 기간: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the Gemini group 기간: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
NASA Task Load Index score of the 3M-6D education Gemini group compared with the control group 기간: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.
Relevant conditions identification of the 3M-6D education GPT group compared with the 3M-6D education Gemini group 기간: 1 hour.		1 hour.
Disposition concordance of the 3M-6D education GPT group compared with the 3M-6D education Gemini group 기간: 1 hour.		1 hour.
Red-flag identification in the 3M-6D education GPT group compared with the 3M-6D education Gemini group 기간: 1 hour.		1 hour.
NASA Task Load Index score of the 3M-6D education GPT group compared with the 3M-6D education Gemini group 기간: 1 hour.	NASA-TLX score is a self-reported task-load score measured after the simulated consultation with a physician. It includes six domains: mental demand, physical demand, temporal demand, effort, frustration, and performance. Each domain is scored from 0 to 100. The total score is the mean of the six domains. Higher scores indicate greater perceived task load.	1 hour.

기타 결과 측정

결과 측정	측정값 설명	기간
Failure to identify red flags in the 3M-6D education GPT group compared with the GPT group 기간: 1 hour.	Failure to identify red flags is defined as the proportion of participants whose final response does not include the expert-defined red-flag symptoms or warning signs for the assigned standardized simulated clinical scenario.	1 hour.
Failure to identify red flags in the 3M-6D education GPT group compared with the control group 기간: 1 hour.		1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the Gemini group 기간: 1 hour.		1 hour.
Failure to identify red flags in the 3M-6D education Gemini group compared with the control group 기간: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the GPT group 기간: 1 hour.	Underestimation of disposition is defined as the proportion of participants whose final care recommendation is lower than the expert-defined disposition level for the assigned standardized simulated clinical scenario.	1 hour.
Underestimation of disposition in the 3M-6D education GPT group compared with the control group 기간: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the Gemini group 기간: 1 hour.		1 hour.
Underestimation of disposition in the 3M-6D education Gemini group compared with the control group 기간: 1 hour.		1 hour.

공동 작업자 및 조사자

여기에서 이 연구와 관련된 사람과 조직을 찾을 수 있습니다.

스폰서

Capital Medical University

협력자

Xuanwu Hospital, Beijing

연구 기록 날짜

이 날짜는 ClinicalTrials.gov에 대한 연구 기록 및 요약 결과 제출의 진행 상황을 추적합니다. 연구 기록 및 보고된 결과는 공개 웹사이트에 게시되기 전에 특정 품질 관리 기준을 충족하는지 확인하기 위해 국립 의학 도서관(NLM)에서 검토합니다.

연구 주요 날짜

연구 시작 (실제)

2026년 7월 3일

기본 완료 (추정된)

2026년 7월 20일

연구 완료 (추정된)

2026년 7월 20일

연구 등록 날짜

최초 제출

2026년 6월 11일

QC 기준을 충족하는 최초 제출

2026년 6월 11일

처음 게시됨 (실제)

2026년 6월 16일

연구 기록 업데이트

마지막 업데이트 게시됨 (실제)

2026년 7월 7일

QC 기준을 충족하는 마지막 업데이트 제출

2026년 7월 3일

마지막으로 확인됨

2026년 7월 1일

추가 정보

이 연구와 관련된 용어

키워드

기타 연구 ID 번호

LAMP-1

개별 참가자 데이터(IPD) 계획

개별 참가자 데이터(IPD)를 공유할 계획입니까?

미정

약물 및 장치 정보, 연구 문서

미국 FDA 규제 의약품 연구

아니

미국 FDA 규제 기기 제품 연구

아니

이 정보는 변경 없이 clinicaltrials.gov 웹사이트에서 직접 가져온 것입니다. 귀하의 연구 세부 정보를 변경, 제거 또는 업데이트하도록 요청하는 경우 register@clinicaltrials.gov. 문의하십시오. 변경 사항이 clinicaltrials.gov에 구현되는 즉시 저희 웹사이트에도 자동으로 업데이트됩니다. .

ChatGPT에 대한 임상 시험

Charite University, Berlin, Germany
German Research Foundation; Max Planck Institute for Human Development

아직 모집하지 않음

난소암 선별 검사와 인공지능 (AI-OCS-Gyn)

부인과 의사의 난소암 검진 권장사항

독일
Chang Gung University of Science and Technology
National Science and Technology Council, Taiwan

아직 모집하지 않음

CHF를 가진 나이든 여성의 사회적 연약함에 대한 Chatgpt- 기반 개입 : 성별 차이

소셜커뮤니케이션 | CHF - 울혈성 심부전 | 65세 이상
National Taiwan University Hospital

초대로 등록

자폐 스펙트럼 장애가있는 성인을위한 채용 코치로 chatgpt 사용

자폐성 | 면접 성과

대만
Ankara University

아직 모집하지 않음

간호 학생들의 Chatgpt 지원 고혈압 지식

고혈압 | 인공 지능 | 간호학생
Hartford Hospital
Boston Scientific Corporation

모집하지 않고 적극적으로

비뇨부인과 환자의 인공지능 활용

하부 요로 증상 | 요실금 | 자궁질 탈출증

미국
Chang Gung University of Science and Technology

아직 모집하지 않음

대도시 및 농촌 환경에서 입원 한 노인 Chf에서 육종, 신체적, 심리적, 사회적 연약함 비교

소셜커뮤니케이션 | CHF - 울혈성 심부전 | 65세 이상 | 노인의 근감소증
Carleton University
Université de Montréal

완전한

보건과학계열 학생을 위한 인공지능(ChatGPT)의 효용성 및 사용성 평가 원문보기 KCI 원문보기 인용 (AIHSS)

교육 활동 | AI(인공 지능)

캐나다
Saglik Bilimleri Universitesi

완전한

ChatGPT 기반 간호과정 교육의 영향

간호사-환자 관계

칠면조
University of Manitoba

모병

사무실 기반 환경에서 정관 수술 전 상담에서 ChatGPT의 유용성

피임 | 정관수술

캐나다
Istituto Clinico Humanitas
Fondazione I.R.C.C.S. Istituto Neurologico Carlo Besta

완전한

실제 사례를 활용한 신경과 전문의와의 비교 분석: 복잡한 다발신경병증의 진단 및 관리에서의 ChatGPT (REASON)

다발신경병증

이탈리아

Improving the Reliability of LLMs as Medical Assistants for the General Public (LAMP-1)

Improving the Reliability of LLMs as Medical Assistants for the General Public: a Proof of Concept Simulation Trial

연구 개요

상태

정황

개입 / 치료

상세 설명

연구 유형

등록 (추정된)

단계

연락처 및 위치

연구 연락처

연구 연락처 백업

연구 장소

참여기준

자격 기준

공부할 수 있는 나이

건강한 자원 봉사자를 받아들입니다

설명

공부 계획

연구는 어떻게 설계됩니까?

디자인 세부사항

팔의 수

무기와 개입

참가자 그룹 / 팔

개입 / 치료

연구는 무엇을 측정합니까?

주요 결과 측정

결과 측정

측정값 설명

기간

2차 결과 측정

결과 측정

측정값 설명

기간

기타 결과 측정

결과 측정

측정값 설명

기간

공동 작업자 및 조사자

스폰서

협력자

연구 기록 날짜

연구 주요 날짜

연구 시작 (실제)

기본 완료 (추정된)

연구 완료 (추정된)

연구 등록 날짜

최초 제출

QC 기준을 충족하는 최초 제출

처음 게시됨 (실제)

연구 기록 업데이트

마지막 업데이트 게시됨 (실제)

QC 기준을 충족하는 마지막 업데이트 제출

마지막으로 확인됨

추가 정보

이 연구와 관련된 용어

키워드

기타 연구 ID 번호

개별 참가자 데이터(IPD) 계획

개별 참가자 데이터(IPD)를 공유할 계획입니까?

약물 및 장치 정보, 연구 문서

미국 FDA 규제 의약품 연구

미국 FDA 규제 기기 제품 연구

ChatGPT에 대한 임상 시험

유사한 임상시험 검색

스폰서 및 공동 작업자

건강 상태

약물 개입

CROs by country

CROs in Cambodia

정황

희귀 질병

약물 개입

식이 보충제

스폰서 / 협력자

위치