Эта страница была переведена автоматически, точность перевода не гарантируется. Пожалуйста, обратитесь к английской версии для исходного текста.

Preliminary Evaluation of a Large Language Model-Based Tool for Complex Surgical Decision Support in Lung Cancer

13 июня 2026 г. обновлено: XiuYuan Chen, Peking University People's Hospital

This study is an exploratory effect-size estimation study, with the following specific objectives: ① to estimate the point estimate and 95% confidence interval of the Win Ratio for the experimental group (GAPS-Agent) versus the control group (large language model) in blinded pairwise preference judgments by thoracic surgery expert adjudicators, to serve as a sample size planning parameter for subsequent multicenter confirmatory clinical trials; ② to preliminarily evaluate the value of GAPS-Agent within clinical workflows.The hypothesis of this study is as follows: compared with a general-purpose large language model without medical enhancement (control group), a structured agentic workflow optimized on the basis of the GAPS evaluation framework (GAPS-Agent, experimental group) can help junior resident physicians generate clinical decision plans for complex lung cancer cases that are more strongly preferred by senior thoracic surgery expert adjudicators.

Обзор исследования

Статус

Запись по приглашению

Условия

Вмешательство/лечение

Тип исследования

Интервенционный

Регистрация (Оцененный)

Фаза

Непригодный

Контакты и местонахождение

В этом разделе приведены контактные данные лиц, проводящих исследование, и информация о том, где проводится это исследование.

Места учебы

Китай
- Beijing Municipality
  - Beijing, Beijing Municipality, Китай, 100044
    - Peking University People's Hospital

Критерии участия

Исследователи ищут людей, которые соответствуют определенному описанию, называемому критериям приемлемости. Некоторыми примерами этих критериев являются общее состояние здоровья человека или предшествующее лечение.

Критерии приемлемости

Возраст, подходящий для обучения

Взрослый
Пожилой взрослый

Принимает здоровых добровольцев

Нет

Описание

Inclusion Criteria:

Resident Physician Subjects:
1. Holds a valid and legally effective Physician Practice License of the People's Republic of China;
2. Currently holds the rank of resident physician in a thoracic surgery department at a tertiary Class A (3A) hospital;
3. Agrees to complete all assessment tasks of the main study phase in accordance with the study protocol;
4. Can guarantee the time and effort required to complete all assessment tasks of the main study.
Study Cases:
1. The case was discussed at the Thoracic Oncology Multidisciplinary Team (MDT) conference of Peking University People's Hospital between January 2025 and May 2026;
2. The current version of the NCCN guidelines does not provide an explicit recommendation covering the management of the case;
3. Does not overlap with the GAPS evaluation set;
4. The case is presented in pure text in a structured format, with all direct and indirect identifiers removed and complete de-identification performed prior to inclusion;
5. From the pool of eligible cases, 12 cases will be randomly drawn using Python (numpy.random, with a fixed and archived seed) to serve as the main study cases. The cases will cover 6 themes (chest mass of undetermined diagnosis, early-stage lung cancer, locally advanced lung cancer, oligometastatic/oligoprogressive disease, special intraoperative situations, and tumor recurrence), with 2 cases per theme.
Adjudication Expert Panel:
1. Holds a valid and legally effective Physician Practice License of the People's Republic of China;
2. Currently holds the rank of attending physician or above in a thoracic surgery department at a tertiary Class A hospital;
3. Chairs or regularly participates in lung cancer multidisciplinary team (MDT) work in their department.

Exclusion Criteria:

Resident Physician Subjects:
1. Has previously participated in the construction of the GAPS evaluation set or the development of GAPS-Agent;
2. Unable to complete the tasks of the study phase.
Study Cases:
1. Key case information is missing, such as text-form data on pathology (including IHC/NGS), imaging, laboratory tests, prior medical history, comorbidities, or PS score;
2. Decision-making for the case is strictly dependent on non-text information.
Adjudication Expert Panel:
1. Participated in the construction of the GAPS evaluation set, the content validity verification, or the development of GAPS-Agent for this study;
2. Has a direct conflict of interest with any specific product among the two-arm tools of this study.

Учебный план

В этом разделе представлена подробная информация о плане исследования, в том числе о том, как планируется исследование и что оно измеряет.

Как устроено исследование?

Детали дизайна

Основная цель: Другой
Распределение: Рандомизированный
Интервенционная модель: Параллельное назначение
Маскировка: Одинокий

Количество рук

Оружие и интервенции

Группа участников / Армия	Вмешательство/лечение
Экспериментальный: test arm GAPS-Agent	Другой: GAPS-Agent The research group has previously developed the GAPS evaluation framework for complex clinical decision-making in lung cancer. In this framework, G (Grounding) characterizes the cognitive depth of decision-making (ranging from knowledge retrieval to decisions that go beyond clinical guidelines), A (Authority) corresponds to the grading of evidence strength, P (Perturbation) describes the identification and management of real-world clinical confounding factors, and S (Strength) corresponds to the calibration of recommendation strength. Within this framework, the research group has completed the construction of a 100-item complex lung cancer decision-making evaluation set along with its corresponding rubrics, and has invited multiple thoracic oncology experts to complete content validity validation. Based on this, the research group developed GAPS-Agent, which uses an open-source large language model as its foundation and integrates functional modules such as guideline and evidence retri
Активный компаратор: control arm LLM	Другой: LLM Open source large language model that is not specifically enhanced in medical field.

Группа участников / Армия

Вмешательство/лечение

Экспериментальный: test arm

GAPS-Agent

Другой: GAPS-Agent

The research group has previously developed the GAPS evaluation framework for complex clinical decision-making in lung cancer. In this framework, G (Grounding) characterizes the cognitive depth of decision-making (ranging from knowledge retrieval to decisions that go beyond clinical guidelines), A (Authority) corresponds to the grading of evidence strength, P (Perturbation) describes the identification and management of real-world clinical confounding factors, and S (Strength) corresponds to the calibration of recommendation strength. Within this framework, the research group has completed the construction of a 100-item complex lung cancer decision-making evaluation set along with its corresponding rubrics, and has invited multiple thoracic oncology experts to complete content validity validation. Based on this, the research group developed GAPS-Agent, which uses an open-source large language model as its foundation and integrates functional modules such as guideline and evidence retri

Активный компаратор: control arm

LLM

Другой: LLM

Open source large language model that is not specifically enhanced in medical field.

Что измеряет исследование?

Первичные показатели результатов

Мера результата	Мера Описание	Временное ограничение
Overall plan Win Ratio Временное ограничение: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.

Вторичные показатели результатов

Мера результата	Мера Описание	Временное ограничение
Inter-rater agreement Временное ограничение: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	For the ternary preference judgment results of 10 expert judges across 192 paired comparisons and 6 evaluation domains, Fleiss' kappa was used to assess inter-rater agreement. The kappa value and its 95% confidence interval are reported for each evaluation domain.	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Redundancy Win Ratio Временное ограничение: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Evidence-based medicine adherence Win Ratio Временное ограничение: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Actionability Win Ratio Временное ограничение: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Completeness Win Ratio Временное ограничение: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Safety Win Ratio Временное ограничение: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
GAPS automated rubric score Временное ограничение: Generated up to 3 weeks after residents finished their plan generation.	A third-party large language model, independent of the two study arms' base models, served as the judge model and automatically scored all 96 plans according to the GAPS rubric.	Generated up to 3 weeks after residents finished their plan generation.
Subject physician's self-confidence score Временное ограничение: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	After submitting each case plan, the participating physicians self-rated their confidence in their own plan using a 1-5 point Likert scale.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.
Tool satisfaction score Временное ограничение: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	After submitting each case plan, the participating physicians rated their satisfaction with the tool using a 1-5 point Likert scale.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.
Tool trustworthiness score Временное ограничение: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	After submitting each case plan, the participating physicians rated the tool's credibility using a 1-5 point Likert scale.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.
Decision-making time Временное ограничение: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	The time taken (in minutes) by each participating physician to complete the production of each case plan was automatically recorded by the evaluation platform. Differences between groups were analyzed using a linear mixed-effects model.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.

Соавторы и исследователи

Здесь вы найдете людей и организации, участвующие в этом исследовании.

Спонсор

Peking University People's Hospital

Даты записи исследования

Эти даты отслеживают ход отправки отчетов об исследованиях и сводных результатов на сайт ClinicalTrials.gov. Записи исследований и сообщаемые результаты проверяются Национальной медицинской библиотекой (NLM), чтобы убедиться, что они соответствуют определенным стандартам контроля качества, прежде чем публиковать их на общедоступном веб-сайте.

Изучение основных дат

Начало исследования (Действительный)

10 июня 2026 г.

Первичное завершение (Оцененный)

21 июня 2026 г.

Завершение исследования (Оцененный)

21 июня 2026 г.

Даты регистрации исследования

Первый отправленный

10 июня 2026 г.

Впервые представлено, что соответствует критериям контроля качества

13 июня 2026 г.

Первый опубликованный (Действительный)

17 июня 2026 г.

Обновления учебных записей

Последнее опубликованное обновление (Действительный)

17 июня 2026 г.

Последнее отправленное обновление, отвечающее критериям контроля качества

13 июня 2026 г.

Последняя проверка

1 июня 2026 г.

Дополнительная информация

Термины, связанные с этим исследованием

Ключевые слова

Дополнительные соответствующие термины MeSH

Другие идентификационные номера исследования

2026PHB458-001

Планирование данных отдельных участников (IPD)

Планируете делиться данными об отдельных участниках (IPD)?

НЕТ

Информация о лекарствах и устройствах, исследовательские документы

Изучает лекарственный продукт, регулируемый FDA США.

Нет

Изучает продукт устройства, регулируемый Управлением по санитарному надзору за качеством пищевых продуктов и медикаментов США.

Нет

Эта информация была получена непосредственно с веб-сайта clinicaltrials.gov без каких-либо изменений. Если у вас есть запросы на изменение, удаление или обновление сведений об исследовании, обращайтесь по адресу register@clinicaltrials.gov. Как только изменение будет реализовано на clinicaltrials.gov, оно будет автоматически обновлено и на нашем веб-сайте. .

Клинические исследования Рак легкого (НМРЛ)

Shanghai Chest Hospital
University Hospital, Rouen; The Affiliated Hospital of Qingdao University; Tianjin... и другие соавторы

Рекрутинг

Даунстейджинг и RATS после неоадъювантной химиоиммунотерапии: влияние на хирургические исходы при НМРЛ (DRAGON-NSCLC)

Неоадъювантная химиоиммунотерапия | Стадия IIB-III NSCLC | Роботизированная резекция легкого

Китай, Италия, Франция
Jianxing He
Innovent Biologics (Suzhou) Co. Ltd.

Рекрутинг

Фулзерасиб в последовательной терапии с синтилимабом и неоадъювантной платиносодержащей двойной терапией при резектабельном KRAS G12C-мутантном НМРЛ (K-NADIR)

Неоадъювантная терапия | Мутация KRAS G12C | Повторный NSCLC | Стадия IB-IIIA НМРЛ

Китай
Shanghai Chest Hospital
University Hospital, Rouen; The Affiliated Hospital of Qingdao University; Tianjin... и другие соавторы

Рекрутинг

Рукавная лобэктомия RATS после неоадъювантной химиоиммунотерапии при НМРЛ (RIDDLE-NSCLC)

Роботизированная хирургия | Неоадъювантная химиоиммунотерапия | Стадия IIB-III NSCLC | Рукавная лобэктомия

Китай, Италия, Франция
Intergroupe Francophone de Cancerologie Thoracique
Amgen

Активный, не рекрутирующий

RW Эффективность соторасиба при метастатическом НМРЛ с мутацией KRAS G12C (LungKG12Ci)

Nsclc | КРАС П.Г12С

Франция
West China Hospital

Еще не набирают

Клинические результаты тишлелизумаба в сочетании с химиотерапией в качестве индукционной терапии на стадии III (CTNM-IIIB/IIIC) NSCLC

NSCLC IIIB, NSCLC Stage IIIC
Wen-zhao ZHONG
Shanghai Junshi Bioscience Co., Ltd.

Рекрутинг

Toripalimab в сочетании с химиотерапией в качестве адъювантной терапии для полностью резецированной стадии IB-IIIB немногсяклеточный рак легких

Стадия IB-IIIB NSCLC | Стадия IIB-III NSCLC

Китай
xiaohua li

Рекрутинг

Чтобы изучить эффективность ситилимаба в сочетании с бевацизумабом при раке прямой кишки

Rcctal Cancer

Китай
Yonsei University

Еще не набирают

Открытое исследование с одноодельной, многоцентровой фазой II для оценки эффективности амивантамаба в сочетании с Folfiri в качестве лечения второй линии у пациентов с распространенным раком дикого типа RAS/BRAF, прогрессирующим на предварительном лечении на основе анти-EGFR.

RAS/BRAF DILE-TYPE Advanced Corelectal Cancer Pement

Корея, Республика
Regina Elena Cancer Institute
Azienda Ospedaliera Universitaria Policlinico Paolo Giaccone Palermo

Рекрутинг

Immune Metabolism Dysregulation and Efficacy to Anti-PD-1 PD-L1 Agents in Non Small Cell Lung Cancer Patients

Nsclc

Италия
Centre Oscar Lambret
University Hospital, Lille

Прекращено

Корреляция между тканевым и плазматическим EGFR при CBNPC с мутацией EGFR или прогностическим фактором мутации EGFR (CONCORDE)

Nsclc

Франция

Клинические исследования GAPS-Agent

ImmunityBio, Inc.

Отозван

Исследование применения Ногапендекина Альфа Инбакицепта и iNKT-клеток у тяжелобольных взрослых пациентов с тяжелой внебольничной пневмонией

Сепсис | Лимфопения | Острый респираторный дистресс-синдром (ОРДС) | Внебольничная пневмония (ВП) | Иммунопаралич
Darren Sigal, MD
Scripps Health

Еще не набирают

BAL/BOT/agenT-797 при КРР c pMMR с метастазами в печени

Метастатический колоректальный рак

Соединенные Штаты
Orchestra BioMed, Inc

Рекрутинг

[Испытание устройства, которое не одобрено или очищено в США FDA]

Ишемическая болезнь сердца

Соединенные Штаты
University of Maryland, Baltimore

Завершенный

Shigella CVD 30000: исследование реакции на вакцинацию против шигеллы

Шигелла

Соединенные Штаты
ImmunityBio, Inc.

Отозван

Ногапендекин Альфа-Инбакицепт и iNKT-клетки для критически больных взрослых с тяжелой внебольничной пневмонией (с сепсисом/ОРДС или без них)

Сепсис | Острый респираторный дистресс-синдром | Тяжелая внебольничная пневмония | Лимфопения / Иммунопаралич у тяжелобольных взрослых
Aydin Adnan Menderes University

Завершенный

Сравнительная оценка уровня бисфенола-А в слюне у пациентов с фиксированными лингвальными ретейнерами, ретейнерами Хоули и вакуумными ретейнерами

Ортодонтическая ретенция

Турция

Preliminary Evaluation of a Large Language Model-Based Tool for Complex Surgical Decision Support in Lung Cancer

Обзор исследования

Статус

Условия

Вмешательство/лечение

Тип исследования

Регистрация (Оцененный)

Фаза

Контакты и местонахождение

Места учебы

Критерии участия

Критерии приемлемости

Возраст, подходящий для обучения

Принимает здоровых добровольцев

Описание

Учебный план

Как устроено исследование?

Детали дизайна

Количество рук

Оружие и интервенции

Группа участников / Армия

Вмешательство/лечение

Что измеряет исследование?

Первичные показатели результатов

Мера результата

Мера Описание

Временное ограничение

Вторичные показатели результатов

Мера результата

Мера Описание

Временное ограничение

Соавторы и исследователи

Спонсор

Даты записи исследования

Изучение основных дат

Начало исследования (Действительный)

Первичное завершение (Оцененный)

Завершение исследования (Оцененный)

Даты регистрации исследования

Первый отправленный

Впервые представлено, что соответствует критериям контроля качества

Первый опубликованный (Действительный)

Обновления учебных записей

Последнее опубликованное обновление (Действительный)

Последнее отправленное обновление, отвечающее критериям контроля качества

Последняя проверка

Дополнительная информация

Термины, связанные с этим исследованием

Ключевые слова

Дополнительные соответствующие термины MeSH

Другие идентификационные номера исследования

Планирование данных отдельных участников (IPD)

Планируете делиться данными об отдельных участниках (IPD)?

Информация о лекарствах и устройствах, исследовательские документы

Изучает лекарственный продукт, регулируемый FDA США.

Изучает продукт устройства, регулируемый Управлением по санитарному надзору за качеством пищевых продуктов и медикаментов США.

Клинические исследования Рак легкого (НМРЛ)

Клинические исследования GAPS-Agent

Искать похожие исследования

Спонсоры и соавторы

Заболевания

Медикаментозные вмешательства

CROs by country

CROs in Burundi

Заболевания

Редкие заболевания

Медикаментозные вмешательства

Пищевые добавки

Спонсор / Соавторы

Места