Tämä sivu käännettiin automaattisesti, eikä käännösten tarkkuutta voida taata. Katso englanninkielinen versio lähdetekstiä varten.

Preliminary Evaluation of a Large Language Model-Based Tool for Complex Surgical Decision Support in Lung Cancer

lauantai 13. kesäkuuta 2026 päivittänyt: XiuYuan Chen, Peking University People's Hospital

This study is an exploratory effect-size estimation study, with the following specific objectives: ① to estimate the point estimate and 95% confidence interval of the Win Ratio for the experimental group (GAPS-Agent) versus the control group (large language model) in blinded pairwise preference judgments by thoracic surgery expert adjudicators, to serve as a sample size planning parameter for subsequent multicenter confirmatory clinical trials; ② to preliminarily evaluate the value of GAPS-Agent within clinical workflows.The hypothesis of this study is as follows: compared with a general-purpose large language model without medical enhancement (control group), a structured agentic workflow optimized on the basis of the GAPS evaluation framework (GAPS-Agent, experimental group) can help junior resident physicians generate clinical decision plans for complex lung cancer cases that are more strongly preferred by senior thoracic surgery expert adjudicators.

Tutkimuksen yleiskatsaus

Tila

Ilmoittautuminen kutsusta

Ehdot

Interventio / Hoito

Opintotyyppi

Interventio

Ilmoittautuminen (Arvioitu)

Vaihe

Ei sovellettavissa

Yhteystiedot ja paikat

Tässä osiossa on tutkimuksen suorittajien yhteystiedot ja tiedot siitä, missä tämä tutkimus suoritetaan.

Opiskelupaikat

Kiina
- Beijing Municipality
  - Beijing, Beijing Municipality, Kiina, 100044
    - Peking University People's Hospital

Osallistumiskriteerit

Tutkijat etsivät ihmisiä, jotka sopivat tiettyyn kuvaukseen, jota kutsutaan kelpoisuuskriteereiksi. Joitakin esimerkkejä näistä kriteereistä ovat henkilön yleinen terveydentila tai aiemmat hoidot.

Kelpoisuusvaatimukset

Opintokelpoiset iät

Aikuinen
Vanhempi Aikuinen

Hyväksyy terveitä vapaaehtoisia

Kuvaus

Inclusion Criteria:

Resident Physician Subjects:
1. Holds a valid and legally effective Physician Practice License of the People's Republic of China;
2. Currently holds the rank of resident physician in a thoracic surgery department at a tertiary Class A (3A) hospital;
3. Agrees to complete all assessment tasks of the main study phase in accordance with the study protocol;
4. Can guarantee the time and effort required to complete all assessment tasks of the main study.
Study Cases:
1. The case was discussed at the Thoracic Oncology Multidisciplinary Team (MDT) conference of Peking University People's Hospital between January 2025 and May 2026;
2. The current version of the NCCN guidelines does not provide an explicit recommendation covering the management of the case;
3. Does not overlap with the GAPS evaluation set;
4. The case is presented in pure text in a structured format, with all direct and indirect identifiers removed and complete de-identification performed prior to inclusion;
5. From the pool of eligible cases, 12 cases will be randomly drawn using Python (numpy.random, with a fixed and archived seed) to serve as the main study cases. The cases will cover 6 themes (chest mass of undetermined diagnosis, early-stage lung cancer, locally advanced lung cancer, oligometastatic/oligoprogressive disease, special intraoperative situations, and tumor recurrence), with 2 cases per theme.
Adjudication Expert Panel:
1. Holds a valid and legally effective Physician Practice License of the People's Republic of China;
2. Currently holds the rank of attending physician or above in a thoracic surgery department at a tertiary Class A hospital;
3. Chairs or regularly participates in lung cancer multidisciplinary team (MDT) work in their department.

Exclusion Criteria:

Resident Physician Subjects:
1. Has previously participated in the construction of the GAPS evaluation set or the development of GAPS-Agent;
2. Unable to complete the tasks of the study phase.
Study Cases:
1. Key case information is missing, such as text-form data on pathology (including IHC/NGS), imaging, laboratory tests, prior medical history, comorbidities, or PS score;
2. Decision-making for the case is strictly dependent on non-text information.
Adjudication Expert Panel:
1. Participated in the construction of the GAPS evaluation set, the content validity verification, or the development of GAPS-Agent for this study;
2. Has a direct conflict of interest with any specific product among the two-arm tools of this study.

Opintosuunnitelma

Tässä osiossa on tietoja tutkimussuunnitelmasta, mukaan lukien kuinka tutkimus on suunniteltu ja mitä tutkimuksella mitataan.

Miten tutkimus on suunniteltu?

Suunnittelun yksityiskohdat

Ensisijainen käyttötarkoitus: Muut
Jako: Satunnaistettu
Inventiomalli: Rinnakkaistehtävä
Naamiointi: Yksittäinen

Aseiden lukumäärä

Aseet ja interventiot

Osallistujaryhmä / Arm	Interventio / Hoito
Kokeellinen: test arm GAPS-Agent	Muut: GAPS-Agent The research group has previously developed the GAPS evaluation framework for complex clinical decision-making in lung cancer. In this framework, G (Grounding) characterizes the cognitive depth of decision-making (ranging from knowledge retrieval to decisions that go beyond clinical guidelines), A (Authority) corresponds to the grading of evidence strength, P (Perturbation) describes the identification and management of real-world clinical confounding factors, and S (Strength) corresponds to the calibration of recommendation strength. Within this framework, the research group has completed the construction of a 100-item complex lung cancer decision-making evaluation set along with its corresponding rubrics, and has invited multiple thoracic oncology experts to complete content validity validation. Based on this, the research group developed GAPS-Agent, which uses an open-source large language model as its foundation and integrates functional modules such as guideline and evidence retri
Active Comparator: control arm LLM	Muut: LLM Open source large language model that is not specifically enhanced in medical field.

Osallistujaryhmä / Arm

Interventio / Hoito

Kokeellinen: test arm

GAPS-Agent

Muut: GAPS-Agent

The research group has previously developed the GAPS evaluation framework for complex clinical decision-making in lung cancer. In this framework, G (Grounding) characterizes the cognitive depth of decision-making (ranging from knowledge retrieval to decisions that go beyond clinical guidelines), A (Authority) corresponds to the grading of evidence strength, P (Perturbation) describes the identification and management of real-world clinical confounding factors, and S (Strength) corresponds to the calibration of recommendation strength. Within this framework, the research group has completed the construction of a 100-item complex lung cancer decision-making evaluation set along with its corresponding rubrics, and has invited multiple thoracic oncology experts to complete content validity validation. Based on this, the research group developed GAPS-Agent, which uses an open-source large language model as its foundation and integrates functional modules such as guideline and evidence retri

Active Comparator: control arm

LLM

Muut: LLM

Open source large language model that is not specifically enhanced in medical field.

Mitä tutkimuksessa mitataan?

Ensisijaiset tulostoimenpiteet

Tulosmittaus	Toimenpiteen kuvaus	Aikaikkuna
Overall plan Win Ratio Aikaikkuna: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.

Toissijaiset tulostoimenpiteet

Tulosmittaus	Toimenpiteen kuvaus	Aikaikkuna
Inter-rater agreement Aikaikkuna: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	For the ternary preference judgment results of 10 expert judges across 192 paired comparisons and 6 evaluation domains, Fleiss' kappa was used to assess inter-rater agreement. The kappa value and its 95% confidence interval are reported for each evaluation domain.	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Redundancy Win Ratio Aikaikkuna: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Evidence-based medicine adherence Win Ratio Aikaikkuna: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Actionability Win Ratio Aikaikkuna: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Completeness Win Ratio Aikaikkuna: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
Safety Win Ratio Aikaikkuna: Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.	A total of 10 blinded expert judges made Win/Tie/Loss ternary preference judgments on 192 paired scheme comparisons in terms of overall scheme quality. The win ratio was calculated as Wins ÷ Losses, and the 95% confidence interval was estimated using a two-level (physician × case) cluster bootstrap resampling method (B = 10,000, quantile method on the log scale).	Measured at the time when experts completed their preference judgements. Calculated up to 3 weeks after the preference judgements.
GAPS automated rubric score Aikaikkuna: Generated up to 3 weeks after residents finished their plan generation.	A third-party large language model, independent of the two study arms' base models, served as the judge model and automatically scored all 96 plans according to the GAPS rubric.	Generated up to 3 weeks after residents finished their plan generation.
Subject physician's self-confidence score Aikaikkuna: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	After submitting each case plan, the participating physicians self-rated their confidence in their own plan using a 1-5 point Likert scale.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.
Tool satisfaction score Aikaikkuna: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	After submitting each case plan, the participating physicians rated their satisfaction with the tool using a 1-5 point Likert scale.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.
Tool trustworthiness score Aikaikkuna: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	After submitting each case plan, the participating physicians rated the tool's credibility using a 1-5 point Likert scale.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.
Decision-making time Aikaikkuna: Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.	The time taken (in minutes) by each participating physician to complete the production of each case plan was automatically recorded by the evaluation platform. Differences between groups were analyzed using a linear mixed-effects model.	Completed at the time when residents submitted their plans. Calculated up to 3 weeks after the submission.

Yhteistyökumppanit ja tutkijat

Täältä löydät tähän tutkimukseen osallistuvat ihmiset ja organisaatiot.

Sponsori

Peking University People's Hospital

Opintojen ennätyspäivät

Nämä päivämäärät seuraavat ClinicalTrials.gov-sivustolle lähetettyjen tutkimustietueiden ja yhteenvetojen edistymistä. National Library of Medicine (NLM) tarkistaa tutkimustiedot ja raportoidut tulokset varmistaakseen, että ne täyttävät tietyt laadunvalvontastandardit, ennen kuin ne julkaistaan julkisella verkkosivustolla.

Opi tärkeimmät päivämäärät

Opiskelun aloitus (Todellinen)

Keskiviikko 10. kesäkuuta 2026

Ensisijainen valmistuminen (Arvioitu)

Sunnuntai 21. kesäkuuta 2026

Opintojen valmistuminen (Arvioitu)

Sunnuntai 21. kesäkuuta 2026

Opintoihin ilmoittautumispäivät

Ensimmäinen lähetetty

Keskiviikko 10. kesäkuuta 2026

Ensimmäinen toimitettu, joka täytti QC-kriteerit

Lauantai 13. kesäkuuta 2026

Ensimmäinen Lähetetty (Todellinen)

Keskiviikko 17. kesäkuuta 2026

Tutkimustietojen päivitykset

Viimeisin päivitys julkaistu (Todellinen)

Keskiviikko 17. kesäkuuta 2026

Viimeisin lähetetty päivitys, joka täytti QC-kriteerit

Lauantai 13. kesäkuuta 2026

Viimeksi vahvistettu

Maanantai 1. kesäkuuta 2026

Lisää tietoa

Tähän tutkimukseen liittyvät termit

Avainsanat

Muita asiaankuuluvia MeSH-ehtoja

Muut tutkimustunnusnumerot

2026PHB458-001

Yksittäisten osallistujien tietojen suunnitelma (IPD)

Aiotko jakaa yksittäisten osallistujien tietoja (IPD)?

Lääke- ja laitetiedot, tutkimusasiakirjat

Tutkii yhdysvaltalaista FDA sääntelemää lääkevalmistetta

Tutkii yhdysvaltalaista FDA sääntelemää laitetuotetta

Nämä tiedot haettiin suoraan verkkosivustolta clinicaltrials.gov ilman muutoksia. Jos sinulla on pyyntöjä muuttaa, poistaa tai päivittää tutkimustietojasi, ota yhteyttä register@clinicaltrials.gov. Heti kun muutos on otettu käyttöön osoitteessa clinicaltrials.gov, se päivitetään automaattisesti myös verkkosivustollemme .

Kliiniset tutkimukset Keuhkosyöpä (NSCLC)

Everest Medicines (Beijing) Co., Ltd.

Ei vielä rekrytointia

Evaluate the Safety, Tolerability, and Preliminary Efficacy of EVM14 in Combination With Ivonescimab in Sq-NSCLC Patients

Squamous Non-Small Cell Lung Cancer sqNSCLC
Jonsson Comprehensive Cancer Center
Eli Lilly and Company; Genentech, Inc.

Aktiivinen, ei rekrytointi

Nesitumumabi ja trastutsumabi yhdistelmänä osimertinibin kanssa refraktaarisen epidermaalisen kasvutekijäreseptorin (EGFR) -mutaation IV-vaiheen ei-pienisoluisen keuhkosyövän hoitoon

Metastaattinen keuhkojen ei-pienisolusyöpä | Tulenkestävä keuhkojen ei-pienisolusyöpä | Stage IV Lung Cancer American Joint Committee on Cancer (AJCC) v8 | Stage IVA Lung Cancer AJCC v8 | Vaihe IVB keuhkosyöpä AJCC v8

Yhdysvallat
The First Affiliated Hospital of Guangzhou Medical...

Rekrytointi

LcProt: Proteomics Longitudinal Cohort Study on Lung Cancer

Syöpä | NSCLC | Keuhkosyöpä | Lung

Kiina
The First Affiliated Hospital of Guangzhou Medical...

Rekrytointi

Keuhkosyövän pitkittäissuuntaisen kohorttitutkimuksen luominen kudoksen ja perifeerisen veren metabolien avulla.

Keuhkosyöpä | Lung | Aineenvaihdunta | Keuhkosyöpä (NSCLC)

Kiina
Philipps University Marburg

Valmis

Viruskuormitusohjattu immunosuppressio keuhkosiirron jälkeen (VIGILung)

Transplantation Lung

Itävalta, Saksa
University of Southern California
National Cancer Institute (NCI); Genentech, Inc.

Aktiivinen, ei rekrytointi

Ihonalainen atetsolitsumabi ei-pienisoluisen keuhkosyövän hoitoon

Stage IVA Lung Cancer AJCC v8 | Vaihe IVB keuhkosyöpä AJCC v8 | Keuhkojen ei-pienisolukarsinooma | Vaiheen III keuhkosyöpä AJCC v8 | IV vaiheen keuhkosyöpä AJCC v8 | Vaiheen II keuhkosyöpä AJCC v8 | Stage IIA Lung Cancer AJCC v8 | Vaiheen IIB keuhkosyöpä AJCC v8 | Vaiheen IIIA keuhkosyöpä AJCC v8 | Vaiheen IIIB... ja muut ehdot

Yhdysvallat
The University of Hong Kong

Rekrytointi

Lyhyt SMART-harjoituksen pikaviestintätuki - pilottitutkimus

Syöpä | Lung

Hong Kong
Stanford University

Rekrytointi

Panitumumab-IRDye800 syövän havaitsemisessa potilailta, joilla on keuhkosyöpä leikkauksen aikana

Stage IVA Lung Cancer AJCC v8 | Vaihe IVB keuhkosyöpä AJCC v8 | IV vaiheen keuhkosyöpä AJCC v8 | Keuhkokarsinooma | Metastaattinen pahanlaatuinen kasvain keuhkoissa

Yhdysvallat
University of Sao Paulo General Hospital

Rekrytointi

Vertaileva tutkimus Sternalock® Bluen rintalastan sulkemisen ja teräslangan välillä, joka on jätetty kahdenväliseen rintalastan transternaaliseen torakotomiaan (simpukka) kahdenvälistä keuhkosiirtoa varten (LungTx-Lock)

Lung | Keuhkojen siirto

Brasilia
Jiangmen Central Hospital

Ei vielä rekrytointia

Low-dose Thoracic Radiotherapy Followed by Adebrelimab Plus Chemotherapy, and Then Sequential Maintenance Therapy With Adebrelimab for Extensive-stage Small Cell Lung Cancer

Lung | Sädehoito

Kliiniset tutkimukset GAPS-Agent

Boston University Charles River Campus
Mental Health Center of Denver; Connecticut State, Department of Mental... ja muut yhteistyökumppanit

Valmis

Yhteisön aukkojen kurominen Photovoice (BCGP)

Mielisairaus | Sosiaalinen eristäytyminen

Yhdysvallat
Vanderbilt University Medical Center
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK); National Institutes of Health (NIH)

Valmis

Potilasportaalin toiminnan vaikutukset diabeteksen hoitopuutteiden korjaamiseen

Diabetes mellitus

Yhdysvallat
Boston Medical Center
Boston University; National Institute of Nursing Research (NINR); Northeastern...

Valmis

Keskusteluhenkilöstö parantaa elämänlaatua palliatiivisessa hoidossa (ECA-PAL)

Palliatiivinen hoito

Yhdysvallat
Northeastern University
Boston University; Tufts Medical Center

Rekrytointi

ECA:n parannettu asiakirjaselitys RCT

Terve

Yhdysvallat
NYU Langone Health

Peruutettu

SPY-angiografia kyynärpään kyynärpään hermon siirtämiseen

Cubitaalitunnelin oireyhtymä

Yhdysvallat
Fox Chase Cancer Center

Lopetettu

VM110 mikroskooppisten kasvainten havaitsemisessa: vaiheen I tutkimus

Haimasyöpä | Munasarjasyöpä

Yhdysvallat
University of Aarhus
Eurostars

Tuntematon

Idiopaattisen keuhkofibroosin (IPF) etäkuntoutusohjelman toteutettavuus ja vaikutus (3-IPF)

Idiopaattinen keuhkofibroosi

Tanska
Vanderbilt University Medical Center
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK); National Institutes of Health (NIH)

Valmis

Potilasportaalin interventioiden arviointi diabeteksen hoitopuutteiden korjaamiseksi

Diabetes mellitus

Yhdysvallat
Orchestra BioMed, Inc

Rekrytointi

[Laitteen kokeilu, jota Yhdysvallat FDA ei ole hyväksynyt tai tyhjennä]

Sepelvaltimotauti

Yhdysvallat
Weill Medical College of Cornell University

Valmis

Sosiaalisten taitojen ryhmähoidon pilottikoe (Secret Agent Society -ohjelma)

Ahdistus | Tarkkailuvaje-hyperaktiivisuushäiriö (ADHD) | Autistinen spektrihäiriö (ASD)

Yhdysvallat

Preliminary Evaluation of a Large Language Model-Based Tool for Complex Surgical Decision Support in Lung Cancer

Tutkimuksen yleiskatsaus

Tila

Ehdot

Interventio / Hoito

Opintotyyppi

Ilmoittautuminen (Arvioitu)

Vaihe

Yhteystiedot ja paikat

Opiskelupaikat

Osallistumiskriteerit

Kelpoisuusvaatimukset

Opintokelpoiset iät

Hyväksyy terveitä vapaaehtoisia

Kuvaus

Opintosuunnitelma

Miten tutkimus on suunniteltu?

Suunnittelun yksityiskohdat

Aseiden lukumäärä

Aseet ja interventiot

Osallistujaryhmä / Arm

Interventio / Hoito

Mitä tutkimuksessa mitataan?

Ensisijaiset tulostoimenpiteet

Tulosmittaus

Toimenpiteen kuvaus

Aikaikkuna

Toissijaiset tulostoimenpiteet

Tulosmittaus

Toimenpiteen kuvaus

Aikaikkuna

Yhteistyökumppanit ja tutkijat

Sponsori

Opintojen ennätyspäivät

Opi tärkeimmät päivämäärät

Opiskelun aloitus (Todellinen)

Ensisijainen valmistuminen (Arvioitu)

Opintojen valmistuminen (Arvioitu)

Opintoihin ilmoittautumispäivät

Ensimmäinen lähetetty

Ensimmäinen toimitettu, joka täytti QC-kriteerit

Ensimmäinen Lähetetty (Todellinen)

Tutkimustietojen päivitykset

Viimeisin päivitys julkaistu (Todellinen)

Viimeisin lähetetty päivitys, joka täytti QC-kriteerit

Viimeksi vahvistettu

Lisää tietoa

Tähän tutkimukseen liittyvät termit

Avainsanat

Muita asiaankuuluvia MeSH-ehtoja

Muut tutkimustunnusnumerot

Yksittäisten osallistujien tietojen suunnitelma (IPD)

Aiotko jakaa yksittäisten osallistujien tietoja (IPD)?

Lääke- ja laitetiedot, tutkimusasiakirjat

Tutkii yhdysvaltalaista FDA sääntelemää lääkevalmistetta

Tutkii yhdysvaltalaista FDA sääntelemää laitetuotetta

Kliiniset tutkimukset Keuhkosyöpä (NSCLC)

Kliiniset tutkimukset GAPS-Agent

Hae vastaavia kokeiluja

Sponsorit ja yhteistyökumppanit

Sairaudet

Huumeiden interventiot

CROs by country

CROs in Mozambique

Ehdot

Harvinaiset sairaudet

Huumeiden interventiot

Ravintolisät

Sponsori / yhteistyökumppanit

Sijainnit