Research on Construction and Verification of Multimodal Medical Imaging Large Model

With the accumulation of multimodal clinical data such as medical imaging and electronic health records (EHRs), efficient utilization of multi-source information to achieve precise diagnosis and intelligent decision-making has become a core direction of medical artificial intelligence (AI). Although traditional unimodal algorithms have yielded outcomes in specific tasks, their inability to model the semantic correlations among imaging, textual, and laboratory data leads to insufficient stability and limited interpretability of diagnostic results, making it difficult to meet the needs of comprehensive decision-making in complex clinical scenarios.

In recent years, multimodal large models have demonstrated excellent cross-modal understanding and knowledge transfer capabilities in natural images and general vision-language tasks, providing a new paradigm for medical AI. However, direct application in medical scenarios still faces challenges: first, the medical semantic system differs significantly from general language models, hindering the accurate representation of disease characteristics and imaging details; second, the complex morphology of lesions and uneven sample distribution in medical data increase the difficulty of model generalization; third, clinical data involves privacy, so data security and ethical compliance serve as prerequisites for research.

The research on medical multimodal large models aims to integrate multi-source heterogeneous medical data, establish a unified semantic representation and reasoning mechanism, and realize full-process intelligent analysis including disease identification and lesion localization. This approach can not only improve the efficiency and accuracy of clinical diagnosis but also provide clinicians with interpretable and traceable auxiliary decision support, boasting broad application prospects.

Based on the hospital's clinical data resources and the research team's algorithmic foundation, this study intends to construct a multimodal large model system for medical imaging diagnosis, enabling closed-loop intelligent analysis from multimodal information fusion to diagnostic report generation. The research will strictly adhere to medical ethical standards, protect patients' right to information, right to privacy, and data security. Before the official launch of the project, ethical review must be passed, and relevant regulations shall be followed to ensure the unity of scientific research and ethics, laying a compliant foundation for subsequent clinical validation and promotion.

Study Overview

Detailed Description

With the continuous accumulation of medical imaging, electronic health records (EHRs), and multimodal clinical data, how to efficiently leverage multi-source medical information to achieve precise diagnosis and intelligent decision-making has become a core direction in the development of medical artificial intelligence (AI). Although traditional unimodal algorithms (e.g., models based solely on CT, MRI, or ultrasound images) have yielded certain results in specific tasks, their inability to model semantic correlations among imaging, textual, and laboratory data often leads to insufficient stability and limited interpretability of diagnostic outcomes, making it difficult to meet the comprehensive decision-making needs of complex clinical scenarios.

In recent years, multimodal large language models (MLLMs) have demonstrated remarkable cross-modal understanding and knowledge transfer capabilities in natural image processing and general vision-language tasks, providing a new technical paradigm for medical AI. However, the direct application of such models in medical scenarios still faces multiple challenges: first, there are significant discrepancies between the medical semantic system and general language models, hindering the accurate representation of disease characteristics and imaging details; second, the complex morphology of lesions and imbalanced sample distribution in medical data increase the difficulty of model generalization; third, clinical data involves privacy-sensitive information, making data security and ethical compliance a prerequisite for research.

Research on medical multimodal large models aims to comprehensively utilize multi-source heterogeneous data-such as medical imaging (e.g., CT, MRI, X-ray), EHRs, and laboratory reports-to establish a unified semantic representation and reasoning mechanism, enabling end-to-end intelligent analysis including disease identification, lesion localization, report generation, and disease progression prediction. This research direction not only helps improve the efficiency and accuracy of clinical diagnosis but also provides clinicians with interpretable and traceable auxiliary decision support, boasting broad prospects for clinical application.

Based on the hospital's abundant clinical data resources and the research team's algorithm development foundation, this study intends to construct a multimodal large model system for medical imaging diagnosis, realizing a closed-loop intelligent analysis pipeline from multimodal information fusion to diagnostic report generation.

During the research implementation, strict adherence to medical ethical standards will be followed to fully protect patients' right to informed consent, privacy, and data security. To ensure the scientificity and compliance of the research design, this project must pass ethical review prior to its official launch. In accordance with relevant regulations including the Declaration of Helsinki, International Ethical Guidelines for Health-related Research Involving Humans, and Ethical Review Measures for Life Science and Medical Research Involving Humans, we will achieve the organic integration of scientific research and ethical principles, laying a compliant foundation for subsequent clinical validation and application promotion.

Study Type

Observational

Enrollment (Estimated)

2000

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Contact

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

  • Adult
  • Older Adult

Accepts Healthy Volunteers

No

Sampling Method

Probability Sample

Study Population

he study population was composed of adult patients who met the preset inclusion criteria and were free of any exclusion criteria.

In terms of inclusion requirements, eligible patients were aged 18 years or older and had received imaging examinations including computed tomography (CT), magnetic resonance imaging (MRI), or ultrasound at the hospital during the study period. The examination items were required to target the hepatobiliary and pancreatic system, which was the focus of the research. All enrolled patients needed to be equipped with at least complete imaging data and radiological diagnostic reports, with relevant medical record information serving as supplementary modalities. In addition, written informed consent must be obtained from the patients themselves or their legal representatives, who agreed that the de-identified data of the patients could be used for the validation of scientific research models. Finally, all case data of the selected patients had to pass strict qual

Description

Inclusion Criteria:

  • Adult patients aged ≥ 18 years.
  • Patients who underwent imaging examinations (CT, MRI, ultrasound, etc.) at this hospital during the study period.
  • The examination items are consistent with the disease types or systems focused on by the study (hepatobiliary and pancreatic system).
  • Possess at least complete imaging data, radiological diagnostic reports, with relevant medical record information as supplementary modalities.
  • Patients and their legal representatives have signed an informed consent form, agreeing to the use of their de-identified data for scientific research model validation.

Exclusion Criteria:

  • Patients who refuse to sign the informed consent form.
  • Cases with unassessable images due to severe motion artifacts, incomplete scanning, or equipment abnormalities.
  • Cases with severe deficiency of clinical data or failure to match with imaging data.
  • Cases with special pathological conditions or post-operative status (e.g., extensive resection, significant structural changes after radiotherapy) that affect the consistency of model analysis.
  • Data samples with privacy protection or legal risks.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

Cohorts and Interventions

Group / Cohort
Group1:This study enrolled adult patients aged ≥ 18 years who underwent imaging examinations (includ

What is the study measuring?

Primary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Disease Diagnosis Task
Time Frame: From enrollment to the end of diagnosis at 3 days
The primary outcome is the accuracy and reliability of the multimodal imaging diagnostic model in identifying and classifying target diseases compared with the gold standard of clinical diagnosis by senior radiologists.
From enrollment to the end of diagnosis at 3 days
Lesion Localization and Segmentation Task
Time Frame: from enrollment to end of diagnosis up to 3 days
It includes the model's performance in precise localization, contour segmentation and quantitative measurement of lesions, assessed by Dice similarity coefficient, IoU and localization error.
from enrollment to end of diagnosis up to 3 days
Diagnostic Report Generation Task
Time Frame: from enrollment to end of diagnosis up to 3 days
It evaluates the clinical validity, completeness, consistency and readability of automatically generated radiology reports relative to manual reports.
from enrollment to end of diagnosis up to 3 days

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Estimated)

November 15, 2026

Primary Completion (Estimated)

November 15, 2027

Study Completion (Estimated)

November 15, 2027

Study Registration Dates

First Submitted

February 9, 2026

First Submitted That Met QC Criteria

February 26, 2026

First Posted (Actual)

March 3, 2026

Study Record Updates

Last Update Posted (Actual)

March 3, 2026

Last Update Submitted That Met QC Criteria

February 26, 2026

Last Verified

November 1, 2025

More Information

Terms related to this study

Other Study ID Numbers

  • 研2025-1442

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

NO

IPD Plan Description

The decision not to share individual participant data (IPD) from this study is primarily based on ethical, privacy protection, and legal compliance considerations, as well as the inherent characteristics of the research data involved.

First and foremost, the core data of this study includes de-identified imaging materials, radiological diagnostic reports, and associated medical records of patients, which are closely linked to personal health information. Despite the implementation of de-identification procedures during data collection and collation, there remains a potential risk of re-identifying individual participants if the data are shared without strict restrictions. Such a risk would violate the stipulations of relevant privacy protection laws and regulations, as well as the informed consent signed by the patients and their legal representatives. It should be emphasized that the informed consent clearly specifies that the de-identified data of participants is only used for the in

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

No

Studies a U.S. FDA-regulated device product

No

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Liver Diseases

Subscribe