Artificial Intelligence-driven Tuberculosis Landscape Analysis & Stratification Research (TB-ATLAS)

May 20, 2026 updated by: Wen-hong Zhang, Huashan Hospital

The goal of this observational study is to establish and validate a comprehensive AI-driven clinical decision support system (AI-CDSS) in whole-chain management for pulmonary tuberculosis (TB) patients. The main question it aims to answer is:

How is the predictive performance of this system in terms of multiple key links during TB diagnosis and treatment? Can real-world benefits be derived from this system? This AI framework supports clinicians in making smarter decisions, ultimately improving cure rates and ensuring that every patient receives the most effective, personalized care possible.

Study Overview

Status

Not yet recruiting

Detailed Description

This study establishes TB-ATLAS (Artificial Intelligence-driven Tuberculosis Landscape Analysis & Stratification Research), a modular framework for whole-chain TB management. The objective is to develop and validate an umbrella suite of AI-driven models to optimize clinical decision-making from initial diagnosis to post-treatment follow-up.

The core hypothesis is that multimodal patient data can stratify TB phenotypes and predict critical clinical events, enabling precision medicine. Beyond the primary focus on distinguishing Easy-to-Treat (ETT) from Hard-to-Treat (HTT) categories, the system incorporates satellite modules for pre-DST drug resistance risk, treatment adherence monitoring, adverse event (AE) early warning, and risk of post-TB lung disease (PTLD).

This study employs a retrospective-prospective cohort design. By utilizing retrospective IPD from clinical trials and real-world EHRs (>30,000 patients), the investigators apply advanced AI, including foundation models for feature representation and multi-task learning for modular development. Integration of structured clinical variables, microbiological profiles, radiomics, and host signatures ensures high-dimensional input. Model interpretability is prioritized via SHAP/LIME to ensure clinical trust. Then the performance will be evaluated using AUROC and calibration metrics. External validation will occur in a prospective cohort (n≥1,600) to assess the system's impact on predicting real-world outcomes compared to standardized care.

The expected output is the TB-ATLAS Clinical Decision Support System (AI-CDSS). By providing evidence-based guidance on regimen intensity, resistance risk, and relapse monitoring, this platform facilitates the transition from "one-size-fits-all" standardized care towards individualized precision management, significantly enhancing clinical decision-making across diverse healthcare settings.

Study Type

Observational

Enrollment (Estimated)

31600

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Contact

Study Locations

      • Shanghai, China, 210000
        • Huashan Hospital Affiliated to Fudan University
        • Contact:
    • Hunan
      • Changsha, Hunan, China
        • Hunan Chest Hospital
        • Contact:

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

  • Child
  • Adult
  • Older Adult

Accepts Healthy Volunteers

No

Sampling Method

Non-Probability Sample

Study Population

Pulmonary tuberculosis (TB) patients diagnosed and treated (or will treat) in several TB clinical centers from China

Description

Inclusion Criteria for Model Development Cohort:

  • Patient with clinically diagnosed or bacteriologically confirmed pulmonary tuberculosis (TB) who received TB treatment;
  • Initiation of TB treatment on or after January 1, 2021;
  • Complete key diagnosis and treatment data available in the electronic medical record system.

Inclusion Criteria for External Validation Cohort:

  • Patient with clinically diagnosed or bacteriologically confirmed pulmonary tuberculosis (TB) who is planning to start TB treatment;
  • Voluntary participation with signed informed consent form (for adults ≥18 years); parental / guardian consent and co-signed informed consent form are required for minors aged ≤ 18 years.

Exclusion Criteria:

  • Co-morbidity confounding: the presence of other active, life-threatening disease (e.g. late-stage malignancy, non-HIV severe immunodeficiency) for which the expected survival or priority of treatment may substantially interfere with the attribution of TB treatment outcomes;
  • Extremely poor treatment adherence: documented evidence indicating that the patient either never initiated treatment or was permanently lost to follow-up within the early treatment period (<2 weeks), precluding the collection of any valid outcome data.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

Cohorts and Interventions

Group / Cohort
Model Development Cohort
Retrospective data used for model fitting and tuning. The training and validating sets are interchangeable due to 10-fold cross validation.
External Validation Cohort
Prospective collected data for external model validation and predictive performance measurement

What is the study measuring?

Primary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Predictive Performance of the "Easy-to-Treat" versus "Hard-to-Treat" stratification model for pulmonary tuberculosis (PTB)
Time Frame: from treatment initiation to 6 months post treatment

The Area Under the Receiver Operating Characteristic curve (AUROC) of the model for discriminating between PTB patients classified as "Easy-to-Treat" versus "Hard-to-Treat".

"Easy-to-Treat" patients are defined as patients with PTB who can achieve favorable outcome when treated with a short-course regimen (≤4 months for drug-sensitive TB, ≤6 months for rifampin-resistant TB).

"Hard-to-Treat" patients are defined as patients with PTB who will experience unfavorable outcome on short-course treatment (≤4 months for drug-sensitive TB, ≤6 months for rifampin-resistant TB).

from treatment initiation to 6 months post treatment

Secondary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Brier Score of the "Easy-to-treat" versus "Hard-to-treat" Model
Time Frame: 6 months post-treatment
The Brier score will be used to assess the overall prediction accuracy and reliability of the model. It measures the mean squared difference between the predicted probabilities and the actual observed outcomes. The score ranges from 0 to 1, where 0 represents perfect accuracy and 1 represents total inaccuracy. Lower scores mean better model performance, indicating a better predictive outcome.
6 months post-treatment
Calibration Slope of the "Easy-to-treat" versus "Hard-to-treat" Model
Time Frame: 6 months post-treatment
The calibration slope will be calculated to evaluate the agreement between the model's predicted probabilities and the actual observed outcomes. An ideal calibration slope value is 1. Values closer to 1 indicate better calibration performance, meaning the predicted probabilities perfectly reflect the true risk, which indicates a better predictive outcome.
6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Pre-Drug Susceptibility Testing (Pre-DST) Drug Resistance Predictive Model
Time Frame: 6 months post-treatment
The AUROC curve will be used to evaluate the discrimination performance of the pre-DST (Drug Susceptibility Testing) drug resistance predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Pre-Drug Susceptibility Testing (Pre-DST) Drug Resistance Predictive Model
Time Frame: 6 months post-treatment
The AUPRC will be used to evaluate the prediction performance of the pre-drug susceptibility testing (Pre-DST) drug resistance predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.
6 months post-treatment
F1-score of the Secondary Decision Models for Pre-Drug Susceptibility Testing (Pre-DST) Drug Resistance Prediction
Time Frame: 6 months post-treatment
The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for pre-drug susceptibility testing (Pre-DST) drug resistance prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Adherence Forecasting Model
Time Frame: From treatment initiation until treatment completion, assessed up to 6 months
The AUROC curve will be used to evaluate the discrimination performance of the adherence forecasting model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.
From treatment initiation until treatment completion, assessed up to 6 months
Area Under the Precision-Recall Curve (AUPRC) of the Adherence Forecasting Model
Time Frame: From treatment initiation until treatment completion, assessed up to 6 months
The AUPRC will be used to evaluate the prediction performance of the adherence forecasting model under data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.
From treatment initiation until treatment completion, assessed up to 6 months
F1-score of the Secondary Decision Models for Adherence Forecasting
Time Frame: From treatment initiation until treatment completion, assessed up to 6 months
The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for adherence forecasting. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.
From treatment initiation until treatment completion, assessed up to 6 months
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Treatment Response Predictive Model
Time Frame: 6 months post-treatment
The AUROC curve will be used to evaluate the overall discrimination performance of the treatment response predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Treatment Response Predictive Model
Time Frame: 6 months post-treatment
The AUPRC will be used to evaluate the prediction performance of the treatment response predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.
6 months post-treatment
F1-score of the Secondary Decision Models for Treatment Response Prediction
Time Frame: 6 months post-treatment
The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for treatment response prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Adverse Event (AE) Predictive Model
Time Frame: 6 months post-treatment
The AUROC curve will be used to evaluate the overall discrimination performance of the adverse event predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Adverse Event (AE) Predictive Model
Time Frame: 6 months post-treatment
The AUPRC will be used to evaluate the prediction performance of the adverse event predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.
6 months post-treatment
F1-score of the Secondary Decision Models for Adverse Event (AE) Prediction
Time Frame: 6 months post-treatment
The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for adverse event prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Relapse Predictive Model
Time Frame: 6 months post-treatment
The AUROC curve will be used to evaluate the overall discrimination performance of the relapse predictive model. Relapse is defined per the World Health Organization (WHO) standard. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Relapse Predictive Model
Time Frame: 6 months post-treatment
The AUPRC will be used to evaluate the prediction performance of the relapse predictive model, particularly under conditions of data imbalance. Relapse is defined per the World Health Organization (WHO) standard. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.
6 months post-treatment
F1-score of the Secondary Decision Models for Relapse Prediction
Time Frame: 6 months post-treatment
The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for relapse prediction. Relapse is defined per the World Health Organization (WHO) standard. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Post-Tuberculosis (TB) Lung Disease Predictive Model
Time Frame: 6 months post-treatment
The AUROC curve will be used to evaluate the overall discrimination performance of the post-tuberculosis (TB) lung disease predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.
6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Post-Tuberculosis (TB) Lung Disease Predictive Model
Time Frame: 6 months post-treatment
The AUPRC will be used to evaluate the prediction performance of the post-tuberculosis (TB) lung disease predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.
6 months post-treatment
F1-score of the Secondary Decision Models for Post-Tuberculosis (TB) Lung Disease Prediction
Time Frame: 6 months post-treatment
The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for post-tuberculosis (TB) lung disease prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.
6 months post-treatment

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Estimated)

June 1, 2026

Primary Completion (Estimated)

December 31, 2027

Study Completion (Estimated)

June 30, 2028

Study Registration Dates

First Submitted

April 23, 2026

First Submitted That Met QC Criteria

May 20, 2026

First Posted (Actual)

May 28, 2026

Study Record Updates

Last Update Posted (Actual)

May 28, 2026

Last Update Submitted That Met QC Criteria

May 20, 2026

Last Verified

April 1, 2026

More Information

Terms related to this study

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

UNDECIDED

IPD Plan Description

This study incorporates datasets from multiple previous studies and future cohort. The detailed IPD sharing plan will be discussed per types of data.

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

No

Studies a U.S. FDA-regulated device product

No

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Pulmonary Tuberculosis

Subscribe