Artificial Intelligence-driven Tuberculosis Landscape Analysis & Stratification Research (TB-ATLAS)

May 20, 2026 updated by: Wen-hong Zhang, Huashan Hospital

The goal of this observational study is to establish and validate a comprehensive AI-driven clinical decision support system (AI-CDSS) in whole-chain management for pulmonary tuberculosis (TB) patients. The main question it aims to answer is:

How is the predictive performance of this system in terms of multiple key links during TB diagnosis and treatment? Can real-world benefits be derived from this system? This AI framework supports clinicians in making smarter decisions, ultimately improving cure rates and ensuring that every patient receives the most effective, personalized care possible.

Study Overview

Status

Not yet recruiting

Conditions

Detailed Description

This study establishes TB-ATLAS (Artificial Intelligence-driven Tuberculosis Landscape Analysis & Stratification Research), a modular framework for whole-chain TB management. The objective is to develop and validate an umbrella suite of AI-driven models to optimize clinical decision-making from initial diagnosis to post-treatment follow-up.

The core hypothesis is that multimodal patient data can stratify TB phenotypes and predict critical clinical events, enabling precision medicine. Beyond the primary focus on distinguishing Easy-to-Treat (ETT) from Hard-to-Treat (HTT) categories, the system incorporates satellite modules for pre-DST drug resistance risk, treatment adherence monitoring, adverse event (AE) early warning, and risk of post-TB lung disease (PTLD).

This study employs a retrospective-prospective cohort design. By utilizing retrospective IPD from clinical trials and real-world EHRs (>30,000 patients), the investigators apply advanced AI, including foundation models for feature representation and multi-task learning for modular development. Integration of structured clinical variables, microbiological profiles, radiomics, and host signatures ensures high-dimensional input. Model interpretability is prioritized via SHAP/LIME to ensure clinical trust. Then the performance will be evaluated using AUROC and calibration metrics. External validation will occur in a prospective cohort (n≥1,600) to assess the system's impact on predicting real-world outcomes compared to standardized care.

The expected output is the TB-ATLAS Clinical Decision Support System (AI-CDSS). By providing evidence-based guidance on regimen intensity, resistance risk, and relapse monitoring, this platform facilitates the transition from "one-size-fits-all" standardized care towards individualized precision management, significantly enhancing clinical decision-making across diverse healthcare settings.

Study Type

Observational

Enrollment (Estimated)

31600

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Contact

Name: Yang Li, MD
Phone Number: 021-52888123
Email: yang.li@nmcid.org.cn

Study Locations

China
- - Shanghai, China, 210000
    - Huashan Hospital Affiliated to Fudan University
    - Contact:
      
      Yang Li
      
      Phone Number: +86 21-52888123
      
      Email: yang.li@nmcid.org.cn
- Hunan
  - Changsha, Hunan, China
    - Hunan Chest Hospital
    - Contact:
      
      Qi Wang
      
      Phone Number: +8613787010006
      
      Email: 158723909@qq.com

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

Child
Adult
Older Adult

Accepts Healthy Volunteers

Sampling Method

Non-Probability Sample

Study Population

Pulmonary tuberculosis (TB) patients diagnosed and treated (or will treat) in several TB clinical centers from China

Description

Inclusion Criteria for Model Development Cohort:

Patient with clinically diagnosed or bacteriologically confirmed pulmonary tuberculosis (TB) who received TB treatment;
Initiation of TB treatment on or after January 1, 2021;
Complete key diagnosis and treatment data available in the electronic medical record system.

Inclusion Criteria for External Validation Cohort:

Patient with clinically diagnosed or bacteriologically confirmed pulmonary tuberculosis (TB) who is planning to start TB treatment;
Voluntary participation with signed informed consent form (for adults ≥18 years); parental / guardian consent and co-signed informed consent form are required for minors aged ≤ 18 years.

Exclusion Criteria:

Co-morbidity confounding: the presence of other active, life-threatening disease (e.g. late-stage malignancy, non-HIV severe immunodeficiency) for which the expected survival or priority of treatment may substantially interfere with the attribution of TB treatment outcomes;
Extremely poor treatment adherence: documented evidence indicating that the patient either never initiated treatment or was permanently lost to follow-up within the early treatment period (<2 weeks), precluding the collection of any valid outcome data.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

Number of groups / cohorts

Cohorts and Interventions

Group / Cohort
Model Development Cohort Retrospective data used for model fitting and tuning. The training and validating sets are interchangeable due to 10-fold cross validation.
External Validation Cohort Prospective collected data for external model validation and predictive performance measurement

What is the study measuring?

Primary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Predictive Performance of the "Easy-to-Treat" versus "Hard-to-Treat" stratification model for pulmonary tuberculosis (PTB) Time Frame: from treatment initiation to 6 months post treatment	The Area Under the Receiver Operating Characteristic curve (AUROC) of the model for discriminating between PTB patients classified as "Easy-to-Treat" versus "Hard-to-Treat". "Easy-to-Treat" patients are defined as patients with PTB who can achieve favorable outcome when treated with a short-course regimen (≤4 months for drug-sensitive TB, ≤6 months for rifampin-resistant TB). "Hard-to-Treat" patients are defined as patients with PTB who will experience unfavorable outcome on short-course treatment (≤4 months for drug-sensitive TB, ≤6 months for rifampin-resistant TB).	from treatment initiation to 6 months post treatment

Secondary Outcome Measures

Outcome Measure	Measure Description	Time Frame
Brier Score of the "Easy-to-treat" versus "Hard-to-treat" Model Time Frame: 6 months post-treatment	The Brier score will be used to assess the overall prediction accuracy and reliability of the model. It measures the mean squared difference between the predicted probabilities and the actual observed outcomes. The score ranges from 0 to 1, where 0 represents perfect accuracy and 1 represents total inaccuracy. Lower scores mean better model performance, indicating a better predictive outcome.	6 months post-treatment
Calibration Slope of the "Easy-to-treat" versus "Hard-to-treat" Model Time Frame: 6 months post-treatment	The calibration slope will be calculated to evaluate the agreement between the model's predicted probabilities and the actual observed outcomes. An ideal calibration slope value is 1. Values closer to 1 indicate better calibration performance, meaning the predicted probabilities perfectly reflect the true risk, which indicates a better predictive outcome.	6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Pre-Drug Susceptibility Testing (Pre-DST) Drug Resistance Predictive Model Time Frame: 6 months post-treatment	The AUROC curve will be used to evaluate the discrimination performance of the pre-DST (Drug Susceptibility Testing) drug resistance predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Pre-Drug Susceptibility Testing (Pre-DST) Drug Resistance Predictive Model Time Frame: 6 months post-treatment	The AUPRC will be used to evaluate the prediction performance of the pre-drug susceptibility testing (Pre-DST) drug resistance predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.	6 months post-treatment
F1-score of the Secondary Decision Models for Pre-Drug Susceptibility Testing (Pre-DST) Drug Resistance Prediction Time Frame: 6 months post-treatment	The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for pre-drug susceptibility testing (Pre-DST) drug resistance prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Adherence Forecasting Model Time Frame: From treatment initiation until treatment completion, assessed up to 6 months	The AUROC curve will be used to evaluate the discrimination performance of the adherence forecasting model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.	From treatment initiation until treatment completion, assessed up to 6 months
Area Under the Precision-Recall Curve (AUPRC) of the Adherence Forecasting Model Time Frame: From treatment initiation until treatment completion, assessed up to 6 months	The AUPRC will be used to evaluate the prediction performance of the adherence forecasting model under data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.	From treatment initiation until treatment completion, assessed up to 6 months
F1-score of the Secondary Decision Models for Adherence Forecasting Time Frame: From treatment initiation until treatment completion, assessed up to 6 months	The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for adherence forecasting. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.	From treatment initiation until treatment completion, assessed up to 6 months
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Treatment Response Predictive Model Time Frame: 6 months post-treatment	The AUROC curve will be used to evaluate the overall discrimination performance of the treatment response predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Treatment Response Predictive Model Time Frame: 6 months post-treatment	The AUPRC will be used to evaluate the prediction performance of the treatment response predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.	6 months post-treatment
F1-score of the Secondary Decision Models for Treatment Response Prediction Time Frame: 6 months post-treatment	The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for treatment response prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Adverse Event (AE) Predictive Model Time Frame: 6 months post-treatment	The AUROC curve will be used to evaluate the overall discrimination performance of the adverse event predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Adverse Event (AE) Predictive Model Time Frame: 6 months post-treatment	The AUPRC will be used to evaluate the prediction performance of the adverse event predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.	6 months post-treatment
F1-score of the Secondary Decision Models for Adverse Event (AE) Prediction Time Frame: 6 months post-treatment	The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for adverse event prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Relapse Predictive Model Time Frame: 6 months post-treatment	The AUROC curve will be used to evaluate the overall discrimination performance of the relapse predictive model. Relapse is defined per the World Health Organization (WHO) standard. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Relapse Predictive Model Time Frame: 6 months post-treatment	The AUPRC will be used to evaluate the prediction performance of the relapse predictive model, particularly under conditions of data imbalance. Relapse is defined per the World Health Organization (WHO) standard. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.	6 months post-treatment
F1-score of the Secondary Decision Models for Relapse Prediction Time Frame: 6 months post-treatment	The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for relapse prediction. Relapse is defined per the World Health Organization (WHO) standard. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Receiver Operating Characteristic (AUROC) Curve of the Post-Tuberculosis (TB) Lung Disease Predictive Model Time Frame: 6 months post-treatment	The AUROC curve will be used to evaluate the overall discrimination performance of the post-tuberculosis (TB) lung disease predictive model. The score ranges from 0 to 1, where 0.5 indicates random guessing and 1 represents perfect discrimination. Higher scores mean better discrimination performance, indicating a better predictive outcome.	6 months post-treatment
Area Under the Precision-Recall Curve (AUPRC) of the Post-Tuberculosis (TB) Lung Disease Predictive Model Time Frame: 6 months post-treatment	The AUPRC will be used to evaluate the prediction performance of the post-tuberculosis (TB) lung disease predictive model, particularly under conditions of data imbalance. The score ranges from 0 to 1. Higher scores mean better precision and recall performance, indicating a better predictive outcome.	6 months post-treatment
F1-score of the Secondary Decision Models for Post-Tuberculosis (TB) Lung Disease Prediction Time Frame: 6 months post-treatment	The F1-score, calculated as the harmonic mean of precision and recall, will be used to evaluate the classification performance of the secondary decision models for post-tuberculosis (TB) lung disease prediction. The score ranges from 0 to 1. Higher scores mean better classification performance, indicating a better predictive outcome.	6 months post-treatment

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Sponsor

Huashan Hospital

Collaborators

The Hong Kong Polytechnic University

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Estimated)

June 1, 2026

Primary Completion (Estimated)

December 31, 2027

Study Completion (Estimated)

June 30, 2028

Study Registration Dates

First Submitted

April 23, 2026

First Submitted That Met QC Criteria

May 20, 2026

First Posted (Actual)

May 28, 2026

Study Record Updates

Last Update Posted (Actual)

May 28, 2026

Last Update Submitted That Met QC Criteria

May 20, 2026

Last Verified

April 1, 2026

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

KY2025-1517

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

UNDECIDED

IPD Plan Description

This study incorporates datasets from multiple previous studies and future cohort. The detailed IPD sharing plan will be discussed per types of data.

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Pulmonary Tuberculosis

Foundation for Innovative New Diagnostics, Switzerland
Institute of Tropical Medicine, Belgium; Research Center Borstel; National Institute...

Completed

Line Probe Assay Evaluation Study (YD)

Multidrug-Resistant Tuberculosis | Isoniazid Resistant Pulmonary Tuberculosis | Rifampicin Resistant Tuberculosis | Pulmonary Tuberculoses
Beijing Chest Hospital
Huashan Hospital; National Medical Center for Infectious Diseases

Not yet recruiting

Innovating Shorter, All- Oral, Precised Treatment Regimen for Rifampicin Resistant Tuberculosis：BLMZ Chinese Cohort (INSPIRE-BLMZ)

Tuberculosis | Drug-resistant Tuberculosis | Pulmonary Tuberculosis | Rifampicin Resistant Tuberculosis

China
Universiteit Antwerpen
Aurum Institute; University of Stellenbosch; University of the Free State; Free...

Recruiting

Sequencing Mycobacteria and Algorithm-determined Resistant Tuberculosis Treatment Trial (SMARTT)

Drug-resistant Tuberculosis | Rifampicin Resistant Tuberculosis | Pulmonary Tuberculoses | Multidrug Resistant Tuberculosis

South Africa
Shanghai Public Health Clinical Center
Beijing YouAn Hospital; Beijing Ditan Hospital; Peking Union Medical College... and other collaborators

Not yet recruiting

Short-Course Regimens for DS-PTB in AIDS Patients

Tuberculosis in HIV-infected Individuals | Drug Susceptible Pulmonary Tuberculosis
Tjip van der Werf
Gadjah Mada University; The Enose Company, Zutphen the Netherlands

Completed

ENOSE in Pulmonary Tuberculosis in Yogyakarta (YOGYATBNOSE)

Pulmonary Tuberculosis Suspected | Other Specified Chronic Obstructive Pulmonary Disease | Pulmonary Tuberculosis TB (+) Histology, (-) Bacteriology

Indonesia
Gates Medical Research Institute
IQVIA RDS Inc.

Recruiting

Bactericidal Activity of TBD09 in Combination With Other Drugs in Pulmonary Tuberculosis

Drug Susceptible Pulmonary Tuberculosis

South Africa
Research Institute of Epidemiology, Microbiology...

Active, not recruiting

Pulmonary Aspergillosis in Tuberculosis Patients

Aspergillosis | Pulmonary Tuberculoses | Old Tuberculosis | Active Tuberculosis | Chronic Pulmonary Aspergillosis

Uzbekistan
Medecins Sans Frontieres, Netherlands
London School of Hygiene and Tropical Medicine; University of Liverpool; Ministry... and other collaborators

Completed

Economic Evaluation of New MDR TB Regimens (PRACTECAL-EE)

Multi-drug Resistant Tuberculosis | Pulmonary Tuberculoses | Extensively Drug-Resistant Tuberculosis

Belarus, South Africa, Uzbekistan
Sohag University

Not yet recruiting

Study About Drug Resistance of Mycobacterium Tuberculosis

Pulmonary and Extra- Pulmonary Tuberculosis (TB)

Egypt
Gates Medical Research Institute
IQVIA RDS Inc.

Recruiting

Bactericidal Activity and Safety of Nicotinamide in Combination With Bedaquiline, Pretomanid, and Linezolid in Drug-susceptible Pulmonary Tuberculosis

Drug Susceptible Pulmonary Tuberculosis

South Africa

Artificial Intelligence-driven Tuberculosis Landscape Analysis & Stratification Research (TB-ATLAS)

Study Overview

Status

Conditions

Detailed Description

Study Type

Enrollment (Estimated)

Contacts and Locations

Study Contact

Study Locations

Participation Criteria

Eligibility Criteria

Ages Eligible for Study

Accepts Healthy Volunteers

Sampling Method

Study Population

Description

Study Plan

How is the study designed?

Design Details

Number of groups / cohorts

Cohorts and Interventions

Group / Cohort

What is the study measuring?

Primary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Secondary Outcome Measures

Outcome Measure

Measure Description

Time Frame

Collaborators and Investigators

Sponsor

Collaborators

Study record dates

Study Major Dates

Study Start (Estimated)

Primary Completion (Estimated)

Study Completion (Estimated)

Study Registration Dates

First Submitted

First Submitted That Met QC Criteria

First Posted (Actual)

Study Record Updates

Last Update Posted (Actual)

Last Update Submitted That Met QC Criteria

Last Verified

More Information

Terms related to this study

Keywords

Additional Relevant MeSH Terms

Other Study ID Numbers

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

IPD Plan Description

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

Studies a U.S. FDA-regulated device product

Clinical Trials on Pulmonary Tuberculosis

Search Similar Trials

Sponsors and Collaborators

Medical Conditions

Drug Interventions

CROs by country

CROs in Vietnam

Conditions

Rare Diseases

Drug Interventions

Dietary Supplements

Sponsor/Collaborators

Locations