Development of PI-ML Algorithm for Prediction of the Real-time Risk for Developing Pre-diabetes (PRAESIIDIUM)

April 29, 2026 updated by: Jelizaveta Sokolovska

Physics Informed Machine Learning-based Prediction and Reversion of Impaired Fasting Glucose Management

In this prospective, non-randomized, monocentric study, data will be collected from otherwise healthy individuals with overweight/obese grade I to increase data availability in the pre-diabetes field (impaired glucose intolerance), and to validate the outputs of an algorithm for the "physics-informed machine learning (PIML)" designed to estimate the real-time risk of prediabetes. Each participant will take part in the study for 4 months, including 3 onsite visits.

During the screening visit, participants' eligibility will be determined by checking the inclusion and exclusion criteria after detailed information and obtaining informed consent by the investigator. Blood will be withdrawn for exclusion of existing prediabetes/diabetes at the fasted state. For women in reproductive age, a urinary pregnancy test will be performed. After getting the results of blood tests (glucose and HbA1c), participants will be asked to participate in study.

On the visit 1, eligible participants will arrive at the study centre in a fasting state. Blood samples will be collected and participants will get vials and instructions for collection of stool and urine samples. Anthropometric data, lifestyle habit (cigarette, alcohol consumption) and family history will be collected. A 6-minute walking test to determine VO2 max will then be performed. Lap counts and time will be manually recorded using a sports watch. The Polar H10 heart rate monitor chest strap will be used to record heart rate (HR) throughout the test. To measure resting HR and heart rate recovery (HRR), participants will be asked to sit still for 5 minutes before the walking test and for 2 minutes after the test.

Participants will receive a blinded Abbott Libre Pro glucose sensor, which they will wear for the next 14-days. Further, participants will be provided with a Fitbit Charge 5 health and fitness wristband. For validation purposes some part of study participants will be kindly asked to test newly develop wrist-worn device (EDIBit). With the help of 24-hour food recall, study subjects will be trained by medical staff on how to correctly enter their food intake in the Study app for completion of digital 3-day food diaries. They will be asked to fill in the diaries for 3 days after study visit1 and 3 days before study visit2. They will also receive a food frequency questionnaire during visit1.

The second study visit will run nearly identical to study visit1 (except for food frequency questionnaire which will be omitted). During this visit, participants will receive information sheets on physical activity and dietary recommendations.

The third and last visit will run nearly identically to the study visit2, except that no new glucose sensor will be inserted and also stool samples will not be collected.

Study Overview

Detailed Description

Noncommunicable diseases (NCDs) such as cancer, cardiovascular diseases, and diabetes represent 74% of the disease burden globally and are the major causes of preventable premature deaths. (1,2).

Diabetes is a chronic NCD characterized by elevated glucose blood levels. In 2021, the prevalence of diabetes in Europe was 1 in 11 adults (61 million), a figure projected to rise to 69 million by 2045. (3) The global management and treatment of diabetes cost 988 billion dollars in 2021. Despite these expenditures, diabetes remains the third leading cause of death, accounting for 6.7 million deaths, with 1.1 millions of these in Europe alone. Among the different types, type-2 diabetes (T2D) comprises 90% of total diabetes cases, primarily presenting in adulthood. (3,4) Risk factors for T2D include genetic predisposition, family history, metabolic syndrome, obesity, physical inactivity, age, and ethnicity. There are 541 million adults worldwide with Impaired Glucose Tolerance (IGT), a significant risk factor for T2D. (5,6) IGT and Impaired Fasting Glucose (IFG) represent intermediate conditions within the "healthy-to-T2D transition" and are symptoms of prediabetes. (7,8) Notably, prediabetes represents an early-stage condition that can be reversed. Studies show that T2D progression can be reduced by approximately 58% within three years through lifestyle modifications. Physical activity of 30-54 minutes at least 2-5 days per week is recommended, as well as a healthy diet. (9) Efforts have been made to develop non-invasive diabetes risk prediction models based on clinically available parameters. (10) The onset of T2D involves complex, multiscale mechanisms starting from molecular, tissue, and organ levels, leading to dysfunction in physiological processes. Chronic inflammatory biomarkers play a significant role in T2D pathogenesis. Recognizing this multi-level approach is a step towards personalized disease diagnosis. However, there remain challenges related to modelling the "healthy-to-prediabetes transition" from both case study and methodological perspectives. (5,13,14) The objective of this project is the development of a prototype tool, aimed at real-time prediction of prediabetic risk. This tool will incorporate a series of patient-specific mathematical models simulating metabolism, pancreas hormone production, microbiome metabolites, inflammatory processes, and immune system response. These models were initially developed during the FP7 MISSION-T2D project and further developed into the implementation of an integrated, multilevel, and patient-specific model, incorporating genetic, metabolic, and nutritional data for the simulation and prediction of metabolic and inflammatory processes in the onset and progression of T2D. (14-18) The prediction algorithm will utilize a "physics-informed machine learning" (PIML) approach, combining a comprehensive dataset from both existing and new clinical trials, with continuous data input through wearable sensors. (19) The final algorithm will be hosted on a web-based platform where both medical professionals and patients can input data from multiple sources.

A dedicated prospective observational study described in this application will be conducted in Latvia recruiting adult participants with metabolic risk factors - overweight and obesity grade I, for data collection purposes to validate the developed machine learning PIML algorithm for pre-diabetes real-time risk prediction.

Data collections has three main purposes: I. Input Data for the in-silico MT2D model:

The input of the simulations includes the following discrete starting parameters: gender (M/F); weight; height; number of sessions of physical activity (0, 1, 2, 3); duration of the bout of physical activity (30, 60, 90 min); intensity in terms of % VO2max (40, 60); 3 meals per day; in each meal are specified the carbohydrates (low, medium, high), proteins (low, medium, high) and fats (low, medium, high).

II. Validation of the MT2D outputs: Output numerical values of the simulations from the model include:

1) Inflammation markers (recorded every 8h): B-cells (B-1, B-2), PBL, TH (Th1, Th2, Th17, Treg), CTL, Treg, NK, MA, DC, EP, ADIP (number, volume), IgM, IgG (IgG1, IgG2), IC, IL-2, IL-12, IFN-g, IL-4, TNF-a, TGF-b, IL-10, IL-6, IL-18, IL-23, IFN-b, IL-1b, LPS, leptin;

a) Metabolic outcomes (recorded every minute): arterial concentrations (glu, pyr, lac, ala, gly, FFA, tgl, O2, CO2); organs (22 metabolites); hormones (insulin, glucagon, epinephrine);fasting glucose; rate of appearance (glucose, alanine, triglycerides);total daily energy balance; %VO2max; anthropometric measures: BW, BMI, fat mass, fat free mass.

III. Data for training/validation of the physics-informed machine learning (PIML) algorithm: demographic Data; health-related Data; lifestyle Data (e.g., food consumption data and physical activity data); continuous ingestion through wearable sensors (Continuous Glucose Monitoring (CGM and tracker of physical activity e.g., Fitbit Charge 5, EDIBit.) These devices, such as smartwatches and fitness/activity trackers (e.g., Fitbit, Fibion, Apple Watch), are equipped with sensors that can track a variety of health metrics, including physical activity, heart rate, sleep patterns and increasingly also for glucose monitoring (non-invasive continuous glucose monitoring are still under development). (20,21) This data can be used to identify patterns and trends in a person's health, which can help with the early detection of diabetes/prediabetes. (22) Machine learning (ML) models show potential in enhancing early detection by analysing various risk factors and predicting outcomes. However, before these models are integrated into healthcare systems and clinical practices, they must be rigorously evaluated. One of the most robust methods for such evaluation is through external validation using longitudinal cohorts. (23-26).

Timeline and probands: The study will run for 18 months. During this period, 75 individuals will be followed for 4 months. The recruitment will take place from January 2024 until June 2025, parallel to the study period.

Subject identification: For each participant who has signed the Participant Informed Consent Form, the investigator must allocate a unique two letter and three-digit Participant Identification Number.

All documents, forms and data (including bio-materials - urine, blood and stool samples) files will be tagged with this Participant ID. Each participant, eligible and not eligible, will be documented in the Screening and Enrolment Log.

Data management: The data of the participants will be entered in an eCRFs. An eCRF will be provided by the project consortium partners Spindox Labs and CheckHealth. The eCRF will be maintained by staff at the University of Latvia. All study data will be captured in the eCRF and monitored by the monitor. All processes will be handled in accordance with standard operating procedures (SOPs).

Collected data from wearable sensors and remote-assisted questionnaires will be managed by the LinkWatch cloud-based storage for the entire duration of the project. This includes regular back-up procedures and security provisions in accordance with the GDPR and the CHK security policy. The security policy is based on the framework recommended by the Swedish Contingency Agency (Reg. 2016/679/UE), following ISO/IEEE 27000.

Paper forms of the questionnaires will be used due to possible technical problems of the platform. Consent form will be stored in archive at University of Latvia.

Macronutrient food intake will be obtained through a three-day food diary. That will be provided in different ways - initializing the macronutrient database with finish Finelli database, using Open food facts for packaged food. Meanwhile participants will be asked to double information, writing manually food diary for three days (two working days, one holiday).

In visit 1 and visit 3 food frequency questionnaire and in each visit, 24h recall will be obtained. Physical activity will be tracked using a commercially available FitBit Charge 5 tracker.

Between visit 1 and visit 2, specific recommendations will not be provided. This phase will serve as a data collection period to accurately assess participants' current lifestyle factors.

To enhance data diversity for PIML model training and validation, during the second visit, each participant will receive dietary and physical activity recommendations to modify lifestyle habits for the next two months.

Physical activity recommendations will follow a structured plan, ensuring a gradual increase in activity level personalized for each participant. The time period between the first and second study visits will serve as a data collection phase to accurately assess participants' current activity levels. Baseline physical activity levels, which will be used to establish personalized physical activity targets for each participant, will be calculated as the average minutes of moderate to vigorous physical activity (MVPA) recorded over the four weeks preceding the second visit. The initial weeks after the first visit will not be used for baseline activity calculation to limit possible behavioural changes participants might exhibit simply because they have started participating in the health related study.

These prescriptions will align with the relevant WHO guidelines and summarized in hand-out materials.

Blood samples from enrolled patients at the three cohort visits (Visit 1, Visit 2 and Visit 3) will be collected and initially processed to obtain plasma samples and temporarily stored (at -80°C) by the recruiting site (see below blood samples processing before shipment).

Plasma samples will be shipped to the project partner Italian Liver Foundation (FIF) in Basovizza, Trieste (Italy). At FIF, plasma samples will be stored at the at a temperature (-80°C) and access-controlled freezer for 10 years.

At FIF, batched samples will be unfrozen and further processed to determine plasma abundances of interleukins (IL-2, IL-6, IL-10, and leptin).

Blood will be collected for determination of clinical blood tests on the study site and for sample transfer to FIF for further biomarker analysis and preservation at the University of Latvia.

Participants will collect their faecal samples at home (together two samples - in first and second visit, respectively). The primary goal of faecal samples collection is to create a long-lasting PRAESIIDIUM biobank. In a subsequent moment, faecal samples might be processed for microbiome analysis. After arrival at local hospital sample should be frozen at -80°C until processing. Further processing - upon availability of additional funds (Fondazione Italiana Fegato ONLUS, Italy).

Study Type

Observational

Enrollment (Actual)

77

Contacts and Locations

This section provides the contact details for those conducting the study, and information on where this study is being conducted.

Study Locations

      • Riga, Latvia
        • University of Latvia, Faculty of Medicine

Participation Criteria

Researchers look for people who fit a certain description, called eligibility criteria. Some examples of these criteria are a person's general health condition or prior treatments.

Eligibility Criteria

Ages Eligible for Study

  • Adult
  • Older Adult

Accepts Healthy Volunteers

Yes

Sampling Method

Non-Probability Sample

Study Population

Study population will be selected mainly in primary care clinic - endocrinologists, general practitioners will be informed about the study. Informative materials about the study will be prepared, which will contain the contact information (e-mail, phone number) using which the potential participant will be able to apply for the study. In addition, it is possible to invite participants to the research using public media (TV, radio, internet), social networks and the homepage of the University of Latvia. Initially, potential participants will be informed about the purpose, the progress of the study by phone or e-mail. If potential individuals willing to participate, more detailed information and signing of informed consent forms will take place during the screening visit.

Description

Inclusion Criteria:

  • Healthy adult volunteers (age ≥ 18 years old);
  • Overweight (BMI 25 - 29.9 kg/m2) and obese grade I individuals (with BMI 30 - 34.9 kg/m2);
  • Written consent of the participant after being informed;
  • Ownership of a smartphone running Android or iOS.

Exclusion Criteria:

  • Non-compliance;
  • Ongoing treatment with immunosuppressive and/or anti-inflammatory medications (NSAIDs, glucocorticoids, chemotherapy, biologicals);
  • Ongoing treatment with glucose lowering drugs, except if anti-diabetic medication has not been stopped - for metformin one month, for GLP-1 RA, tirzepatide - two months prior enrolment;
  • Presence of autoimmune and/or inflammatory disease (autoimmune thyroid disease, psoriasis, inflammatory bowel disease);
  • Skin conditions hindering application of continuous glucose monitoring systems;
  • Diabetes or prediabetes as diagnosed by ADA/WHO criteria according to fasting glucose and/or HbA1c;
  • High risk alcohol consumption - according to NIAAA - National Institute on Alcohol Abuse and Alcoholism (for men - more than 4 drinks on any day or more than 14 drinks per week; for women - more than 3 drinks on any day or more than 7 drinks per week);
  • Factors otherwise limiting the participation in the study according to the judgement of the investigator;
  • Pregnancy or intention to get pregnant during the study timeline.

Study Plan

This section provides details of the study plan, including how the study is designed and what the study is measuring.

How is the study designed?

Design Details

What is the study measuring?

Primary Outcome Measures

Outcome Measure
Measure Description
Time Frame
Validation of the Mission T2D (MT2D) algorithm outputs, that predicts the real time risk for developing pre-diabetes.
Time Frame: The study will run for 15 months. During this period, 75 individuals will be followed for 4 months, including screening visit and three onsite visits, if participants meet the predetermined inclusion criteria. Time frame between visits are 65 days (± 10

Data collections has three main purposes input data for the in-silico MT2D model (gender, weight, height, number of sessions of physical activity, duration of the bout of physical activity, intensity in terms of %VO2max, 3 meals per day (specified macronutrients).

Validation of the MT2D outputs include inflammation markers, metabolic outcomes. The third data for training/validation of the physics-informed machine learning (PIML) algorithm: demographic data; health-related data; lifestyle data (e.g., food consumption data and physical activity data); continuous ingestion through wearable sensors (Continuous Glucose Monitoring (CGM and tracker of physical activity e.g., Fitbit Charge 5, EDIBit.)

The study will run for 15 months. During this period, 75 individuals will be followed for 4 months, including screening visit and three onsite visits, if participants meet the predetermined inclusion criteria. Time frame between visits are 65 days (± 10

Collaborators and Investigators

This is where you will find people and organizations involved with this study.

Publications and helpful links

The person responsible for entering information about the study voluntarily provides these publications. These may be about anything related to the study.

General Publications

Study record dates

These dates track the progress of study record and summary results submissions to ClinicalTrials.gov. Study records and reported results are reviewed by the National Library of Medicine (NLM) to make sure they meet specific quality control standards before being posted on the public website.

Study Major Dates

Study Start (Actual)

January 29, 2024

Primary Completion (Actual)

September 30, 2025

Study Completion (Actual)

September 30, 2025

Study Registration Dates

First Submitted

December 22, 2023

First Submitted That Met QC Criteria

January 5, 2024

First Posted (Actual)

January 8, 2024

Study Record Updates

Last Update Posted (Actual)

May 6, 2026

Last Update Submitted That Met QC Criteria

April 29, 2026

Last Verified

April 1, 2026

More Information

Terms related to this study

Plan for Individual participant data (IPD)

Plan to Share Individual Participant Data (IPD)?

NO

IPD Plan Description

IPD could be shared under specific data transfer agreements between parts.

Drug and device information, study documents

Studies a U.S. FDA-regulated drug product

No

Studies a U.S. FDA-regulated device product

No

This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.

Clinical Trials on Body Weight

Subscribe