RNA profiles reveal signatures of future health and disease in pregnancy

Morten Rasmussen, Mitsu Reddy, Rory Nolan, Joan Camunas-Soler, Arkady Khodursky, Nikolai M Scheller, David E Cantonwine, Line Engelbrechtsen, Jia Dai Mi, Arup Dutta, Tiffany Brundage, Farooq Siddiqui, Mainou Thao, Elaine P S Gee, Johnny La, Courtney Baruch-Gravett, Mark K Santillan, Saikat Deb, Shaali M Ame, Said M Ali, Melanie Adkins, Mark A DePristo, Manfred Lee, Eugeni Namsaraev, Dorte Jensen Gybel-Brask, Lillian Skibsted, James A Litch, Donna A Santillan, Sunil Sazawal, Rachel M Tribe, James M Roberts, Maneesh Jain, Estrid Høgdall, Claudia Holzman, Stephen R Quake, Michal A Elovitz, Thomas F McElrath, Morten Rasmussen, Mitsu Reddy, Rory Nolan, Joan Camunas-Soler, Arkady Khodursky, Nikolai M Scheller, David E Cantonwine, Line Engelbrechtsen, Jia Dai Mi, Arup Dutta, Tiffany Brundage, Farooq Siddiqui, Mainou Thao, Elaine P S Gee, Johnny La, Courtney Baruch-Gravett, Mark K Santillan, Saikat Deb, Shaali M Ame, Said M Ali, Melanie Adkins, Mark A DePristo, Manfred Lee, Eugeni Namsaraev, Dorte Jensen Gybel-Brask, Lillian Skibsted, James A Litch, Donna A Santillan, Sunil Sazawal, Rachel M Tribe, James M Roberts, Maneesh Jain, Estrid Høgdall, Claudia Holzman, Stephen R Quake, Michal A Elovitz, Thomas F McElrath

Abstract

Maternal morbidity and mortality continue to rise, and pre-eclampsia is a major driver of this burden1. Yet the ability to assess underlying pathophysiology before clinical presentation to enable identification of pregnancies at risk remains elusive. Here we demonstrate the ability of plasma cell-free RNA (cfRNA) to reveal patterns of normal pregnancy progression and determine the risk of developing pre-eclampsia months before clinical presentation. Our results centre on comprehensive transcriptome data from eight independent prospectively collected cohorts comprising 1,840 racially diverse pregnancies and retrospective analysis of 2,539 banked plasma samples. The pre-eclampsia data include 524 samples (72 cases and 452 non-cases) from two diverse independent cohorts collected 14.5 weeks (s.d., 4.5 weeks) before delivery. We show that cfRNA signatures from a single blood draw can track pregnancy progression at the placental, maternal and fetal levels and can robustly predict pre-eclampsia, with a sensitivity of 75% and a positive predictive value of 32.3% (s.d., 3%), which is superior to the state-of-the-art method2. cfRNA signatures of normal pregnancy progression and pre-eclampsia are independent of clinical factors, such as maternal age, body mass index and race, which cumulatively account for less than 1% of model variance. Further, the cfRNA signature for pre-eclampsia contains gene features linked to biological processes implicated in the underlying pathophysiology of pre-eclampsia.

Conflict of interest statement

M. Rasmussen, M. Reddy, R.N., J.C.-S., A.K., T.B., F.S., M.T., E.P.S.G., J.L., M.L., E.N., M.J., M.A.E., M.D., S.R.Q. and T.M. have an equity interest in Mirvie. All cohort contributors were compensated for sample collection and/or shipping. T.M. serves on the scientific advisory board for Mirvie, NxPrenatal, Momenta Pharmaceuticals and Hoffmann–La Roche. M. Rasmussen, M. Reddy, R.N., J.C.-S., A.K., T.B., F.S., M.T., E.P.S.G., J.L., M.L., E.N., M.J., M.A.E., S.R.Q., M.K.S. and D.A.S. are inventors on patent applications (US20170145509A1, US9937182B2 and EP2954324A1) that cover the detection, diagnosis or treatment of pregnancy complications.

© 2022. The Author(s).

Figures

Fig. 1. Overview of plasma sampling and…
Fig. 1. Overview of plasma sampling and cohorts and gestational age modelling.
a, Cohorts are labelled A–H (Table 1). Circles represent plasma samples from liquid biopsies (n = 2,539). Colours represent the race of the maternal donor. b, Model predictions from the hold-out test (n = 474) using cfRNA transcript data in the Lasso linear model versus ultrasound-predicted gestational age. The dark grey zone represents 1 s.d., and the light grey zone represents 2 s.d. c, Variance explained from ANOVA.
Fig. 2. Temporal profiles of pregnancy pathways…
Fig. 2. Temporal profiles of pregnancy pathways for gene sets from the gestational age model and independently identified gene sets for placenta, developing fetal heart and collagen extracellular matrix known to be involved in uterus and cervix growth over gestation.
ad, Maternal plasma transcriptome fractions for gene sets averaged across all samples in each collection window. Gestational age model (a), placenta (b), developing heart (c) and collagen extracellular matrix (ECM) (d). Error bars correspond to the 95% confidence interval around the mean. CPM, counts per million. n = 93 for each time point and gene set. eh, Signal across all cohorts with longitudinal data: gestational age model (e), placenta (f), developing heart (g) and collagen ECM (h). Linear fits are shown of transcriptome fractions for all samples across corresponding gestational ages recorded at collection times. The band around the solid line corresponds to the 95% confidence interval. All slopes for the gestational age coefficients are distinct from 0 at a confidence level of 0.05. Cohort is indicated by colour.
Fig. 3. Features and model performance for…
Fig. 3. Features and model performance for prediction of pre-eclampsia.
a, Sample collection time (dashed lines) and delivery time (solid lines) for women with pre-eclampsia (purple and green) and controls (grey). Gradients illustrate timelines for developing pathophysiology and onset of clinical symptoms. b, Quantile–quantile plot of ranked Spearman P values for women with pre-eclampsia (cases) versus controls. P values were calculated from Spearman correlation on cohort-corrected data for each gene. The genes used in the model are labelled. The black dotted line represents the expectation. c, Receiver operating characteristic curve (mean and 95% confidence interval) for the logistic regression model for pre-eclampsia (n = 524). d, Kaplan–Meier curves of deliveries in test-positive and test-negative populations (n = 439), excluding spontaneous preterm deliveries.
Extended Data Fig. 1. Temporal profiles of…
Extended Data Fig. 1. Temporal profiles of pregnancy-related endocrine signatures during pregnancy.
Seven pregnancy-related gene ontology term signatures identified as highly significantly enriched (α=0.01) were profiled across collection times using cumulative CPM. Plasma transcriptome fractions for each gene set were averaged across all samples in each collection window with error bars corresponding to the 95% confidence interval around the mean. Panels correspond to different ranges of CPM, for the ease of comparison. CPM, counts per million. N=93 for each timepoint and gene set.
Extended Data Fig. 2. Temporal profiles of…
Extended Data Fig. 2. Temporal profiles of fetal gene sets from developing kidney and gastrointestinal tract.
a-c, Maternal plasma transcriptome fractions for gene sets averaged across all samples in a given collection window. Error bars correspond to the 95% confidence interval around the mean. CPM, counts per million. N=93 for each timepoint and gene set. d-f, signal across all cohorts with longitudinal data. Linear fits of transcriptome fractions for all samples across corresponding gestational ages recorded at the collection times. The band around the solid line corresponds to the 95% CI. All slopes for the gestational age coefficient are distinct from 0 at a confidence level of 0.05. Cohort is indicated by color.
Extended Data Fig. 3. Bootstrapping with and…
Extended Data Fig. 3. Bootstrapping with and without time-scrambling.
a-d, for each of the significantly enriched gene sets, the trends were evaluated by bootstrapping (B=1,000) the original data (blue lines) and time-scrambled data (grey lines) obtained by reshuffling collection times.
Extended Data Fig. 4. Gene set enrichment…
Extended Data Fig. 4. Gene set enrichment analysis of preeclampsia for gene ontology (GO) gene sets.
a, Top-20 significantly upregulated gene sets. b, Top-20 significantly downregulated gene sets. Color gradient for adjusted p-value. NES, absolute normalized enrichment score.
Extended Data Figure 5. Effect of correcting…
Extended Data Figure 5. Effect of correcting for total count and cohort.
Counts for ACTB as a function of total counts for the sample before (a) and after (b) correction. Counts for CAPN6 as a function of gestational age for all samples used in the gestational age model before (c) and after (d) cohort correction. Plot of first two principal components before (e) and after (f) cohort correction.
Extended Data Fig. 6. Learning curve for…
Extended Data Fig. 6. Learning curve for gestational age model.
Model for gestational age is trained with increasing sample size, error is plotted for both training set (Cross-validated, purple) and held-out test set (green). Error bars are 1 standard deviation.
Extended Data Fig. 7. Learning curve for…
Extended Data Fig. 7. Learning curve for preeclampsia model.
Model performance as a function of training set size. Error bars are 1 standard deviation.

References

    1. Rich-Edwards JW, Fraser A, Lawlor DA, Catov JM. Pregnancy characteristics and women’s future cardiovascular health: an underused opportunity to improve women’s health? Epidemiol. Rev. 2014;36:57–70. doi: 10.1093/epirev/mxt006.
    1. Tan MY, et al. Screening for pre-eclampsia by maternal factors and biomarkers at 11–13 weeks’ gestation: first-trimester PE screening. Ultrasound Obstet. Gynecol. 2018;52:186–195. doi: 10.1002/uog.19112.
    1. Marinić M, Lynch VJ. Relaxed constraint and functional divergence of the progesterone receptor (PGR) in the human stem-lineage. PLoS Genet. 2020;16:e1008666. doi: 10.1371/journal.pgen.1008666.
    1. Robillard P-Y, Dekker GA, Hulsey TC. Evolutionary adaptations to pre-eclampsia/eclampsia in humans: low fecundability rate, loss of oestrus, prohibitions of incest and systematic polyandry. Am. J. Reprod. Immunol. 2002;47:104–111. doi: 10.1034/j.1600-0897.2002.1o043.x.
    1. McCarthy FP, Kingdom JC, Kenny LC, Walsh SK. Animal models of preeclampsia; uses and limitations. Placenta. 2011;32:413–419. doi: 10.1016/j.placenta.2011.03.010.
    1. Chez RA. Nonhuman primate models of toxemia of pregnancy. Perspect. Nephrol. Hypertens. 1976;5:421–424.
    1. Malassiné A, Frendo JL, Evain-Brion D. A comparison of placental development and endocrine functions between the human and mouse model. Hum. Reprod. Update. 2003;9:531–539. doi: 10.1093/humupd/dmg043.
    1. Skupski DW, et al. Estimating gestational age from ultrasound fetal biometrics. Obstet Gynecol. 2017;130:433–441. doi: 10.1097/AOG.0000000000002137.
    1. Khosrotehrani K, Johnson KL, Cha DH, Salomon RN, Bianchi DW. Transfer of fetal cells with multilineage potential to maternal tissue. JAMA. 2004;292:75–80. doi: 10.1001/jama.292.1.75.
    1. Kahn DA, Baltimore D. Pregnancy induces a fetal antigen-specific maternal T regulatory cell response that contributes to tolerance. Proc. Natl Acad. Sci. USA. 2010;107:9299–9304. doi: 10.1073/pnas.1003909107.
    1. Ashburner M, et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556.
    1. Subramanian A, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102.
    1. Liberzon A, et al. Molecular Signatures Database (MSigDB) 3.0. Bioinformatics. 2011;27:1739–1740. doi: 10.1093/bioinformatics/btr260.
    1. Shi J-W, et al. Collagen at the maternal-fetal interface in human pregnancy. Int. J. Biol. Sci. 2020;16:2220–2234. doi: 10.7150/ijbs.45586.
    1. Menon R, et al. Single-cell analysis of progenitor cell dynamics and lineage specification in the human fetal kidney. Development. 2018;145:dev164038. doi: 10.1242/dev.164038.
    1. Ryan D, et al. Development of the human fetal kidney from mid to late gestation in male and female infants. EBioMedicine. 2018;27:275–283. doi: 10.1016/j.ebiom.2017.12.016.
    1. Gao S, et al. Tracing the temporal-spatial transcriptome landscapes of the human fetal digestive tract using single-cell RNA-sequencing. Nat. Cell Biol. 2018;20:721–734. doi: 10.1038/s41556-018-0105-4.
    1. Munchel S, et al. Circulating transcripts in maternal blood reflect a molecular signature of early-onset preeclampsia. Sci. Transl. Med. 2020;12:eaaz0131. doi: 10.1126/scitranslmed.aaz0131.
    1. Uhlén M, et al. Tissue-based map of the human proteome. Science. 2015;347:1260419. doi: 10.1126/science.1260419.
    1. Kramer AW, Lamale-Smith LM, Winn VD. Differential expression of human placental PAPP-A2 over gestation and in preeclampsia. Placenta. 2016;37:19–25. doi: 10.1016/j.placenta.2015.11.004.
    1. Chen X, et al. The potential role of pregnancy-associated plasma protein-A2 in angiogenesis and development of preeclampsia. Hypertens. Res. 2019;42:970–980. doi: 10.1038/s41440-019-0224-8.
    1. Poon CE, Madawala RJ, Day ML, Murphy CR. Claudin 7 is reduced in uterine epithelial cells during early pregnancy in the rat. Histochem. Cell Biol. 2013;139:583–593. doi: 10.1007/s00418-012-1052-y.
    1. Schumann S, Buck VU, Classen-Linke I, Wennemuth G, Grümmer R. Claudin-3, claudin-7, and claudin-10 show different distribution patterns during decidualization and trophoblast invasion in mouse and human. Histochem. Cell Biol. 2015;144:571–585. doi: 10.1007/s00418-015-1361-z.
    1. Alazami AM, et al. TLE6 mutation causes the earliest known human embryonic lethality. Genome Biol. 2015;16:240. doi: 10.1186/s13059-015-0792-0.
    1. Wang G, Bonkovsky HL, de Lemos A, Burczynski FJ. Recent insights into the biological functions of liver fatty acid binding protein 1. J. Lipid Res. 2020;56:2238–2247. doi: 10.1194/jlr.R056705.
    1. Cunningham P, McDermott L. Long chain PUFA transport in human term placenta. J. Nutr. 2009;139:636–639. doi: 10.3945/jn.108.098608.
    1. Ren, Z. et al. Distinct molecular processes in placentae involved in two major subtypes of preeclampsia. Preprint at bioRxiv10.1101/787796 (2019).
    1. Gormley M, et al. Preeclampsia: novel insights from global RNA profiling of trophoblast subpopulations. Am. J. Obstet. Gynecol. 2017;217:200.e1–200.e17. doi: 10.1016/j.ajog.2017.03.017.
    1. Redman CW, Sargent IL. Latest advances in understanding preeclampsia. Science. 2005;308:1592–1594. doi: 10.1126/science.1111726.
    1. Zeller T, et al. Transcriptome-wide analysis identifies novel associations with blood pressure. Hypertension. 2017;70:743–750. doi: 10.1161/HYPERTENSIONAHA.117.09458.
    1. Challis JR, et al. Inflammation and pregnancy. Reprod. Sci. 2009;16:206–215. doi: 10.1177/1933719108329095.
    1. Raghupathy R, Kalinka J. Cytokine imbalance in pregnancy complications and its modulation. Front. Biosci. 2008;13:985–994. doi: 10.2741/2737.
    1. Carbone IF, et al. Circulating nucleic acids in maternal plasma and serum in pregnancy complications: are they really useful in clinical practice? A systematic review. Mol. Diagn. Ther. 2020;24:409–431. doi: 10.1007/s40291-020-00468-5.
    1. Vyas DA, Eisenstein LG, Jones DS. Hidden in plain sight—reconsidering the use of race correction in clinical algorithms. N. Engl. J. Med. 2020;383:874–882. doi: 10.1056/NEJMms2004740.
    1. Delgado C, et al. Reassessing the inclusion of race in diagnosing kidney diseases: an interim report from the NKF-ASN Task Force. J. Am. Soc. Nephrol. 2021;32:1305–1317. doi: 10.1681/ASN.2021010039.
    1. Grobman, W. A. et al. Prediction of vaginal birth after cesarean delivery in term gestations: a calculator without race and ethnicity. Am. J. Obstet. Gynecol. 10.1016/j.ajog.2021.05.021 (2021).
    1. Ngo TTM, et al. Noninvasive blood tests for fetal development predict gestational age and preterm delivery. Science. 2018;360:1133–1136. doi: 10.1126/science.aar3819.
    1. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170.
    1. Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635.
    1. Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638.
    1. Mootha VK, et al. PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat. Genet. 2003;34:267–273. doi: 10.1038/ng1180.
    1. Korotkevich, G. et al. Fast gene set enrichment analysis. Preprint at bioRxiv10.1101/060012 (2016).
    1. Cre, A. S. Fast gene set enrichment analysis. 10.18129/B9.BIOC.FGSEA (Bioconductor, 2017).
    1. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
    1. Tal, R. & Taylor, H. S. Endocrinology of pregnancy. Endotext (, 2021).

Source: PubMed

3
Sottoscrivi