Gene Expression Signatures Can Aid Diagnosis of Sexually Transmitted Infection-Induced Endometritis in Women

Xiaojing Zheng, Catherine M O'Connell, Wujuan Zhong, Taylor B Poston, Harold C Wiesenfeld, Sharon L Hillier, Maria Trent, Charlotte Gaydos, George Tseng, Brandie D Taylor, Toni Darville, Xiaojing Zheng, Catherine M O'Connell, Wujuan Zhong, Taylor B Poston, Harold C Wiesenfeld, Sharon L Hillier, Maria Trent, Charlotte Gaydos, George Tseng, Brandie D Taylor, Toni Darville

Abstract

Sexually transmitted infection (STI) of the upper reproductive tract can result in inflammation and infertility. A biomarker of STI-induced upper tract inflammation would be significant as many women are asymptomatic and delayed treatment increases risk of sequelae. Blood mRNA from 111 women from three cohorts was profiled using microarray. Unsupervised analysis revealed a transcriptional profile that distinguished 9 cases of STI-induced endometritis from 18 with cervical STI or uninfected controls. Using a hybrid feature selection algorithm we identified 21 genes that yielded maximal classification accuracy within our training dataset. Predictive accuracy was evaluated using an independent testing dataset of 5 cases and 10 controls. Sensitivity was evaluated in a separate test set of 12 women with asymptomatic STI-induced endometritis in whom cervical burden was determined by PCR; and specificity in an additional test set of 15 uninfected women with pelvic pain due to unknown cause. Disease module preservation was assessed in 42 women with a clinical diagnosis of pelvic inflammatory disease (PID). We also tested the ability of the biomarker to discriminate STI-induced endometritis from other diseases. The biomarker was 86.7% (13/15) accurate in correctly distinguishing cases from controls in the testing dataset. Sensitivity was 83.3% (5/6) in women with high cervical Chlamydia trachomatis burden and asymptomatic endometritis, but 0% (0/6) in women with low burden. Specificity in patients with non-STI-induced pelvic pain was 86.7% (13/15). Disease modules were preserved in all 8 biomarker predicted cases. The 21-gene biomarker was highly discriminatory for systemic infections, lupus, and appendicitis, but wrongly predicted tuberculosis as STI-induced endometritis in 52.4%. A 21-gene biomarker can identify asymptomatic women with STI-induced endometritis that places them at risk for chronic disease development and discriminate STI-induced endometritis from non-STI pelvic pain and other diseases.

Keywords: Chlamydia; biomarker; gonorrhea; mRNA; pelvic inflammatory disease.

Figures

Figure 1
Figure 1
Diagram of study organization. Blood transcriptional mRNA profiles from cases and controls were analyzed in a training dataset. Identified classifier genes were subsequently validated in an independent testing dataset. The classifier genes were also evaluated in three additional independent patient datasets for sensitivity, specificity, and disease module preservation, respectively. †Cases: women with symptoms consistent with PID and with biopsy confirmed endometrial STI with N. gonorrhoeae and/or C. trachomatis and/or M. genitalium and endometritis. ‡‡Controls: asymptomatic women who were uninfected or infected at the cervix only, with no endometrial inflammation. *Participants recruited from TRAC cohort. **Participants recruited from ACE cohort. ***Participants recruited from TECH-N cohort comprised women with symptoms of PID who had cervical microbiology determined but lacked endometrial data.
Figure 2
Figure 2
Unsupervised hierarchical clustering and principal component analysis (PCA) revealed distinctive blood mRNA transcriptional profiles separating cases from controls. Unsupervised hierarchical clustering of blood transcriptional profiles of women in training (A) and testing (B) datasets using 4952 transcripts. Samples were ordered by hierarchical clustering (Spearman correlation with average linkage) creating a condition tree, upper horizontal edge of heat map; study groupings (clinical phenotypes) are the colored blocks on the top of each profile. Heat map rows are genes; columns are participants. Principle component analysis of the variance in mRNA expression of the subjects depicted in the heat maps in training (C) and testing (D) datasets, using 4,952 transcripts and the same color scheme with each colored square depicting one subject. The x axis represents the first principal component, PC1, which accounts for the largest variance of mRNA expression, and the y axis, PC2, explains the second largest variance.
Figure 3
Figure 3
Predicted probability of STI-induced endometritis in testing datasets using a 21-gene signature and SVM learning algorithm. Each dot represents one subject. The x-axis indicates the biopsy confirmed groups of subjects and y-axis indicates the predicted probability of STI-induced endometritis. The dotted line corresponds to a predicted probability cutoff of 0.5. If the predicted probability of PID is > 0.5, we consider the predicted response as high risk.
Figure 4
Figure 4
Predicted probability using 21-gene signature and SVM of STI-induced endometritis in a target dataset of asymptomatic subjects with biopsy proven subclinical CT/GC-induced endometritis. Each dot represents one subject. The x-axis indicates the log10 cervical C. trachomatis load and the y-axis indicates the predicted probability of STI-induced endometritis.
Figure 5
Figure 5
Predicted probability using 21-gene signature and SVM of STI-induced endometritis in a dataset of women who were symptomatic but were uninfected. Subsets of patients within the dataset were positive and negative for histologic endometritis. Each dot represents one subject. The x-axis indicates two groups of women with pelvic pain without STI. One group has normal histology, the other group has chronic endometritis; y-axis indicates the predicted probability of STI-induced endometritis.
Figure 6
Figure 6
Zsummary statistics reveal module preservation of blood profiles from women with biopsy-confirmed CT/GC-induced endometritis compared to women with biomarker-predicted STI-induced endometritis from independent TECH-N cohort. Colored circles correspond with highly correlated gene modules identified in biopsy-confirmed CT/GC-induced endometritis. The x-axis indicates the number of genes in each module; y axis is the preservation Zsummary value. The dotted lines with cutoff Zsummary of 2 and 10 indicate preservation and high level preservation respectively.
Figure 7
Figure 7
The 21-gene biomarker distinguishes STI-induced endometritis from other infectious and inflammatory conditions. (A) The number of samples within each disease and control group predicted as STI-induced endometritis is summarized in the table. (B) The predicted probability of STI-induced endometritis in each disease and control group using a 21-gene biomarker. Each dot represents one subject. The x-axis indicates the groups of subjects and y-axis indicates the predicted probability of STI-induced endometritis. The dotted line corresponds to a predicted probability cutoff of 0.5. (C) Unsupervised hierarchical clustering using 21-genes revealed distinctive blood mRNA transcriptional profiles separating STI-induced endometritis from other diseases and controls. Study groupings (clinical phenotypes) are the colored blocks on the top of each profile. Heat map rows are genes; columns are participants. Transformed expression levels are indicated by color scale, with red representing relative high expression and blue relative low expression.

References

    1. Berry M. P., Graham C. M., McNab F. W., Xu Z., Bloch S. A., Oni T., et al. . (2010). An interferon-inducible neutrophil-driven blood transcriptional signature in human tuberculosis. Nature 466, 973–977. 10.1038/nature09247
    1. Bjartling C., Osser S., Persson K. (2012). Mycoplasma genitalium in cervicitis and pelvic inflammatory disease among women at a gynecologic outpatient service. Am. J. Obstet. Gynecol. 206, 476. e1–476.e8. 10.1016/j.ajog.2012.02.036
    1. Centers for Disease Control and Prevention (2017). Sexually Transmitted Disease Surveillance 2016. Atlanta, GA: US Department of Health and Human Services.
    1. Chawla L. S., Toma I., Davison D., Vaziri K., Lee J., Lucas R., et al. . (2016). Acute appendicitis: transcript profiling of blood identifies promising biomarkers and potential underlying processes. BMC Med. Genomics 9:40. 10.1186/s12920-016-0200-y
    1. Duan K. B., Rajapakse J. C., Wang H., Azuaje F. (2005). Multiple SVM-RFE for gene selection in cancer classification with expression data. IEEE Trans. Nanobiosci. 4, 228–234. 10.1109/TNB.2005.853657
    1. Fuller T. F., Ghazalpour A., Aten J. E., Drake T. A., Lusis A. J., Horvath S. (2007). Weighted gene coexpression network analysis strategies applied to mouse weight. Mamm. Genome 18, 463–472. 10.1007/s00335-007-9043-3
    1. Guyon I., Weston J., Barnhill S., Vapnik V. (2002). Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 389–422. 10.1023/A:1012487302797
    1. Haggerty C. L., Totten P. A., Tang G., Astete S. G., Ferris M. J., Norori J., et al. . (2016). Identification of novel microbes associated with pelvic inflammatory disease and infertility. Sex. Transm. Infect. 92, 441–446. 10.1136/sextrans-2015-052285
    1. Huggins C. E., Domenighetti A. A., Ritchie M. E., Khalil N., Favaloro J. M., Proietto J., et al. (2008). Functional and metabolic remodeling in GLUT4-deficient hearts confers hyper-responsiveness to substrate intervention. J. Mol. Cell. Cardiol. 44, 270–280. 10.1016/j.yjmcc.2007.11.020
    1. Irizarry R. A., Hobbs B., Collin F., Beazer-Barclay Y. D., Antonellis K. J., Scherf U., et al. . (2003). Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4, 249–264. 10.1093/biostatistics/4.2.249
    1. Johnson W. E., Li C., Rabinovic A. (2007). Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8, 118–127. 10.1093/biostatistics/kxj037
    1. Kiviat N. B., Wølner-Hanssen P., Eschenbach D. A., Wasserheit J. N., Paavonen J. A., Bell T. A., et al. . (1990). Endometrial histopathology in patients with culture-proved upper genital tract infection and laparoscopically diagnosed acute salpingitis. Am. J. Surg. Pathol. 14, 167–175. 10.1097/00000478-199002000-00008
    1. Langfelder P., Luo R., Oldham M. C., Horvath S. (2011). Is my network module preserved and reproducible? PLoS Comput. Biol. 7:e1001057. 10.1371/journal.pcbi.1001057
    1. Luo W., Friedman M. S., Shedden K., Hankenson K. D., Woolf P. J. (2009). GAGE: generally applicable gene set enrichment for pathway analysis. BMC Bioinformatics 10:161 10.1186/1471-2105-10-161
    1. Masucci G. V., Cesano A., Hawtin R., Janetzki S., Zhang J., Kirsch I., et al. . (2016). Validation of biomarkers to predict response to immunotherapy in cancer: volume I - pre-analytical and analytical validation. J. Immunother. Cancer 4:76. 10.1186/s40425-016-0178-1
    1. McGowin C. L., Anderson-Smits C. (2011). Mycoplasma genitalium: an emerging cause of sexually transmitted disease in women. PLoS Pathog. 7:e1001324. 10.1371/journal.ppat.1001324
    1. Northcott P. A., Shih D. J., Remke M., Cho Y. J., Kool M., Hawkins C., et al. . (2012). Rapid, reliable, and reproducible molecular sub-grouping of clinical medulloblastoma samples. Acta Neuropathol. 123, 615–626. 10.1007/s00401-011-0899-7
    1. Poston T. B., Gottlieb S. L., Darville T. (2017). Status of vaccine research and development of vaccines for Chlamydia trachomatis infection. Vaccine. 10.1016/j.vaccine.2017.01.023. [Epub ahead of print].
    1. Price M. J., Ades A. E., De Angelis D., Welton N. J., Macleod J., Soldan K., et al. . (2013). Risk of pelvic inflammatory disease following Chlamydia trachomatis infection: analysis of prospective studies with a multistate model. Am. J. Epidemiol. 178, 484–492. 10.1093/aje/kws583
    1. Reese S. E., Archer K. J., Therneau T. M., Atkinson E. J., Vachon C. M., De Andrade M., et al. . (2013). A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis. Bioinformatics 29, 2877–2883. 10.1093/bioinformatics/btt480
    1. Rein D. B., Kassler W. J., Irwin K. L., Rabiee L. (2000). Direct medical cost of pelvic inflammatory disease and its sequelae: decreasing, but still substantial. Obstet. Gynecol. 95, 397–402. 10.1097/00006250-200003000-00016
    1. Russell A. N., Zheng X., O'Connell C. M., Taylor B. D., Wiesenfeld H. C., Hillier S. L., et al. . (2016). Analysis of factors driving incident and ascending infection and the role of serum antibody in Chlamydia trachomatis genital tract infection. J. Infect. Dis. 213, 523–531. 10.1093/infdis/jiv438
    1. Trent M., Chung S. E., Gaydos C., Frick K. D., Anders J., Huettner S., et al. . (2016). Recruitment of minority adolescents and young adults into randomised clinical trials: Testing the design of the Technology Enhanced Community Health Nursing (TECH-N) pelvic inflammatory disease trial. Eur. Med. J. Reprod. Health 2, 41–51. 10.1016/j.jadohealth.2016.10.041
    1. Veldman-Jones M. H., Brant R., Rooney C., Geh C., Emery H., Harbron C. G., et al. . (2015). Evaluating robustness and sensitivity of the NanoString Technologies nCounter Platform to enable multiplexed gene expression analysis of clinical samples. Cancer Res. 75, 2587–2593. 10.1158/0008-5472.CAN-15-0262
    1. Wiesenfeld H. C., Hillier S. L., Meyn L. A., Amortegui A. J., Sweet R. L. (2012). Subclinical pelvic inflammatory disease and infertility. Obstet. Gynecol. 120, 37–43. 10.1097/AOG.0b013e31825a6bc9
    1. Wiesenfeld H. C., Sweet R. L., Ness R. B., Krohn M. A., Amortegui A. J., Hillier S. L. (2005). Comparison of acute and subclinical pelvic inflammatory disease. Sex. Transm. Dis. 32, 400–405. 10.1097/01.olq.0000154508.26532.6a
    1. Workowski K. A., Bolan G. A., Centers for Disease Control Prevention (2015). Sexually transmitted diseases treatment guidelines 2015. MMWR Recomm. Rep. 64, 1–137. 10.1093/cid/civ771
    1. Wurmbach E., Yuen T., Ebersole B. J., Sealfon S. C. (2001). Gonadotropin-releasing hormone receptor-coupled gene network organization. J. Biol. Chem. 276, 47195–47201. 10.1074/jbc.M108716200
    1. Zheng X., O'Connell C. M., Zhong W., Nagarajan U. M., Tripathy M., Lee D., et al. (2018). Discovery of blood transcriptional endotypes in women with pelvic inflammatory disease. J. Immunol. 2018, 2941–2956. 10.4049/jimmunol.1701658
    1. Zhou Y., Cras-Méneur C., Ohsugi M., Stormo G. D., Permutt M. A. (2007). A global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments. Bioinformatics 23, 2073–2079. 10.1093/bioinformatics/btm292

Source: PubMed

3
購読する