A relational database to identify differentially expressed genes in the endometrium and endometriosis lesions

Michael Gabriel, Vidal Fey, Taija Heinosalo, Prem Adhikari, Kalle Rytkönen, Tuomo Komulainen, Kaisa Huhtinen, Teemu Daniel Laajala, Harri Siitari, Arho Virkki, Pia Suvitie, Harry Kujari, Tero Aittokallio, Antti Perheentupa, Matti Poutanen, Michael Gabriel, Vidal Fey, Taija Heinosalo, Prem Adhikari, Kalle Rytkönen, Tuomo Komulainen, Kaisa Huhtinen, Teemu Daniel Laajala, Harri Siitari, Arho Virkki, Pia Suvitie, Harry Kujari, Tero Aittokallio, Antti Perheentupa, Matti Poutanen

Abstract

Endometriosis is a common inflammatory estrogen-dependent gynecological disorder, associated with pelvic pain and reduced fertility in women. Several aspects of this disorder and its cellular and molecular etiology remain unresolved. We have analyzed the global gene expression patterns in the endometrium, peritoneum and in endometriosis lesions of endometriosis patients and in the endometrium and peritoneum of healthy women. In this report, we present the EndometDB, an interactive web-based user interface for browsing the gene expression database of collected samples without the need for computational skills. The EndometDB incorporates the expression data from 115 patients and 53 controls, with over 24000 genes and clinical features, such as their age, disease stages, hormonal medication, menstrual cycle phase, and the different endometriosis lesion types. Using the web-tool, the end-user can easily generate various plot outputs and projections, including boxplots, and heatmaps and the generated outputs can be downloaded in pdf-format.Availability and implementationThe web-based user interface is implemented using HTML5, JavaScript, CSS, Plotly and R. It is freely available from https://endometdb.utu.fi/ .

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
a) Schematic overview of the EndometDB used for transcriptomic analysis. The questionnaire data is collected through the Webropol survey system and imported into EndometDB via an automated service layer API. The numerical results of the biomedical samples analyzed are uploaded into the system together with the sample information. The uploading is done through the web user interface. Analytical functions are available through the analysis engine and can be used to query the whole data set. (b) EndometDB web graphical user interface (GUI) analysis process. The GUI, through a client, communicates with the analysis engine through an API layer implemented with PHP. The analysis engine analyzes the data using the R programming language and Plotly graphing library with the ggplot2 R package is used to generate the plots that are transferred back to the GUI.
Fig. 2
Fig. 2
EndometDB GUI. Screenshot of the EndometDB user interface showing tabbed browsing functions (on the left) and an example boxplot (HSD17B2) as an output of a gene analyzed (on right). The browsing functions include controls for interacting with the filters such as the clinical data (age, menstrual cycle phase, hormonal medication, disease stage), sample data (tissue type; endometrium, peritoneum, endometriosis lesions) and plots and projections. The different color boxplot represents the different tissues and lesions for both the healthy women and patients. The colors make it easier to distinguish between the different tissues and lesions as well as between healthy controls and patients.
Fig. 3
Fig. 3
Example output of unsupervised hierarchical clustering analysis generated via the EndometDB GUI. Example of unsupervised hierarchical clustering analysis of mRNA expression of the differentially expressed WNT pathway genes (Online-only Table 1) in all the sample groups. The different clinical features of the samples (lesion/tissue type, age of the patients with pre-selected grouping, hormonal stage, and disease stage) are attached to the heatmap. Canberra distance metric with Ward’s clustering method was applied showing clusters corresponding to lesions and tissue types. The dendrogram on the x-axis shows the hierarchical relationship between the tissues and lesion as well as the cycle phase and disease stage. While the dendrogram on the y-axis shows the measure of similarities in the activation levels of the WNT signaling pathway genes. The colors represent different tissues and lesions from both healthy controls and patients.
Fig. 4
Fig. 4
Example of correlation heatmap from the EndometDB generated via the GUI. Correlation heatmap after hierarchical clustering of WNT pathway genes (Online-only Table 1) with Pearson correlation method and Ward’s clustering method. Clustering dendrogram on both axes show the measure of the relationship between the genes.
Fig. 5
Fig. 5
Example of projection outputs from the EndometDB generated via the GUI. (a) PCA analysis of mRNA expression of the differentially expressed WNT pathway genes in all the sample groups. Samples are colored by tissue types, and the confidence ellipses with 95% confidence level for the expression in various tissue types are generated using the EndometDB GUI. The PCA separates control and patient endometrium from the three subtypes of endometriosis lesions. (b) LFDA analysis of WNT pathway genes with raw eigenvectors metric colored by tissues. In addition to separating endometriosis lesions from endometrium, LFDA separates ovarian endometriosis from peritoneal and deep endometriosis. The list of WNT pathway genes used in these analyses are listed in Online-only Table 1.
Fig. 6
Fig. 6
Validation of microarray result with qRT-PCR. Examples of the mRNA expression for steroid receptors (ESR1, PGR), steroid metabolizing enzymes (HSD17B2, HSD17B6), and WNT- pathway genes (SFRP2, and MPZL2) analyzed by RT-qPCR (a1, b1, c1, d1, e1, f1) and with the EndometDB (a2, b2, c2, d2, e2, f2).

References

    1. Giudice LC. Clinical practice. Endometriosis. N. Engl. J. Med. 2010;362:2389–98. doi: 10.1056/NEJMcp1000274.
    1. Montgomery GW, Giudice LC. New Lessons about Endometriosis — Somatic Mutations and Disease Heterogeneity. N. Engl. J. Med. 2017;376:1881–1882. doi: 10.1056/NEJMe1701700.
    1. Bulun SE, et al. Endometriosis. Endocr. Rev. 2019;40:1048–1079. doi: 10.1210/er.2018-00242.
    1. Nisolle M, Donnez J. Peritoneal endometriosis, ovarian endometriosis, and adenomyotic nodules of the rectovaginal septum are three different entities. Fertil. Steril. 1997;68:585–596. doi: 10.1016/S0015-0282(97)00191-X.
    1. Vercellini P, Viganò P, Somigliana E, Fedele L. Endometriosis: pathogenesis and treatment. Nat. Rev. Endocrinol. 2014;10:261–275. doi: 10.1038/nrendo.2013.255.
    1. American Society for Reproductive Medicine Revised American Society for Reproductive Medicine classification of endometriosis: 1996. Fertil. Steril. 1997;67:817–821. doi: 10.1016/S0015-0282(97)81391-X.
    1. Gibson, D. A., Simitsidellis, I., Collins, F. & Saunders, P. T. K. Endometrial Intracrinology: Oestrogens, Androgens and Endometrial Disorders. Int. J. Mol. Sci. 19, (2018).
    1. Trevino V, Falciani F, Barrera-Saldaña HA. DNA microarrays: a powerful genomic tool for biomedical and clinical research. Mol. Med. 2007;13:527–41. doi: 10.2119/2006-00107.Trevino.
    1. Borghese B, et al. Research Resource: Gene Expression Profile for Ectopic Versus Eutopic Endometrium Provides New Insights into Endometriosis Oncogenic Potential. Mol. Endocrinol. 2008;22:2557–2562. doi: 10.1210/me.2008-0322.
    1. Hever A, et al. Human endometriosis is associated with plasma cells and overexpression of B lymphocyte stimulator. Proc. Natl. Acad. Sci. U. S. A. 2007;104:12451–6. doi: 10.1073/pnas.0703451104.
    1. Hull ML, et al. Endometrial-peritoneal interactions during endometriotic lesion establishment. Am. J. Pathol. 2008;173:700–15. doi: 10.2353/ajpath.2008.071128.
    1. Ahn SH, et al. Immune-inflammation gene signatures in endometriosis patients. Fertil. Steril. 2016;106:1420–1431.e7. doi: 10.1016/j.fertnstert.2016.07.005.
    1. Rekker K, et al. High-throughput mRNA sequencing of stromal cells from endometriomas and endometrium. Reproduction. 2017;154:93–100. doi: 10.1530/REP-17-0092.
    1. Burney RO, et al. Gene Expression Analysis of Endometrium Reveals Progesterone Resistance and Candidate Susceptibility Genes in Women with Endometriosis. Endocrinology. 2007;148:3814–3826. doi: 10.1210/en.2006-1692.
    1. Tamaresis JS, et al. Molecular classification of endometriosis and disease stage using high-dimensional genomic data. Endocrinology. 2014;155:4986–99. doi: 10.1210/en.2014-1490.
    1. Zhao L, et al. Identification of global transcriptome abnormalities and potential biomarkers in eutopic endometria of women with endometriosis: A preliminary study. Biomed. Reports. 2017;6:654. doi: 10.3892/br.2017.902.
    1. Khan, M. A., Sengupta, J., Mittal, S. & Ghosh, D. Genome-wide expressions in autologous eutopic and ectopic endometrium of fertile women with endometriosis. Reprod. Biol. Endocrinol. 10, 84 (2012).
    1. Coutinho, L. M., Ferreira, M. C., Rocha, A. L. L., Carneiro, M. M. & Reis, F. M. In Advances in Clinical Chemistry 1st edn, vol. 89 (ed. Gregory S. Makowski) Ch. 2 (Academic Press Inc., 2019).
    1. Heinosalo T, et al. Secreted frizzled-related protein 2 (SFRP2) expression promotes lesion proliferation via canonical WNT signaling and indicates lesion borders in extraovarian endometriosis. Hum. Reprod. 2018;33:817–831. doi: 10.1093/humrep/dey026.
    1. Johnson NP, Miller LM. EPHect - the Endometriosis Phenome (and Biobanking) Harmonisation Project - may be very helpful for clinicians and the women they are treating. F1000Research. 2017;6:14. doi: 10.12688/f1000research.9850.1.
    1. Becker CM, et al. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonisation Project: I. Surgical phenotype data collection in endometriosis research. Fertil. Steril. 2014;102:1213–1222. doi: 10.1016/j.fertnstert.2014.07.709.
    1. Vitonis AF, et al. World Endometriosis Research Foundation Endometriosis Phenome and biobanking harmonization project: II. Clinical and covariate phenotype data collection in endometriosis research. Fertil. Steril. 2014;102:1223–1232. doi: 10.1016/j.fertnstert.2014.07.1244.
    1. Fassbender A, et al. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonisation Project: IV. Tissue collection, processing, and storage in endometriosis research. Fertil. Steril. 2014;102:1244–1253. doi: 10.1016/j.fertnstert.2014.07.1209.
    1. Rahmioglu N, et al. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonization Project: III. Fluid biospecimen collection, processing, and storage in endometriosis research. Fertil. Steril. 2014;102:1233–1243. doi: 10.1016/j.fertnstert.2014.07.1208.
    1. Villanueva RAM, Chen ZJ. Measurement: Interdisciplinary Research and Perspectives. 2nd ed. New York: Springer-Verlag; 2019. ggplot2: Elegant Graphics for Data Analysis.
    1. Gabriel M, 2020. A relational database to identify differentially expressed genes in the endometrium and endometriosis lesions. figshare.
    1. Dunning MJ, Smith ML, Ritchie ME, Tavare S. beadarray: R classes and methods for Illumina bead-based data. Bioinformatics. 2007;23:2183–2184. doi: 10.1093/bioinformatics/btm311.
    1. Müller C, et al. Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data. PLoS One. 2016;11:e0156594. doi: 10.1371/journal.pone.0156594.
    1. Gentleman RC, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004;5:R80. doi: 10.1186/gb-2004-5-10-r80.
    1. Team., R. C. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. (2019).
    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2.
    1. Allen JD, et al. Probe mapping across multiple microarray platforms. Brief. Bioinform. 2012;13:547–554. doi: 10.1093/bib/bbr076.
    1. Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/ Bioconductor package biomaRt. Nat. Protoc. 2009;4:1184–1191. doi: 10.1038/nprot.2009.97.
    1. Smedley D, et al. The BioMart community portal: an innovative alternative to large, centralized data repositories. Nucleic Acids Res. 2015;43:W589–W598. doi: 10.1093/nar/gkv350.
    1. Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–127. doi: 10.1093/biostatistics/kxj037.
    1. Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–883. doi: 10.1093/bioinformatics/bts034.
    1. 2011. Gene Expression Omnibus. GPL13376
    1. 2010. Gene Expression Omnibus. GPL10558
    1. Gabriel M, Poutanen M, 2019. Transcriptome analysis of differential gene expression of endometrium, peritoneum and endometriosis lesions. Gene Expression Omnibus. GSE141549
    1. Kauffmann A, Gentleman R, Huber W. arrayQualityMetrics—a bioconductor package for quality assessment of microarray data. Bioinformatics. 2009;25:415–416. doi: 10.1093/bioinformatics/btn647.
    1. Green J, Nusse R, van Amerongen R. The role of Ryk and Ror receptor tyrosine kinases in wnt signal transduction. Cold Spring Harb. Perspect. Biol. 2014;6:a009175. doi: 10.1101/cshperspect.a009175.
    1. Niehrs C. The complex world of WNT receptor signalling. Nature Reviews Molecular Cell Biology. 2012;13:767–779. doi: 10.1038/nrm3470.
    1. Clevers H, Loh KM, Nusse R. An integral program for tissue renewal and regeneration: Wnt signaling and stem cell control. Science. 2014;346:1248012. doi: 10.1126/science.1248012.
    1. Pfaffl MW. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001;29:e45. doi: 10.1093/nar/29.9.e45.

Source: PubMed

3
Abonneren