Choice of surrogate tissue influences neonatal EWAS findings

Xinyi Lin, Ai Ling Teh, Li Chen, Ives Yubin Lim, Pei Fang Tan, Julia L MacIsaac, Alexander M Morin, Fabian Yap, Kok Hian Tan, Seang Mei Saw, Yung Seng Lee, Joanna D Holbrook, Keith M Godfrey, Michael J Meaney, Michael S Kobor, Yap Seng Chong, Peter D Gluckman, Neerja Karnani, Xinyi Lin, Ai Ling Teh, Li Chen, Ives Yubin Lim, Pei Fang Tan, Julia L MacIsaac, Alexander M Morin, Fabian Yap, Kok Hian Tan, Seang Mei Saw, Yung Seng Lee, Joanna D Holbrook, Keith M Godfrey, Michael J Meaney, Michael S Kobor, Yap Seng Chong, Peter D Gluckman, Neerja Karnani

Abstract

Background: Epigenomes are tissue specific and thus the choice of surrogate tissue can play a critical role in interpreting neonatal epigenome-wide association studies (EWAS) and in their extrapolation to target tissue. To develop a better understanding of the link between tissue specificity and neonatal EWAS, and the contributions of genotype and prenatal factors, we compared genome-wide DNA methylation of cord tissue and cord blood, two of the most accessible surrogate tissues at birth.

Methods: In 295 neonates, DNA methylation was profiled using Infinium HumanMethylation450 beadchip arrays. Sites of inter-individual variability in DNA methylation were mapped and compared across the two surrogate tissues at birth, i.e., cord tissue and cord blood. To ascertain the similarity to target tissues, DNA methylation profiles of surrogate tissues were compared to 25 primary tissues/cell types mapped under the Epigenome Roadmap project. Tissue-specific influences of genotype on the variable CpGs were also analyzed. Finally, to interrogate the impact of the in utero environment, EWAS on 45 prenatal factors were performed and compared across the surrogate tissues.

Results: Neonatal EWAS results were tissue specific. In comparison to cord blood, cord tissue showed higher inter-individual variability in the epigenome, with a lower proportion of CpGs influenced by genotype. Both neonatal tissues were good surrogates for target tissues of mesodermal origin. They also showed distinct phenotypic associations, with effect sizes of the overlapping CpGs being in the same order of magnitude.

Conclusions: The inter-relationship between genetics, prenatal factors and epigenetics is tissue specific, and requires careful consideration in designing and interpreting future neonatal EWAS.

Trial registration: This birth cohort is a prospective observational study, designed to study the developmental origins of health and disease, and was retrospectively registered on 1 July 2010 under the identifier NCT01174875 .

Keywords: DNA methylation; Epigenome-wide association study; Genotype; Neonate; Prenatal factors; Tissue-specificity.

Conflict of interest statement

Consent for publication

Not applicable.

Competing interests

YSC and KMG have received reimbursement for speaking at conferences sponsored by companies selling nutritional products. They are part of an academic consortium that has received research funding from Abbott Nutrition, Nestec, and Danone. The other authors declare no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
Infant cord tissue showed more inter-individual variation than infant cord blood: proportion of CpGs that showed inter-individual variation and interquartile range (IQR) in DNA methylation. a Pie chart shows the proportion of CpGs for four distinct categories: (1) CpGs which showed inter-individual variation in both tissues, (2) CpGs which showed inter-individual variation only in infant cord blood, (3) CpGs which showed inter-individual variation only in infant cord tissue, and (4) CpGs which did not show inter-individual variation in either tissue. A total of 239,560 CpGs passed quality control in both datasets. b Plot of proportion of CpGs (vertical axis) in each tissue (out of 239,560 CpGs) with DNA methylation IQR greater than or equal to the value specified on the horizontal axis. c Boxplots show the distribution of the DNA methylation IQR, for CpGs in infant cord tissue (bright orange) and infant cord blood (bright blue), respectively, for each of the four categories. Outliers are not shown in the boxplots. A CpG was defined to show inter-individual variation if the DNA methylation range (maximum–minimum, excluding outliers) was greater than 10% and DNA methylation 99th percentile–1st percentile was greater than 5%
Fig. 2
Fig. 2
Infant cord tissue is a better surrogate for primary tissues of mesenchymal stem cell (MSC)-derived mesodermic germinal origins, while infant cord blood is a better surrogate for primary tissues of hematopoietic stem cell (HSC)-derived mesodermic germinal origins: hierarchical clustering of GUSTO tissues (cord tissue, cord blood) with 25 primary tissues/cells profiled using reduced representation bisulfite sequencing in the Epigenome Roadmap project. Infant cord tissue clustered more closely with primary tissues of MSC-derived mesodermic germinal origins, while infant cord blood clustered more closely with primary tissues of HSC-derived mesodermic germinal origins. Left panel shows heatmap of DNA methylation values, with each row representing each tissue type and each column representing each CpG. Color changes from yellow to blue as DNA methylation changes from 0% to 100%. Right panel of plot shows dendrogram, with tissue types of ectodermic, endodermic, HSC-derived mesodermic, and MSC-derived mesodermic germinal origins represented in light pink, light purple, light turquoise, and light orange, respectively; GUSTO cord tissue and cord blood are represented in bright orange and bright blue, respectively. DNA methylation values from GUSTO tissues were generated using Infinium 450 K array (for each CpG and tissue type, the median value across all samples was used). For tissues/cells profiled by the Epigenome Roadmap project, only DNA methylation sites with a minimum reads coverage of 30X were retained and reads from both strands were combined. Hierarchical clustering was performed using only CpG sites that passed quality control filtering in GUSTO tissues, were non-missing in at least 10 out of the 25 Epigenome Roadmap samples, and had interquartile range greater than 10% across different Epigenome Roadmap tissues/cells
Fig. 3
Fig. 3
SNPs explained a greater proportion of inter-individual variation in DNA methylation in infant cord blood (CB) than in infant cord tissue (CT): SNP-associated CpGs detected in each infant tissue. a Pie charts show the percentage of CpGs in each infant tissue whose inter-individual variation could be explained by SNPs (out of all CpGs which showed inter-individual variation in the infant tissue). A CpG whose inter-individual variation could be explained by SNPs (SNP-associated) was defined to be one where the most significant association between the CpG and cis-SNPs (all SNPs on the same chromosome as CpG) attained a P value < 5 × 10–8, the commonly used Bonferroni threshold for genome-wide association studies (corresponding to testing for 106 independent SNPs across the genome at a family-wise Type 1 error rate of 0.05). b Overlap between SNP-associated, non-SNP-associated (but variable), and non-variable CpGs in the two tissues. Only CpGs which showed inter-individual variation in at least one tissue were included (N = 98,124). Examining each tissue separately, each of these 98,124 CpGs can either be SNP-associated, not SNP-associated, or not variable in each tissue. The number of CpGs in each of these three sets in each tissue is shown in the bottom left bar chart (for each tissue the number of CpGs from the three sets will sum to 98,124). Collectively, the 98,124 CpGs can be grouped into eight categories. The bottom right panel identifies each of these eight categories, with the solid black dots representing the sets being considered. For example, the extreme right column identifies the group of CpGs that are SNP-associated in both tissues. The top bar chart shows the number of CpGs in each of these eight categories. For example, 7822 CpGs were SNP-associated in both tissues
Fig. 4
Fig. 4
Prenatal factors (PFs) explained a similar proportion of inter-individual variation in infant cord blood (CB) and infant cord tissue (CT): CpGs where the inter-individual variation in DNA methylation were explained by PFs. a Heatmap shows the pairwise Spearman correlation (absolute value) between 45 PFs. Each row/column represents each PF. Color changes from white to blue as correlation changes from zero to one. b Pie charts show the percentage of CpGs in each infant tissue whose inter-individual variation could be explained by PFs (out of all CpGs, which showed inter-individual variation in the infant tissue). A CpG whose inter-individual variation could be explained by PFs was defined to be one where the most significant association between the CpG and all 45 PFs attained a P value < 1 × 10–3, the Bonferroni threshold for testing 45 PFs at a family-wise Type 1 error rate of 0.05. c Overlap between PF-associated, non-PF-associated (but variable), and non-variable CpGs in the two tissues. Only CpGs which showed inter-individual variation in at least one tissue were included (N = 98,124). Examining each tissue separately, each of these 98,124 CpGs can either be PF-associated, non-PF-associated, or not variable in each tissue. The number of CpGs in each of these three sets in each tissue is shown in the bottom left bar chart. Collectively, the 98,124 CpGs can be grouped into eight categories. The bottom right panel identifies each of these eight categories, with the solid black dots representing the sets being considered. The top bar chart shows the number of CpGs in each of these eight categories

References

    1. Smith AK, Kilaru V, Kocak M, Almli LM, Mercer KB, Ressler KJ, Tylavsky FA, Conneely KN. Methylation quantitative trait loci (meQTLs) are consistently detected across ancestry, developmental stage, and tissue type. BMC Genomics. 2014;15:145. doi: 10.1186/1471-2164-15-145.
    1. Gaunt TR, Shihab HA, Hemani G, Min JL, Woodward G, Lyttleton O, Zheng J, Duggirala A, McArdle WL, Ho K, et al. Systematic identification of genetic influences on methylation across the human life course. Genome Biol. 2016;17:61. doi: 10.1186/s13059-016-0926-z.
    1. Green BB, Marsit CJ. Select prenatal environmental exposures and subsequent alterations of gene-specific and repetitive element DNA methylation in fetal tissues. Curr Environmen Health Rep. 2015;2:126–36. doi: 10.1007/s40572-015-0045-0.
    1. Gordon L, Joo JE, Powell JE, Ollikainen M, Novakovic B, Li X, Andronikos R, Cruickshank MN, Conneely KN, Smith AK, et al. Neonatal DNA methylation profile in human twins is specified by a complex interplay between intrauterine environmental and genetic factors, subject to tissue-specific influence. Genome Res. 2012;22:1395–406. doi: 10.1101/gr.136598.111.
    1. Roadmap Epigenomics Consortium. Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–30. doi: 10.1038/nature14248.
    1. Armstrong DA, Lesseur C, Conradt E, Lester BM, Marsit CJ. Global and gene-specific DNA methylation across multiple tissues in early infancy: implications for children’s health research. FASEB J. 2014;28:2088–97. doi: 10.1096/fj.13-238402.
    1. Lesseur C, Armstrong DA, Paquette AG, Koestler DC, Padbury JF, Marsit CJ. Tissue-specific Leptin promoter DNA methylation is associated with maternal and infant perinatal factors. Mol Cell Endocrinol. 2013;381:160–7. doi: 10.1016/j.mce.2013.07.024.
    1. Novakovic B, Ryan J, Pereira N, Boughton B, Craig JM, Saffery R. Postnatal stability, tissue, and time specific effects of AHRR methylation change in response to maternal smoking in pregnancy. Epigenetics. 2014;9:377–86. doi: 10.4161/epi.27248.
    1. Nomura Y, Lambertini L, Rialdi A, Lee M, Mystal EY, Grabie M, Manaster I, Huynh N, Finik J, Davey M, et al. Global methylation in the placenta and umbilical cord blood from pregnancies with maternal gestational diabetes, preeclampsia, and obesity. Reprod Sci. 2014;21:131–7. doi: 10.1177/1933719113492206.
    1. Ruchat SM, Houde AA, Voisin G, St-Pierre J, Perron P, Baillargeon JP, Gaudet D, Hivert MF, Brisson D, Bouchard L. Gestational diabetes mellitus epigenetically affects genes predominantly involved in metabolic diseases. Epigenetics. 2013;8:935–43. doi: 10.4161/epi.25578.
    1. Soh S-E, Tint MT, Gluckman PD, Godfrey KM, Rifkin-Graboi A, Chan YH, Stünkel W, Holbrook JD, Kwek K, Chong Y-S, et al. Cohort profile: Growing Up in Singapore Towards healthy Outcomes (GUSTO) birth cohort study. Int J Epidemiol. 2014;43:1401–9. doi: 10.1093/ije/dyt125.
    1. Pan H, Chen L, Dogra S, Teh AL, Tan JH, Lim YI, Lim YC, Jin S, Lee YK, Ng PY, et al. Measuring the methylome in clinical samples: improved processing of the Infinium Human Methylation450 BeadChip Array. Epigenetics. 2012;7:1173–87. doi: 10.4161/epi.22102.
    1. Johnson WE, Rabinovic A, Li C. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–27. doi: 10.1093/biostatistics/kxj037.
    1. Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–3. doi: 10.1093/bioinformatics/bts034.
    1. Leek JT, Scharpf RB, Bravo HC, Simcha D, Langmead B, Johnson WE, Geman D, Baggerly K, Irizarry RA. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat Rev Genet. 2010;11:733–9. doi: 10.1038/nrg2825.
    1. Chen YA, Lemire M, Choufani S, Butcher DT, Grafodatskaya D, Zanke BW, Gallinger S, Hudson TJ, Weksberg R. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013;8:203–9. doi: 10.4161/epi.23470.
    1. Price ME, Cotton AM, Lam LL, Farre P, Emberly E, Brown CJ, Robinson WP, Kobor MS. Additional annotation enhances potential for biologically-relevant analysis of the Illumina Infinium HumanMethylation450 BeadChip array. Epigenetics Chromatin. 2013;6:4. doi: 10.1186/1756-8935-6-4.
    1. Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, Wiencke JK, Kelsey KT. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinf. 2012;13:86. doi: 10.1186/1471-2105-13-86.
    1. Gutierrez-Arcelus M, Lappalainen T, Montgomery SB, Buil A, Ongen H, Yurovsky A, Bryois J, Giger T, Romano L, Planchon A, et al. Passive and active DNA methylation and the interplay with genetic variation in gene regulation. Elife. 2013;2:e00523.
    1. Bakulski KM, Feinberg JI, Andrews SV, Yang J, Brown S, LM S, Witter F, Walston J, Feinberg AP, Fallin MD. DNA methylation of cord blood cell types: applications for mixed cell birth studies. Epigenetics. 2016;11:354–62. doi: 10.1080/15592294.2016.1161875.
    1. Siggens L, Ekwall K. Epigenetics, chromatin and genome organization: recent advances from the ENCODE project. J Intern Med. 2014;276:201–14. doi: 10.1111/joim.12231.
    1. Lizio M, Harshbarger J, Shimoji H, Severin J, Kasukawa T, Sahin S, Abugessaisa I, Fukuda S, Hori F, Ishikawa-Kato S, et al. Gateways to the FANTOM5 promoter level mammalian expression atlas. Genome Biol. 2015;16:22. doi: 10.1186/s13059-014-0560-6.
    1. Meng H, Joyce AR, Adkins DE, Basu P, Jia Y, Li G, Sengupta TK, Zedler BK, Murrelle EL, van den Oord EJ. A statistical method for excluding non-variable CpG sites in high-throughput DNA methylation profiling. BMC Bioinf. 2010;11:227. doi: 10.1186/1471-2105-11-227.
    1. Chen J, Just AC, Schwartz J, Hou L, Jafari N, Sun Z, Kocher JP, Baccarelli A, Lin X. CpGFilter: model-based CpG probe filtering with replicates for epigenome-wide association studies. Bioinformatics. 2016;32:469–71. doi: 10.1093/bioinformatics/btv577.
    1. Rakyan VK, Down TA, Balding DJ, Beck S. Epigenome-wide association studies for common human diseases. Nat Rev Genet. 2011;12:529–41. doi: 10.1038/nrg3000.
    1. Chadwick LH, Sawa A, Yang IV, Baccarelli A, Breakefield XO, Deng H-W, Dolinoy DC, Fallin MD, Holland NT, Houseman EA, et al. New insights and updated guidelines for epigenome-wide association studies. Neuroepigenetics. 2015;1:14–9. doi: 10.1016/j.nepig.2014.10.004.
    1. Hayward CJ, Fradette J, Galbraith T, Remy M, Guignard R, Gauvin R, Germain L, Auger FA. Harvesting the potential of the human umbilical cord: isolation and characterisation of four cell types for tissue engineering applications. Cells Tissues Organs. 2013;197:37–54. doi: 10.1159/000341254.
    1. Lowe R, Gemma C, Beyan H, Hawa MI, Bazeos A, Leslie RD, Montpetit A, Rakyan VK, Ramagopalan SV. Buccals are likely to be a more informative surrogate tissue than blood for epigenome-wide association studies. Epigenetics. 2013;8:445–54. doi: 10.4161/epi.24362.

Source: PubMed

3
Abonnieren