Bacterial colonization reprograms the neonatal gut metabolome

Kyle Bittinger, Chunyu Zhao, Yun Li, Eileen Ford, Elliot S Friedman, Josephine Ni, Chiraag V Kulkarni, Jingwei Cai, Yuan Tian, Qing Liu, Andrew D Patterson, Debolina Sarkar, Siu H J Chan, Costas Maranas, Anumita Saha-Shah, Peder Lund, Benjamin A Garcia, Lisa M Mattei, Jeffrey S Gerber, Michal A Elovitz, Andrea Kelly, Patricia DeRusso, Dorothy Kim, Casey E Hofstaedter, Mark Goulian, Hongzhe Li, Frederic D Bushman, Babette S Zemel, Gary D Wu, Kyle Bittinger, Chunyu Zhao, Yun Li, Eileen Ford, Elliot S Friedman, Josephine Ni, Chiraag V Kulkarni, Jingwei Cai, Yuan Tian, Qing Liu, Andrew D Patterson, Debolina Sarkar, Siu H J Chan, Costas Maranas, Anumita Saha-Shah, Peder Lund, Benjamin A Garcia, Lisa M Mattei, Jeffrey S Gerber, Michal A Elovitz, Andrea Kelly, Patricia DeRusso, Dorothy Kim, Casey E Hofstaedter, Mark Goulian, Hongzhe Li, Frederic D Bushman, Babette S Zemel, Gary D Wu

Abstract

Initial microbial colonization and later succession in the gut of human infants are linked to health and disease later in life. The timing of the appearance of the first gut microbiome, and the consequences for the early life metabolome, are just starting to be defined. Here, we evaluated the gut microbiome, proteome and metabolome in 88 African-American newborns using faecal samples collected in the first few days of life. Gut bacteria became detectable using molecular methods by 16 h after birth. Detailed analysis of the three most common species, Escherichia coli, Enterococcus faecalis and Bacteroides vulgatus, did not suggest a genomic signature for neonatal gut colonization. The appearance of bacteria was associated with reduced abundance of approximately 50 human proteins, decreased levels of free amino acids and an increase in products of bacterial fermentation, including acetate and succinate. Using flux balance modelling and in vitro experiments, we provide evidence that fermentation of amino acids provides a mechanism for the initial growth of E. coli, the most common early colonizer, under anaerobic conditions. These results provide a deep characterization of the first microbes in the human gut and show how the biochemical environment is altered by their appearance.

Conflict of interest statement

Competing Interests

The authors declare no competing interests.

Figures

Extended Data Fig. 1. Microbiota differences between…
Extended Data Fig. 1. Microbiota differences between birth and 1 month.
(a) The number of bacterial species increased in the 1 month samples (P = 8×10−16, two-sided Wilcoxon signed-rank test, n = 88 per group). Boxes indicate the median and interquartile distance, whiskers indicate maximum and minimum data points within 1.5 times the interquartile range, points represent values outside this range. (b) The identity of bacterial species was different in samples at 1 month, as quantified by Jaccard distance (R2 = 0.09, P = 0.001, PERMANOVA test with restricted permutations, n1 = 81 samples from birth, n2 = 88 samples from 1 month, 7 birth samples excluded due to no taxonomic assignments). (c) Heatmap of taxa detected in samples collected at 1 month. Taxa were included if the relative abundance was greater than 10% in any sample. (d) Prevalence of bacterial taxa in samples collected at birth and 1 month. Taxa shown were determined to be differentially present or absent by Fisher’s exact test, P < 0.05 after correction for false discovery rate (n = 88 per group, 482 taxa tested, two-sided test).
Extended Data Fig. 2. Abundance of bacterial…
Extended Data Fig. 2. Abundance of bacterial gene orthologs at birth and 1 month.
. (a) The total number of KEGG gene orthologs per sample was higher at 1 month relative to birth (P = 9×10−16, two-sided Wilcoxon signed-rank test, n = 88 per group). (b) Genes increasing in abundance at 1 month relative to birth (top 100 shown, P < 0.001 after correction for false discovery rate, two-sided Wilcoxon signed-rank test, n = 88 per group). Points show the median value, error bars show the interquartile range. (c) The number of glycoside hydrolase gene types per sample (P = 9×10−16) and total abundance of glycoside hydrolase genes (P = 7×10−13) in each sample increased from birth to 1 month (two-sided Wilcoxon signed-rank test, n = 88 per group). Boxes indicate the median and interquartile distance, whiskers indicate maximum and minimum data points within 1.5 times the interquartile range, points represent values outside this range.
Extended Data Fig. 3. Correlation of microbiota…
Extended Data Fig. 3. Correlation of microbiota with mode of delivery.
(a) The mode of delivery was not associated with differences in the number of bacterial species per sample at birth or 1 month (two-sided Mann-Whitney test). (b) The mode of delivery had a small effect on the composition of bacteria present at 1 month, as measured by Jaccard distance (R2 = 0.02, PERMANOVA test), but no effect at birth. (c) Several taxa differed in prevalence according to mode of delivery at 1 month, but were not statistically significant after correction for multiple comparisons (two-sided Fisher’s exact test). No taxa differed in abundance at either time point (two-sided Mann-Whitney test). (d) KEGG gene orthologs associated with mode of delivery in 1 month samples (two-sided Mann-Whitney test, P < 0.05 after correction for false discovery rate). Points with error bars in (d) indicate the median and interquartile range. Boxes in (a) and (c) indicate the median and interquartile distance, whiskers indicate maximum and minimum data points within 1.5 times the interquartile range, points represent values outside this range. Sample size in all tests was n1 = 64 vaginal birth, n2 = 24 c-section.
Extended Data Fig. 4. Association of breastfeeding…
Extended Data Fig. 4. Association of breastfeeding with bacterial taxa and gene function.
(a) The number of bacterial species decreased with breastfeeding at 1 month, but not at birth (two-sided Mann-Whitney test). Boxes in indicate the median and interquartile distance, whiskers indicate maximum and minimum data points within 1.5 times the interquartile range, points represent values outside this range. (b) Breastfeeding altered the composition of bacterial species present at 1 month but not at birth (PERMANOVA test). (c) The abundance of Bifidobacterium increased with breastfeeding at birth and 1 month (one-sided Mann-Whitney test). (d) Other genera found to differ in abundance with breastfeeding at 1 month (two-sided Mann-Whitney test, corrected for false discovery rate). (e) KEGG gene orthologs differing in abundance with breastfeeding (two-sided Mann-Whitney test, corrected for false discovery rate). Corrected p-values are shown for statistically significant differences. Points with error bars in (e) indicate the median and interquartile range. Sample size at birth was n1 = 19 formula, n2 = 61 breastfed; sample size at 1 month was n1 = 36 formula, n2 = 52 breastfed.
Extended Data Fig. 5. Negative control samples…
Extended Data Fig. 5. Negative control samples used in metagenomic DNA sequencing.
(a) Bacterial species abundance in negative control samples. (b) Jaccard distance between negative control samples and meconium samples (n1 = 81 meconium samples, n2 = 15 negative control samples, 7 meconium samples excluded due to no taxonomic assignments). (c) Jaccard distance to centroid of negative control samples. The 95% quantile for distance of negative control samples to their own centroid is indicated with a dashed line; 32 meconium samples fell within this distance. (d) Prevalence of species commonly detected in negative controls. For all but E. coli, the species were more prevalent in negative controls than in meconium samples. (e) Stacked bar charts showing prominent taxa in negative controls, birth, and 1 month samples.
Extended Data Fig. 6. Estimation of bacterial-to-human…
Extended Data Fig. 6. Estimation of bacterial-to-human DNA ratio by qPCR.
(a) Absolute quantification of bacterial DNA by 16S qPCR in meconium and negative control samples. (b) Negative correlation of 16S copy number and human DNA percentage in metagenomic sequencing (two-sided test of Spearman correlation, ρ = −0.6, P = 2×10−9, n = 88). (c) Positive correlation between beta-actin copy number and human DNA percentage (two-sided test of Spearman correlation, ρ = 0.4, P = 3×10−4, n = 88). (d) Negative correlation between estimated bacterial-to-human DNA ratio and human DNA percentage (two-sided test of Spearman correlation, ρ = −0.8, P = 2×10−16, n = 48, samples were excluded if either measurement was below the limit of detection). The linear regression estimate is indicated with a solid black line and the 95% confidence interval is indicated by the grey area.
Extended Data Fig. 7. Bacterial-to-human DNA ratio…
Extended Data Fig. 7. Bacterial-to-human DNA ratio associated with time since birth.
(a) Bacterial 16S copy number per gram feces increased with time since birth (two-sided test of Spearman correlation, ρ = 0.5, P = 6×10−6, n = 85, 3 samples excluded due to no data on time since birth). (b) Bacterial 16S copy number per μL extracted DNA increases with time since birth (two-sided test of Spearman correlation, ρ = 0.5, P = 7×10−6, n = 85). (c) The bacterial-to-human DNA ratio is higher in samples collected after 16 hours with low human DNA relative to others (two-sided Mann-Whitney test, P = 4×10−11, n1 = 32 samples collected after 16 hours with low human DNA, n2 = 53 others). Samples with a bacterial-to-human DNA ratio above unity are labeled with the subject ID. (d) The bacterial-to-human DNA ratio is higher in samples collected
Extended Data Fig. 8. Acetate concentration in…
Extended Data Fig. 8. Acetate concentration in meconium samples.
a) The acetate concentration was higher in samples obtained after 16 hours with low human DNA and other groups, and was not different in samples collected before vs. after 16 hours with high human DNA (two-sided Mann-Whitney test, p-values indicated above bars, n1 = 30 collected before 16 hours, n2 = 21 after 16 hours with human DNA > 75%, n3 = 30 after 16 hours with human DNA < 75%). Boxes in indicate the median and interquartile distance, whiskers indicate maximum and minimum data points within 1.5 times the interquartile range, points represent values outside this range. (b) Acetate concentration increased with 16S copy number per gram feces (two-sided test of Spearman correlation, ρ = 0.33, P = 0.002, n = 84). The blue line indicates the linear regression estimate, and the grey area indicates the 95% confidence interval. The dashed vertical line indicates the lower limit of detection for 16S qPCR measurements. Samples with high acetate concentration are labeled. (c) Acetate concentration increased with time since birth (two-sided test of Spearman correlation, ρ = 0.27, P = 0.02, n = 81). The dashed vertical line indicates 16 hours after birth.
Extended Data Fig. 9. Products of aerobic…
Extended Data Fig. 9. Products of aerobic and anaerobic amino acid metabolism in E. coli.
Simulated metabolic flux in E. coli under aerobic and anaerobic conditions. The arrow thickness for a reaction is proportional to the flux flowing through it, with red being the maximum and grey the minimum (equivalent to zero flux).
Extended Data Fig. 10. Summary of data…
Extended Data Fig. 10. Summary of data presented for meconium samples and negative controls.
Samples are ordered from top to bottom by time of collection. An empty set symbol (∅) indicates samples that were not submitted for proteomic and metabolomic analysis, due to availability of specimen. The dashed horizontal line indicates 16 hours after birth.
Figure 1:. Bacterial and human DNA in…
Figure 1:. Bacterial and human DNA in meconium samples.
(a) Heatmap of bacterial taxa identified in meconium samples, ordered by time since birth. The gap is positioned at 16 hours. The percentage of human DNA in each fecal sample is indicated at the top. (b) Human DNA percentage as a function of time since birth, showing samples with low levels of human DNA appearing after 16 hours. Grey lines represent the logistic regression estimate on either side of the break point at 16 hours, indicated with a vertical dashed line. (c) Estimation of bacterial-to-human DNA ratio by qPCR.
Figure 2:. Assembly of E. coli metagenomes…
Figure 2:. Assembly of E. coli metagenomes from meconium samples.
(a) Pan-genome of E. coli detected in meconium, plotted alongside E. coli reference genomes. Each genome is represented as a ring; black areas represent genes present, grey areas represent genes absent. The purple region indicates genes used in the phylogenetic analysis. (b) Phylogenetic tree of E. coli assembled from meconium, showing placement in multiple clades. (c) Principal coordinates ordination of gene content from E. coli pan-genome. (d) Genes found to be more abundant in assemblies from meconium samples. All are of unassigned function. Sample size: n1=17 genomes from this study, n2=33 reference genomes.
Figure 3:. Bacterial strains in meconium and…
Figure 3:. Bacterial strains in meconium and their retention at 1 month.
(a) The number of bacterial strains in each sample, as determined by analysis of single-copy core genes in metagenomic assembly results. (b) The time of earliest detection and prevalence of bacterial species in meconium samples. (c) Correlation of bacterial strain number with absolute quantity of bacterial DNA by 16S qPCR. (d) Retention of meconium strains at 1 month.
Figure 4:. Proteomics of meconium samples.
Figure 4:. Proteomics of meconium samples.
(a) The relative abundance of bacterial proteins increases with time to collection and bacterial-to-human DNA ratio. (b) Principal components analysis of protein concentrations in meconium samples (n1 = 26 before 16h, n2 = 12 after 16h, human DNA > 75%, n3 = 24 after 16h, human DNA < 75%). Boxes above the ordination show the median and interquartile range of groups along the first principal component; whiskers above the ordination extend to the full range of data points. (c) Proteins differing in abundance between the three groups. Grey arrows indicate samples collected before 16 hours with low levels of host DNA. (d) STRING network of differentially abundant proteins. Circles represent proteins. Colored lines connecting proteins indicate interactions from curated databases (cyan), experimentally verified interactions (magenta), and associations based on text mining (yellow), co-expression (black), and homology (slate blue).
Figure 5:. Metabolomics of meconium samples and…
Figure 5:. Metabolomics of meconium samples and E. coli amino acid utilization.
(a) Heatmap of metabolites found to differ in abundance between the three groups. Arrows at top indicate samples collected before 16 hours with low levels of host DNA. The top part of the chart shows metabolites increased in samples collected after 16 hours with low levels of host DNA, such as succinate and pyruvate. The bottom part of the chart shows metabolites decreasing in abundance, such as serine, and threonine. Grey arrows indicate samples collected before 16 hours with low levels of host DNA. (b) Predicted ATP production and metabolite product flux for substrates identified in meconium samples. (c) Predicted amino acid utilization by E. coli at various concentrations of acetate and succinate. (d and e) Amino acid utilization and acetate/succinate production of E. coli grown under (d) anaerobic and (e) aerobic conditions.

References

    1. Yatsunenko T et al. Human gut microbiome viewed across age and geography. Nature 486, 222–227 (2012).
    1. Stewart CJ et al. Temporal development of the gut microbiome in early childhood from the TEDDY study. Nature 562, 583–588 (2018).
    1. Yassour M et al. Natural history of the infant gut microbiome and impact of antibiotic treatment on bacterial strain diversity and stability. Sci. Transl. Med 8, 343ra81 (2016).
    1. Bokulich NA et al. Antibiotics, birth mode, and diet shape microbiome maturation during early life. Sci. Transl. Med 8, 343ra82 (2016).
    1. Wang J et al. Dysbiosis of maternal and neonatal microbiota associated with gestational diabetes mellitus. Gut 67, 1614–1625 (2018).
    1. Durack J et al. Delayed gut microbiota development in high-risk for asthma infants is temporarily modifiable by Lactobacillus supplementation. Nat. Commun 9, 707 (2018).
    1. Grier A et al. Impact of prematurity and nutrition on the developing gut microbiome and preterm infant growth. Microbiome 5, 158 (2017).
    1. Mueller NT et al. Delivery Mode and the Transition of Pioneering Gut-Microbiota Structure, Composition and Predicted Metabolic Function. Genes 8, (2017).
    1. Dobbler PT et al. Low Microbial Diversity and Abnormal Microbial Succession Is Associated with Necrotizing Enterocolitis in Preterm Infants. Front. Microbiol 8, 2243 (2017).
    1. Brazier L et al. Evolution in fecal bacterial/viral composition in infants of two central African countries (Gabon and Republic of the Congo) during their first month of life. PLoS One 12, e0185569 (2017).
    1. Wampach L et al. Colonization and Succession within the Human Gut Microbiome by Archaea, Bacteria, and Microeukaryotes during the First Year of Life. Front. Microbiol 8, 738 (2017).
    1. Chu DM et al. Maturation of the infant microbiome community structure and function across multiple body sites and in relation to mode of delivery. Nat. Med 23, 314–326 (2017).
    1. Chu DM et al. The early infant gut microbiome varies in association with a maternal high-fat diet. Genome Med 8, 77 (2016).
    1. Collado MC, Rautava S, Aakko J, Isolauri E & Salminen S Human gut colonisation may be initiated in utero by distinct microbial communities in the placenta and amniotic fluid. Sci. Rep 6, 23129 (2016).
    1. Heida FH et al. A Necrotizing Enterocolitis-Associated Gut Microbiota Is Present in the Meconium: Results of a Prospective Study. Clin. Infect. Dis 62, 863–870 (2016).
    1. Gómez M et al. Early Gut Colonization of Preterm Infants: Effect of Enteral Feeding Tubes. J. Pediatr. Gastroenterol. Nutr 62, 893–900 (2016).
    1. Hansen R et al. First-Pass Meconium Samples from Healthy Term Vaginally-Delivered Neonates: An Analysis of the Microbiota. PLoS One 10, e0133320 (2015).
    1. Dutta S, Ganesh M, Ray P & Narang A Intestinal colonization among very low birth weight infants in first week of life. Indian Pediatr 51, 807–809 (2014).
    1. Ardissone AN et al. Meconium microbiome analysis identifies bacteria correlated with premature birth. PLoS One 9, e90784 (2014).
    1. Hu J et al. Diversified microbiota of meconium is affected by maternal diabetes status. PLoS One 8, e78257 (2013).
    1. Moles L et al. Bacterial diversity in meconium of preterm neonates and evolution of their fecal microbiota during the first month of life. PLoS One 8, e66986 (2013).
    1. Nagpal R et al. Sensitive Quantitative Analysis of the Meconium Bacterial Microbiota in Healthy Term Infants Born Vaginally or by Cesarean Section. Front. Microbiol 7, (2016).
    1. Lim ES et al. Early life dynamics of the human gut virome and bacterial microbiome in infants. Nat. Med 21, 1228–1234 (2015).
    1. Bäckhed F et al. Dynamics and Stabilization of the Human Gut Microbiome during the First Year of Life. Cell Host Microbe 17, 690–703 (2015).
    1. La Rosa PS et al. Patterned progression of bacterial populations in the premature infant gut. Proc. Natl. Acad. Sci. U. S. A 111, 12522–12527 (2014).
    1. Del Chierico F et al. Phylogenetic and Metabolic Tracking of Gut Microbiota during Perinatal Development. PLoS One 10, e0137347 (2015).
    1. Zwittink RD et al. Metaproteomics reveals functional differences in intestinal microbiota development of preterm infants. Mol. Cell. Proteomics 16, 1610–1620 (2017).
    1. Xiong W, Brown CT, Morowitz MJ, Banfield JF & Hettich RL Genome-resolved metaproteomic characterization of preterm infant gut microbiota development reveals species-specific metabolic shifts and variabilities during early life. Microbiome 5, 72 (2017).
    1. Young JC et al. Metaproteomics reveals functional shifts in microbial and human proteins during a preterm infant gut colonization case. Proteomics 15, 3463–3473 (2015).
    1. Dominguez-Bello MG, Blaser MJ, Ley RE & Knight R Development of the human gastrointestinal microbiota and insights from high-throughput sequencing. Gastroenterology 140, 1713–1719 (2011).
    1. Sprockett D, Fukami T & Relman DA Role of priority effects in the early-life assembly of the gut microbiota. Nat. Rev. Gastroenterol. Hepatol 15, 197–205 (2018).
    1. Campbell JH et al. UGA is an additional glycine codon in uncultured SR1 bacteria from the human microbiota. Proc. Natl. Acad. Sci. U. S. A 110, 5540–5545 (2013).
    1. Sakamori R et al. Cdc42 and Rab8a are critical for intestinal stem cell division, survival, and differentiation in mice. J. Clin. Invest 122, 1052–1065 (2012).
    1. Melendez J et al. Cdc42 coordinates proliferation, polarity, migration, and differentiation of small intestinal epithelial cells in mice. Gastroenterology 145, 808–819 (2013).
    1. Kolachala VL et al. Epithelial-derived fibronectin expression, signaling, and function in intestinal inflammation. J. Biol. Chem 282, 32965–32973 (2007).
    1. Cotter PA, Chepuri V, Gennis RB & Gunsalus RP Cytochrome o (cyoABCDE) and d (cydAB) oxidase gene expression in Escherichia coli is regulated by oxygen, pH, and the fnr gene product. J. Bacteriol 172, 6333–6338 (1990).
    1. Unden G & Bongaerts J Alternative respiratory pathways of Escherichia coli: energetics and transcriptional regulation in response to electron acceptors. Biochim. Biophys. Acta 1320, 217–234 (1997).
    1. Knorr AL, Jain R & Srivastava R Bayesian-based selection of metabolic objective functions. Bioinformatics 23, 351–357 (2007).
    1. Yang Y et al. Relation between chemotaxis and consumption of amino acids in bacteria. Mol. Microbiol 96, 1272–1282 (2015).
    1. Friedman ES et al. Microbes vs. chemistry in the origin of the anaerobic gut lumen. Proc. Natl. Acad. Sci. U. S. A 115, 4170–4175 (2018).
    1. Lu W et al. Metabolomic analysis via reversed-phase ion-pairing liquid chromatography coupled to a stand alone orbitrap mass spectrometer. Anal. Chem 82, 3212–3221 (2010).
    1. Clasquin MF, Melamud E & Rabinowitz JD LC-MS data processing with MAVEN: a metabolomic analysis and visualization engine. Curr. Protoc. Bioinformatics Chapter 14, Unit 14.11 (2012).
    1. Cai J et al. Orthogonal Comparison of GC-MS and H NMR Spectroscopy for Short Chain Fatty Acid Quantitation. Anal. Chem 89, 7900–7906 (2017).
    1. Clarke EL et al. Sunbeam: an extensible pipeline for analyzing metagenomic sequencing experiments. Microbiome 7, 46 (2019).
    1. Bolger AM, Lohse M & Usadel B Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
    1. Li H & Durbin R Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
    1. Truong DT et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat. Methods 12, 902–903 (2015).
    1. Li D, Liu C-M, Luo R, Sadakane K & Lam T-W MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31, 1674–1676 (2015).
    1. Eren AM et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ 3, e1319 (2015).
    1. Scholz M et al. Strain-level microbial epidemiology and population genomics from shotgun metagenomics. Nat. Methods 13, 435–438 (2016).
    1. Delmont TO & Eren AM Identifying contamination with advanced visualization and analysis practices: metagenomic approaches for eukaryotic genome assemblies. PeerJ 4, e1839 (2016).
    1. Li J et al. An integrated catalog of reference genes in the human gut microbiome. Nat. Biotechnol 32, 834–841 (2014).
    1. Apweiler R et al. UniProt: the Universal Protein knowledgebase. Nucleic Acids Res 32, D115–9 (2004).
    1. Zhang X et al. MetaPro-IQ: a universal metaproteomic approach to studying human and mouse gut microbiota. Microbiome 4, 31 (2016).
    1. Szklarczyk D et al. The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res 45, D362–D368 (2017).
    1. Anderson MJ A new method for non-parametric multivariate analysis of variance. Austral Ecol 26, 32–46 (2008).
    1. Benjamini Y & Hochberg Y Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Royal Stat. Soc, Series B 57, 289–300 (1995).

Source: PubMed

3
구독하다