Comparison between 16S rRNA and shotgun sequencing data for the taxonomic characterization of the gut microbiota

Francesco Durazzi, Claudia Sala, Gastone Castellani, Gerardo Manfreda, Daniel Remondini, Alessandra De Cesare, Francesco Durazzi, Claudia Sala, Gastone Castellani, Gerardo Manfreda, Daniel Remondini, Alessandra De Cesare

Abstract

In this paper we compared taxonomic results obtained by metataxonomics (16S rRNA gene sequencing) and metagenomics (whole shotgun metagenomic sequencing) to investigate their reliability for bacteria profiling, studying the chicken gut as a model system. The experimental conditions included two compartments of gastrointestinal tracts and two sampling times. We compared the relative abundance distributions obtained with the two sequencing strategies and then tested their capability to distinguish the experimental conditions. The results showed that 16S rRNA gene sequencing detects only part of the gut microbiota community revealed by shotgun sequencing. Specifically, when a sufficient number of reads is available, Shotgun sequencing has more power to identify less abundant taxa than 16S sequencing. Finally, we showed that the less abundant genera detected only by shotgun sequencing are biologically meaningful, being able to discriminate between the experimental conditions as much as the more abundant genera detected by both sequencing strategies.

Figures

Figure 1
Figure 1
RSA histograms in logarithmic scale (Preston plots ) of bacterial abundances in one sample selected as anexample (caeca25): (a) genera sampled by shotgun sequencing, (b) genera sampled by 16S rRNA sequencing, (c) phyla sampled by shotgun sequencing and (d) phyla sampled by 16S sequencing.
Figure 2
Figure 2
Box plot of the RSA skewness of bacterial communities at (a) phylum level and (b) genus level. Bacterial communities are sampled with (left) shotgun sequencing and (right) 16S sequencing.
Figure 3
Figure 3
Fold changes between caeca and crop in genera identified by both strategies. Some fold changes are shrunk toward zero by the DESeq2 algorithm (see “Methods” section). Points with a statistically significant change for both strategies are represented in blue, for shotgun only in green, for 16S only in orange and without a significant change in cyan (adjusted P > 0.05 with DESeq2). Point size is the log10 of average number of reads from shotgun strategy mapping to each genus. Pearson’s correlation coefficient r and regression line are computed only on points with statistically significant fold changes according to both strategies (“Both” group in figure legend and in Table 1).
Figure 4
Figure 4
Scatter plot of 16S and shotgun genera abundances of one sample selected as example (caeca25). Histograms display stacked bars, where every column is divided in a part corresponding to the abundance of genera detected by both sequencing strategies (blue) and the other part is relative to genera detected exclusively by only one strategy (red for 16S and green for shotgun). Pearson's correlation coefficient is computed only for the common genera. Logarithmic (log2) scale helps to recognize that less abundant genera identified by shotgun sequencing are almost not detected by 16S sequencing.
Figure 5
Figure 5
Average number of (a) phyla and (b) genera found within caeca and crop samples. The length of the error bars is equal to the standard deviation computed on all the samples.
Figure 6
Figure 6
Intercepts of shotgun vs 16S abundance linear regressions of caeca samples against the total number of reads in each set, representing the number of shotgun sequences corresponding to one 16S sequence. Error bars correspond to the confidence interval for the parameter provided by the fit.
Figure 7
Figure 7
PCoA based on the beta-diversities between samples (Bray–Curtis metric), computed on genera abundances of caeca samples normalized by DESeq2, labelled by sampling time: 14th day (gold), 35th day (cyan). (a, b) with all genera detected respectively by shotgun (a) and 16S (b); with genera found exclusively by shotgun (c) or 16S (d).

References

    1. Maccaferri S, Biagi E, Brigidi P. Metagenomics: key to human gut microbiota. Dig. Dis. 2011;29:525–530. doi: 10.1159/000332966.
    1. Cornejo-Pareja, I., Muñoz-Garach, A., Clemente-Postigo, M. & Tinahones, F. J. Importance of gut microbiota in obesity. Eur. J. Clin. Nutr.72, (2018).
    1. Stanley D, Hughes RJ, Moore RJ. Microbiota of the chicken gastrointestinal tract: influence on health, productivity and disease. Appl. Microbiol. Biotechnol. 2014;98:4301–4310. doi: 10.1007/s00253-014-5646-2.
    1. De Cesare, A. et al. Effect of dietary supplementation with Lactobacillus acidophilus D2/CSL (CECT 4529) on caecum microbioma and productive performance in broiler chickens. PLoS One12, (2017).
    1. ten Hoopen, P. et al. The metagenomic data life-cycle: Standards and best practices. Gigascience6, (2017).
    1. Scholz, M. B., Lo, C.-C. & Chain, P. S. Next generation sequencing and bioinformatic bottlenecks: the current state of metagenomic data analysis. Curr. Opin. Biotechnol.23, (2012).
    1. Chistoserdova L. Functional metagenomics: recent advances and future challenges. Biotechnol. Genet. Eng. Rev. 2009;26:335–352. doi: 10.5661/bger-26-335.
    1. Choi KY, Lee TK, Sul WJ. Metagenomic analysis of chicken gut microbiota for improving metabolism and health of chickens: a review. Asian-Australasian J. Anim. Sci. 2015;28:1217–1225. doi: 10.5713/ajas.15.0026.
    1. Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the miseq illumina sequencing platform. Appl. Environ. Microbiol. 2013;79:5112–5120. doi: 10.1128/AEM.01043-13.
    1. 16S Metagenomic sequencing library preparation. in 1–28 (Illumina MiSeq System, 2013).
    1. Cole, J. R. et al. Ribosomal database project: data and tools for high throughput rRNA analysis. Nucleic Acids Res.42, (2014).
    1. Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J. 2017;11:2639–2643. doi: 10.1038/ismej.2017.119.
    1. Colwell, R. K. Biodiversity: concepts, patterns, and measurement. in The Princeton Guide to Ecology 257–263 (2009).
    1. Sala, C. et al. Stochastic neutral modelling of the Gut Microbiota’s relative species abundance from next generation sequencing data. BMC Bioinformatics17, (2016).
    1. Breitwieser, F. P., Lu, J. & Salzberg, S. L. A review of methods and databases for metagenomic classification and assembly. Brief. Bioinform.20, (2017).
    1. Laudadio I, et al. Quantitative assessment of shotgun metagenomics and 16S rDNA amplicon sequencing in the study of human gut microbiome. Omi. A J. Integr. Biol. 2018;22:248–254. doi: 10.1089/omi.2018.0013.
    1. Campanaro, S., Treu, L., Kougias, P. G., Zhu, X. & Angelidaki, I. Taxonomy of anaerobic digestion microbiome reveals biases associated with the applied high throughput sequencing strategies. Sci. Rep.8, (2018).
    1. Fouhy F, Clooney AG, Stanton C, Claesson MJ, Cotter PD. 16S rRNA gene sequencing of mock microbial populations-impact of DNA extraction method, primer choice and sequencing platform. BMC Microbiol. 2016;16:123. doi: 10.1186/s12866-016-0738-z.
    1. Weinstock GM. Genomic approaches to studying the human microbiota. Nature. 2012;489:250–256. doi: 10.1038/nature11553.
    1. De Cesare, A. et al. Effect of Lactobacillus acidophilus D2/CSL (CECT 4529) Supplementation in Drinking Water on Chicken Crop and Caeca Microbiome. PLoS One 1–18 (2020) 10.1371/journal.pone.0228338.
    1. Preston FW. The commonness, and rarity, of species. Ecology. 1948;29:254–283. doi: 10.2307/1930989.
    1. Hubbell, S. P. The Unified Neutral Theory of Biodiversity and Biogeography. (Princeton University Press, 2001).
    1. Tessler, M. et al. Large-scale differences in microbial biodiversity discovery between 16S amplicon and shotgun sequencing. Sci. Rep.7, (2017).
    1. Shah, N., Tang, H., Doak, T. G. & Ye, Y. Comparing bacterial communities inferred from 16S rRNA gene sequencing and shotgun metagenomics. Pac. Symp. Biocomput. 165–76 (2011) 10.1142/9789814335058_0018.
    1. De Cesare A, et al. Impact of a probiotic-based cleaning product on the microbiological profile of broiler litters and chicken caeca microbiota. Poult. Sci. 2019;98:3602–3610. doi: 10.3382/ps/pez148.
    1. MiSeq Reagent Kits v2. .
    1. Eisenhofer R, et al. Contamination in Low Microbial Biomass Microbiome Studies : Issues and Recommendations. Trends Microbiol. 2018;27:105–117. doi: 10.1016/j.tim.2018.11.003.
    1. Chiu CY, Miller SA. Clinical metagenomics. Nat. Rev. Genet. 2019;20:341–355. doi: 10.1038/s41576-019-0113-7.
    1. Wilke, A. et al. MG-RAST Manual for version 4, revision 3. (2017).
    1. Quast, C. et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res.41, (2013).
    1. O’leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res.44, 733–745 (2015).
    1. Magurran, A. E. Measuring Biological Diversity. (Wiley-Blackwell, 2003).
    1. Mariadassou, M. Phyloseq-extended. .
    1. McMurdie PJ, Holmes S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLoS ONE. 2013;8:e61217. doi: 10.1371/journal.pone.0061217.
    1. Oksanen, J. et al. vegan: Community Ecology Package. (2019).
    1. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8.
    1. Faith DP, Minchin PR, Belbin L. Compositional dissimilarity as a robust measure of ecological distance. 1987;69:57–68.
    1. Hendriksen RS, et al. Global monitoring of antimicrobial resistance based on metagenomics analyses of urban sewage. Nat. Commun. 2019;10:1124. doi: 10.1038/s41467-019-08853-3.
    1. Wickelmaier F. An introduction to MDS. Ind. Commer. Train. 2003;10:11–18.

Source: PubMed

3
Abonneren