Deep Impact of Random Amplification and Library Construction Methods on Viral Metagenomics Results

Béatrice Regnault, Thomas Bigot, Laurence Ma, Philippe Pérot, Sarah Temmam, Marc Eloit, Béatrice Regnault, Thomas Bigot, Laurence Ma, Philippe Pérot, Sarah Temmam, Marc Eloit

Abstract

Clinical metagenomics is a broad-range agnostic detection method of pathogens, including novel microorganisms. A major limit is the low pathogen load compared to the high background of host nucleic acids. To overcome this issue, several solutions exist, such as applying a very high depth of sequencing, or performing a relative enrichment of viral genomes associated with capsids. At the end, the quantity of total nucleic acids is often below the concentrations recommended by the manufacturers of library kits, which necessitates to random amplify nucleic acids. Using a pool of 26 viruses representative of viral diversity, we observed a deep impact of the nature of sample (total nucleic acids versus RNA only), the reverse transcription, the random amplification and library construction method on virus recovery. We further optimized the two most promising methods and assessed their performance with fully characterized reference virus stocks. Good genome coverage and limit of detection lower than 100 or 1000 genome copies per mL of plasma, depending on the genome viral type, were obtained from a three million reads dataset. Our study reveals that optimized random amplification is a technique of choice when insufficient amounts of nucleic acid are available for direct libraries constructions.

Keywords: random amplification; sensitivity; viral metagenomics.

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Experimental design with Virus Multiplex Reference Panel (VMRP). Six protocols including reverse transcription step, random pre-amplification step and subsequent library constructions were compared to a protocol without pre-amplification (NoAmp). A first experiment was carried out directly on a mix of 26 viruses representative of the diversity of viruses (VMRP) and the second experiment on VMRP diluted in a plasma matrix (ratio: 1/10). Both fractions, total nucleic acid fraction (NA) and RNA only, were used.
Figure 2
Figure 2
Experimental design with WHO reference virus stocks (WRVS). The experiment was designed to quantify the viral detection level of two MALBAC-based methods compared to the protocol without pre-amplification (NoAmp). The NA fraction of WRVS (mix of quantified reference virus stocks) was used at a final concentration of 104, 103 or 102 viral gc/mL of plasma matrix.
Figure 3
Figure 3
Proportion of spiked viruses (upper panels (A,C)) and viral genomic groups (lower panels (B,D)) in the VMRP based on the weighted contigs and singletons (WNCS). (A,B) In the NA fraction; (C,D) in the RNA fraction. Comparison of viral genomic groups is shown for target viruses (top) and for all viruses (bottom).
Figure 3
Figure 3
Proportion of spiked viruses (upper panels (A,C)) and viral genomic groups (lower panels (B,D)) in the VMRP based on the weighted contigs and singletons (WNCS). (A,B) In the NA fraction; (C,D) in the RNA fraction. Comparison of viral genomic groups is shown for target viruses (top) and for all viruses (bottom).
Figure 4
Figure 4
Heatmap of whole genome coverage of viruses of VMRP from seven methods in NA and RNA fractions. The genome fraction for each virus is in row and methods for both NA and RNA fractions are in column. Viruses are color-coded according to their genomic group (dsDNA, ssDNA, dsRNA, ss(+) RNA and ss(-) RNA). An asterisk * indicates the non-enveloped viruses. Norovirus GI detected by no method was removed. Ranges of Ct value previously determined [31] were reported per viral cluster.
Figure 5
Figure 5
Combined results of virus detection genome fraction of viruses detected in crude VMRP and in VMRP spiked in plasma sample with seven methods and viral detection in VMRP spiked in plasma sample. (A) in NA fraction; (B) in RNA fraction. Stacked histogram represents the genome fraction (%) in crude VMRP (blue) and in spiked plasma matrix (orange). DOPlify1 and DOPlify2 referred at different number of amplification cycles. Note that MATQ was not assessed in plasma sample.
Figure 5
Figure 5
Combined results of virus detection genome fraction of viruses detected in crude VMRP and in VMRP spiked in plasma sample with seven methods and viral detection in VMRP spiked in plasma sample. (A) in NA fraction; (B) in RNA fraction. Stacked histogram represents the genome fraction (%) in crude VMRP (blue) and in spiked plasma matrix (orange). DOPlify1 and DOPlify2 referred at different number of amplification cycles. Note that MATQ was not assessed in plasma sample.
Figure 6
Figure 6
Effect of denaturation temperature of dsRNA virus prior to reverse-transcription on the treated sample with or without benzonase. Each viral stock (HHV-4, REO L1, RSV, FeLV) was diluted to 5 × 106 genome-copies per mL for the NA isolation.
Figure 7
Figure 7
Genome coverage of reference virus stocks (HHV-4, PCV1, REO1, HRSV, FeLV, plus SMRV) spiked in plasma sample pool at 102, 103 or 104 genome-copies per mL (gc/mL) for NoAmp, MALBAC and MALBAC_V2 methods.

References

    1. Elbehery A.H.A., Feichtmayer J., Singh D., Griebler C., Deng L. The Human Virome Protein Cluster Database (HVPC): A Human Viral Metagenomic Database for Diversity and Function Annotation. Front. Microbiol. 2018;9:1110. doi: 10.3389/fmicb.2018.01110.
    1. Chiu C.Y. Viral pathogen discovery. Curr. Opin. Microbiol. 2013;16:468–478. doi: 10.1016/j.mib.2013.05.001.
    1. Greninger A.L. A decade of RNA virus metagenomics is (not) enough. Virus Res. 2018;244:218–229. doi: 10.1016/j.virusres.2017.10.014.
    1. Cheval J., Sauvage V., Frangeul L., Dacheux L., Guigon G., Dumey N., Pariente K., Rousseaux C., Dorange F., Berthet N., et al. Evaluation of High-Throughput Sequencing for Identifying Known and Unknown Viruses in Biological Samples. J. Clin. Microbiol. 2011;49:3268–3275. doi: 10.1128/JCM.00850-11.
    1. Asplund M., Kjartansdóttir K.R., Mollerup S., Vinner L., Fridholm H., Herrera J.A.R., Friis-Nielsen J., Hansen T.A., Jensen R.H., Nielsen I.B., et al. Contaminating viral sequences in high-throughput sequencing viromics: A linkage study of 700 sequencing libraries. Clin. Microbiol. Infect. 2019;25:1277–1285. doi: 10.1016/j.cmi.2019.04.028.
    1. Bal A., Pichon M., Picard C., Casalegno J.S., Valette M., Schuffenecker I., Billard L., Vallet S., Vilchez G., Cheynet V., et al. Quality control implementation for universal characterization of DNA and RNA viruses in clinical respiratory samples using single metagenomic next-generation sequencing workflow. BMC Infect. Dis. 2018;18:537. doi: 10.1186/s12879-018-3446-5.
    1. Miller S., Naccache S.N., Samayoa E., Messacar K., Arevalo S., Federman S., Stryke D., Pham E., Fung B., Bolosky W.J., et al. Laboratory validation of a clinical metagenomic sequencing assay for pathogen detection in cerebrospinal fluid. Genome Res. 2019;29:831–842. doi: 10.1101/gr.238170.118.
    1. Shi M., Lin X.-D., Tian J.-H., Chen L.-J., Chen X., Li C.-X., Qin X.-C., Li J., Cao J.-P., Eden J.-S., et al. Redefining the invertebrate RNA virosphere. Nature. 2016;540:539–543. doi: 10.1038/nature20167.
    1. Greninger A.L. The challenge of diagnostic metagenomics. Exp. Rev. Mol. Diagn. 2018;18:605–615. doi: 10.1080/14737159.2018.1487292.
    1. Milo R., Phillips R., Orme N. Cell Biology by the Numbers. Garland Science; New York, NY, USA: 2016.
    1. Gu W., Miller S., Chiu C.Y. Clinical Metagenomic Next-Generation Sequencing for Pathogen Detection. Annu. Rev. Pathol. Mech. Dis. 2019;14:319–338. doi: 10.1146/annurev-pathmechdis-012418-012751.
    1. Schlaberg R., Chiu C.Y., Miller S., Procop G.W., Weinstock G., The Professional Practice Committee and Committee on Laboratory Practices of the American Society for Microbiology. The Microbiology Resource Committee of the College of American Pathologists Validation of metagenomic next-generation sequencing tests for universal pathogen detection. Arch. Pathol. Lab. Med. 2017;141:776–786. doi: 10.5858/arpa.2016-0539-RA.
    1. Hall R.J., Wang J., Todd A.K., Bissielo A.B., Yen S., Strydom H., Moore N.E., Ren X., Huang Q.S., Carter P.E., et al. Evaluation of rapid and simple techniques for the enrichment of viruses prior to metagenomic virus discovery. J. Virol. Methods. 2014;195:194–204. doi: 10.1016/j.jviromet.2013.08.035.
    1. Sauvage V., Laperche S., Cheval J., Muth E., Dubois M., Boizeau L., Hébert C., Lionnet F., Lefrère J.-J., Eloit M. Viral metagenomics applied to blood donors and recipients at high risk for blood-borne infections. Blood Transf. 2016;14:400. doi: 10.2450/2016.0160-15.
    1. Froussard P. A random-POR method (rPCR) to construct whole cDNA library from low amounts of RNA. Nucleic Acids Res. 1992;20:2900. doi: 10.1093/nar/20.11.2900.
    1. Djikeng A., Halpin R., Kuzmickas R., DePasse J., Feldblyum J., Sengamalay N., Afonso C., Zhang X., Anderson N.G., Ghedin E., et al. Viral genome sequencing by random priming methods. BMC Genom. 2008;9:5. doi: 10.1186/1471-2164-9-5.
    1. Rosseel T., Van Borm S., Vandenbussche F., Hoffmann B., van den Berg T., Beer M., Höper D. The Origin of Biased Sequence Depth in Sequence-Independent Nucleic Acid Amplification and Optimization for Efficient Massive Parallel Sequencing. PLoS ONE. 2013;8:e76144. doi: 10.1371/journal.pone.0076144.
    1. Monteil-Bouchard S., Temmam S., Desnues C. Protocol for Generating Infectious RNA Viromes from Complex Biological Samples. In: Moya A., Pérez Brocal V., editors. The Human Virome. Volume 1838. Springer; New York, NY, USA: 2018. pp. 25–36.
    1. Allander T., Emerson S.U., Engle R.E., Purcell R.H., Bukh J. A Virus Discovery Method Incorporating DNase Treatment and Its Application to the Identification of Two Bovine Parvovirus Species. Proc. Natl. Acad. Sci. USA. 2001;98:11609–11614. doi: 10.1073/pnas.211424698.
    1. Kallies R., Hölzer M., Brizola Toscan R., Nunes da Rocha U., Anders J., Marz M., Chatzinotas A. Evaluation of Sequencing Library Preparation Protocols for Viral Metagenomic Analysis from Pristine Aquifer Groundwaters. Viruses. 2019;11:484. doi: 10.3390/v11060484.
    1. Nanda S., Jayan G., Voulgaropoulou F., Sierra-Honigmann A.M., Uhlenhaut C., McWatters B.J.P., Patel A., Krause P.R. Universal virus detection by degenerate-oligonucleotide primed polymerase chain reaction of purified viral nucleic acids. J. Virol. Methods. 2008;152:18–24. doi: 10.1016/j.jviromet.2008.06.007.
    1. Huang L., Ma F., Chapman A., Lu S., Xie X.S. Single-Cell Whole-Genome Amplification and Sequencing: Methodology and Applications. Ann. Rev. Genom. Hum. Genet. 2015;16:79–102. doi: 10.1146/annurev-genom-090413-025352.
    1. Goya S., Valinotto L.E., Tittarelli E., Rojo G.L., Nabaes Jodar M.S., Greninger A.L., Zaiat J.J., Marti M.A., Mistchenko A.S., Viegas M. An optimized methodology for whole genome sequencing of RNA respiratory viruses from nasopharyngeal aspirates. PLoS ONE. 2018;13:e0199714. doi: 10.1371/journal.pone.0199714.
    1. Parras-Moltó M., Rodríguez-Galet A., Suárez-Rodríguez P., López-Bueno A. Evaluation of bias induced by viral enrichment and random amplification protocols in metagenomic surveys of saliva DNA viruses. Microbiome. 2018;6 doi: 10.1186/s40168-018-0507-3.
    1. Telenius H., Carter N.P., Bebb C.E., Nordenskjo¨ld M., Ponder B.A.J., Tunnacliffe A. Degenerate oligonucleotide-primed PCR: General amplification of target DNA by a single degenerate primer. Genomics. 1992;13:718–725. doi: 10.1016/0888-7543(92)90147-K.
    1. Sheng K., Cao W., Niu Y., Deng Q., Zong C. Effective detection of variation in single-cell transcriptomes using MATQ-seq. Nat. Methods. 2017;14:267–270. doi: 10.1038/nmeth.4145.
    1. Sheng K., Zong C. Single-Cell RNA-Seq by Multiple Annealing and Tailing-Based Quantitative Single-Cell RNA-Seq (MATQ-Seq) In: Proserpio V., editor. Single Cell Methods. Volume 1979. Springer; New York, NY, USA: 2019. pp. 57–71. Methods in Molecular Biology.
    1. Birnberg L., Temmam S., Aranda C., Correa-Fiz F., Talavera S., Bigot T., Eloit M., Busquets N. Viromics on Honey-Baited FTA Cards as a New Tool for the Detection of Circulating Viruses in Mosquitoes. Viruses. 2020;12:274. doi: 10.3390/v12030274.
    1. Bigot T., Temmam S., Pérot P., Eloit M. RVDB-prot, a reference viral protein database and its HMM profiles. F1000Research. 2019;8:530. doi: 10.12688/f1000research.18776.1.
    1. Ngoi C.N., Siqueira J., Li L., Deng X., Mugo P., Graham S.M., Price M.A., Sanders E.J., Delwart E. The Plasma Virome of Febrile Adult Kenyans Shows Frequent Parvovirus B19 Infections and a Novel Arbovirus (Kadipiro Virus) J. Gen. Virol. 2016;97:3359–3367. doi: 10.1099/jgv.0.000644.
    1. Li L., Deng X., Mee E.T., Collot-Teixeira S., Anderson R., Schepelmann S., Minor P.D., Delwart E. Comparing viral metagenomics methods using a highly multiplexed human viral pathogens reagent. J. Virol. Methods. 2015;213:139–146. doi: 10.1016/j.jviromet.2014.12.002.
    1. Sun R., Grogan E., Shedd D., Bykovsky A., Kushnaryov V., Grossberg S., Miller G. Transmissible Retrovirus in Epstein-Burr Virus-Producer B95-8 Cells. Virology. 1995;209:374–383. doi: 10.1006/viro.1995.1269.
    1. Mee E.T., Preston M.D., Minor P.D., Schepelmann S., Huang X., Nguyen J., Wall D., Hargrove S., Fu T., Xu G., et al. Development of a Candidate Reference Material for Adventitious Virus Detection in Vaccine and Biologicals Manufacturing by Deep Sequencing. Vaccine. 2016;34:2035–2043. doi: 10.1016/j.vaccine.2015.12.020.
    1. Lewandowska D.W., Zagordi O., Geissberger F.-D., Kufner V., Schmutz S., Böni J., Metzner K.J., Trkola A., Huber M. Optimization and Validation of Sample Preparation for Metagenomic Sequencing of Viruses in Clinical Samples. Microbiome. 2017;5 doi: 10.1186/s40168-017-0317-z.
    1. Ellegaard K.M., Klasson L., Andersson S.G.E. Testing the Reproducibility of Multiple Displacement Amplification on Genomes of Clonal Endosymbiont Populations. PLoS ONE. 2013;8:e82319. doi: 10.1371/journal.pone.0082319.
    1. Picher Á.J., Budeus B., Wafzig O., Krüger C., García-Gómez S., Martínez-Jiménez M.I., Díaz-Talavera A., Weber D., Blanco L., Schneider A. TruePrime is a novel method for whole-genome amplification from single cells based on TthPrimPol. Nat. Commun. 2016;7:13296. doi: 10.1038/ncomms13296.
    1. Wilcox A.H., Delwart E., Diaz-Munoz S.L. Next-Generation Sequencing of DsRNA Is Greatly Improved by Treatment with the Inexpensive Denaturing Reagent DMSO. Microb. Genom. 2019;5:11. doi: 10.1099/mgen.0.000315.
    1. Zong C., Lu S., Chapman A.R., Xie X.S. Genome-Wide Detection of Single-Nucleotide and Copy-Number Variations of a Single Human Cell. Science. 2012;338:1622–1626. doi: 10.1126/science.1229164.
    1. Chapman A.R., He Z., Lu S., Yong J., Tan L., Tang F., Xie X.S. Single Cell Transcriptome Amplification with MALBAC. PLoS ONE. 2015;10:e0120889. doi: 10.1371/journal.pone.0120889.
    1. Lasken R.S. Single-cell sequencing in its prime. Nat. Biotechnol. 2013;31:211–212. doi: 10.1038/nbt.2523.

Source: PubMed

3
Subscribe