Trends in Next-Generation Sequencing and a New Era for Whole Genome Sequencing

Sang Tae Park, Jayoung Kim, Sang Tae Park, Jayoung Kim

Abstract

This article is a mini-review that provides a general overview for next-generation sequencing (NGS) and introduces one of the most popular NGS applications, whole genome sequencing (WGS), developed from the expansion of human genomics. NGS technology has brought massively high throughput sequencing data to bear on research questions, enabling a new era of genomic research. Development of bioinformatic software for NGS has provided more opportunities for researchers to use various applications in genomic fields. De novo genome assembly and large scale DNA resequencing to understand genomic variations are popular genomic research tools for processing a tremendous amount of data at low cost. Studies on transcriptomes are now available, from previous-hybridization based microarray methods. Epigenetic studies are also available with NGS applications such as whole genome methylation sequencing and chromatin immunoprecipitation followed by sequencing. Human genetics has faced a new paradigm of research and medical genomics by sequencing technologies since the Human Genome Project. The trend of NGS technologies in human genomics has brought a new era of WGS by enabling the building of human genomes databases and providing appropriate human reference genomes, which is a necessary component of personalized medicine and precision medicine.

Keywords: Epigenomics; Genomics; High-Throughput Nucleotide Sequencing; Human Genome Project; Sequence Analysis, RNA.

Conflict of interest statement

No potential conflict of interest relevant to this article was reported.

Figures

Fig. 1.
Fig. 1.
Comparison of Sanger sequencing and next-generation sequencing (NGS). (A) Shotgun Sanger sequencing workflow. (B) Shotgun based NGS workflow with cyclic-array method. This figure shows the basic process of the 2 technologies and also shows 3 major improvements in NGS from Sanger sequencing described in that review. Adapted from Shendure J, et al. Nat Biotechnol 2008;26:1135-45 [4].
Fig. 2.
Fig. 2.
Graph of “Cost per Genome.” This graph illustrates the nature of the reductions in sequencing costs and also hypothetical data reflecting Moore’s Law. Adapted from National Human Genome Research Institute (NHGRI) [9].
Fig. 3.
Fig. 3.
(A) Geographic map of 5 sequenced genomes. MT type represents the mitochondrial haplogroup. Illumina GA, ABI 3730, and GS FLX represent the sequencing platform used for these sequencing projects. (B) Diagram of single nucleotide polymorphism overlapping results between 5 genomes. Adapted from Kim JI, et al. Nature 2009;460:1011-5 [28].
Fig. 4.
Fig. 4.
Whole genome sequencing application workflow based on HiSeq X system. This figure is provided from a genomic sequencing service provider with HiSeq X system. Actual workflow will vary by institutions and companies. BCL, basecall file; BAM, the binary version of a SAM (sequence alignment/map); mark-dup, marks duplicate reads; SNP, single nucleotide polymorphism; INDEL, insertion or deletion of bases; SV, structural variation; VCF, variant call format; CNV, copy number variation.

References

    1. Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977;74:5463–7.
    1. Maxam AM, Gilbert W. A new method for sequencing DNA. Proc Natl Acad Sci U S A. 1977;74:560–4.
    1. van Dijk EL, Auger H, Jaszczyszyn Y, Thermes C. Ten years of next-generation sequencing technology. Trends Genet. 2014;30:418–26.
    1. Shendure J, Ji H. Next-generation DNA sequencing. Nat Biotechnol. 2008;26:1135–45.
    1. Miller JR, Koren S, Sutton G. Assembly algorithms for next-generation sequencing data. Genomics. 2010;95:315–27.
    1. GS FLX+ System [Internet] Branford (CT): Roche Diagnostics Co; c1996-2016. [cited 2016 Oct 1]. Available from:
    1. Ion Proton System Specifications [Internet] Waltham (MA): Thermo Fisher Scientific Inc; c2016. [cited 2016 Oct 1]. Available from: .
    1. San Diego (CA): Illumina Inc; c2016. HiSeq X System Specifications [Internet] [cited 2016 Oct 1]. Available from: .
    1. DNA Sequencing Costs: Data [Internet] Bethesda (MD): National Human Genome Research Institute; 2016. [cited 2016 Oct 1]. Available from:
    1. Menlo Park (CA): Pacific Biosciences; c2015-2016. The original long-read sequencer [Internet] [cited 2016 Oct 1]. Available from:
    1. Pendleton M, Sebra R, Pang AW, Ummat A, Franzen O, Rausch T, et al. Assembly and diploid architecture of an individual human genome via single-molecule technologies. Nat Methods. 2015;12:780–6.
    1. Shi L, Guo Y, Dong C, Huddleston J, Yang H, Han X, et al. Long-read sequencing and de novo assembly of a Chinese genome. Nat Commun. 2016;7:12065.
    1. Baker M. De novo genome assembly: what every biologist should know. Nat Methods. 2012;9:333–7.
    1. Ng PC, Kirkness EF. Whole genome sequencing. Methods Mol Biol. 2010;628:215–26.
    1. Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10:57–63.
    1. Niu W, Lu ZJ, Zhong M, Sarov M, Murray JI, Brdlik CM, et al. Diverse transcription factor binding features revealed by genomewide ChIP-seq in C. elegans. Genome Res. 2011;21:245–54.
    1. Galagan JE, Minch K, Peterson M, Lyubetskaya A, Azizi E, Sweet L, et al. The Mycobacterium tuberculosis regulatory network and hypoxia. Nature. 2013;499:178–83.
    1. Settings M. Metagenomics versus Moore’s law. Nat Methods. 2009;6:623.
    1. Schmutz J, Wheeler J, Grimwood J, Dickson M, Yang J, Caoile C, et al. Quality assessment of the human genome sequence. Nature. 2004;429:365–8.
    1. GenBank Overview [Internet] Bethesda (MD): National Center for Biotechnology Information, U.S. National Library of Medicine; 2016. [updated 2016 Mar 11; cited 2016 Oct 1]. Available from:
    1. e!Ensembl [Internet] Cambridgeshire (UK): Ensembl, EMBL-EBI; 2016. [cited 2016 Oct 1]. Available from: .
    1. Genome Browser Gateway [Internet] Cambridgeshire (UK): Ensembl, EMBL-EBI; 2016. [cited 2016 Oct 1]. Available from: .
    1. Human Genome Overview [Internet] Bethesda (MD): National Center for Biotechnology Information, U.S. National Library of Medicine; 2016. [cited 2016 Oct 1]. Available from:
    1. Levy S, Sutton G, Ng PC, Feuk L, Halpern AL, Walenz BP, et al. The diploid genome sequence of an individual human. PLoS Biol. 2007;5:e254.
    1. Wheeler DA, Srinivasan M, Egholm M, Shen Y, Chen L, McGuire A, et al. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–6.
    1. Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456:53–9.
    1. Wang J, Wang W, Li R, Li Y, Tian G, Goodman L, et al. The diploid genome sequence of an Asian individual. Nature. 2008;456:60–5.
    1. Kim JI, Ju YS, Park H, Kim S, Lee S, Yi JH, et al. A highly annotated whole-genome sequence of a Korean individual. Nature. 2009;460:1011–5.
    1. Chambers JC, Abbott J, Zhang W, Turro E, Scott WR, Tan ST, et al. The South Asian genome. PLoS One. 2014;9:e102645.
    1. Luikart G, England PR, Tallmon D, Jordan S, Taberlet P. The power and promise of population genomics: from genotyping to genome typing. Nat Rev Genet. 2003;4:981–94.
    1. Black WC, 4th, Baer CF, Antolin MF, DuTeau NM. Population genomics: genome-wide sampling of insect populations. Annu Rev Entomol. 2001;46:441–69.
    1. Cirulli ET, Goldstein DB. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Rev Genet. 2010;11:415–25.
    1. 1000 Genomes Project Consortium. Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature. 2015;526:68–74.
    1. UK10K Consortium. Walter K, Min JL, Huang J, Crooks L, Memari Y, et al. The UK10K project identifies rare variants in health and disease. Nature. 2015;526:82–90.
    1. 1000 Genomes Project Consortium. Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, et al. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–73.
    1. Trans-Omics for Precision Medicine (TOPMed) Program [Internet] Bethesda (MD): NHLBI Health Information Center; [updated 2015 Oct; cited 2016 Oct 1]. Available from: .
    1. Boeke JD, Church G, Hessel A, Kelley NJ, Arkin A, Cai Y, et al. GENOME ENGINEERING. The Genome Project-Write. Science. 2016;353:126–7.
    1. Akst J. “Human Genome Project-Write” Unveiled [Internet] The Scientist; 2016. Jun 2, [cited 2016 Oct 1]. Available from:
    1. The 100,000 Genomes Project [Internet] London (UK): Genomics England; [cited 2016 Oct 1]. Available from:
    1. GenomeAisa 100K; GenomeAisa 100K [Internet] [cited 2016 Oct 1]. Available from:
    1. White House announces efforts to accelerate precision medicine initiative [Internet] New York (NY): GenomeWeb; c2016. [cited 2016 Oct 1]. Available from: .
    1. Case study: Assembling high-quality human genomes–Beyond the $1,000 genome. PacBio literature [Internet] Menlo Park (CA): Pacific Biosciences; Pacific Biosciences of California Inc; c2015-2016. [cited 2016 Oct 1]. Available from:

Source: PubMed

3
Se inscrever