Molecular Evolution of Human Coronavirus Genomes
Diego Forni, Rachele Cagliani, Mario Clerici, Manuela Sironi, Diego Forni, Rachele Cagliani, Mario Clerici, Manuela Sironi
Abstract
Human coronaviruses (HCoVs), including SARS-CoV and MERS-CoV, are zoonotic pathogens that originated in wild animals. HCoVs have large genomes that encode a fixed array of structural and nonstructural components, as well as a variety of accessory proteins that differ in number and sequence even among closely related CoVs. Thus, in addition to recombination and mutation, HCoV genomes evolve through gene gains and losses. In this review we summarize recent findings on the molecular evolution of HCoV genomes, with special attention to recombination and adaptive events that generated new viral species and contributed to host shifts and to HCoV emergence. VIDEO ABSTRACT.
Keywords: gene gain/loss; host shift; human coronavirus; molecular evolution; positive selection; recombination.
Copyright © 2016 Elsevier Ltd. All rights reserved.
Figures
References
- Graham R.L. A decade after SARS: strategies for controlling emerging coronaviruses. Nat. Rev. Microbiol. 2013;11:836–848.
- Lau S.K. Discovery of a novel coronavirus, China Rattus coronavirus HKU24, from Norway rats supports the murine origin of Betacoronavirus 1 and has implications for the ancestor of Betacoronavirus lineage A. J. Virol. 2015;89:3076–3092.
- Chan J.F. Interspecies transmission and emergence of novel viruses: lessons from bats and birds. Trends Microbiol. 2013;21:544–555.
- Drexler J.F. Ecology, evolution and classification of bat coronaviruses in the aftermath of SARS. Antiviral Res. 2014;101:45–56.
- Su S. Epidemiology, genetic recombination, and pathogenesis of coronaviruses. Trends Microbiol. 2016;24:490–502.
- Eckerle L.D. High fidelity of murine hepatitis virus replication is decreased in nsp14 exoribonuclease mutants. J. Virol. 2007;81:12135–12144.
- Eckerle L.D. Infidelity of SARS-CoV Nsp14-exonuclease mutant virus replication is revealed by complete genome sequencing. PLoS Pathog. 2010;6:e1000896.
- Vega V.B. Mutational dynamics of the SARS coronavirus in cell culture and human populations isolated in 2003. BMC Infect. Dis. 2004;4:32.
- Lauber C. The footprint of genome architecture in the largest genome expansion in RNA viruses. PLoS Pathog. 2013;9:e1003500.
- Subissi L. One severe acute respiratory syndrome coronavirus protein complex integrates processive RNA polymerase and exonuclease activities. Proc. Natl. Acad. Sci. U. S. A. 2014;111:E3900–E3909.
- Menachery V.D. SARS-like WIV1-CoV poised for human emergence. Proc. Natl. Acad. Sci. U. S. A. . 2016;113:3048–3053.
- Ge X.Y. Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor. Nature. 2013;503:535–538.
- Yang X.L. Isolation and characterization of a novel bat coronavirus closely related to the direct progenitor of severe acute respiratory syndrome coronavirus. J. Virol. 2015;90:3253–3256.
- Lau S.K. Severe acute respiratory syndrome (SARS) coronavirus ORF8 protein is acquired from sars-related coronavirus from greater horseshoe bats through recombination. J. Virol. 2015;89:10532–10547.
- Wu Z. ORF8-related genetic evidence for Chinese horseshoe bats as the source of human severe acute respiratory syndrome coronavirus. J. Infect. Dis. 2016;213:579–583.
- Chinese SARS Molecular Epidemiology Consortium Molecular evolution of the SARS coronavirus during the course of the SARS epidemic in China. Science. 2004;303:1666–1669.
- Scheffler K. Robust inference of positive selection from recombining coding sequences. Bioinformatics. 2006;22:2493–2499.
- Wertheim J.O. RELAX: detecting relaxed selection in a phylogenetic framework. Mol. Biol. Evol. 2015;32:820–832.
- Corman V.M. Evidence for an ancestral association of human coronavirus 229E with bats. J. Virol. 2015;89:11858–11870.
- Crossley B.M. Identification and characterization of a novel alpaca respiratory coronavirus most closely related to the human coronavirus 229E. Viruses. 2012;4:3689–3700.
- Crossley B.M. Identification of a novel coronavirus possibly associated with acute respiratory syndrome in alpacas (Vicugna pacos) in California, 2007. J. Vet. Diagn. Invest. 2010;22:94–97.
- Sabir J.S. Co-circulation of three camel coronavirus species and recombination of MERS-CoVs in Saudi Arabia. Science. 2016;351:81–84.
- Zhao L. Antagonism of the interferon-induced OAS-RNase L pathway by murine coronavirus ns2 protein is required for virus replication and liver pathology. Cell. Host Microbe. 2012;11:607–616.
- Zhang R. Homologous 2′,5′-phosphodiesterases from disparate RNA viruses antagonize antiviral innate immunity. Proc. Natl. Acad. Sci. U. S. A. 2013;110:13114–13119.
- Gusho E. Murine AKAP7 has a 2′,5′-phosphodiesterase domain that can complement an inactive murine coronavirus ns2 gene. mBio. 2014;5:e01312–e1314.
- Thornbrough J.M. Middle East respiratory syndrome coronavirus NS4b protein inhibits host RNase L activation. mBio. 2016 doi: 10.1128/mBio.00258-16. Published online March 29, 2016.
- Chen L., Li F. Structural analysis of the evolutionary origins of influenza virus hemagglutinin and other viral lectins. J. Virol. 2013;87:4118–4120.
- Peng G. Crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor. Proc. Natl. Acad. Sci. U. S. A. 2011;108:10696–10701.
- Huang X. Human coronavirus HKU1 spike protein uses o-acetylated sialic acid as an attachment receptor determinant and employs hemagglutinin-esterase protein as a receptor-destroying enzyme. J. Virol. 2015;89:7202–7213.
- Desforges M. The acetyl-esterase activity of the hemagglutinin-esterase protein of human coronavirus OC43 strongly enhances the production of infectious virus. J. Virol. 2013;87:3097–3107.
- Dijkman R. Human coronavirus 229E encodes a single ORF4 protein between the spike and the envelope genes. Virol. J. 2006;3:106.
- Farsani S.M. The first complete genome sequences of clinical isolates of human coronavirus 229E. Virus Genes. 2012;45:433–439.
- Zhang R. The ORF4a protein of human coronavirus 229E functions as a viroporin that regulates viral production. Biochim. Biophys. Acta. 2014;1838:1088–1095.
- Zhang R. The ns12.9 accessory protein of human coronavirus OC43 is a viroporin involved in virion morphogenesis and pathogenesis. J. Virol. 2015;89:11383–11395.
- Lu W. Severe acute respiratory syndrome-associated coronavirus 3a protein forms an ion channel and modulates virus release. Proc. Natl. Acad. Sci. U. S. A. 2006;103:12540–12545.
- Koetzner C.A. Accessory protein 5a is a major antagonist of the antiviral action of interferon against murine coronavirus. J. Virol. 2010;84:8262–8274.
- Zhao G.P. SARS molecular epidemiology: a Chinese fairy tale of controlling an emerging zoonotic disease in the genomics era. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 2007;362:1063–1081.
- Graham R.L., Baric R.S. Recombination, reservoirs, and the modular spike: mechanisms of coronavirus cross-species transmission. J. Virol. 2010;84:3134–3146.
- Lu G. Bat-to-human: spike features determining ‘host jump’ of coronaviruses SARS-CoV, MERS-CoV, and beyond. Trends Microbiol. 2015;23:468–478.
- Corman V.M. Rooting the phylogenetic tree of middle East respiratory syndrome coronavirus by characterization of a conspecific virus from an African bat. J. Virol. 2014;88:11297–11303.
- Kim J.I. The recent ancestry of Middle East respiratory syndrome coronavirus in Korea has been shaped by recombination. Sci. Rep. 2016;6:18825.
- Forni D. The heptad repeat region is a major selection target in MERS-CoV and related coronaviruses. Sci. Rep. 2015;5:14480.
- Cotten M. Spread, circulation, and evolution of the Middle East respiratory syndrome coronavirus. mBio. 2014 doi: 10.1128/mBio.01062-13. Published online February 18, 2014.
- Yamada Y. Acquisition of cell-cell fusion activity by amino acid substitutions in spike protein determines the infectivity of a coronavirus in cultured cells. PLoS One. 2009;4:e6130.
- Navas-Martin S. Murine coronavirus evolution in vivo: functional compensation of a detrimental amino acid substitution in the receptor binding domain of the spike glycoprotein. J. Virol. 2005;79:7629–7640.
- McRoy W.C., Baric R.S. Amino acid substitutions in the S2 subunit of mouse hepatitis virus variant V51 encode determinants of host range expansion. J. Virol. 2008;82:1414–1424.
- Kim Y. Spread of mutant Middle East respiratory syndrome coronavirus with reduced affinity to human CD26 during the South Korean outbreak. mBio. 2016 doi: 10.1128/mBio.00019-16. Published online March 1, 2016.
- Forni D. Extensive positive selection drives the evolution of nonstructural proteins in lineage C betacoronaviruses. J. Virol. 2016;90:3627–3639.
- Baez-Santos Y.M. The SARS-coronavirus papain-like protease: structure, function and inhibition by designed antiviral compounds. Antiviral Res. 2015;115:21–38.
- Rasschaert D. Porcine respiratory coronavirus differs from transmissible gastroenteritis virus by a few genomic deletions. J. Gen. Virol. 1990;71:2599–2607.
- Sanchez C.M. Targeted recombination demonstrates that the spike gene of transmissible gastroenteritis coronavirus is a determinant of its enteric tropism and virulence. J. Virol. 1999;73:7607–7618.
- Pyrc K. Mosaic structure of human coronavirus NL63, one thousand years of evolution. J. Mol. Biol. 2006;364:964–973.
- Wu K. Crystal structure of NL63 respiratory coronavirus receptor-binding domain complexed with its human receptor. Proc. Natl. Acad. Sci. U. S. A. 2009;106:19970–19974.
- Reguera J. Structural bases of coronavirus attachment to host aminopeptidase N and its inhibition by neutralizing antibodies. PLoS Pathog. 2012;8:e1002859.
- Chen L. Structural basis for multifunctional roles of mammalian aminopeptidase N. Proc. Natl. Acad. Sci. U. S. A. 2012;109:17966–17971.
- Milewska A. Human coronavirus NL63 utilizes heparan sulfate proteoglycans for attachment to target cells. J. Virol. 2014;88:13221–13230.
- de Haan C.A. Murine coronavirus with an extended host range uses heparan sulfate as an entry receptor. J. Virol. 2005;79:14451–14456.
- Lau S.K. Molecular epidemiology of human coronavirus OC43 reveals evolution of different genotypes over time and recent emergence of a novel genotype due to natural recombination. J. Virol. 2011;85:11325–11337.
- Zhang Y. Genotype shift in human coronavirus OC43 and emergence of a novel genotype by natural recombination. J. Infect. 2015;70:641–650.
- Al-Khannaq M.N. Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper respiratory tract infections in Kuala Lumpur, Malaysia. Virol. J. 2016;13 33-016-0488-4.
- Ren L. Genetic drift of human coronavirus OC43 spike gene during adaptive evolution. Sci. Rep. 2015;5:11451.
- Qian Z. Identification of the receptor-binding domain of the spike glycoprotein of human betacoronavirus HKU1. J. Virol. 2015;89:8816–8827.
- Kirchdoerfer R.N. Pre-fusion structure of a human coronavirus spike protein. Nature. 2016;531:118–121.
- Dominguez S.R. Isolation, propagation, genome analysis and epidemiology of HKU1 betacoronaviruses. J. Gen. Virol. 2014;95:836–848.
- Pickett B.E. ViPR: an open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012;40:D593–D598.
- Anisimova M. Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites. Genetics. 2003;164:1229–1236.
- Guindon S. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 2010;59:307–321.
- Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313.
- Ronquist F., Huelsenbeck J.P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19:1572–1574.
- Kosakovsky Pond S.L. Automated phylogenetic detection of recombination using a genetic algorithm. Mol. Biol. Evol. 2006;23:1891–1901.
- Maydt J., Lengauer T. Recco: recombination analysis using cost optimization. Bioinformatics. 2006;22:1064–1071.
- Martin D., Rybicki E. RDP: detection of recombination amongst aligned sequences. Bioinformatics. 2000;16:562–563.
- Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 2007;24:1586–1591.
- Zhang J. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol. Biol. Evol. 2005;22:2472–2479.
- Murrell B. Gene-wide identification of episodic selection. Mol. Biol. Evol. 2015;32:1365–1371.
- Kosakovsky Pond S.L. A random effects branch-site model for detecting episodic diversifying selection. Mol. Biol. Evol. 2011;28:3033–3043.
- McDonald J.H., Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991;351:652–654.
- Gharib W.H., Robinson-Rechavi M. The branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in GC. Mol. Biol. Evol. 2013;30:1675–1686.
- Xia X. An index of substitution saturation and its application. Mol. Phylogenet. Evol. 2003;26:1–7.
- Sealfon R.S. FRESCo: finding regions of excess synonymous constraint in diverse viruses. Genome Biol. 2015;16 38-015-0603-7.
- de Groot R.J. Middle East respiratory syndrome coronavirus (MERS-CoV): announcement of the Coronavirus Study Group. J. Virol. 2013;87:7790–7792.
- Wertheim J.O. A case for the ancient origin of coronaviruses. J. Virol. 2013;87:7039–7045.
- Woo P.C. Discovery of seven novel Mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus. J. Virol. 2012;86:3995–4008.
- Blair J.E., Hedges S.B. Molecular phylogeny and divergence times of deuterostome animals. Mol. Biol. Evol. 2005;22:2275–2284.
- Huynh J. Evidence supporting a zoonotic origin of human coronavirus strain NL63. J. Virol. 2012;86:12816–12825.
- Pfefferle S. Distant relatives of severe acute respiratory syndrome coronavirus and close relatives of human coronavirus 229E in bats, Ghana. Emerg. Infect. Dis. 2009;15:1377–1384.
- Vijgen L. Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event. J. Virol. 2005;79:1595–1604.
- Zhang Z. Evolutionary dynamics of MERS-CoV: potential recombination, positive selection and transmission. Sci. Rep. 2016;6:25049.
- Gralinski L.E., Baric R.S. Molecular pathology of emerging coronavirus infections. J. Pathol. 2015;235:185–195.
- Krogh A. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 2001;305:567–580.
Source: PubMed