Molecular Evolution of Human Coronavirus Genomes

Diego Forni, Rachele Cagliani, Mario Clerici, Manuela Sironi, Diego Forni, Rachele Cagliani, Mario Clerici, Manuela Sironi

Abstract

Human coronaviruses (HCoVs), including SARS-CoV and MERS-CoV, are zoonotic pathogens that originated in wild animals. HCoVs have large genomes that encode a fixed array of structural and nonstructural components, as well as a variety of accessory proteins that differ in number and sequence even among closely related CoVs. Thus, in addition to recombination and mutation, HCoV genomes evolve through gene gains and losses. In this review we summarize recent findings on the molecular evolution of HCoV genomes, with special attention to recombination and adaptive events that generated new viral species and contributed to host shifts and to HCoV emergence. VIDEO ABSTRACT.

Keywords: gene gain/loss; host shift; human coronavirus; molecular evolution; positive selection; recombination.

Copyright © 2016 Elsevier Ltd. All rights reserved.

Figures

Figure 1
Figure 1
Key Figure: Phylogenetic Relationships and Genome Organization of Human and Animal Coronaviruses (CoVs) CoVs that infect nonhuman mammals are included only if they are mentioned in the text for comparative purposes. (A) The phylogenetic tree of complete genome sequences of HCoVs and selected mammalian CoVs was obtained with RAxML 8.2.4 . Numbers indicate bootstrap support. CoVs are colored according to genus and lineage. Information about origin, intermediate host, and clinical presentation is reported for the six HCoVs , , , , , . Data about case fatality rate were derived from the World Health Organization website (http://www.who.int/mediacentre/factsheets/mers-cov/;http://www.who.int/csr/sars/country/table2004_04_21/en/). (B) CoV genome organization is schematically reported together with information on receptor/coreceptor usage. Virus names are colored according to their genus or lineage, as in (A). Only ORFs mentioned in the text are colored or shaded. Empty boxes represent accessory ORFs that are not described in the text.
Figure I
Figure I
Timeline for the Emergence of HCoVs.
Figure 2
Figure 2
Evolution of Human Coronavirus (HCoV) Accessory Proteins.(A) Test for relaxation of selective strength for SARS-CoV and SARSr-BatCoVs ORF8. Branches are colored according to the selection intensity parameter k. RELAX evaluates if selection on the test branches (bold) is relaxed (k < 1) or intensified (k > 1) compared to background branches. In the evolutionary analysis table the number of sequences differs from that in the tree because RELAX removes identical sequences. Evidence of positive selection was searched for using the M7/M8 ‘site models’ from PAML (see Box 2) and with PARRIS. M7 and M8 represent the null and the positive selection models, respectively. A likelihood ratio test (with 2 degrees of freedom) was applied. (B) An amino acid alignment of rodent AKAP7 and four viral phosphodiesterases (PDEs) is shown. Amino acids are colored red if they are identical, orange if they have very similar properties. PDEs belonging to the 2H family are characterized by two H-Φ-[S/T]-Φ motifs (blue boxes), where Φ is a hydrophobic residue. The structure of rat AKAP7 (gray, PDB ID: 2VFK) is superimposed on MERS-CoV NS4b (green, model generated from 2VFK), MHV NS2a (cyan, PDB ID: 4Z5V), and Rotavirus A VP3 (yellow, PDB ID: 5AF2). Catalytic histidines are shown in red. (C) Sequence and membrane topology comparison of HCoV viroporins. Transmembrane regions (TM1-3) predicted by the TMHMM algorithm are boxed in blue. The corresponding topology model for SARS-CoV ORF3A, HCoV-229E ORF4a (from the Inf-1 strain), and HCoV-NL63 ORF3 is shown. The topology model of HCoV-OC43 OFR5 was derived from recent data .
Figure 3
Figure 3
Evolution at the Coronavirus (CoV)–Host Interaction Surface.(A) Schematic representation of MERS-CoV spike protein domains. Positively selected sites in MERS-CoV and other lineage C beta-CoVs are shown in red, RBD mutations emerged in the South Korean outbreak are in magenta (see text). A detail of the interaction surface between the MERS-CoV RBD and human DPP4 (PDB ID: 4F5C) is also reported. (B) Ribbon diagram of the interaction surface of human ACE2 with the spike protein of SARS-CoV (PDB ID: 2AJF) and HCoV-NL63 (PDB ID: 3KBH). The binding surface of porcine ANPEP with the TGEV spike protein (PDB ID: 4F5C) is also shown. The location of the HCoV-299E binding site on ANPEP is circled. Red denotes protein regions involved in binding.

References

    1. Graham R.L. A decade after SARS: strategies for controlling emerging coronaviruses. Nat. Rev. Microbiol. 2013;11:836–848.
    1. Lau S.K. Discovery of a novel coronavirus, China Rattus coronavirus HKU24, from Norway rats supports the murine origin of Betacoronavirus 1 and has implications for the ancestor of Betacoronavirus lineage A. J. Virol. 2015;89:3076–3092.
    1. Chan J.F. Interspecies transmission and emergence of novel viruses: lessons from bats and birds. Trends Microbiol. 2013;21:544–555.
    1. Drexler J.F. Ecology, evolution and classification of bat coronaviruses in the aftermath of SARS. Antiviral Res. 2014;101:45–56.
    1. Su S. Epidemiology, genetic recombination, and pathogenesis of coronaviruses. Trends Microbiol. 2016;24:490–502.
    1. Eckerle L.D. High fidelity of murine hepatitis virus replication is decreased in nsp14 exoribonuclease mutants. J. Virol. 2007;81:12135–12144.
    1. Eckerle L.D. Infidelity of SARS-CoV Nsp14-exonuclease mutant virus replication is revealed by complete genome sequencing. PLoS Pathog. 2010;6:e1000896.
    1. Vega V.B. Mutational dynamics of the SARS coronavirus in cell culture and human populations isolated in 2003. BMC Infect. Dis. 2004;4:32.
    1. Lauber C. The footprint of genome architecture in the largest genome expansion in RNA viruses. PLoS Pathog. 2013;9:e1003500.
    1. Subissi L. One severe acute respiratory syndrome coronavirus protein complex integrates processive RNA polymerase and exonuclease activities. Proc. Natl. Acad. Sci. U. S. A. 2014;111:E3900–E3909.
    1. Menachery V.D. SARS-like WIV1-CoV poised for human emergence. Proc. Natl. Acad. Sci. U. S. A. ​. 2016;113:3048–3053.
    1. Ge X.Y. Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor. Nature. 2013;503:535–538.
    1. Yang X.L. Isolation and characterization of a novel bat coronavirus closely related to the direct progenitor of severe acute respiratory syndrome coronavirus. J. Virol. 2015;90:3253–3256.
    1. Lau S.K. Severe acute respiratory syndrome (SARS) coronavirus ORF8 protein is acquired from sars-related coronavirus from greater horseshoe bats through recombination. J. Virol. 2015;89:10532–10547.
    1. Wu Z. ORF8-related genetic evidence for Chinese horseshoe bats as the source of human severe acute respiratory syndrome coronavirus. J. Infect. Dis. 2016;213:579–583.
    1. Chinese SARS Molecular Epidemiology Consortium Molecular evolution of the SARS coronavirus during the course of the SARS epidemic in China. Science. 2004;303:1666–1669.
    1. Scheffler K. Robust inference of positive selection from recombining coding sequences. Bioinformatics. 2006;22:2493–2499.
    1. Wertheim J.O. RELAX: detecting relaxed selection in a phylogenetic framework. Mol. Biol. Evol. 2015;32:820–832.
    1. Corman V.M. Evidence for an ancestral association of human coronavirus 229E with bats. J. Virol. 2015;89:11858–11870.
    1. Crossley B.M. Identification and characterization of a novel alpaca respiratory coronavirus most closely related to the human coronavirus 229E. Viruses. 2012;4:3689–3700.
    1. Crossley B.M. Identification of a novel coronavirus possibly associated with acute respiratory syndrome in alpacas (Vicugna pacos) in California, 2007. J. Vet. Diagn. Invest. 2010;22:94–97.
    1. Sabir J.S. Co-circulation of three camel coronavirus species and recombination of MERS-CoVs in Saudi Arabia. Science. 2016;351:81–84.
    1. Zhao L. Antagonism of the interferon-induced OAS-RNase L pathway by murine coronavirus ns2 protein is required for virus replication and liver pathology. Cell. Host Microbe. 2012;11:607–616.
    1. Zhang R. Homologous 2′,5′-phosphodiesterases from disparate RNA viruses antagonize antiviral innate immunity. Proc. Natl. Acad. Sci. U. S. A. 2013;110:13114–13119.
    1. Gusho E. Murine AKAP7 has a 2′,5′-phosphodiesterase domain that can complement an inactive murine coronavirus ns2 gene. mBio. 2014;5:e01312–e1314.
    1. Thornbrough J.M. Middle East respiratory syndrome coronavirus NS4b protein inhibits host RNase L activation. mBio. 2016 doi: 10.1128/mBio.00258-16. Published online March 29, 2016.
    1. Chen L., Li F. Structural analysis of the evolutionary origins of influenza virus hemagglutinin and other viral lectins. J. Virol. 2013;87:4118–4120.
    1. Peng G. Crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor. Proc. Natl. Acad. Sci. U. S. A. 2011;108:10696–10701.
    1. Huang X. Human coronavirus HKU1 spike protein uses o-acetylated sialic acid as an attachment receptor determinant and employs hemagglutinin-esterase protein as a receptor-destroying enzyme. J. Virol. 2015;89:7202–7213.
    1. Desforges M. The acetyl-esterase activity of the hemagglutinin-esterase protein of human coronavirus OC43 strongly enhances the production of infectious virus. J. Virol. 2013;87:3097–3107.
    1. Dijkman R. Human coronavirus 229E encodes a single ORF4 protein between the spike and the envelope genes. Virol. J. 2006;3:106.
    1. Farsani S.M. The first complete genome sequences of clinical isolates of human coronavirus 229E. Virus Genes. 2012;45:433–439.
    1. Zhang R. The ORF4a protein of human coronavirus 229E functions as a viroporin that regulates viral production. Biochim. Biophys. Acta. 2014;1838:1088–1095.
    1. Zhang R. The ns12.9 accessory protein of human coronavirus OC43 is a viroporin involved in virion morphogenesis and pathogenesis. J. Virol. 2015;89:11383–11395.
    1. Lu W. Severe acute respiratory syndrome-associated coronavirus 3a protein forms an ion channel and modulates virus release. Proc. Natl. Acad. Sci. U. S. A. 2006;103:12540–12545.
    1. Koetzner C.A. Accessory protein 5a is a major antagonist of the antiviral action of interferon against murine coronavirus. J. Virol. 2010;84:8262–8274.
    1. Zhao G.P. SARS molecular epidemiology: a Chinese fairy tale of controlling an emerging zoonotic disease in the genomics era. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 2007;362:1063–1081.
    1. Graham R.L., Baric R.S. Recombination, reservoirs, and the modular spike: mechanisms of coronavirus cross-species transmission. J. Virol. 2010;84:3134–3146.
    1. Lu G. Bat-to-human: spike features determining ‘host jump’ of coronaviruses SARS-CoV, MERS-CoV, and beyond. Trends Microbiol. 2015;23:468–478.
    1. Corman V.M. Rooting the phylogenetic tree of middle East respiratory syndrome coronavirus by characterization of a conspecific virus from an African bat. J. Virol. 2014;88:11297–11303.
    1. Kim J.I. The recent ancestry of Middle East respiratory syndrome coronavirus in Korea has been shaped by recombination. Sci. Rep. 2016;6:18825.
    1. Forni D. The heptad repeat region is a major selection target in MERS-CoV and related coronaviruses. Sci. Rep. 2015;5:14480.
    1. Cotten M. Spread, circulation, and evolution of the Middle East respiratory syndrome coronavirus. mBio. 2014 doi: 10.1128/mBio.01062-13. Published online February 18, 2014.
    1. Yamada Y. Acquisition of cell-cell fusion activity by amino acid substitutions in spike protein determines the infectivity of a coronavirus in cultured cells. PLoS One. 2009;4:e6130.
    1. Navas-Martin S. Murine coronavirus evolution in vivo: functional compensation of a detrimental amino acid substitution in the receptor binding domain of the spike glycoprotein. J. Virol. 2005;79:7629–7640.
    1. McRoy W.C., Baric R.S. Amino acid substitutions in the S2 subunit of mouse hepatitis virus variant V51 encode determinants of host range expansion. J. Virol. 2008;82:1414–1424.
    1. Kim Y. Spread of mutant Middle East respiratory syndrome coronavirus with reduced affinity to human CD26 during the South Korean outbreak. mBio. 2016 doi: 10.1128/mBio.00019-16. Published online March 1, 2016.
    1. Forni D. Extensive positive selection drives the evolution of nonstructural proteins in lineage C betacoronaviruses. J. Virol. 2016;90:3627–3639.
    1. Baez-Santos Y.M. The SARS-coronavirus papain-like protease: structure, function and inhibition by designed antiviral compounds. Antiviral Res. 2015;115:21–38.
    1. Rasschaert D. Porcine respiratory coronavirus differs from transmissible gastroenteritis virus by a few genomic deletions. J. Gen. Virol. 1990;71:2599–2607.
    1. Sanchez C.M. Targeted recombination demonstrates that the spike gene of transmissible gastroenteritis coronavirus is a determinant of its enteric tropism and virulence. J. Virol. 1999;73:7607–7618.
    1. Pyrc K. Mosaic structure of human coronavirus NL63, one thousand years of evolution. J. Mol. Biol. 2006;364:964–973.
    1. Wu K. Crystal structure of NL63 respiratory coronavirus receptor-binding domain complexed with its human receptor. Proc. Natl. Acad. Sci. U. S. A. 2009;106:19970–19974.
    1. Reguera J. Structural bases of coronavirus attachment to host aminopeptidase N and its inhibition by neutralizing antibodies. PLoS Pathog. 2012;8:e1002859.
    1. Chen L. Structural basis for multifunctional roles of mammalian aminopeptidase N. Proc. Natl. Acad. Sci. U. S. A. 2012;109:17966–17971.
    1. Milewska A. Human coronavirus NL63 utilizes heparan sulfate proteoglycans for attachment to target cells. J. Virol. 2014;88:13221–13230.
    1. de Haan C.A. Murine coronavirus with an extended host range uses heparan sulfate as an entry receptor. J. Virol. 2005;79:14451–14456.
    1. Lau S.K. Molecular epidemiology of human coronavirus OC43 reveals evolution of different genotypes over time and recent emergence of a novel genotype due to natural recombination. J. Virol. 2011;85:11325–11337.
    1. Zhang Y. Genotype shift in human coronavirus OC43 and emergence of a novel genotype by natural recombination. J. Infect. 2015;70:641–650.
    1. Al-Khannaq M.N. Molecular epidemiology and evolutionary histories of human coronavirus OC43 and HKU1 among patients with upper respiratory tract infections in Kuala Lumpur, Malaysia. Virol. J. 2016;13 33-016-0488-4.
    1. Ren L. Genetic drift of human coronavirus OC43 spike gene during adaptive evolution. Sci. Rep. 2015;5:11451.
    1. Qian Z. Identification of the receptor-binding domain of the spike glycoprotein of human betacoronavirus HKU1. J. Virol. 2015;89:8816–8827.
    1. Kirchdoerfer R.N. Pre-fusion structure of a human coronavirus spike protein. Nature. 2016;531:118–121.
    1. Dominguez S.R. Isolation, propagation, genome analysis and epidemiology of HKU1 betacoronaviruses. J. Gen. Virol. 2014;95:836–848.
    1. Pickett B.E. ViPR: an open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012;40:D593–D598.
    1. Anisimova M. Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites. Genetics. 2003;164:1229–1236.
    1. Guindon S. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 2010;59:307–321.
    1. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313.
    1. Ronquist F., Huelsenbeck J.P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19:1572–1574.
    1. Kosakovsky Pond S.L. Automated phylogenetic detection of recombination using a genetic algorithm. Mol. Biol. Evol. 2006;23:1891–1901.
    1. Maydt J., Lengauer T. Recco: recombination analysis using cost optimization. Bioinformatics. 2006;22:1064–1071.
    1. Martin D., Rybicki E. RDP: detection of recombination amongst aligned sequences. Bioinformatics. 2000;16:562–563.
    1. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 2007;24:1586–1591.
    1. Zhang J. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol. Biol. Evol. 2005;22:2472–2479.
    1. Murrell B. Gene-wide identification of episodic selection. Mol. Biol. Evol. 2015;32:1365–1371.
    1. Kosakovsky Pond S.L. A random effects branch-site model for detecting episodic diversifying selection. Mol. Biol. Evol. 2011;28:3033–3043.
    1. McDonald J.H., Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991;351:652–654.
    1. Gharib W.H., Robinson-Rechavi M. The branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in GC. Mol. Biol. Evol. 2013;30:1675–1686.
    1. Xia X. An index of substitution saturation and its application. Mol. Phylogenet. Evol. 2003;26:1–7.
    1. Sealfon R.S. FRESCo: finding regions of excess synonymous constraint in diverse viruses. Genome Biol. 2015;16 38-015-0603-7.
    1. de Groot R.J. Middle East respiratory syndrome coronavirus (MERS-CoV): announcement of the Coronavirus Study Group. J. Virol. 2013;87:7790–7792.
    1. Wertheim J.O. A case for the ancient origin of coronaviruses. J. Virol. 2013;87:7039–7045.
    1. Woo P.C. Discovery of seven novel Mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus. J. Virol. 2012;86:3995–4008.
    1. Blair J.E., Hedges S.B. Molecular phylogeny and divergence times of deuterostome animals. Mol. Biol. Evol. 2005;22:2275–2284.
    1. Huynh J. Evidence supporting a zoonotic origin of human coronavirus strain NL63. J. Virol. 2012;86:12816–12825.
    1. Pfefferle S. Distant relatives of severe acute respiratory syndrome coronavirus and close relatives of human coronavirus 229E in bats, Ghana. Emerg. Infect. Dis. 2009;15:1377–1384.
    1. Vijgen L. Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event. J. Virol. 2005;79:1595–1604.
    1. Zhang Z. Evolutionary dynamics of MERS-CoV: potential recombination, positive selection and transmission. Sci. Rep. 2016;6:25049.
    1. Gralinski L.E., Baric R.S. Molecular pathology of emerging coronavirus infections. J. Pathol. 2015;235:185–195.
    1. Krogh A. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 2001;305:567–580.

Source: PubMed

3
Iratkozz fel