A new coronavirus associated with human respiratory disease in China
Fan Wu, Su Zhao, Bin Yu, Yan-Mei Chen, Wen Wang, Zhi-Gang Song, Yi Hu, Zhao-Wu Tao, Jun-Hua Tian, Yuan-Yuan Pei, Ming-Li Yuan, Yu-Ling Zhang, Fa-Hui Dai, Yi Liu, Qi-Min Wang, Jiao-Jiao Zheng, Lin Xu, Edward C Holmes, Yong-Zhen Zhang, Fan Wu, Su Zhao, Bin Yu, Yan-Mei Chen, Wen Wang, Zhi-Gang Song, Yi Hu, Zhao-Wu Tao, Jun-Hua Tian, Yuan-Yuan Pei, Ming-Li Yuan, Yu-Ling Zhang, Fa-Hui Dai, Yi Liu, Qi-Min Wang, Jiao-Jiao Zheng, Lin Xu, Edward C Holmes, Yong-Zhen Zhang
Abstract
Emerging infectious diseases, such as severe acute respiratory syndrome (SARS) and Zika virus disease, present a major threat to public health1-3. Despite intense research efforts, how, when and where new diseases appear are still a source of considerable uncertainty. A severe respiratory disease was recently reported in Wuhan, Hubei province, China. As of 25 January 2020, at least 1,975 cases had been reported since the first patient was hospitalized on 12 December 2019. Epidemiological investigations have suggested that the outbreak was associated with a seafood market in Wuhan. Here we study a single patient who was a worker at the market and who was admitted to the Central Hospital of Wuhan on 26 December 2019 while experiencing a severe respiratory syndrome that included fever, dizziness and a cough. Metagenomic RNA sequencing4 of a sample of bronchoalveolar lavage fluid from the patient identified a new RNA virus strain from the family Coronaviridae, which is designated here 'WH-Human 1' coronavirus (and has also been referred to as '2019-nCoV'). Phylogenetic analysis of the complete viral genome (29,903 nucleotides) revealed that the virus was most closely related (89.1% nucleotide similarity) to a group of SARS-like coronaviruses (genus Betacoronavirus, subgenus Sarbecovirus) that had previously been found in bats in China5. This outbreak highlights the ongoing ability of viral spill-over from animals to cause severe disease in humans.
Conflict of interest statement
The authors declare no competing interests.
Figures
References
- Drosten C, et al. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348:1967–1976. doi: 10.1056/NEJMoa030747.
- Wolfe ND, Dunavan CP, Diamond J. Origins of major human infectious diseases. Nature. 2007;447:279–283. doi: 10.1038/nature05775.
- Ventura CV, Maia M, Bravo-Filho V, Góis AL, Belfort R., Jr. Zika virus in Brazil and macular atrophy in a child with microcephaly. Lancet. 2016;387:228. doi: 10.1016/S0140-6736(16)00006-4.
- Shi M, et al. Redefining the invertebrate RNA virosphere. Nature. 2016;540:539–543. doi: 10.1038/nature20167.
- Hu D, et al. Genomic characterization and infectivity of a novel SARS-like coronavirus in Chinese bat. Emerg. Microbes Infect. 2018;7:1–10. doi: 10.1038/s41426-018-0155-5.
- Shi M, et al. The evolutionary history of vertebrate RNA viruses. Nature. 2018;556:197–202. doi: 10.1038/s41586-018-0012-7.
- Yadav PD, et al. Nipah virus sequences from humans and bats during Nipah outbreak, Kerala, India, 2018. Emerg. Infect. Dis. 2019;25:1003–1006. doi: 10.3201/eid2505.181076.
- McMullan LK, et al. Characterisation of infectious Ebola virus from the ongoing outbreak to guide response activities in the Democratic Republic of the Congo: a phylogenetic and in vitro analysis. Lancet Infect. Dis. 2019;19:1023–1032. doi: 10.1016/S1473-3099(19)30291-9.
- Li D, Liu CM, Luo R, Sadakane K, Lam TW. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31:1674–1676. doi: 10.1093/bioinformatics/btv033.
- Wang W, et al. Discovery, diversity and evolution of novel coronaviruses sampled from rodents in China. Virology. 2015;474:19–27. doi: 10.1016/j.virol.2014.10.017.
- Hu B, et al. Discovery of a rich gene pool of bat SARS-related coronaviruses provides new insights into the origin of SARS coronavirus. PLoS Pathog. 2017;13:e1006698. doi: 10.1371/journal.ppat.1006698.
- Lin X-D, et al. Extensive diversity of coronaviruses in bats from China. Virology. 2017;507:1–10. doi: 10.1016/j.virol.2017.03.019.
- Xu L, et al. Detection and characterization of diverse alpha- and betacoronaviruses from bats in China. Virol. Sin. 2016;31:69–77. doi: 10.1007/s12250-016-3727-3.
- Ren W, et al. Difference in receptor usage between severe acute respiratory syndrome (SARS) coronavirus and SARS-like coronavirus of bat origin. J. Virol. 2008;82:1899–1907. doi: 10.1128/JVI.01085-07.
- Li F, Li W, Farzan M, Harrison SC. Structure of SARS coronavirus spike receptor-binding domain complexed with receptor. Science. 2005;309:1864–1868. doi: 10.1126/science.1116480.
- Hulswit RJG, et al. Human coronaviruses OC43 and HKU1 bind to 9-O-acetylated sialic acids via a conserved receptor-binding site in spike protein domain A. Proc. Natl Acad. Sci. USA. 2019;116:2681–2690. doi: 10.1073/pnas.1809667116.
- Ge XY, et al. Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor. Nature. 2013;503:535–538. doi: 10.1038/nature12711.
- Yang XL, et al. Isolation and characterization of a novel bat coronavirus closely related to the direct progenitor of severe acute respiratory syndrome coronavirus. J. Virol. 2016;90:3253–3256. doi: 10.1128/JVI.02582-15.
- Martin DP, et al. RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics. 2010;26:2462–2463. doi: 10.1093/bioinformatics/btq467.
- Menachery VD, et al. A SARS-like cluster of circulating bat coronaviruses shows potential for human emergence. Nat. Med. 2015;21:1508–1513. doi: 10.1038/nm.3985.
- Bermingham A, et al. Severe respiratory illness caused by a novel coronavirus, in a patient transferred to the United Kingdom from the Middle East, September 2012. Euro Surveill. 2012;17:20290. doi: 10.2807/ese.17.40.20290-en.
- Hamre D, Procknow JJ. A new virus isolated from the human respiratory tract. Proc. Soc. Exp. Biol. Med. 1966;121:190–193. doi: 10.3181/00379727-121-30734.
- McIntosh K, Becker WB, Chanock RM. Growth in suckling-mouse brain of “IBV-like” viruses from patients with upper respiratory tract disease. Proc. Natl Acad. Sci. USA. 1967;58:2268–2273. doi: 10.1073/pnas.58.6.2268.
- van der Hoek L, et al. Identification of a new human coronavirus. Nat. Med. 2004;10:368–373. doi: 10.1038/nm1024.
- Woo PC, et al. Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia. J. Virol. 2005;79:884–895. doi: 10.1128/JVI.79.2.884-895.2005.
- Li W, et al. Bats are natural reservoirs of SARS-like coronaviruses. Science. 2005;310:676–679. doi: 10.1126/science.1118391.
- Lau SK, et al. Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats. Proc. Natl Acad. Sci. USA. 2005;102:14040–14045. doi: 10.1073/pnas.0506735102.
- Wang W, et al. Discovery of a highly divergent coronavirus in the Asian house shrew from China illuminates the origin of the Alphacoronaviruses. J. Virol. 2017;91:e00764–17.
- Zhou, P. et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature10.1038/s41586-020-2012-7 (2020).
- Gorbalenya, A. E. Severe acute respiratory syndrome-related coronavirus — the species and its viruses, a statement of the Coronavirus Study Group. Preprint at bioRxiv 10.1101/2020.02.07.93786 (2020).
- WHO. WHO Director-General’s remarks at the media briefing on 2019-nCoV on 11 February 2020. (WHO, 11 February 2020).
- Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170.
- Grabherr MG, et al. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883.
- Li B, Ruotti V, Stewart RM, Thomson JA, Dewey CN. RNA-seq gene expression estimation with read mapping uncertainty. Bioinformatics. 2010;26:493–500. doi: 10.1093/bioinformatics/btp692.
- Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923.
- Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352.
- Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010.
- Guindon S, et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 2010;59:307–321. doi: 10.1093/sysbio/syq010.
- Tamura K, et al. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 2011;28:2731–2739. doi: 10.1093/molbev/msr121.
- Lole KS, et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J. Virol. 1999;73:152–160. doi: 10.1128/JVI.73.1.152-160.1999.
- Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340.
- Waterhouse A, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46:W296–W303. doi: 10.1093/nar/gky427.
- Hwang WC, et al. Structural basis of neutralization by a human anti-severe acute respiratory syndrome spike protein antibody, 80R. J. Biol. Chem. 2006;281:34610–34616. doi: 10.1074/jbc.M603275200.
Source: PubMed