Literature DB >> 32004758

Full-genome evolutionary analysis of the novel corona virus (2019-nCoV) rejects the hypothesis of emergence as a result of a recent recombination event.

D Paraskevis1, E G Kostaki2, G Magiorkinis2, G Panayiotakopoulos3, G Sourvinos4, S Tsiodras5.   

Abstract

BACKGROUND: A novel coronavirus (2019-nCoV) associated with human to human transmission and severe human infection has been recently reported from the city of Wuhan in China. Our objectives were to characterize the genetic relationships of the 2019-nCoV and to search for putative recombination within the subgenus of sarbecovirus.
METHODS: Putative recombination was investigated by RDP4 and Simplot v3.5.1 and discordant phylogenetic clustering in individual genomic fragments was confirmed by phylogenetic analysis using maximum likelihood and Bayesian methods.
RESULTS: Our analysis suggests that the 2019-nCoV although closely related to BatCoV RaTG13 sequence throughout the genome (sequence similarity 96.3%), shows discordant clustering with the Bat_SARS-like coronavirus sequences. Specifically, in the 5'-part spanning the first 11,498 nucleotides and the last 3'-part spanning 24,341-30,696 positions, 2019-nCoV and RaTG13 formed a single cluster with Bat_SARS-like coronavirus sequences, whereas in the middle region spanning the 3'-end of ORF1a, the ORF1b and almost half of the spike regions, 2019-nCoV and RaTG13 grouped in a separate distant lineage within the sarbecovirus branch.
CONCLUSIONS: The levels of genetic similarity between the 2019-nCoV and RaTG13 suggest that the latter does not provide the exact variant that caused the outbreak in humans, but the hypothesis that 2019-nCoV has originated from bats is very likely. We show evidence that the novel coronavirus (2019-nCov) is not-mosaic consisting in almost half of its genome of a distinct lineage within the betacoronavirus. These genomic features and their potential association with virus characteristics and virulence in humans need further attention.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Genomic sequence analysis; Molecular epidemiology; Novel coronavirus; Origin; Phylogenetic analysis; Recombination

Mesh:

Year:  2020        PMID: 32004758      PMCID: PMC7106301          DOI: 10.1016/j.meegid.2020.104212

Source DB:  PubMed          Journal:  Infect Genet Evol        ISSN: 1567-1348            Impact factor:   3.342


The family Coronaviridae includes a large number of viruses that in nature are found in birds and mammals (Kahn and McIntosh, 2005; Fehr and Perlman, 2015). Human coronaviruses, first characterized in the 1960s, are associated with a large percentage of respiratory infections both in children and adults (Kahn and McIntosh, 2005; Paules et al., 2020). Scientific interest in Coronaviruses exponentially increased after the emergence of SARS-Coronavirus (SARS-CoV) in Southern China (Drosten et al., 2003; Ksiazek et al., 2003; Peiris et al., 2003). Its rapid spread led to the global appearance of more than 8000 human cases and 774 deaths (Kahn and McIntosh, 2005). The virus was initially detected in Himalayan palm civets (Guan et al., 2003) that may have served as an amplification host; the civet virus contained a 29-nucleotide sequence not found in most human isolates that were related to the global epidemic (Guan et al., 2003). It has been speculated that the function of the affected open reading frame (ORF 10) might have played a role in the trans-species jump (Kahn and McIntosh, 2005). A similar virus was found later in horseshoe bats (Lau et al., 2005; Li et al., 2005a). A 29-bp insertion in ORF 8 of bat-SARS-CoV genome, not found in most human SARS-CoV genomes, was suggestive of a common ancestor with civet SARS-CoV (Lau et al., 2005). After the SARS epidemic, bats have been considered as a potential reservoir species that could be implicated in future coronavirus-related human pandemics (Cui et al., 2019). During 2012 Middle East Respiratory coronavirus (MERS-CoV) emerged in Saudi Arabia (Zaki et al., 2012; Hajjar et al., 2013) and has since claimed the lives of 919 out of 2521 (35%) people affected (ECDC, 2020). A main role in the transmission of the virus to humans has been attributed to dromedary camels (Alagaili et al., 2014) and its origin has been again traced to bats (Ithete et al., 2013). Ever since both SARS and MERS-CoV (due to their high case fatality rates) are prioritized together with “highly pathogenic coronaviral diseases other than MERS and SARS” under the Research and Development Blueprint published by the WHO (World Health Organization, 2018). A novel coronavirus (2019-nCoV) associated with human to human transmission and severe human infection has been recently reported from the city of Wuhan in Hubei province in China (World Health Organization, 2020; Hui et al., 2020). A total of 1,320 confirmed and 1,965 suspect cases were reported up to 25 January 2020; of the confirmed cases 237 were severely ill and 41 had died (World Health Organization, 2020). Most of the original cases had close contact with a local fresh seafood and an animal market (Zhu et al., 2020; Perlman, 2020). Full-genome sequence analysis of 2019-nCoV revealed that belongs to betacoronavirus, but it is divergent from SARS-CoV and MERS-CoV that caused epidemics in the past (Zhu et al., 2020). The 2019-nCoV along with the Bat_SARS-like coronavirus forms a distinct lineage within the subgenus of the sarbecovirus (Zhu et al., 2020). Our objectives were to characterize the genetic relationships of the 2019-nCoV and to search for putative recombination within the subgenus of sarbecovirus. Viral sequences were downloaded from NCBI nucleotide sequence database (http://www.ncbi.nlm.nih.gov). The BatCoV RaTG13 sequence was downloaded from the GISAID BetaCov 2019–2020 repository (http://www.GISAID.org). The sequence was reported in Zhou et al. (2020). Full-genomic sequence alignment was performed using MAFFT v7.4.2. (Katoh and Standley, 2013) and manually edited using MEGA v1.0 (Stecher et al., 2020) according to the encoded reading frame. Putative recombination was investigated by RDP4 (Martin, 2015) and Simplot v3.5.1 (Lole et al., 1999) and discordant phylogenetic clustering in individual genomic fragments was confirmed by phylogenetic analysis using maximum likelihood (ML) and Bayesian methods. ML trees were reconstructed using Neighbor-Joining (NJ) with ML distances or after heuristic ML search (TBR) with GTR + G as nucleotide substitution model as implemented in PAUP* 4.0 beta (Swofford, 2003). The GTR + G was used in Bayesian analysis as implemented in MrBayes v3.2.7 (Huelsenbeck and Ronquist, 2001). Phylogenetic trees were viewed using FigTree v1.4 (http://tree.bio.ed.ac.uk/software/figtree/). A similarity plot was performed using a sliding window of 450 nts moving in steps of 50 nts, between the query sequence (2019-nCoV) and different sequences grouped according to their clustering pattern. The similarity plot (Fig. 1A,B) suggested that the RaTG13 was the most closely related sequence to the 2019-nCoV throughout the genome. The genetic similarity between the 2019-nCoV and RaTG13 was 96.3% (p-distance: 0.0369). On the other hand, a discordant relationship was detected between the query and the sequences of the Bat_SARS-like coronavirus (MG772934 and MG772933) (Fig. 1C). Specifically in in the 5′-part of the genome spanning the first 10,901 nts of the alignment that correspond to the 11,498 nucleotides of the prototype strain (NC_045512) and the last 3′-part spanning 22,831–27,933 positions (24,341–30,696 nucleotides in the NC_045512), 2019-nCoV and RaTG13 formed a single cluster with Bat_SARS-like coronavirus sequences (Fig. 1C). In the middle region spanning the 3′-end of ORF1a, the ORF1b and almost half of the spike regions (10,901–22,830 nts in the alignment or 11,499–24,340 of the NC_045512), 2019-nCoV and RaTG13 grouped in a separate distant lineage within the sarbecovirus branch (Fig. 1B, C). In this region the 2019-nCoV and RaTG13 is distantly related to the Bat_SARS-like coronavirus sequences. Phylogenetic analyses using different methods confirmed these findings. A BLAST search of 2019-nCoV middle fragment revealed no considerable similarity with any of the previously characterized corona viruses (Fig. 2 ).
Fig. 1

A. Genomic organization of the novel coronavirus (2019-nCoV) according to the positions in the edited alignment. B. Simplot of 2019-nCoV (NC_045512_Wuhan_Hu-1) against sequences within the subgenus sarbecovirus. Different colours correspond to the nucleotide similarity between the 2019-nCoV and different groups. The regions with discordant phylogenetic clustering of the 2019-nCoV with Bats_SARS-like sequences are shown in different colours. C. Maximum likelihood (ML) phylogenetic trees inferred in different genomic regions as indicated by the Simplot analysis. The genomic regions are shown in numbers at the top or at the left of the trees. The 2019-nCoV sequence is shown in red and stars indicate important nodes received 100% bootstrap and 1 posterior probability support. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Fig. 2

Neighbor joining distance tree of the BLAST search results of the 2019-nCoV (NC_0455_Wuhan_Hu_1) sequence in the genomic region 10,901–22,830 of the alignment.

A. Genomic organization of the novel coronavirus (2019-nCoV) according to the positions in the edited alignment. B. Simplot of 2019-nCoV (NC_045512_Wuhan_Hu-1) against sequences within the subgenus sarbecovirus. Different colours correspond to the nucleotide similarity between the 2019-nCoV and different groups. The regions with discordant phylogenetic clustering of the 2019-nCoV with Bats_SARS-like sequences are shown in different colours. C. Maximum likelihood (ML) phylogenetic trees inferred in different genomic regions as indicated by the Simplot analysis. The genomic regions are shown in numbers at the top or at the left of the trees. The 2019-nCoV sequence is shown in red and stars indicate important nodes received 100% bootstrap and 1 posterior probability support. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) Neighbor joining distance tree of the BLAST search results of the 2019-nCoV (NC_0455_Wuhan_Hu_1) sequence in the genomic region 10,901–22,830 of the alignment. Our study suggests that the new corona virus (2019-nCoV) is not a mosaic and it is most closely related with the BatCoV RaTG13 detected in bats from Yunnan Province (Zhou et al., 2020). The levels of genetic similarity between the 2019-nCoV and RaTG13 suggest that the latter does not provide the exact variant that caused the outbreak in humans, but the hypothesis that 2019-nCoV has originated from bats is very likely. On the other hand, there is evidence for discordant phylogenetic relationships between 2019-nCoV and RaTG13 clade with their closest partners, the Bat_SARS-like coronavirus sequences. In accordance with previous analysis (http://virological.org/t/ncovs-relationship-to-bat-coronaviruses-recombination-signals-no-snakes/331), Bat_SARS-like coronavirus sequences cluster in different positions in the tree, suggesting that they are recombinants, and thus that the 2019-nCoV and RaTG13 are not (Ji et al., 2020; Magiorkinis et al., 2004). One previous study based on codon usage analyses suggested that the spike protein of 2019-nCoV might have originated from one yet-unknown unsampled coronavirus through recombination (Ji et al., 2020). Codon usage analyses can resolve the origin of proteins with deep ancestry and insufficient phylogenetic signal or invented de novo. The recently-published bat coronavirus sequence however provides strong phylogenetic information to resolve the origin of the Spike protein, as well as the rest of the genome, suggesting a uniform ancestry across the genome. We have previously shown that phylogenetic discordance in deep relationships of coronaviruses is common and can be explained either by ancient recombination event or altered evolutionary rates in different lineages, or a combination of both (Magiorkinis et al., 2004). Our study rejects the hypothesis of emergence as a result of a recent recombination event. Notably, the new coronavirus provides a new lineage for almost half of its genome, with no close genetic relationships to other viruses within the subgenus of sarbecovirus. This genomic part comprises half of the spike region encoding a multifunctional protein responsible also for virus entry into host cells (Babcock et al., 2004; Li et al., 2005b). The unique genetic features of 2019-nCoV and their potential association with virus characteristics and virulence in humans remain to be elucidated.

Declaration of Competing Interest

All authors report no conflict of interest related to the submitted work.
  26 in total

1.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

2.  Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia.

Authors:  Ali M Zaki; Sander van Boheemen; Theo M Bestebroer; Albert D M E Osterhaus; Ron A M Fouchier
Journal:  N Engl J Med       Date:  2012-10-17       Impact factor: 91.245

3.  Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination.

Authors:  K S Lole; R C Bollinger; R S Paranjape; D Gadkari; S S Kulkarni; N G Novak; R Ingersoll; H W Sheppard; S C Ray
Journal:  J Virol       Date:  1999-01       Impact factor: 5.103

4.  A novel coronavirus associated with severe acute respiratory syndrome.

Authors:  Thomas G Ksiazek; Dean Erdman; Cynthia S Goldsmith; Sherif R Zaki; Teresa Peret; Shannon Emery; Suxiang Tong; Carlo Urbani; James A Comer; Wilina Lim; Pierre E Rollin; Scott F Dowell; Ai-Ee Ling; Charles D Humphrey; Wun-Ju Shieh; Jeannette Guarner; Christopher D Paddock; Paul Rota; Barry Fields; Joseph DeRisi; Jyh-Yuan Yang; Nancy Cox; James M Hughes; James W LeDuc; William J Bellini; Larry J Anderson
Journal:  N Engl J Med       Date:  2003-04-10       Impact factor: 91.245

5.  Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2.

Authors:  Wenhui Li; Chengsheng Zhang; Jianhua Sui; Jens H Kuhn; Michael J Moore; Shiwen Luo; Swee-Kee Wong; I-Chueh Huang; Keming Xu; Natalya Vasilieva; Akikazu Murakami; Yaqing He; Wayne A Marasco; Yi Guan; Hyeryun Choe; Michael Farzan
Journal:  EMBO J       Date:  2005-03-24       Impact factor: 11.598

Review 6.  Coronaviruses: an overview of their replication and pathogenesis.

Authors:  Anthony R Fehr; Stanley Perlman
Journal:  Methods Mol Biol       Date:  2015

7.  Middle East respiratory syndrome coronavirus infection in dromedary camels in Saudi Arabia.

Authors:  Abdulaziz N Alagaili; Thomas Briese; Nischay Mishra; Vishal Kapoor; Stephen C Sameroff; Peter D Burbelo; Emmie de Wit; Vincent J Munster; Lisa E Hensley; Iyad S Zalmout; Amit Kapoor; Jonathan H Epstein; William B Karesh; Peter Daszak; Osama B Mohammed; W Ian Lipkin
Journal:  mBio       Date:  2014-02-25       Impact factor: 7.867

Review 8.  Middle East Respiratory Syndrome Coronavirus (MERS-CoV): a perpetual challenge.

Authors:  Sami Al Hajjar; Ziad A Memish; Kenneth McIntosh
Journal:  Ann Saudi Med       Date:  2013 Sep-Oct       Impact factor: 1.526

9.  Phylogenetic analysis of the full-length SARS-CoV sequences: evidence for phylogenetic discordance in three genomic regions.

Authors:  G Magiorkinis; E Magiorkinis; D Paraskevis; A M Vandamme; M Van Ranst; V Moulton; A Hatzakis
Journal:  J Med Virol       Date:  2004-11       Impact factor: 2.327

10.  Response to comments on "Cross-species Transmission of the Newly Identified Coronavirus 2019-nCoV" and "Codon bias analysis may be insufficient for identifying host(s) of a novel virus".

Authors:  Wei Ji; Xingguang Li
Journal:  J Med Virol       Date:  2020-07-14       Impact factor: 20.693

View more
  208 in total

1.  Bi- and Multi-directional Gene Transfer in the Natural Populations of Polyvalent Bacteriophages, and Their Host Species Spectrum Representing Foodborne Versus Other Human and/or Animal Pathogens.

Authors:  Ekaterine Gabashvili; Saba Kobakhidze; Stylianos Koulouris; Tobin Robinson; Mamuka Kotetishvili
Journal:  Food Environ Virol       Date:  2021-01-23       Impact factor: 2.778

2.  Genomic evolution of severe acute respiratory syndrome Coronavirus 2 in India and vaccine impact.

Authors:  Jobin John Jacob; Karthick Vasudevan; Balaji Veeraraghavan; Ramya Iyadurai; Karthik Gunasekaran
Journal:  Indian J Med Microbiol       Date:  2020 Apr-Jun       Impact factor: 0.985

3.  A study of differential circRNA and lncRNA expressions in COVID-19-infected peripheral blood.

Authors:  Yingping Wu; Tiejun Zhao; Riqiang Deng; Xiaoping Xia; Bin Li; Xunzhang Wang
Journal:  Sci Rep       Date:  2021-04-12       Impact factor: 4.379

4.  What We Need to Consider During and After the SARS-CoV-2 Pandemic.

Authors:  Willy A Valdivia-Granda; Jürgen A Richt
Journal:  Vector Borne Zoonotic Dis       Date:  2020-05-29       Impact factor: 2.133

Review 5.  Genetic Aspects and Immune Responses in Covid-19: Important Organ Involvement.

Authors:  Zari Naderi Ghale-Noie; Arash Salmaninejad; Robert Bergquist; Samaneh Mollazadeh; Benyamin Hoseini; Amirhossein Sahebkar
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

6.  Clinical orthodontic management during the COVID-19 pandemic.

Authors:  Sunjay Suri; Yona R Vandersluis; Anuraj S Kochhar; Ritasha Bhasin; Mohamed-Nur Abdallah
Journal:  Angle Orthod       Date:  2020-07-01       Impact factor: 2.079

7.  Molecular Detection of SARS-CoV-2 in Formalin Fixed Paraffin Embedded Specimens.

Authors:  Jun Liu; April M Babka; Brian J Kearney; Sheli R Radoshitzky; Jens H Kuhn; Xiankun Zeng
Journal:  bioRxiv       Date:  2020-04-21

8.  Molecular detection of SARS-CoV-2 in formalin-fixed, paraffin-embedded specimens.

Authors:  Jun Liu; April M Babka; Brian J Kearney; Sheli R Radoshitzky; Jens H Kuhn; Xiankun Zeng
Journal:  JCI Insight       Date:  2020-06-18

9.  Global transmission network of SARS-CoV-2: from outbreak to pandemic.

Authors:  Pavel Skums; Alexander Kirpich; Pelin Icer Baykal; Alex Zelikovsky; Gerardo Chowell
Journal:  medRxiv       Date:  2020-03-27

10.  The use of SARS-CoV-2-related coronaviruses from bats and pangolins to polarize mutations in SARS-Cov-2.

Authors:  Tao Li; Xiaolu Tang; Changcheng Wu; Xinmin Yao; Yirong Wang; Xuemei Lu; Jian Lu
Journal:  Sci China Life Sci       Date:  2020-07-01       Impact factor: 6.038

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.