Literature DB >> 31994738

The 2019-new coronavirus epidemic: Evidence for virus evolution.

Domenico Benvenuto1, Marta Giovanetti2, Alessandra Ciccozzi1, Silvia Spoto3, Silvia Angeletti4, Massimo Ciccozzi2.   

Abstract

There is a worldwide concern about the new coronavirus 2019-nCoV as a global public health threat. In this article, we provide a preliminary evolutionary and molecular epidemiological analysis of this new virus. A phylogenetic tree has been built using the 15 available whole genome sequences of 2019-nCoV, 12 whole genome sequences of 2019-nCoV, and 12 highly similar whole genome sequences available in gene bank (five from the severe acute respiratory syndrome, two from Middle East respiratory syndrome, and five from bat SARS-like coronavirus). Fast unconstrained Bayesian approximation analysis shows that the nucleocapsid and the spike glycoprotein have some sites under positive pressure, whereas homology modeling revealed some molecular and structural differences between the viruses. The phylogenetic tree showed that 2019-nCoV significantly clustered with bat SARS-like coronavirus sequence isolated in 2015, whereas structural analysis revealed mutation in Spike Glycoprotein and nucleocapsid protein. From these results, the new 2019-nCoV is distinct from SARS virus, probably trasmitted from bats after mutation conferring ability to infect humans.
© 2020 Wiley Periodicals, Inc.

Entities:  

Keywords:  SARS coronavirus; coronavirus; epidemiology; macromolecular design

Mesh:

Substances:

Year:  2020        PMID: 31994738      PMCID: PMC7166400          DOI: 10.1002/jmv.25688

Source DB:  PubMed          Journal:  J Med Virol        ISSN: 0146-6615            Impact factor:   2.327


INTRODUCTION

The family Coronaviridae comprises a group of large, single, plus‐stranded RNA viruses isolated from several species, and it is previously known to cause the common cold and diarrheal illnesses in humans. , In 2003, a new coronavirus (severe acute respiratory syndrome coronavirus [SARS‐CoV]) was associated with the SARS outbreak. , Recently, a new coronavirus (2019‐nCoV) has emerged in the region of Wuhan (China) as a cause of severe respiratory infection in humans. Since December 2019, different cases of pneumonia of unknown origin associated with permanence at the Wuhan market in China have been reported. , A new coronavirus, named 2019‐nCoV, belonging to the Orthocoronavirinae subfamily, distinct from MERS‐CoV and SARS‐CoV, was described. To date, a total of 1975 pneumonia cases have been confirmed in China (the State Council Information Office in Beijing, capital of China, 26 January 2020). , Animal to human transmission is considered the origin of epidemics, as many patients declared to have visited a local fish and wild animal market in Wuhan in November. Quite recently, evidence has been gathered for the animal to the human and interhuman transmission of the virus. , Although prompt diagnosis and patient isolation are the hallmarks for initial control of this new epidemic, molecular epidemiology, evolutionary models, and phylogenetic analysis can help estimate genetic variability and the evolutionary rate, which in turn have important implications for disease progression as well as for drug and vaccine development. In this short report, we provide a phylogenetic tree of the 2019‐nCoV and identify sites of positive or negative selection pressure in distinct regions of the virus.

MATERIAL AND METHODS

The complete genomes of 15 2019‐nCoV sequences have been downloaded from GISAID (https://www.gisaid.org/) and GenBank (http://www.ncbi.nlm.nih.gov/genbank/). A dataset has been built using five highly similar sequences for SARS, two sequences for the Middle East respiratory syndrome (MERS), and five highly similar sequences for bat SARS‐like coronavirus. The percentage of similarity has been identified using a basic local alignment search tool (https://blast.ncbi.nlm.nih.gov/Blast.cgi); eventually duplicated sequences have been excluded from the datasets. The dataset including 27 sequences has been aligned using multiple sequence alignment online tool and manually edited using BioEdit program v7.0.5. Maximum likelihood (ML) methods were employed for the analyses because they allow for testing different phylogenetic hypotheses by calculating the probability of a given model of evolution generating the observed data and by comparing the probabilities of nested models by the likelihood ratio test. The best‐fitting nucleotide substitution model was chosen by jModeltest software. ML tree was reconstructed using generalized time‐reversible plus gamma distribution and invariant sites (+G+I) as an evolutionary model using MEGA‐X. The adaptive evolution server (http://www.datamonkey.org/) was used to find eventual sites of positive or negative selection. For this purpose, the following test has been used: fast unconstrained Bayesian approximation (FUBAR). This test allowed us to infer the site‐specific pervasive selection, the episodic diversifying selection across the region of interest, and to identify episodic selection at individual sites.  The statistically significant positive or negative selection was based on P value less than .05. Homology models have been built relying on the website SwissModel. Structural templates have been searched and validated using the software available within the SwissModel environment and HHPred. Homology models have been validated using the QMEAN tool. Three‐dimensional structures have been analyzed and displayed using PyMOL. To map the structural variability of the N, E, S, and M regions of the virus and their sites under selection pressure, homology modeling has been applied to the sequence of 2019‐nCoV.

RESULTS

The ML phylogenetic tree, performed on whole genome sequences, is represented in Figure 1. In the tree, MERS virus sequences formed a distinct clade (clade I) from Bat SARS‐like coronavirus, SARS virus, and the 2019‐nCoV clustering together in clade II. This clade includes two different clusters: cluster IIa with Bat SARS‐like coronavirus and the 2019‐nCoV sequences, and cluster IIb with the bat SARS‐like coronavirus and the SARS virus sequences. The 2019‐nCoV is significantly and closely related only to the specific bat SARS‐like coronavirus isolated from Rhinolophus sinicus in 2015 in China (MG772934.1) (Figure 1).
Figure 1

Maximum likelihood tree of 2019‐nCoV. Along the branches, an asterisk (*) indicates a statistical value from bootstrap (>97%)

Maximum likelihood tree of 2019‐nCoV. Along the branches, an asterisk (*) indicates a statistical value from bootstrap (>97%) Regarding the FUBAR analysis performed on the N Region, significant (P < .05) pervasive episodic selection was found in two sites (380th and 410th nucleotide position using the reference sequence; Wuhan seafood market pneumonia virus isolate labeled Wuhan‐Hu‐1 MN908947.3). On the 380th aminoacidic position in the Wuhan coronavirus sequence, there is a glutamine residue instead of an asparagine residue, whereas on the 410th aminoacidic position in Wuhan coronavirus sequence, there is a threonine residue instead of an alanine residue. A significant (P < .05) pervasive negative selection in six sites (14%) has been evidenced and confirmed by FUBAR analysis. Concerning the sequences in clade II, on the 409th aminoacidic position in the Wuhan coronavirus sequence, there is glutamine instead of an asparagine residue, whereas, on the 380th aminoacidic position in Wuhan coronavirus sequence, there is a threonine residue instead of an alanine. A significant (P < .05) pervasive negative selection in six sites (14%) has been evidenced and confirmed by FUBAR analysis. On the S region, a significant (P < .05) pervasive episodic selection was found in two different sites (536th and 644th nucleotide position using the reference sequence; the Wuhan seafood market pneumonia virus isolate labeled Wuhan‐Hu‐1 MN908947.3). For the sequences in clade II, on the 536th aminoacidic position in the Wuhan coronavirus sequence, there is an asparagine residue instead of an aspartic acid residue, whereas, on the 644th aminoacidic position in Wuhan coronavirus sequence, there is a threonine residue instead of an alanine residue. A significant (P < .05) pervasive negative selection in 1065 sites (87%) has been evidenced and confirmed by FUBAR analysis, suggesting that the S region could be highly conserved. No sites under positive selection sites have been found in the E and M regions. The N region of the 2019‐nCoV homology model has been built using a SARS coronavirus nucleocapsid protein structure (2jw8.1) as the statistical test has shown that this is the most stable and similar model among the other possible structures (Ramachandran favored 90.52%, GMQE 0.17, QMEAN −2.43), whereas, for the S region, the 2019‐nCoV homology model has been built using a SARS coronavirus spike glycoprotein (6acc.1) for the same reason (Ramachandran favored 87.87%, GMQE 0.83, QMEAN −3.14) (Figure 3).
Figure 3

Cartoon model of the structural superposition between the homology model of the 2019‐nCoV in blue and the spike glycoprotein of SARS coronavirus (PDB code 6acc.1) in orange. The red circle highlights the presence of a variable region on the 2019‐nCoV at the beginning of the protein, whereas the blue square highlights the presence of two beta‐sheets on the 2019‐nCoV (401:KYR and 440:LND) that are not present on the SARS‐CoV structure; SARS, severe acute respiratory syndrome

Cartoon model of the structural superposition between the homology model of the 2019‐nCoV in blue and the nucleocapsid protein of SARS coronavirus (PDB code 2jw8.1) in orange. The red circle highlights the presence of an alpha‐helix on the SARS‐CoV and the absence of the 2019‐nCoV structure, and e positional difference of the beta‐sheets. SARS, severe acute respiratory syndrome Cartoon model of the structural superposition between the homology model of the 2019‐nCoV in blue and the spike glycoprotein of SARS coronavirus (PDB code 6acc.1) in orange. The red circle highlights the presence of a variable region on the 2019‐nCoV at the beginning of the protein, whereas the blue square highlights the presence of two beta‐sheets on the 2019‐nCoV (401:KYR and 440:LND) that are not present on the SARS‐CoV structure; SARS, severe acute respiratory syndrome The further structural and molecular analysis of the Nucleocapsid region of the 2019‐nCoV (MN908947.3) has highlighted that 2019‐nCoV and the bat SARS‐like coronavirus (MG772934.) share the same aminoacidic sequence near the 309th position (SKQLQQ,) whereas the SARS reference genome has a different aminoacidic sequence (SRQLQN). The same results have been found in the 380th aminoacidic position (KADET for 2019‐nCoV and bat SARS‐like coronavirus and KTDEA for the SARS reference genome). In particular, in this case, the 2019‐nCoV has a polar amino acid, whereas the SARS has a nonpolar amino acid (Figure 2).
Figure 2

Cartoon model of the structural superposition between the homology model of the 2019‐nCoV in blue and the nucleocapsid protein of SARS coronavirus (PDB code 2jw8.1) in orange. The red circle highlights the presence of an alpha‐helix on the SARS‐CoV and the absence of the 2019‐nCoV structure, and e positional difference of the beta‐sheets. SARS, severe acute respiratory syndrome

Regarding the sites under positive selective pressure found on the spike glycoprotein, the results have shown that the 536th aminoacidic position in 2019‐nCoV has an asparagine residue, whereas the bat SARS‐like coronavirus has a glutamine residue. The SARS virus, instead, has an aspartic acid residue. On the 644th aminoacidic position in 2019‐nCoV sequence, there is a threonine residue, whereas the bat SARS‐like virus has a serine residue. The SARS virus, however, has an alanine residue.

DISCUSSION

The data reported above show that the new 2019‐nCoV significantly clustered with a sequence from the bat SARS‐like coronavirus isolated in 2015. Moreover, in the phylogenetic tree, these two sequences are separated from the other bat SARS‐like coronavirus sequences, suggesting that this bat SARS‐like coronavirus is homologous and genetically more similar to the 2019‐nCoV than to the other sequences of Bat SARS‐like coronavirus. This supports the hypothesis that the transmission chain began from the bat and reached the human. All other genomic sequences represented in the phylogenetic tree, also including SARS and MERS coronavirus, clustered separately, thus excluding the fact that the virus involved in the actual epidemic could belong to these subgenuses. The structural analysis of two important viral proteins, the nucleocapsid and the spike‐like nucleoprotein (protein S), confirmed the significant similarity of the new coronavirus with the bat‐like SARS coronavirus and its difference from SARS coronavirus. From the selective pressure and structural analysis, mutations of surface proteins, as the spike protein S, and of nucleocapsid N protein conferring stability to the viral particle have been shown. The viral spike protein is responsible for virus entry into the cell after by binding to a cell receptor and membrane fusion, two key steps in viral infection and pathogenesis. The N protein is a structural protein involved in virion assembly, playing a pivotal role in virus transcription and assembly efficiency. Mutation of these proteins could determine two important characteristics of the coronavirus isolated during the 2019‐nCoV epidemic: a higher ability to infect and enhanced pathogenicity than the bat‐like SARS coronavirus but lower pathogenicity than SARS coronavirus. These features can explain the 2019‐nCoV zoonotic transmission and its initial lower severity than SARS epidemic. These results do not exclude the fact that further mutation due to positive selective pressure, led by the epidemic evolution, could favor an enhancement of pathogenicity and transmission of this novel virus. Recently, Ji et al described homologous recombination within the spike glycoprotein of 2019‐nCoV favoring cross‐species transmission and suggested snake as probable virus reservoir for human infection because its resampling similarity codon usage bias is more similar to Bungarus multicinctus snake compared with other animals and humans. In a previous article, it has been proven that compositional properties, mutation pressure, natural selection, gene expression, and dinucleotides, affect the codon usage bias of Bungarus species. These data, along with analysis from our study, enforcing bat origin of infection, could clarify the transmission dynamics of the 2019‐nCoV, supporting infection control policy during the ongoing epidemic.
  17 in total

1.  jModelTest 2: more models, new heuristics and parallel computing.

Authors:  Diego Darriba; Guillermo L Taboada; Ramón Doallo; David Posada
Journal:  Nat Methods       Date:  2012-07-30       Impact factor: 28.547

2.  FUBAR: a fast, unconstrained bayesian approximation for inferring selection.

Authors:  Ben Murrell; Sasha Moola; Amandla Mabona; Thomas Weighill; Daniel Sheward; Sergei L Kosakovsky Pond; Konrad Scheffler
Journal:  Mol Biol Evol       Date:  2013-02-18       Impact factor: 16.240

3.  A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core.

Authors:  Lukas Zimmermann; Andrew Stephens; Seung-Zin Nam; David Rau; Jonas Kübler; Marko Lozajic; Felix Gabler; Johannes Söding; Andrei N Lupas; Vikram Alva
Journal:  J Mol Biol       Date:  2017-12-16       Impact factor: 5.469

4.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

5.  QMEAN server for protein model quality estimation.

Authors:  Pascal Benkert; Michael Künzli; Torsten Schwede
Journal:  Nucleic Acids Res       Date:  2009-05-08       Impact factor: 16.971

6.  A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster.

Authors:  Jasper Fuk-Woo Chan; Shuofeng Yuan; Kin-Hang Kok; Kelvin Kai-Wang To; Hin Chu; Jin Yang; Fanfan Xing; Jieling Liu; Cyril Chik-Yan Yip; Rosana Wing-Shan Poon; Hoi-Wah Tsoi; Simon Kam-Fai Lo; Kwok-Hung Chan; Vincent Kwok-Man Poon; Wan-Mui Chan; Jonathan Daniel Ip; Jian-Piao Cai; Vincent Chi-Chung Cheng; Honglin Chen; Christopher Kim-Ming Hui; Kwok-Yung Yuen
Journal:  Lancet       Date:  2020-01-24       Impact factor: 79.321

7.  The 2019-new coronavirus epidemic: Evidence for virus evolution.

Authors:  Domenico Benvenuto; Marta Giovanetti; Alessandra Ciccozzi; Silvia Spoto; Silvia Angeletti; Massimo Ciccozzi
Journal:  J Med Virol       Date:  2020-02-07       Impact factor: 2.327

8.  MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization.

Authors:  Kazutaka Katoh; John Rozewicki; Kazunori D Yamada
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

9.  The continuing 2019-nCoV epidemic threat of novel coronaviruses to global health - The latest 2019 novel coronavirus outbreak in Wuhan, China.

Authors:  David S Hui; Esam I Azhar; Tariq A Madani; Francine Ntoumi; Richard Kock; Osman Dar; Giuseppe Ippolito; Timothy D Mchugh; Ziad A Memish; Christian Drosten; Alimuddin Zumla; Eskild Petersen
Journal:  Int J Infect Dis       Date:  2020-01-14       Impact factor: 3.623

Review 10.  Emerging coronaviruses: Genome structure, replication, and pathogenesis.

Authors:  Yu Chen; Qianyun Liu; Deyin Guo
Journal:  J Med Virol       Date:  2020-02-07       Impact factor: 2.327

View more
  185 in total

1.  CHALLENGES IN THE FIGHT AGAINST THE COVID-19 PANDEMIC IN UNIVERSITY HOSPITALS.

Authors:  Eduardo Alexandrino Servolo Medeiros
Journal:  Rev Paul Pediatr       Date:  2020-04-22

2.  Total Knee Replacement: The Inpatient-Only List and the Two Midnight Rule, Patient Impact, Length of Stay, Compliance Solutions, Audits, and Economic Consequences.

Authors:  Richard Iorio; C Lowry Barnes; Matthew P Vitale; James I Huddleston; Derek A Haas
Journal:  J Arthroplasty       Date:  2020-01-15       Impact factor: 4.757

3.  Plasmonic Fiberoptic Absorbance Biosensor (P-FAB) for Rapid Detection of SARS-CoV-2 Nucleocapsid Protein.

Authors:  M Divagar; R Gayathri; Rahiel Rasool; J Kuzhandai Shamlee; Himanshu Bhatia; Jitendra Satija; V V R Sai
Journal:  IEEE Sens J       Date:  2021-08-24       Impact factor: 3.301

4.  Generation of restriction endonucleases barcode map to trace SARS-CoV-2 origin and evolution.

Authors:  Federico Colombo; Elisa Corsiero; Myles J Lewis; Costantino Pitzalis
Journal:  Sci Rep       Date:  2021-06-03       Impact factor: 4.379

5.  Global transmission network of SARS-CoV-2: from outbreak to pandemic.

Authors:  Pavel Skums; Alexander Kirpich; Pelin Icer Baykal; Alex Zelikovsky; Gerardo Chowell
Journal:  medRxiv       Date:  2020-03-27

Review 6.  [Replication and transmission mechanisms of highly pathogenic human coronavirus].

Authors:  Yeyan He; Chanying Zheng
Journal:  Zhejiang Da Xue Xue Bao Yi Xue Ban       Date:  2020-05-25

Review 7.  COVID-19 and central nervous system interplay: A big picture beyond clinical manifestation.

Authors:  Sutapa Som Chaudhury; Koel Sinha; Rabindranath Majumder; Atanu Biswas; Chitrangada DAS Mukhopadhyay
Journal:  J Biosci       Date:  2021       Impact factor: 1.826

Review 8.  SARS-CoV-2-Morphology, Transmission and Diagnosis during Pandemic, Review with Element of Meta-Analysis.

Authors:  Katarzyna Grudlewska-Buda; Natalia Wiktorczyk-Kapischke; Ewa Wałecka-Zacharska; Joanna Kwiecińska-Piróg; Katarzyna Buszko; Kamil Leis; Klaudia Juszczuk; Eugenia Gospodarek-Komkowska; Krzysztof Skowron
Journal:  J Clin Med       Date:  2021-05-03       Impact factor: 4.241

Review 9.  COVID-19 impact on the liver.

Authors:  Liliana Baroiu; Caterina Dumitru; Alina Iancu; Ana-Cristina Leșe; Miruna Drăgănescu; Nicușor Baroiu; Lucreția Anghel
Journal:  World J Clin Cases       Date:  2021-06-06       Impact factor: 1.337

Review 10.  SHERLOCK and DETECTR: CRISPR-Cas Systems as Potential Rapid Diagnostic Tools for Emerging Infectious Diseases.

Authors:  Mujahed I Mustafa; Abdelrafie M Makhawi
Journal:  J Clin Microbiol       Date:  2021-02-18       Impact factor: 5.948

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.