Literature DB >> 32492183

Evidence for mutations in SARS-CoV-2 Italian isolates potentially affecting virus transmission.

Domenico Benvenuto1, Ayse Banu Demir2, Marta Giovanetti3, Martina Bianchi4, Silvia Angeletti5, Stefano Pascarella4, Roberto Cauda6,7, Massimo Ciccozzi1, Antonio Cassone8.   

Abstract

Italy is the first western country suffering heavy severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission and disease impact after coronavirus disease-2019 pandemia started in China. Even though the presence of mutations on spike glycoprotein and nucleocapsid in Italian isolates has been reported, the potential impact of these mutations on viral transmission has not been evaluated. We have compared SARS-CoV-2 genome sequences from Italian patients with virus sequences from Chinese patients. We focussed upon three nonsynonymous mutations of genes coding for S(one) and N (two) viral proteins present in Italian isolates and absent in Chinese ones, using various bioinformatics tools. Amino acid analysis and changes in three-dimensional protein structure suggests the mutations reduce protein stability and, particularly for S1 mutation, the enhanced torsional ability of the molecule could favor virus binding to cell receptor(s). This theoretical interpretation awaits experimental and clinical confirmation.
© 2020 Wiley Periodicals LLC.

Entities:  

Keywords:  COVID-19; SARS coronavirus; bioinformatics; molecular evolution; mutation

Mesh:

Substances:

Year:  2020        PMID: 32492183      PMCID: PMC7300971          DOI: 10.1002/jmv.26104

Source DB:  PubMed          Journal:  J Med Virol        ISSN: 0146-6615            Impact factor:   20.693


INTRODUCTION

Coronavirus disease‐2019 (COVID‐19), which is caused by a novel coronavirus termed severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2), is a respiratory disease that officially started in the Chinese city of Wuhan Hubei Province during December 2019 and has since spread globally as a pandemic. As of 13 May 2020, the total COVID‐19 world cases have surpassed four million with more than 250 000 deaths. Italy was swept by the virus soon after China and has long ranked second in the dreadful list of most affected countries with high rate of contagiousness and SARS‐CoV‐2‐ attributable mortality rate (https://www.worldometers.info/coronavirus/) SARS‐CoV‐2 is a beta‐coronavirus similar to those causing severe acute respiratory syndrome (SARS‐CoV) and Middle East respiratory syndrome (MERS‐CoV). Most of the protein structures of coronaviruses, including SARS‐CoV, MERS‐CoV, HKU, MHV, and NL63, share high similarity except for their receptor binding domain (RBD). SARS‐CoV‐2 genome was found to be about 80% similar to SARS‐CoV. The most variable residues among bat‐CoV, SARS‐CoV, and SARS‐CoV‐2 have been found to reside within the S1 subunit of the S protein that exposes RBD. A flexible RBD was observed on S glycoproteins of MERS‐CoV and SARS‐CoV and is thought to be important for pathogenesis. Searching for mutations while the virus continues to spread within the country can offer opportunities for a better understanding of virus evolution, biopathology, and transmission. The earliest nucleotide mutations of three Italian isolates of SARS‐CoV‐2 isolates were reported on 20 March 2020. Recently, Lorusso et al reported on mutations in SARS‐2‐CoV isolates in the region of Abruzzo, Central Italy. In this study, we have examined all available SARS‐2‐CoV genome sequences of isolate from different Italian regions and compared them with the first original sequences isolated in Wuhan. Here, we report on identification, biochemical, and biophysical properties of three mutations one in the S1 spike component and two in the nucleocapsid proteins, shared by all Italian isolates and absent in the Chinese ones. The S1 mutation D614G is here particularly discussed in light of a potential influence on SARS‐2‐CoV transmission.

MATERIALS AND METHODS

All 79 COVID‐19 whole genome sequences isolated from Italian patients from 29 January 2020 to 27 April 2020 (53, 11, 6, 6, 2, and 1 isolates from regions Abruzzo, Lazio, Friuli Venezia Giulia, Lombardy, Veneto, and Marche, respectively) (Table 1) and 48 COVID‐19 sequences from 24 December 2019 to 17 January 2020 in China have been downloaded from GISAID (https://www.gisaid.org/) database. For Chinese isolates, the above time interval has been selected to rule out sequences from virus isolated from possible return infections, so allowing comparison of Italian sequences with the truly original Chinese ones. The data set has been aligned using multiple sequence alignment (MAFFT) online tool and manually edited using Bioedit program v7.0.5. Sequence alignments and analyses were obtained through the Jalview editor and structural models have been built relying on the website I‐Tasser and HHPred. CUPSAT and Dynamut online server has been used to estimate the stability of potential mutations found using the selective pressure analysis. three‐dimensional structures have been analyzed and displayed using PyMOL.
Table 1

Table reporting GISAID accession number and region of isolation of the sequences with a mutation on the spike glycoprotein

Note: The sequences showing a mutation in the nucleocapsid region have been highlighted in light gray.

Table reporting GISAID accession number and region of isolation of the sequences with a mutation on the spike glycoprotein Note: The sequences showing a mutation in the nucleocapsid region have been highlighted in light gray.

RESULTS

The analysis of the alignment has revealed the presence of two mutations on spike and nucleocapsid (N) proteins. Regarding the spike glycoprotein, the transition from an adenosine to guanine occurring on the 1901st nucleotide has led to nonsynonymous mutation from an aspartate to a glycine residue in the 614 amino acidic position (found in all the Italian isolates). This mutation, which was first reported in italian isolates by Zehender et al, is characteristic of all the Italian SARS‐CoV‐2 sequences besides those of the numerous viral isolates in Abruzzo. Aspartate is a chiral and polar amino acid which frequently occurs at the N‐termini of alpha helices while glycine is nonpolar and the only achiral amino acid. This mutation is located on the SD2 region of the RBD‐containing S1 subunit. Using the crystallographic three‐dimensional structure of the S protein, the implications of this mutation has been analyzed using CUPSAT and Dynamut servers. The results of these analysis have shown that the mutation from aspartate to glycine reduces the stability of the protein (ΔΔG [kcal/mol] −1.51) favouring the torsion potential. The Δ vibrational entropy energy between wild‐type and mutant spike protein has been calculated to be: ΔΔSVib ENCoM: 0.065 kcal mol−1 K−1 (Figure 1).
Figure 1

A, A model of spike glycoprotein monomer displaying the amino acids colored according to the vibrational entropy change upon mutation, red regions are those gaining in flexibility, The amino acidic mutation is blue circled; (B) the top image shows the molecular interaction between the side chain of the wild‐type amino acid and the side chains of the surrounding amino acid; the bottom image shows the molecular interaction between the side chain of the mutated amino acid and the side chains of the surrounding amino acid

A, A model of spike glycoprotein monomer displaying the amino acids colored according to the vibrational entropy change upon mutation, red regions are those gaining in flexibility, The amino acidic mutation is blue circled; (B) the top image shows the molecular interaction between the side chain of the wild‐type amino acid and the side chains of the surrounding amino acid; the bottom image shows the molecular interaction between the side chain of the mutated amino acid and the side chains of the surrounding amino acid Regarding the nucleocapsid protein, the two contiguous mutations (found in 56% of Italian isolates), from AGG to AAA occurring on the 649 to 651 nucleotides and GGA to CGA occurring on the 652 to 655 nucleotides both lead to nonsynonymous mutation from an arginine to a lysine and from glycine to arginine residue in the 203 and 204 amino acidic position, respectively. This latter mutation is characteristic of the SARS‐CoV‐2 sequences isolated in Abruzzo. Arginine is a chiral and polar amino acid which frequently occurs at the N‐termini of alpha helices while lysine is a chiral and polar amino acid (for glycine, see above). Using the three‐dimensional structure available on I‐tasser server, the implication of these mutations has been analyzed by CUPSAT server. The results point out both mutations reduce protein stability while favouring torsion (ΔΔG [kcal/mol] −1.92 and −2.94 for arginine to lysine and glycine to arginine, respectively). The homology modeling analysis performed using HHpred server has shown structural similarity of the subdomains 22 to 184 and 261 to 378 amino acidic regions of the SARS‐CoV‐2 nucleocapsid with the RNA binding domain of nucleocapsid protein of MERS‐CoV and the nucleocapsid oligomerization domain of SARS‐CoV, respectively. For the sequence 184 to 261 amino acids no statistically significant homologous model has been found (Figure 2).
Figure 2

Cartoon model of the nucleocapsid of the SARS‐CoV‐2 where the mutated amino acids have been shown in purple

Cartoon model of the nucleocapsid of the SARS‐CoV‐2 where the mutated amino acids have been shown in purple

DISCUSSION

Italy is the European country that has been first and heavily hit by SARS‐CoV‐2 epidemic started from China. In fact, recent evidence strongly suggests that the virus was circulating unrecognized in Lombardy since at least mid‐January, while the first official COVID‐19 Italian case was notified in Codogno (Lombardy) only the 21st of February. This happened when other European countries and United States were not yet, or minimally, affected. For these reasons, the Italian isolates of SARS‐CoV‐2 represent an interesting and useful case for investigating early virus mutations in a comparison with Chinese isolates. In this paper, we have examined all available genome sequences of the virus isolates from different Italian regions, in comparison with 48 Chinese sequences, in an attempt to unveil changes, if any, prospecting their potential biological relevance for virus infection and transmission. While confirming previous ones, , , we have now focussed on biostructural features on three nonsynonymous mutations, one on the 614 amino acidic position of S1 region within the spike S glycoprotein (from an aspartate to a glycine) and two consecutive mutations on the N protein (an arginine to a lysine and glycine to arginine residue in the 203 and 204 amino acidic positions, respectively. All three mutations, none of which were present in the Chinese isolates examined, affect biologically relevant, structural components of the virus and appear to confer enhanced torsional ability to the encoded proteins. S1 spike is a major SARS‐CoV‐2 protein allowing, through a defined binding domain (RBD), virus entry into human cells expressing angiotensin‐converting enzyme 2 (ACE‐2) receptor. The RBD engages ACE‐2 receptor through a conformational movement that exposes the relevant binding moieties in a “up” position. For this and the additional reason that many efforts to generate a safe and efficacious COVID‐19 vaccine focus upon the spike component, changes in its composition and structure are particularly relevant. The S1 mutation in the genome of Italian isolates, first reported by Zehender et al is in a position (AA 614; SD2 region) close to the S1 junction with the S2 component of the spike protein. The mutation appears to reduce protein stability through enhancement of torsional flexibility that, in theory, could favor energetically acquiring the” up” conformation of the spike glycoprotein, thus enhancing receptor binding capacity. Regarding the mutations affecting N protein (known as LKR in SARS‐CoV) no structural information is available, possibly due to its high positive charge and flexible nature. However, previous studies suggested some evidence in support of the functional importance of this intrinsically disordered region, in the modulation of a number of virus properties, including cell signaling. Recently, it has been shown that SARS‐CoV‐2 enters cells through endocytosis after binding of S protein to ACE‐2 receptor and phosphatidylinositol 3‐phosphate 5‐kinase (PIKfyve), two pore channel subtype 2 (TPC2), and cathepsin L are critical for this entry. Nucleocapsid protein was also proposed to be important in COVID‐19 infectivity. In conclusion, we have here focussed on mutations present on critical structural components of Italian SARS‐CoV‐2 isolates, that appear to be absent in early Chinese isolates of the virus. These mutations, in particular the one present in the S1 protein, are here examined for their potential to affect virus evolution and biopathological impact on the epidemic. However, our interpretation remains purely theoretical and should be taken cautiously in the absence of experimental and clinical investigations addressing the infectious and transmission capacity of SARS‐CoV‐2 bearing the above mutations. Definitely, more work is required in this area.

CONFLICT OF INTERESTS

The authors declare that there are no conflict of interests.

AUTHOR CONTRIBUTIONS

DB, MC, and AC conceived and designed the study. DB, MB, and ABD collected data and prepared the data sets. DB and SP participated to bioinformatic analyses. DB, MG, RC, and AC wrote the first draft of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version. Supporting information Click here for additional data file.
  19 in total

1.  A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core.

Authors:  Lukas Zimmermann; Andrew Stephens; Seung-Zin Nam; David Rau; Jonas Kübler; Marko Lozajic; Felix Gabler; Johannes Söding; Andrei N Lupas; Vikram Alva
Journal:  J Mol Biol       Date:  2017-12-16       Impact factor: 5.469

Review 2.  Intrinsically unstructured proteins and their functions.

Authors:  H Jane Dyson; Peter E Wright
Journal:  Nat Rev Mol Cell Biol       Date:  2005-03       Impact factor: 94.444

3.  Jalview Version 2--a multiple sequence alignment editor and analysis workbench.

Authors:  Andrew M Waterhouse; James B Procter; David M A Martin; Michèle Clamp; Geoffrey J Barton
Journal:  Bioinformatics       Date:  2009-01-16       Impact factor: 6.937

4.  CUPSAT: prediction of protein stability upon point mutations.

Authors:  Vijaya Parthiban; M Michael Gromiha; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

5.  Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation.

Authors:  Daniel Wrapp; Nianshuang Wang; Kizzmekia S Corbett; Jory A Goldsmith; Ching-Lin Hsieh; Olubukola Abiona; Barney S Graham; Jason S McLellan
Journal:  Science       Date:  2020-02-19       Impact factor: 47.728

6.  A "One-Health" approach for diagnosis and molecular characterization of SARS-CoV-2 in Italy.

Authors:  Alessio Lorusso; Paolo Calistri; Maria Teresa Mercante; Federica Monaco; Ottavio Portanti; Maurilia Marcacci; Cesare Cammà; Antonio Rinaldi; Iolanda Mangone; Adriano Di Pasquale; Marino Iommarini; Maria Mattucci; Paolo Fazii; Pierluigi Tarquini; Rinalda Mariani; Alessandro Grimaldi; Daniela Morelli; Giacomo Migliorati; Giovanni Savini; Silvio Borrello; Nicola D'Alterio
Journal:  One Health       Date:  2020-04-19

7.  Evidence for mutations in SARS-CoV-2 Italian isolates potentially affecting virus transmission.

Authors:  Domenico Benvenuto; Ayse Banu Demir; Marta Giovanetti; Martina Bianchi; Silvia Angeletti; Stefano Pascarella; Roberto Cauda; Massimo Ciccozzi; Antonio Cassone
Journal:  J Med Virol       Date:  2020-06-19       Impact factor: 20.693

8.  DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability.

Authors:  Carlos Hm Rodrigues; Douglas Ev Pires; David B Ascher
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

9.  Rigidity of the Outer Shell Predicted by a Protein Intrinsic Disorder Model Sheds Light on the COVID-19 (Wuhan-2019-nCoV) Infectivity.

Authors:  Gerard Kian-Meng Goh; A Keith Dunker; James A Foster; Vladimir N Uversky
Journal:  Biomolecules       Date:  2020-02-19

10.  Potential short-term outcome of an uncontrolled COVID-19 epidemic in Lombardy, Italy, February to March 2020.

Authors:  Giorgio Guzzetta; Piero Poletti; Marco Ajelli; Filippo Trentini; Valentina Marziano; Danilo Cereda; Marcello Tirani; Giulio Diurno; Annalisa Bodina; Antonio Barone; Lucia Crottogini; Maria Gramegna; Alessia Melegaro; Stefano Merler
Journal:  Euro Surveill       Date:  2020-03
View more
  11 in total

1.  Possible effects of air temperature on COVID-19 disease severity and transmission rates.

Authors:  Dominique Kang; Clifford Ellgen; Erik Kulstad
Journal:  J Med Virol       Date:  2021-05-03       Impact factor: 20.693

2.  Analysis of Three Mutations in Italian Strains of SARS-CoV-2: Implications for Pathogenesis.

Authors:  Domenico Benvenuto; Francesca Benedetti; Ayse Banu Demir; Massimo Ciccozzi; Davide Zella
Journal:  Chemotherapy       Date:  2021-03-18       Impact factor: 2.544

3.  Comparison of the Diagnostic Value of Immunochromatography Kits in Corona Virus Disease 2019 Patients: A Prospective Pilot Study.

Authors:  Toshiya Mitsunaga; Yutaka Seki; Masakata Yoshioka; Ippei Suzuki; Kumi Akita; Syunsuke Mashiko; Masahiko Uzura; Satoshi Takeda; Akihiro Sekine; Kunihiro Mashiko
Journal:  JMA J       Date:  2021-01-14

4.  Bioinformatic analysis of the whole genome sequences of SARS-CoV-2 from Indonesia.

Authors:  Maria Ulfah; Is Helianti
Journal:  Iran J Microbiol       Date:  2021-04

5.  Real-time quantification of the transmission advantage associated with a single mutation in pathogen genomes: a case study on the D614G substitution of SARS-CoV-2.

Authors:  Shi Zhao; Jingzhi Lou; Lirong Cao; Hong Zheng; Marc K C Chong; Zigui Chen; Renee W Y Chan; Benny C Y Zee; Paul K S Chan; Maggie H Wang
Journal:  BMC Infect Dis       Date:  2021-10-07       Impact factor: 3.090

6.  Indian Ethnomedicinal Phytochemicals as Promising Inhibitors of RNA-Binding Domain of SARS-CoV-2 Nucleocapsid Phosphoprotein: An In Silico Study.

Authors:  Sankar Muthumanickam; Arumugam Kamaladevi; Pandi Boomi; Shanmugaraj Gowrishankar; Shunmugiah Karutha Pandian
Journal:  Front Mol Biosci       Date:  2021-07-02

7.  Analysis of SARS-CoV-2 nucleocapsid phosphoprotein N variations in the binding site to human 14-3-3 proteins.

Authors:  Samanta Del Veliz; Lautaro Rivera; Diego M Bustos; Marina Uhart
Journal:  Biochem Biophys Res Commun       Date:  2021-07-02       Impact factor: 3.575

8.  Evidence for mutations in SARS-CoV-2 Italian isolates potentially affecting virus transmission.

Authors:  Domenico Benvenuto; Ayse Banu Demir; Marta Giovanetti; Martina Bianchi; Silvia Angeletti; Stefano Pascarella; Roberto Cauda; Massimo Ciccozzi; Antonio Cassone
Journal:  J Med Virol       Date:  2020-06-19       Impact factor: 20.693

Review 9.  The variants question: What is the problem?

Authors:  Davide Zella; Marta Giovanetti; Francesca Benedetti; Francesco Unali; Silvia Spoto; Michele Guarino; Silvia Angeletti; Massimo Ciccozzi
Journal:  J Med Virol       Date:  2021-07-28       Impact factor: 20.693

Review 10.  An Overview of the Crystallized Structures of the SARS-CoV-2.

Authors:  Mihaela Ileana Ionescu
Journal:  Protein J       Date:  2020-10-24       Impact factor: 4.000

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.