Literature DB >> 32283146

Evolutionary analysis of SARS-CoV-2: how mutation of Non-Structural Protein 6 (NSP6) could affect viral autophagy.

Domenico Benvenuto1, Silvia Angeletti2, Marta Giovanetti3, Martina Bianchi4, Stefano Pascarella4, Roberto Cauda5, Massimo Ciccozzi6, Antonio Cassone7.   

Abstract

BACKGROUND: SARS-CoV-2 is a new coronavirus that has spread globally, infecting more than 150000 people, and being declared pandemic by the WHO. We provide here bio-informatic, evolutionary analysis of 351 available sequences of its genome with the aim of mapping genome structural variations and the patterns of selection.
METHODS: A Maximum likelihood tree has been built and selective pressure has been investigated in order to find any mutation developed during the SARS-CoV-2 epidemic that could potentially affect clinical evolution of the infection. FINDING: We have found in more recent isolates the presence of two mutations affecting the Non-Structural Protein 6 (NSP6) and the Open Reding Frame10 (ORF 10) adjacent regions. Amino acidic change stability analysis suggests both mutations could confer lower stability of the protein structures.
INTERPRETATION: One of the two mutations, likely developed within the genome during virus spread, could affect virus intracellular survival. Genome follow-up of SARS-CoV-2 spread is urgently needed in order to identify mutations that could significantly modify virus pathogenicity.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Autophagy; Bio-informatic; COVID-19; Coronavirus; Molecular evolution; SARS-CoV-2

Mesh:

Substances:

Year:  2020        PMID: 32283146      PMCID: PMC7195303          DOI: 10.1016/j.jinf.2020.03.058

Source DB:  PubMed          Journal:  J Infect        ISSN: 0163-4453            Impact factor:   6.072


Introduction

SARS-CoV-2 is the agent of Covid-19, a new coronavirus infection, recently declared pandemic by the WHO, which causes severe pneumonia and acute respiratory distress syndrome (ARDS). As of March 16th, more than 150,000 cases of Covid-19 have been notified. While most cases have occurred in mainland China and other Asiatic countries, the virus has also spread to Europe, particularly Italy, where it has caused more than thousand deaths and is overstressing the national health system. The SARS-CoV-2 genome has been intensely investigated for diagnostics and pathogenicity insights into this virus, as well as to trace its evolution. Presently, more than 350 sequences of virus isolated from several countries are shared in GISAID database. Studies have highlighted the basic structure of the RNA genome, its probable source from a bat coronavirus at Wuhan food and wild animal market (with or without a still unidentified secondary animal host) and the rather close similarity in viral sequences of isolates from different patients. However, interpretation of genome-driven virus evolution has remained difficult because the published data do still refer to a relatively low number of viral isolates, most of which from China, and few ones from other countries. In particular, there is little information about the evolutionary impact of the few mutations that have been reported by various Authors. We have here examined all available SARS-CoV-2 sequences with the aim of mapping structural variations of this new coronavirus genome and the patterns of selection, if any, of viral protein genes. We describe the presence of two mutations affecting the Non-Structural Protein 6 (NSP6) and the Open Reading Frame10 (ORF 10) adjacent aminoacidic regions of SARS-CoV-2 and discuss their potential relevance for virus-host interaction, particularly virus-induced cellular autophagy.

Material and Methods

All the 351 sequences available of COVID-19 isolated from humans have been downloaded from GISAID (https://www.gisaid.org/) databank. A dataset has been built including sequences from human and excluding sequences from animals (like bat or pangolin). The Dataset has been aligned using multiple sequence alignment (MAFFT) online tool and manually edited using Bioedit program v7.0.5. The complete dataset was assessed for presence of phylogenetic signal by applying the likelihood mapping analysis implemented in the IQ-TREE 1.6.8 software (http://www.iqtree.org). A maximum likelihood (ML) phylogeny was reconstructed using IQ-TREE 1.6.8 software under the HKY nucleotide substitution model with four gamma categories (HKY+G4), which was inferred in jModelTest (https://github.com/ddarriba/jmodeltest2) as the best fitting model. Adaptive Evolution Server (http://www.datamonkey.org/) was used to find possible sites of positive or negative selection. To this purpose the following tests has been used: Fixed Effects Likelihood (FEL), Mixed Effects Model of Evolution (MEME) and Bayesian Graphical Models for co-evolving sites (BGM). These tests allowed to infer the site-specific pervasive selection, the episodic diversifying selection across the region of interest, to identify episodic selection at individual sites and to verify the presence of some co-evolving sites. Statistically significant positive or negative selection was based on p value < 0.05 . Protein homology modelling has been attempted using the websites SwissModel and HHPred. I-Tasser has also been used as an alternative source of SARS-CoV-2 protein structure models. I-Mutant2.0 online server has been used to predict the effect of the mutations found under selective pressure on protein stability. Secondary structure and trans-membrane predictions have been carried out with Jpred, TMHMM and Protter services. Three-dimensional structures have been analyzed and displayed using PyMOL.

Role of the funding source

No specific funding source has been received

Results

A Maximum Likelihood tree using HKY+G4 model has been built and results have been compared with epidemiological information. Sequences from several different countries have been found in the same clusters while sequences from the same countries have not been found in the same cluster. No separated clade is evident, but all the sequences are part of the same clade. The mutation on the amino acid position 3691 does not appear to be associated within the same cluster with sequences with a leucine on the residue position 9659. Moreover 3 sequences have been found to have a mutation on both the 3691 and the 9659 amino acidic positions. Sequences with a histidine on the position 9659 have been found to belong to distinct clusters. At any rate, clustering of sequence presenting amino acidic mutations did not indicate geographical/epidemiological link with the patients from whom SARS-CoV-2 was isolated (Supplementary Fig. 1). No reliable homology model could be built using SwissModel and HHpred servers. For this reason, the three-dimensional model of NSP6 has been downloaded from I-Tasser website. (https://zhanglab.ccmb.med.umich.edu/C-I-TASSER/2019-nCov/QHD43415_6.pdb.gz). The structural analysis performed using TMHMM and Protter servers have shown that NSP6 protein has 7 putative trans-membrane helices like in other coronaviruses. The MEME analysis has shown evidence for episodic synonymous mutations mostly concerning the 3rd codon and not impacting on the overall proteomic asset of the virus. Regarding the FEL analysis, the presence of potential sites under positive selective pressure have been found on 2 sites, on the amino acidic positions 3691 and 9659. These mutations fall on NSP6 and on a region near the Open Reading Frame 10 (ORF 10), respectively. The amino acidic change stability (ACS) analysis has shown that both mutations lead to a lower stability of the protein structures. Namely, at amino acid position 3691 (corresponding to NSP6 position 37), most of the SARS-CoV-2 sequences have a leucine residue while some more recent sequences from Asia, America, Oceania and Europe isolates show phenylalanine (Table 1 ). Both amino acids are non-polar, but phenylalanine has a benzoic ring in the side chain which may stiffen the secondary structure by means of aromatic-aromatic, hydrophobic or stacking interactions. The ACS analysis has shown that this mutation lead to a lower stability of the protein structure (Fig. 1). The mutant position is predicted to be at the C-terminal side of the first transmembrane helix corresponding to the first outer membrane site, close to a sequence region rich of phenylalanine residues (from NSP6 residue position 32 to 40: SLFFFFYENA) of SARS-CoV-2 (Fig. 1 b). According to the structural model, the mutant position is part of a constellation of aromatic residues which includes, in addition to the sequentially contiguous residues, Trp31, Phe42 and Phe45 (Fig. 2). Jpred attributes a helical conformation also to the cytosolic portion of the segment connecting the first to the second transmembrane helix which may facilitate hydrophobic interactions among these aromatic residues.
Table 1

Table reporting GISAID accession number and country of isolation of the sequences with a mutation on the 3691 aminoacidic position.

GISAID Accession NumberCountry of isolation
408480Yunnan
408481Chongqing
407988Singapore
406223USA - Arizona
410984France
412974Italy
413016Brazil
411218France
413490New Zeland
412975Australia
408430France
410546Italy
413214Australia
413213Australia
413597Australia
413598Australia
413597Australia
413600Australia
410546Italy
413595Australia
412030Hong Kong
412968Japan
412969Japan
413459Japan
408482Shandong
412981Hubei
413019Switzerland
413025USA - Washington
413603Finland
413605Finland
413588Netherlands
413589Netherlands
413585Netherlands
Fig. 1

I-TASSER model of NSP6. Residue under positive selective pressure with a p< 0.05 is shown as a sphere. Residues found in the structure proximity are shown in sticks. All residues are marked by the corresponding labels.

Fig. 2

Results obtained with Protter and TMHMM are shown in panel A and B, respectively.

In the panel A, the residue under positive pressure with p< 0.05 is marked by the red arrow (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.).

Table reporting GISAID accession number and country of isolation of the sequences with a mutation on the 3691 aminoacidic position. I-TASSER model of NSP6. Residue under positive selective pressure with a p< 0.05 is shown as a sphere. Residues found in the structure proximity are shown in sticks. All residues are marked by the corresponding labels. Results obtained with Protter and TMHMM are shown in panel A and B, respectively. In the panel A, the residue under positive pressure with p< 0.05 is marked by the red arrow (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.). At the amino acidic position 9659 (corresponding to ORF 10 position 3 o 4), most of the SARS-CoV-2 sequences have an arginine residue while some sequences from Australia and America isolates have a histidine residue. The BGM analysis has highlighted the presence of co-evolution between the amino acidic position 9375 and the position 9659 (Table 2 ). Both amino acids are polar, but histidine has an imidazole side chain that suggests a more rigid secondary structure. In fact, ACS analysis has shown that this mutation leads to a lower stability of the protein structure. On the same position, other sequences of SARS-CoV-2 isolates from Australia and New Zealand have shown the presence of a non-polar (leucine) aminoacidic residue (Table 3 ). Also, in this case, the mutation leads to a lower stability of the protein structure, as indicated by ACS analysis.
Table 2

Table reporting GISAID accession number and country of isolation of the sequences with a histidine on the 9659 aminoacidic position.

GISAID Accession NumberCountry of isolation
412965Canada
413490New Zeland
412975Australia
413214Australia
413213Australia
Table 3

Table reporting GISAID accession number and country of isolation of the sequences with a Leucine on the 9659 aminoacidic position.

GISAID Accession NumberCountry of isolation
411954USA - California
410717Australia
410718Australia
407896Australia
407894Australia
Table reporting GISAID accession number and country of isolation of the sequences with a histidine on the 9659 aminoacidic position. Table reporting GISAID accession number and country of isolation of the sequences with a Leucine on the 9659 aminoacidic position.

Discussion

In this paper, we have examined all available genome sequences (352) of the recently emerged, new coronavirus SARS-CoV-2 which causes a dreadful pneumonia pandemic termed Covid-19 (19). This virus has infected so far more than 120,000 subjects worldwide, with several thousand casualties. Almost all countries have been affected and some of them are now experiencing a rampant rise of disease cases with severe consequences on the stability of health systems. Since disease probable emergence in a wet market of Wuhan, city in the Hubei region of China and recognition of its causative agent, a number of studies on SARS-CoV-2 genome have been published and showed its close similarities (and differences) with the genomes of other coronaviruses isolated from bat, snake, pangolin and SARS CoV.20, 21, 22 Now the attention of most investigators is focused on the potential capability of the viral genome to evolve through mutation, recombination and gene gain and losses, as verified in other human coronaviruses. Despite contrary expectations, the selective pressure analysis reported here points out that the genome of SARS-CoV-2 has so far undergone very few mutations, which mostly affect the 3rd codon, and are synonymous, meaning are not going to influence the general molecular structure of this new virus. In addition, it remains difficult to prove the biological relevance of these mutations by pure bioinformatic approach, in the absence of experimental correlates. To somewhat overcome these difficulties, we have here joined bioinformatic and phylogenetic with structural analysis of SARS-CoV-2 protein encoded by mutated genes, in an attempt to obtain some insights into the biological significance and plausibility of the noted mutations. We posit that some of these mutations can provide the virus with useful adaptations in its fight to persist and multiply within humans. We have particularly assessed two SARS-CoV-2 mutations of non-structural viral proteins, NSP6 and an aminoacidic region near ORF 10, with particular interest into the former protein. NSP6, a common component of both α and β-coronaviruses, locates to the endoplasmic reticulum (ER) and generates autophagosomes. We notice that the presence of multiple phenylalanine residues in the outer membrane region of NSP6 should favor the affinity between this region and the ER membrane inducing a more stable binding of the protein to ER. It has been shown that this binding may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation. Thus, its role would be to limit autophagosome expansion, directly or indirectly by starvation or chemical inhibition of mTOR signaling. Nonetheless, the role of autophagy in viral infection is a double-edge sword and we don't have direct evidence that NSP6 mutation does in fact favor viral replication and evasion from cellular immunity or the opposite. In this context, it should be noted that mutational protein analysis speaks for a lower stability of NSP6 upon changing phenylalanine from leucine, but it should be considered that ACS analysis doesn't consider trans-membrane position and other protein interactions. Regarding the aminoacidic region near the ORF 10, previous studies performed on the SARS-CoV reported a 29 nucleotides deletion segment disrupting ORF 9 and, simultaneously, eliminating ORFs 10 and 11. The clinical significance of this deletion is unclear also because it has been found to co-exist with the non-deleted variant in the same host and same clinical specimen (25). A comparison of data from evolutionary and phylogenetic analysis leads us to hypothesize that the mutations are probably unrelated to a strain or a sub-family of the COVID-19 but are due to independent converging evolution of the virus that promote these changes in the viral genome. In conclusion, the analysis of a relatively wide database of SARS-CoV-2 genomes of worldwide isolates representative of Covid-19, from the start of epidemic in China up to the recent virus spread to European countries, has revealed only two synonymous mutations. Nonetheless, we here speculate that one of these two mutations, i.e the NSP6, could bring to some appreciable change in the expression of SARS-CoV-2 relationship with its host, particularly concerning a critical host anti-viral defense, such as the autophagic lysosomal machinery. Changes in these viral regions should be constantly monitored as they could significantly modify SARS-CoV-2 pathogenicity.

Funding

No specific funding source has been received

Contributors

DB and AC designed the study. DB, MB and MG did the experiments. DB, MB, SP and MC analysed data and DB, AC, SA and RC wrote the article.

Data sharing

Data are available on different websites

Declaration of Competing Interest

We declare no competing interests.
  11 in total

1.  Evaluation of methods for the prediction of membrane spanning regions.

Authors:  S Möller; M D Croning; R Apweiler
Journal:  Bioinformatics       Date:  2001-07       Impact factor: 6.937

2.  Protter: interactive protein feature visualization and integration with experimental proteomic data.

Authors:  Ulrich Omasits; Christian H Ahrens; Sebastian Müller; Bernd Wollscheid
Journal:  Bioinformatics       Date:  2013-10-24       Impact factor: 6.937

3.  Topology and membrane anchoring of the coronavirus replication complex: not all hydrophobic domains of nsp3 and nsp6 are membrane spanning.

Authors:  Monique Oostra; Marne C Hagemeijer; Michiel van Gent; Cornelis P J Bekker; Eddie G te Lintelo; Peter J M Rottier; Cornelis A M de Haan
Journal:  J Virol       Date:  2008-10-08       Impact factor: 5.103

4.  Spidermonkey: rapid detection of co-evolving sites using Bayesian graphical models.

Authors:  Art F Y Poon; Fraser I Lewis; Simon D W Frost; Sergei L Kosakovsky Pond
Journal:  Bioinformatics       Date:  2008-06-18       Impact factor: 6.937

5.  The I-TASSER Suite: protein structure and function prediction.

Authors:  Jianyi Yang; Renxiang Yan; Ambrish Roy; Dong Xu; Jonathan Poisson; Yang Zhang
Journal:  Nat Methods       Date:  2015-01       Impact factor: 28.547

6.  A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core.

Authors:  Lukas Zimmermann; Andrew Stephens; Seung-Zin Nam; David Rau; Jonas Kübler; Marko Lozajic; Felix Gabler; Johannes Söding; Andrei N Lupas; Vikram Alva
Journal:  J Mol Biol       Date:  2017-12-16       Impact factor: 5.469

7.  Autophagy postpones apoptotic cell death in PRRSV infection through Bad-Beclin1 interaction.

Authors:  Ao Zhou; Shuaifeng Li; Faheem Ahmed Khan; Shujun Zhang
Journal:  Virulence       Date:  2015-12-15       Impact factor: 5.882

8.  I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure.

Authors:  Emidio Capriotti; Piero Fariselli; Rita Casadio
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

9.  IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.

Authors:  Lam-Tung Nguyen; Heiko A Schmidt; Arndt von Haeseler; Bui Quang Minh
Journal:  Mol Biol Evol       Date:  2014-11-03       Impact factor: 16.240

Review 10.  Molecular Evolution of Human Coronavirus Genomes.

Authors:  Diego Forni; Rachele Cagliani; Mario Clerici; Manuela Sironi
Journal:  Trends Microbiol       Date:  2016-10-19       Impact factor: 17.079

View more
  89 in total

Review 1.  Decoding Asymptomatic COVID-19 Infection and Transmission.

Authors:  Rui Wang; Jiahui Chen; Yuta Hozumi; Changchuan Yin; Guo-Wei Wei
Journal:  J Phys Chem Lett       Date:  2020-11-12       Impact factor: 6.475

2.  Global analysis of more than 50,000 SARS-CoV-2 genomes reveals epistasis between eight viral genes.

Authors:  Hong-Li Zeng; Vito Dichio; Edwin Rodríguez Horta; Kaisa Thorell; Erik Aurell
Journal:  Proc Natl Acad Sci U S A       Date:  2020-11-17       Impact factor: 11.205

3.  Could dantrolene be explored as a repurposed drug to treat COVID-19 patients by restoring intracellular calcium homeostasis?

Authors:  B Jiang; S Liang; G Liang; H Wei
Journal:  Eur Rev Med Pharmacol Sci       Date:  2020-10       Impact factor: 3.507

4.  Genomic Variations in SARS-CoV-2 Genomes From Gujarat: Underlying Role of Variants in Disease Epidemiology.

Authors:  Madhvi Joshi; Apurvasinh Puvar; Dinesh Kumar; Afzal Ansari; Maharshi Pandya; Janvi Raval; Zarna Patel; Pinal Trivedi; Monika Gandhi; Labdhi Pandya; Komal Patel; Nitin Savaliya; Snehal Bagatharia; Sachin Kumar; Chaitanya Joshi
Journal:  Front Genet       Date:  2021-03-19       Impact factor: 4.599

Review 5.  SARS-CoV-2 mutations: the biological trackway towards viral fitness.

Authors:  Parinita Majumdar; Sougata Niyogi
Journal:  Epidemiol Infect       Date:  2021-04-30       Impact factor: 2.451

6.  Q493K and Q498H substitutions in Spike promote adaptation of SARS-CoV-2 in mice.

Authors:  Kun Huang; Yufei Zhang; Xianfeng Hui; Ya Zhao; Wenxiao Gong; Ting Wang; Shaoran Zhang; Yong Yang; Fei Deng; Qiang Zhang; Xi Chen; Ying Yang; Xiaomei Sun; Huanchun Chen; Yizhi J Tao; Zhong Zou; Meilin Jin
Journal:  EBioMedicine       Date:  2021-05-13       Impact factor: 8.143

Review 7.  MicroRNAs and SARS-CoV-2 life cycle, pathogenesis, and mutations: biomarkers or therapeutic agents?

Authors:  Farshad Abedi; Ramin Rezaee; A Wallace Hayes; Somayyeh Nasiripour; Gholamreza Karimi
Journal:  Cell Cycle       Date:  2020-12-31       Impact factor: 4.534

8.  E484K as an innovative phylogenetic event for viral evolution: Genomic analysis of the E484K spike mutation in SARS-CoV-2 lineages from Brazil.

Authors:  Patrícia Aline Gröhs Ferrareze; Vinícius Bonetti Franceschi; Amanda de Menezes Mayer; Gabriel Dickin Caldana; Ricardo Ariel Zimerman; Claudia Elizabeth Thompson
Journal:  Infect Genet Evol       Date:  2021-05-25       Impact factor: 4.393

9.  Screening of FDA-approved compound library identifies potential small-molecule inhibitors of SARS-CoV-2 non-structural proteins NSP1, NSP4, NSP6 and NSP13: molecular modeling and molecular dynamics studies.

Authors:  Shobana Sundar; Lokesh Thangamani; Shanmughavel Piramanayagam; Chandrasekar Narayanan Rahul; Natarajan Aiswarya; Kanagaraj Sekar; Jeyakumar Natarajan
Journal:  J Proteins Proteom       Date:  2021-06-09

Review 10.  Evolution, Ecology, and Zoonotic Transmission of Betacoronaviruses: A Review.

Authors:  Herbert F Jelinek; Mira Mousa; Eman Alefishat; Wael Osman; Ian Spence; Dengpan Bu; Samuel F Feng; Jason Byrd; Paola A Magni; Shafi Sahibzada; Guan K Tay; Habiba S Alsafar
Journal:  Front Vet Sci       Date:  2021-05-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.