Literature DB >> 32891874

Improved binding of SARS-CoV-2 Envelope protein to tight junction-associated PALS1 could play a key role in COVID-19 pathogenesis.

Flavio De Maio1, Ettore Lo Cascio2, Gabriele Babini3, Michela Sali1, Stefano Della Longa4, Bruno Tilocca5, Paola Roncada5, Alessandro Arcovito6, Maurizio Sanguinetti1, Giovanni Scambia7, Andrea Urbani8.   

Abstract

The Envelope (E) protein of n class="Species">SARS-CoV-2 is the most enigmatic protein among the four structural ones. Most of its current knowledge is based on the direct comparison to the SARS E protein, initially mistakenly undervalued and subsequently proved to be a key factor in the ER-Golgi localization and in tight junction disruption. We compared the genomic sequences of E protein of SARS-CoV-2, SARS-CoV and the closely related genomes of bats and pangolins obtained from the GISAID and GenBank databases. When compared to the known SARS E protein, we observed a significant difference in amino acid sequence in the C-terminal end of SARS-CoV-2 E protein. Subsequently, in silico modelling analyses of E proteins conformation and docking provide evidences of a strengthened binding of SARS-CoV-2 E protein with the tight junction-associated PALS1 protein. Based on our computational evidences and on data related to SARS-CoV, we believe that SARS-CoV-2 E protein interferes more stably with PALS1 leading to an enhanced epithelial barrier disruption, amplifying the inflammatory processes, and promoting tissue remodelling. These findings raise a warning on the underestimated role of the E protein in the pathogenic mechanism and open the route to detailed experimental investigations.
Copyright © 2020 The Authors. Published by Elsevier Masson SAS.. All rights reserved.

Entities:  

Keywords:  COVID-19; Envelope protein; PALS1; SARS-CoV-2; Tight junctions

Mesh:

Substances:

Year:  2020        PMID: 32891874      PMCID: PMC7473260          DOI: 10.1016/j.micinf.2020.08.006

Source DB:  PubMed          Journal:  Microbes Infect        ISSN: 1286-4579            Impact factor:   2.700


Severe acute respiratory syndrome 2, known as n class="Disease">Coronavirus disease 2019 (Covid-19) is caused by a Betacoronavirus named SARS-CoV-2 virus. To date, Covid-19 associated pneumonia accounts (as of 14th July 2020, source: www.who.int, “COVID-19 Situation report - 176”) almost 13 Million confirmed cases and more than 550′000 deaths since its first reported discovery in the Hubei province in China in December 2019 [1]. Bat and Malayan pangolin n class="Species">coronavirus genomes show a high identity percentage to SARS-CoV-2, suggesting these animal species as possible reservoir hosts for SARS-CoV-2 related viruses before adaptation to humans [2]. This hypothesis is well supported by recent findings in genome analysis. In this sense, Spike (S) protein investigation has highlighted a common pattern with the bat and pangolin orthologues, as well as a peculiar motif likely acquired during human adaptation of the SARS-CoV-2 [3]. Coronavirus infection starts with inhalation of droplets containing virus that invades the epithelial cells by using n class="Gene">angiotensin converting enzyme 2 (ACE2) [4], or other cell components like integrins, as targets of the SARS-CoV2 S protein [5,6]. Viral replication in Type II alveolar epithelial cells leads to severe modifications of the innate immune response [7]. Lungs are rapidly compromised following direct damage of the pulmonary tissue mainly through dysregulation of the immune mediators that enhance the influx of monocytes and neutrophils in the infected tissue [4]. Furthermore, pro-inflammatory cytokine storm affects virus replication and increases its diffusion to nearby cells. SARS-CoV-2 virus shows a single straene">nd, positive-sense RNA genon class="Gene">me of slightly less than 30 kb in length where at 3’ end are located the genes coding for the four structural proteins: spike (S), membrane (M), envelope (E), and nucleocapsid (N) [1]. As an enveloped virus, S, M and E proteins encounter cellular membrane at the initiation of infection and during the replication cycle and are involved in the budding of the mature virions [8]. Whereas Spike protein has been immediately investigated [3,[9], [10], [11], [12]], less information has been collected on the other surface proteins. Indeed, as occurred with the previously studied Betacoronavirus, the role of E protein and its involvement with the host adverse effects was wrongly underestimated. In contrast to S, E protein of the SARS-CoV-2 has not been thoroughly studied yet. During the initial outbreak, efforts were put into the identification of epitopes with potential cross-protective role [13] while most of the information about its structure and function derive from SARS-CoV experimental studies. While E protein is not represented in the mature virions up to levels of S or M proteins [8,14], it is abundantly expressed inside the infected cell and actively involved in the pathogenic viral mechanisms [15]. SARS-CoV E protein is the sn class="Gene">mallest among structural proteins (76 amino acids), organized in three main domains: a short (approx. 8–10 amino acids) luminal oriented N-terminal domain, a long α-helical transmembrane domain composed of ≈22 amino acid residues and a cytoplasmically oriented C-terminal domain [16,17]. Homologous assembling of the E protein contributes to create a pentameric channel with its transmembrane domain that directly alters virus replication [18]. Conversely, monon class="Gene">meric E protein affects the host’s intracellular activities through C-terminal end domain, which is predicted to have a β-coil-β structure, leading to its localization in the endoplasmic reticulum, Golgi and the ER-Golgi intermediate compartment [[17], [18], [19]]. Additional targeting information at the N-terminal domain of the E protein would ensure the maintaining of the Golgi complex targeting [19]. Interestingly, last four hydrophobic aminoacidic residues of the C-terminal domain (DLLV) have been indicated to be a PDZ (PSD-95, Dlg, ZO-1 homology) binding motif, as previously reported by Teoh et al. [20]. Particularly, DLLV motif has been demonstrated to compete with Crumbs cell polarity complex component 1 (CRB1) for binding to PALS1 PDZ domain [20]. PALS1 is a cellular protein involved in maintaining tight junctions between epithelial cells also via the interaction with PALS1-associated tight junction (PATJ) protein. The Crumbs-PALS1-PATJ complex is fundamental for the development and maintenance of apical-basal polarity of epithelial cells [21]. Therefore, interactions between the SARS E protein and PALS1 induced relocation of PALS1 to the virus assembly site and disrupted tight junctions promoting virus spread. Little information has been collected yet on SARS-CoV-2 E protein and mainly focused on the sequences conserved from SARS, suggesting its potential interaction with bromodomain proteins [22]. Conversely, aim of this work is to identify the potential implications of sequence dissimilarities between the previous SARS-CoV and SARS-CoV-2.

Materials and methods

Data collection and multiple sequence alignment

Reference coronavirus genon class="Gene">mes were obtained either from NCBI GenBank (https://www.ncbi.nlm.nih.gov/genbank/) or from GISAID (https://www.gisaid.org/) databases as suggested from the recent discoveries on the proximal origins of SARS-CoV-2 [2,3] (Detailed accession numbers in Supplementary Table 1). Pangolin genomes were pooled in two groups, as shown by Lam et al. phylogenetic analyses [2] and only the consensus genome for each one of the two groups was considered. The Envelope proteins (E) from all genomes were extracted and multiple sequence alignment performed with the MAFFT algorithm (v7.450) in Geneious Prime (version 11.0.4) [23,24]. Pairwise sequence identities were also calculated using Geneious Prime.

3D homology modeling of SARS-CoV-2 E

The amino acid sequence of the n class="Species">SARS-CoV-2 Envelope protein was extracted, and the NMR structure of the homologous protein of SARS-CoV (PDB code: 2MM4) was used as a template. The starting 3D model was then built using the Homology Modeling protocol Prime of the Schrodinger Suite [25]. According to the Ramachandran plot analysis for 58 residues, 93.1% lie in the most favored regions, 6.9% in the allowed regions, and none in the disallowed regions. The missing C and N terminal residues, not present in the chosen template, were finally added.

Molecular dynamic simulations of E protein

In order to get a more realistic n class="Gene">model of the SARS-CoV-2 Envelope protein, the starting 3D homology model was further optimized using molecular dynamics simulations prepared via the CHARMM-GUI [26] server and performed using standard GROMACS tools [27]. At first, the protein was shortly simulated (10 ns, 300 K) in neutralized solution. It allowed to get conformations in which the transmembrane (TM), the C-terminus and the N-terminus domains were well suited for only embedding of the TM domain into a phospholipidic double layer membrane. The chosen conformation of the protein obtained this way was then oriented and inserted into a small POPC double layer (60 molecules per layer) surrounded by water (29,472 atoms). A 100 ns long MD simulation of the obtained system was performed as proof of stability.

In silico docking of SARS-CoV and SARS-CoV-2 Envelope C-terminus to PALS1

PALS1 structure for the in silico docking of the E protein C-tern class="Gene">minus octapeptides, was obtained by the reported PALS1-CRB1 complex (PDB code: 4UU5). It was prepared for docking with the Maestro protein preparation wizard, using default parameters [28]. Neither side-chain atoms nor residues were missing in the protein in the neighbourhood of CRB1 peptide at 5 Å of the protein. A search grid was generated with Glide5 by selecting the 8 C-terminal residues of CRB1 (PPAMERLI) to define the binding pocket, thus including the entire binding site of the peptide–protein complex. Then, the 8 C-terminal residues of each E protein were built, i.e. EGVPDLLV aene">nd SRVPDLLV for n class="Species">SARS-CoV and SARS-CoV-2, respectively, whose protonation state was assigned with PROPKA. Using the peptide-protein docking protocol of Glide [29] multiple conformers of the peptide were generated, docked on the protein and post-processed using MM-GBSA.

Results

Identification of genomic differences between SARS-CoV-2 and SARS-CoV E proteins

Multiple sequence alignn class="Gene">ments showed a quasi-perfect identity between all genomes of bats, pangolin, SARS-CoV and SARS-CoV-2 in the N-terminal and transmembrane regions of the E protein: only few synonymous mutations were identified in these two regions (Supplementary Table 2). A different outcome is highlighted in the C-tern class="Gene">minal region of the E protein sequence (Fig. 1 A) where two different mutation sites show a clear difference between SARS-CoV-2 and SARS-CoV, confirming what has been found from the whole genome phylogenetic analyses [2].
Fig. 1

Multiple sequence alignment and homology modelling of E proteins. A) Envelope Protein multiple sequence alignment of C-terminal domain in human, bat and pangolin SARS-like coronaviruses highlights the identity among SARS-CoV-2, 2 bat CoV strains (RaTG13 and CoVZC45/CoVZX21) and pangolins E proteins. The comparison also points out two mutation sites where they differ from SARS-CoV and Bat SARS-like Cov Rs3367. B) E protein structure model for SARS and C) SARS-CoV-2. A closer look at C-terminal domain of E protein in D) SARS and E) SARS-CoV-2 in which are highlighted: in red, the motifs regulating the transport to Golgi apparatus; in blue, the PDZ-binding domain; in yellow, the two mutation sites identified in SARS-CoV-2 with respect to SARS.

Multiple sequence alignn class="Gene">ment and homology modelling of E proteins. A) Envelope Protein multiple sequence alignment of C-terminal domain in human, bat and pangolin SARS-like coronaviruses highlights the identity among SARS-CoV-2, 2 bat CoV strains (RaTG13 and CoVZC45/CoVZX21) and pangolins E proteins. The comparison also points out two mutation sites where they differ from SARS-CoV and Bat SARS-like Cov Rs3367. B) E protein structure model for SARS and C) SARS-CoV-2. A closer look at C-terminal domain of E protein in D) SARS and E) SARS-CoV-2 in which are highlighted: in red, the motifs regulating the transport to Golgi apparatus; in blue, the PDZ-binding domain; in yellow, the two mutation sites identified in SARS-CoV-2 with respect to SARS. Specifically, the mutation in the inner part of the C-tern class="Gene">minal region consists in 4 nucleotide changes that lead to substitution of Thr55-Val56 for SARS-CoV with Ser55-Phe56 in SARS-CoV-2. The second mutation regards the deletion of Glu69-Gly70 and substitution with an Arg69, as also described by Hassan et al. [30].

Homology modelling of E proteins

In Fig. 1B, C, the two predicted monon class="Gene">meric E full length protein structure models have been constructed and show N-terminal (blue), transmembrane (green), C-terminal domains (red) as well as the amino acid variants (yellow). As expected, transmembrane domains of both proteins presented the highest accuracy with a total confidence score of more than 90% on ∼80% of the full-length proteins. The full-length domains of the SARS-CoV variants have been further characterised, posing them in a membrane bilayer and determining their stability on a short MD simulation of 100 ns (Supplementary Figure S1). In both models, as expected, the trans-membrane helix is perfectly fitted in the hydrophobic core of the membrane bilayer. The end of the C-terminal, accouene">nting 11 an class="Gene">mino acid residues, and the beginning of the N-terminal end did not reach previously indicated accuracy. Moreover, deletion of two amino acid residues and arginine substitution at C-terminus could affect the protein structure altering the spatial disposition of the β-coil-β (Fig. 1D, E). The structure of this subunit appeared highly mobile but remained substantially unaltered along the short MD simulations for both E variants.

E protein C-terminals binding to tight junction-associated PALS1

In order to verify the potential implications of the altered an class="Gene">mino acid sequence, the binding pose of the two C-terminus octapeptides belonging to SARS-CoV and SARS-CoV-2 were determined and compared with the crystallographic structure of the complex PALS1-CRB1 [31]. The poses with the lowest ΔG, calculated via MM-GBSA by using default parameters, are shown in Fig. 2 A. Accordingly, the Free Energy of Binding for SARS-CoV and SARS-CoV-2 Envelope C-terminals amounts to −63.62 and −97.10 kcal/mol, respectively. This value must be compared to the value of −92.5 kcal/mol obtained performing the same analysis on the complex PALS1-CRB1, where the endogenous peptide was shortened to 8 amino acids in order to compare its in silico affinity with the SARS-CoVs variants. Interestingly, the SARS-CoV-2 peptide is able to bind PALS1 with a significantly higher affinity compared to SARS-CoV variant, reaching and slightly ameliorating the affinity value of the endogenous ligand, even though the two octapeptides differ for only two out of 8 of the selected amino acids. In particular, the last four residues of both E C-terminals are the same (Asp, Leu, Leu, Val) and bind PALS1 similarly to what observed for the endogenous CRB1, even if the short sequence of the CRB1 peptide is slightly different (Glu, Arg, Leu, Ile). As shown in the interaction maps described in Fig. 2C, D, the side-chain of the last residue of the E proteins, which is a valine, interacts with Leu267, Leu321 and Phe330 of PALS1; its free terminal carboxyl group, instead, makes a salt bridge with Lys261 and H-bond interactions with amide hydrogens of Leu267, Gly268 and Ala269. The two following leucine residues of the E C-terminals make van der Waals contacts, in particular the second one makes interaction with Phe318 (while CRB1 interacts with this residue via cation-π through its Arg). Aspartate is the last common residue inside the binding pocket and its sidechain residue makes a salt bridge with Arg282. Immediately after this negatively charged amino acid the two C-terminal residues of SARS-CoV and SARS-CoV-2 differ significantly. The positively charged Arg69 of the SARS-CoV-2 octapeptide well suits a negatively charged pocket (zoomed region in Fig. 2B), being able to contemporarily create a salt bridge with Asp313 and several H bonds with Asp313 and with a carbonyl oxygen of Met275 backbone. In the same position in SARS-CoV, the small size side chain of the Gly residue cannot be involved in any interaction with this pocket thus reducing the interaction strength. Finally, the Ser alcohol moiety of SARS-CoV-2 makes a hydrogen bond with Asp299 backbone oxygen, while the Glu residue in SARS-CoV could interact with Arg272 with a salt bridge but this condition is never realized among all the poses.
Fig. 2

SARS-CoV and SARS-CoV-2 octapeptides lowest ΔG poses on PALS1 binding site, representation of H bond in yellow dashed lines, Salt bridge in purple dashed lines (red structure and label: SARS-CoV; green structure and label: SARS-CoV-2). B) Magnification of SARS-CoV-2 Arg69 inside PALS1 negatively charged pocket showing the interesting salt bridge with ASP313 and H bonds with MET275. C) Ligand Interaction diagram of SARS-CoV octapeptide. D) Ligand Interaction diagram of SARS-CoV-2 octapeptide.

SARS-CoV aene">nd n class="Species">SARS-CoV-2 octapeptides lowest ΔG poses on PALS1 binding site, representation of H bond in yellow dashed lines, Salt bridge in purple dashed lines (red structure and label: SARS-CoV; green structure and label: SARS-CoV-2). B) Magnification of SARS-CoV-2 Arg69 inside PALS1 negatively charged pocket showing the interesting salt bridge with ASP313 and H bonds with MET275. C) Ligand Interaction diagram of SARS-CoV octapeptide. D) Ligand Interaction diagram of SARS-CoV-2 octapeptide.

Discussion

In the present work, we compared the genon class="Gene">mic sequences of SARS-CoV-2 E protein with the E protein of SARS-CoV, and the corresponding bats and pangolin orthologues, in order to identify the implications of dissimilarities in such an enigmatic protein. Given the very small sequence length of the E protein, full genon class="Gene">me multiple sequence alignment might affect the overall precision on this region, likely the reason for erroneously aligned E proteins in previous works, leading to misinterpret these amino acids deletion and substitution in the C-terminal end [32,33]. SARS-CoV matches to orthologue E proteins of bat-CoV Rs3367, while SARS-CoV-2 E protein is identical to bat RaTG13 and, except for synonymous mutations, to bat CoVs and the recently identified pangolin coronaviruses. Interestingly, SARS-CoV and SARS-CoV-2 E proteins confirm the two phylogenetic clades observed with the full genome [2]. The subsequent analysis aimed to identify potentially beneficial or detrin class="Gene">mental effects of SARS-CoV-2 E protein variant with respect to the previously studied SARS E protein. Docking results of the tight junction complex PALS1-CRB1, used as a reference from the previous study by Ivanova and colleagues [31], were compared to the docking values of PALS1 with C-terminal ends of both E proteins. Our findings support the hypothesis that characteristic virulence of SARS-CoV-2 virus could depend on the strengthened interaction between SARS-CoV-2 E protein and PALS1 prompting a strong alteration of the tight junctions. The enhanced binding to PALS1 represents only the first step of the immunopathogenic process associated to SARS-CoV-2 infection. Moreover, PALS1 – E binding alters E-cadherin intracellular traffic with change in cell polarization [20] endorsing a severe dysregulation of the Th2 mediated response due to enhanced exposition to environmental allergens [34]. Epithelial mesenchymal transition occurs with a significant modification of the epithelial tissue and a meaningfully production of chemokines and cytokines in the infection site [35], promoting infection of nearby cells and potentially enhancing transmission to other individuals. Remarkably, chaene">nges in epithelial tissue structures, aene">nd consequent loss of fuene">nctions, are age-dependent aene">nd associated with defects in tight juene">nctions aene">nd cadherin – catenin con class="Gene">mplex [36]. Although little information is available on lung epithelium ageing-dependent mechanisms, we hypothesize that E protein significantly increase this dysfunction, especially in elderly people as evidenced by COVID-19 epidemiological data. Conversely, SARS-CoV infection resulted in a major fatality rate compared with the recent SARS-CoV-2 infection. Further studies will be needed to elucidate the role of this genomic variant to promote novel interactions with PDZ domains in other host cell components. Importaene">nce of our findings are corroborated by recent observations of the low rate of non-synonyn class="Gene">mous mutations in the genomic sequence codifying the E protein [30]. Indeed, these mutations have been detected only in ≈0.4% of the analysed genomes over the transmembrane and C-terminus domains. Interestingly, changes in the C-terminus motif observed in the QJR88103 (DLLV to DFLV) protein did not result in any alteration of the hydrophobicity, whereas in the QKI36831 protein the C-terminus YLLV showed a significant perturbation of the motif suggesting a diverse interaction with PALS1 [30]. Conservation of the E protein, and particularly of the sequence studied in this work, give therefore a key role to the C-terminal domain during infection. The genomic variaene">nt highlighted in this work raise concerns on the uene">nderestin class="Gene">mated role of the E protein of SARS-CoV-2 in the host adverse responses. Indeed, these computational results shed the lights on the most enigmatic protein among the structural proteins of coronaviruses, the E protein, laying the foundations for a fundamental detailed “wet” experimental investigation of its interaction with host components and comparison with E protein of SARS-CoV in standardized infection model.

Declaration of competing interest

The authors declare no competing interests.
  25 in total

1.  Impact of Clinical and Genomic Factors on COVID-19 Disease Severity.

Authors:  Sanjoy Dey; Aritra Bose; Subrata Saha; Prithwish Chakraborty; Mohamed Ghalwash; Aldo Guzm X E N-Sáenz; Filippo Utro; Kenney Ng; Jianying Hu; Laxmi Parida; Daby Sow
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

Review 2.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

3.  Evolution of SARS-CoV-2 in Spain during the First Two Years of the Pandemic: Circulating Variants, Amino Acid Conservation, and Genetic Variability in Structural, Non-Structural, and Accessory Proteins.

Authors:  Paloma Troyano-Hernáez; Roberto Reinosa; África Holguín
Journal:  Int J Mol Sci       Date:  2022-06-07       Impact factor: 6.208

4.  Time-Dependent Increase in Susceptibility and Severity of Secondary Bacterial Infections During SARS-CoV-2.

Authors:  Amanda P Smith; Evan P Williams; Taylor R Plunkett; Muneeswaran Selvaraj; Lindey C Lane; Lillian Zalduondo; Yi Xue; Peter Vogel; Rudragouda Channappanavar; Colleen B Jonsson; Amber M Smith
Journal:  Front Immunol       Date:  2022-05-12       Impact factor: 8.786

5.  SARS-CoV-2 Spike mutations modify the interaction between virus Spike and human ACE2 receptors.

Authors:  Pushpendra Mani Mishra; Farhan Anjum; Vladimir N Uversky; Chayan Kanti Nandi
Journal:  Biochem Biophys Res Commun       Date:  2022-06-23       Impact factor: 3.322

6.  SARS-CoV-2 Envelope (E) protein interacts with PDZ-domain-2 of host tight junction protein ZO1.

Authors:  Ariel Shepley-McTaggart; Cari A Sagum; Isabela Oliva; Elizabeth Rybakovsky; Katie DiGuilio; Jingjing Liang; Mark T Bedford; Joel Cassel; Marius Sudol; James M Mullin; Ronald N Harty
Journal:  PLoS One       Date:  2021-06-09       Impact factor: 3.240

Review 7.  Virus-associated disruption of mucosal epithelial tight junctions and its role in viral transmission and spread.

Authors:  Sharof Tugizov
Journal:  Tissue Barriers       Date:  2021-07-09

Review 8.  Insights into the virologic and immunologic features of SARS-COV-2.

Authors:  Ceylan Polat; Koray Ergunay
Journal:  World J Clin Cases       Date:  2021-07-06       Impact factor: 1.337

Review 9.  The Role of Dysbiosis in Critically Ill Patients With COVID-19 and Acute Respiratory Distress Syndrome.

Authors:  Denise Battaglini; Chiara Robba; Andrea Fedele; Sebastian Trancǎ; Samir Giuseppe Sukkar; Vincenzo Di Pilato; Matteo Bassetti; Daniele Roberto Giacobbe; Antonio Vena; Nicolò Patroniti; Lorenzo Ball; Iole Brunetti; Antoni Torres Martí; Patricia Rieken Macedo Rocco; Paolo Pelosi
Journal:  Front Med (Lausanne)       Date:  2021-06-04

Review 10.  The blood-gas barrier in COVID-19: an overview of the effects of SARS-CoV-2 infection on the alveolar epithelial and endothelial cells of the lung.

Authors:  Milad Shirvaliloo
Journal:  Tissue Barriers       Date:  2021-07-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.