| Literature DB >> 32898469 |
Gerard Duart1, Mª Jesús García-Murria1, Brayan Grau1, José M Acosta-Cáceres1, Luis Martínez-Gil1, Ismael Mingarro1.
Abstract
Coronavirus E protein is a small membrane protein found in the virus envelope. Different coronavirus E proteins share striking biochemical and functional similarities, but sequence conservation is limited. In this report, we studied the E protein topology from the new SARS-CoV-2 virus both in microsomal membranes and in mammalian cells. Experimental data reveal that E protein is a single-spanning membrane protein with the N-terminus being translocated across the membrane, while the C-terminus is exposed to the cytoplasmic side (Ntlum/Ctcyt). The defined membrane protein topology of SARS-CoV-2 E protein may provide a useful framework to understand its interaction with other viral and host components and contribute to establish the basis to tackle the pathogenesis of SARS-CoV-2.Entities:
Keywords: SARS-CoV-2; coronavirus; envelope protein; membrane insertion; topology
Mesh:
Substances:
Year: 2020 PMID: 32898469 PMCID: PMC7536074 DOI: 10.1098/rsob.200209
Source DB: PubMed Journal: Open Biol ISSN: 2046-2441 Impact factor: 6.411
Figure 1.(a) Multi-alignment of amino acid sequences of the E protein of SARS-CoV-2 and the other six human coronavirus. SARS-CoV severe acute respiratory syndrome coronavirus (UniProt P59637), MERS-CoV Middle East respiratory syndrome coronavirus (UniProt K9N5R3), HCoV-HKU1 (UniProt Q0ZJ83), HCoC-OC43 (UniProt Q4VID3), HCoC-229E (UniProt P19741) and HCoV-NL63 (UniProt Q5SBN7). Predicted TM segments at UniProt are highlighted in a grey box. Native predicted glycosylation acceptor sites in SARS-CoV-2 are shown in bold and charged residues highlighted with + or – symbols on top. Conserved residues are shown in orange. Differences between SARS-CoV-2 and SARS-CoV are highlighted as yellow boxes. (b) Phylogenetic data and (c) tree obtained with Clustal Omega (EMBL-EBI) using the default parameters.
Computer analysis of the SARS-CoV-2 E protein amino acid sequence topology. n.p., non-predicted.
| algorithm | Nt | Ct | TMDs (start-end) |
|---|---|---|---|
| ΔG predictor | n.p. | n.p. | 1 (17–39) |
| TMHMM | cytosol | lumen | 1 (12–34) |
| MEMSAT-SVM | lumen | lumen | 2 (10–39) (43–58) |
| TMpred | lumen | cytosol | 1 (17–34) |
| HMMTop | lumen | cytosol | 1 (11–35) |
| Phobius | cytosol | lumen | 1 (12–37) |
| TOPCONS | lumen | cytosol | 1 (16–36) |
Figure 2.Translocon-mediated insertion of E protein variants into microsomal membranes. (a) Schematic representation of E protein constructs. Glycosylation acceptor Asn residues are indicated. (b) In vitro translation in the presence of microsomes of the different E protein constructs. Construct containing inserted asparagine and threonine residues at positions 3 and 5 (NST; lanes 1–2) or glutamine and threonine at positions 3 and 5 (lane 3), and wild-type variants (lane 4) were translated in the presence of microsomes. NST variant was split and half of the sample was Endo H treated (lane 1). Bands of non-glycosylated and glycosylated proteins are indicated by white and black dots, respectively. (c) E protein (harbouring an engineered glycosylation site at the N-terminus, positions 3–5) was translated in either the absence (lanes 1 and 2) or the presence (lanes 3 and 4) of microsomal membranes. In lanes 1 and 2, microsomal membranes were added posttranslationally (after 1 h, post-) and incubation was continued for another 1 h. Samples in lanes 1 and 3 were treated later with EndoH. The gels are representative of at least three independent experiments.
Figure 3.E protein topology in mammalian cells. To determine the topology in vivo HEK-293T cells were transfected with C-terminal tagged (c-myc) E protein variants. (a) Constructs encoding wild-type (Wt; lanes 1 and 2), inserted asparagine and threonine at positions 3 and 5 (NST; lanes 3 and 4) or glutamine and threonine at positions 3 and 5 (QST; lanes 5 and 6) were Endo H (+) or mock (−) treated. Filled and empty Y-shaped symbols denoted acceptor (NST) and non-acceptor (QST) glycosylation sites, respectively. (b) Additionally, we included constructs containing similar Wt (lanes 1 and 2), replaced glutamic acids at positions 7 and 8 by lysine residues (EE > KK; lanes 3 and 4) or NST (lanes 5 and 6) variants with an extra glycosylation site inserted at the Ct end of the protein. Once again, to confirm the glycosylated nature of the higher molecular weight bands, samples were either Endo H (+) or mock (−) treated. Designed glycosylation sites and tags are shown in black, while native E protein features are shown in grey.