| Literature DB >> 34222663 |
Md Mehedi Hassan1,2, Shirina Sharmin1, Jinny Hong3, Hoi-Seon Lee4, Hyeon-Jin Kim2,3, Seong-Tshool Hong1.
Abstract
Since its emergence in late 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been spreading remarkably fast worldwide. Effective countermeasures require the rapid development of data and tools to monitor its spread and better understand immunogenic profile. However, limited information is available about the tools and target of the immune responses to SARS-CoV-2. In this study, we excogitated a new approach for analyzing phylogenetic relationships by using the whole prototype proteome sequences. Phylogenetic analysis on the whole prototype proteome sequences showed that SARS-CoV-2 was a direct descendant of Bat-CoV and was closely related to Pangolin-CoV, Bat-SL-CoV, and SARS-CoV. The pairwise comparison of SARS-CoV-2 with Bat-CoV showed an unusual replacement of the motif consisting of seven amino acids (NNLDSKV) within the spike protein of SARS-CoV-2. The replaced motif in the spike protein of SARS-CoV-2 was found in 12 other species, including a conserved surface protein of a malaria-causing pathogen, Plasmodium malariae. We further identified the T and B cell epitope sequence homology of SARS-CoV-2 spike protein with conserved surface protein of P. malariae using the Immune Epitope Database and Analysis Resource (IEDB). The shared immunodominant epitopes may provide immunity against SARS-CoV-2 infection to those previously infected with P. malariae.Entities:
Keywords: SARS-CoV-2; malaria; phylogenetic analysis; proteome; prototype; spike glycoprotein
Year: 2021 PMID: 34222663 PMCID: PMC8231468 DOI: 10.1515/biol-2021-0062
Source DB: PubMed Journal: Open Life Sci ISSN: 2391-5412 Impact factor: 0.938
Figure 1Sequence alignment and genome organization of Bat-CoV, SARS-CoV-2, Pan-CoV, Bat-SL-CoV, and SARS-CoVs. The gene ORF1ab encodes the pp1ab protein that contains 15 predicted nonstructural proteins (nsps). The structural proteins are encoded by Spike (S), Envelope (E), and Nucleocapsid (N) genes. The protein-encoding genes of CoVs genome were predicted by GeneMarks and ORFfinder online server with a manual check.
Figure 2Comparison of SARS-CoV-2 and Bat-CoVs. (a) The phylogeny of SARS-CoV-2 based on the 14 ORF and receptor-binding domain (RBD) sequences of Bat-CoV, Pan-CoV, Bat-SL-CoV, and SARS-CoV. Phylogenies were estimated by the neighbor-joining method using Unipro UGENE bioinformatics toolkits. (b) Organization of genes in SARS-CoV-2 and Bat-CoVs. The distribution of mutated amino acids in SARS-CoV-2 compare to Bat-CoV are represented with red lines. (c) Homotrimer 3D model of Bat-CoV and SARS-CoV-2 spike protein constructed using the SWISS-MODEL protein homology-modeling server. Circle showing the altered surface structure in the spike region of SARS-CoV-2 for antigen binding.
Lists of proteins having the NNLDSKV motif in nature
| Organism name | Protein name | Position | |
|---|---|---|---|
| 1 | SARS-CoV-2 | Surface glycoprotein | 439–445 |
| 2 |
| Conserved surface protein | 449–455 |
| 3 |
| GRIP and coiled-coil domain-containing protein 2 | 1,284–1,290 |
| 4 |
| Sperm-associated antigen 5 isoform X1 | 478–484 |
| 5 |
| BspA family leucine-rich repeat surface protein | 1,700–1,706 |
| 6 |
| Hypothetical protein APR63_07190 | 580–586 |
| 7 |
| Retention module-containing protein | 1,574–1,580 |
| 8 |
| Nonribosomal peptide synthetase | 658–664 |
| 9 |
| Hypothetical protein BB561_003344 | 1,059–1,065 |
| 10 |
| S8 family serine peptidase | 510–516 |
| 11 |
| Transient receptor potential cation channel subfamily M member 3-like | 1,380–1,386 |
| 12 |
| Bacillopeptidase F precursor | 510–516 |
| 13 |
| Hypothetical protein DICPUDRAFT_46686 | 241–247 |
Figure 3The SARS-CoV-2 spike glycoprotein showed NNLDSKV motif identity with conserved surface protein of P. malariae. The 3D model for the NNLDSKV motif of SARS-CoV-2 and P. malariae surface protein was built by the SWISS-MODEL protein homology-modeling server, and the PDB sequence was analyzed by UCSF CHIMERA software. The orientation of 7-amino acids in the NNLDSKV motif is shown in the box with different colors.
Figure 4T cell immunodominant regions based on the conserved surface protein of P. malariae. (a) Specific T cell epitope mapping response frequency score (RF) for each epitope position from P. malariae conserved surface protein. (b) Mapping of T cell epitopes of the conserved surface protein of P. malariae against SARS-CoV-2 spike glycoprotein. (c and d) Cluster analysis of epitopes of SARS-CoV-2 spike glycoprotein and surface protein of P. malariae for identification sequence homology. (d) Mapping of B cell epitopes from a conserved surface protein of P. malariae.
Experimental T cell immunodominant epitopes from a conserved surface protein of P. malariae sharing homology with T cell epitopes of SARS-CoV-2 spike protein
Note: CSP, conserved surface protein; MHC, major histocompatibility complex.