| Literature DB >> 32206694 |
Swatantra Kumar1, Shailendra K Saxena1, Vimal K Maurya1, Anil K Prasad1, Madan L B Bhatt1.
Abstract
The emergence of 2019 novel coronavirus (2019-nCoV) is of global concern and might have emerged from RNA recombination among existing coronaviruses. CoV spike (S) protein which is crucial for receptor binding, membrane fusion via conformational changes, internalization of the virus, host tissue tropism and comprises crucial targets for vaccine development, remain largely uncharacterized. Therefore, the present study has been planned to determine the sequence variation, structural and antigenic divergence of S glycoprotein which may be helpful for the management of 2019-nCoV infection. The sequences of spike glycoprotein of 2019-nCoV and SARS coronavirus (SARS-CoV) were used for the comparison. The sequence variations were determined using EMBOSS Needle pairwise sequence alignment tools. The variation in glycosylation sites was predicted by NetNGlyc 1.0 and validated by N-GlyDE server. Antigenicity was predicted by NetCTL 1.2 and validated by IEDB Analysis Resource server. The structural divergence was determined by using SuperPose Version 1.0 based on cryo-EM structure of the SARS coronavirus spike glycoprotein. Our data suggests that 2019-nCoV is newly spilled coronavirus into humans in China is closely related to SARS-CoV, which has only 12.8% of difference with SARS-CoV in S protein and has 83.9% similarity in minimal receptor-binding domain with SARS-CoV. Addition of a novel glycosylation sites were observed in 2019-nCoV. In addition, antigenic analysis proposes that great antigenic differences exist between both the viral strains, but some of the epitopes were found to be similar between both the S proteins. In spite of the variation in S protein amino acid composition, we found no significant difference in their structures. Collectively, for the first time our results exhibit the emergence of human 2019-nCoV is closely related to predecessor SARS-CoV and provide the evidence that 2019-nCoV uses various novel glycosylation sites as SARS-CoV and may have a potential to become pandemic owing its antigenic discrepancy. Further, demonstration of novel Cytotoxic T lymphocyte epitopes may impart opportunities for the development of peptide based vaccine for the prevention of 2019-nCoV. © Indian Virological Society 2020.Entities:
Keywords: 2019-nCoV; Antigenicity; COVID-19; Coronavirus; Glycosylation; S glycoprotein; SARS-CoV; Structural divergence
Year: 2020 PMID: 32206694 PMCID: PMC7085496 DOI: 10.1007/s13337-020-00571-5
Source DB: PubMed Journal: Virusdisease ISSN: 2347-3584
Fig. 1Phylogeny of 2019-nCoV. The phylogenetic tree was constructed by molecular evolutionary genetic analysis (MEGA) software based on the spike gene sequences, showing the evolutionary relationship of 2019-nCoV with predecessors strains of SARS-coronaviruses. The phylogenetic analysis showing that 2019-nCoV is closely related with Bat SARS-like coronavirus. However, both 2019-nCoV and Bat SARS-like coronavirus emerged from the SARS coronavirus. The accession number of the sequences used for the phylogenetic analysis is represented at the tip of the branches, where Japanese encephalitis virus has been used as the outgroup
Fig. 2Sequence variation of spike glycoprotein. a The complete amino acid sequences of spike glycoprotein of 2019-nCoV and SARS-CoV are shown as described earlier. Standard single-letter abbreviations for the amino acids were used. The collinear sequences were aligned by online use of Clustal Omega. Amino acid alignment exhibits non-conservative substitutions (“.”), conservative substitutions (“:”) and semi-conservative substitutions (“.”). Conserved regions are represented as (“”). There are 76.2% identity, 87.2% similarity and 2% Gaps in 1273 positions. b The tertiary structure of minimal RBD (residues 270–510)
Comparison of N-glycosylation sites between spike glycoproteins of 2019-nCoV and SARS-CoV strains
| N-glycosylation | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2019-nCoV | SARS-CoV | ||||||||||
| Position (NetNGlyc 1.0) agreement result | Position (N-GlyDE) | Sites | Potential | Jury agreement | N-Glyc Result | Position (NetNGlyc 1.0) agreement result | Position (N-GlyDE) | Sites | Potential | Jury agreement | N-Glyc Result |
| – | – | – | – | – | – | 29 | 29 | 0.7751 | (9/9) | +++ | |
| 61 | 61 | 0.7820 | (9/9) | +++ | 65 | 65 | 0.8091 | (9/9) | +++ | ||
| 234 | 234 | 0.7613 | (9/9) | +++ | 227 | 227 | NITN | 0.7518 | (9/9) | +++ | |
| 74 | 74 | 0.7192 | (9/9) | ++ | 119 | 119 | 0.7039 | (9/9) | ++ | ||
| 282 | 282 | NGTI | 0.7378 | (9/9) | ++ | 269 | 269 | NGTI | 0.6910 | (9/9) | ++ |
| 616 | 616 | 0.7163 | (9/9) | ++ | 318 | 318 | NITN | 0.6413 | (9/9) | ++ | |
| 717 | 717 | 0.6426 | (9/9) | ++ | 602 | 602 | 0.6916 | (9/9) | ++ | ||
| 1194 | 1194 | NESL | 0.6791 | (9/9) | ++ | 783 | 783 | NFSQ | 0.6260 | (9/9) | ++ |
| 17 | 17 | 0.6606 | (8/9) | + | 1176 | 1176 | NESL | 0.6794 | (9/9) | ++ | |
| 122 | 122 | 0.6781 | (8/9) | + | 73 | 73 | 0.5303 | (4/9) | + | ||
| 149 | 149 | 0.6318 | (7/9) | + | 109 | 109 | 0.6080 | (7/9) | + | ||
| 165 | 165 | NCTF | 0.6220 | (8/9) | + | 158 | 158 | NCTF | 0.5808 | (7/9) | + |
| 331 | 331 | NITN | 0.5970 | (7/9) | + | 330 | 330 | 0.6063 | (8/9) | + | |
| 343 | 343 | 0.5671 | (8/9) | + | 357 | 357 | 0.6836 | (8/9) | + | ||
| 603 | 603 | 0.5783 | (6/9) | + | 589 | 589 | 0.5777 | (6/9) | + | ||
| 801 | 801 | NFSQ | 0.6146 | (8/9) | + | 699 | 699 | 0.5356 | (7/9) | + | |
| 1098 | 1098 | 0.5496 | (5/9) | + | 1080 | 1080 | 0.5806 | (7/9) | + | ||
| 1134 | 1134 | NNTV | 0.5800 | (6/9) | + | 1116 | 1116 | NNTV | 0.5107 | (5/9) | + |
The table shows a comparison of predicted N-glycosylation sites in Spike glycoprotein of Wuhan-Hu-1–2019 novel coronavirus (2019-nCoV) and SARS coronavirus-2003 (SARS-CoV) strains. N-glycosylation potential (0.5) was taken as cutoff. The N-glycosylation sites were determined by NetNGlyc 1.0 [20] and validated by N-GlyDE server [21]
*Italics indicates the differences between N-glycosylation sites between the two Spike glycoproteins
Comparison of antigenicity between spike glycoproteins of 2019-nCoV and SARS-CoV strains
| Antigenicity | |||||
|---|---|---|---|---|---|
| 2019-nCoV | SARS-CoV | ||||
| Peptides (IEDB) | Peptides (NetCTL 1.2) | Position (start) | Peptides (IEDB) | Peptides (NetCTL 1.2) | Position (start) |
| NSFTRGVYY | NSFTRGVYY | 30 | FDDVQAPNY | FDDVQAPNY | 22 |
| STQDLFLPF | STQDLFLPF | 50 | HTSSMRGVY | HTSSMRGVY | 33 |
| VLPFNDGVY | VLPFNDGVY | 83 | TSSMRGVYY | TSSMRGVYY | 34 |
| CNDPFLGVY | CNDPFLGVY | 136 | EIFRSDTLY | EIFRSDTLY | 45 |
| WMESEFRVY | WMESEFRVY | 152 | RSDTLYLTQ | RSDTLYLTQ | 48 |
| YSSANNCTF | YSSANNCTF | 160 | LTQDLFLPF | LTQDLFLPF | 54 |
| SANNCTFEY | SANNCTFEY | 162 | TQDLFLPFY | TQDLFLPFY | 55 |
| FVFKNIDGY | FVFKNIDGY | 192 | VIPFKDGIY | VIPFKDGIY | 80 |
| NIDGYFKIY | NIDGYFKIY | 196 | HTMIFDNAF | HTMIFDNAF | 149 |
| WTAGAAAYY | WTAGAAAYY | 258 | NAFNCTFEY | NAFNCTFEY | 155 |
| GAAAYYVGY | GAAAYYVGY | 261 | ISDAFSLDV | ISDAFSLDV | 164 |
| ITDAVDCAL | ITDAVDCAL | 285 | FKNKDGFLY | FKNKDGFLY | 187 |
| LSETKCTLK | LSETKCTLK | 296 | NKDGFLYVY | NKDGFLYVY | 189 |
| NATRFASVY | NATRFASVY | 343 | GTSAAAYFV | GTSAAAYFV | 246 |
| 357 | SAAAYFVGY | SAAAYFVGY | 248 | ||
| 361 | NATKFPSVY | NATKFPSVY | 330 | ||
| NSASFSTFK | NSASFSTFK | 370 | 344 | ||
| ASFSTFKCY | ASFSTFKCY | 372 | 348 | ||
| FTNVYADSF | FTNVYADSF | 392 | TSFSTFKCY | TSFSTFKCY | 359 |
| VGGNYNYLY | VGGNYNYLY | 445 | FSNVYADSF | FSNVYADSF | 379 |
| ERDISTEIY | ERDISTEIY | 465 | ATSTGNYNY | ATSTGNYNY | 430 |
| TSNQVAVLY | TSNQVAVLY | 604 | STGNYNYKY | STGNYNYKY | 432 |
| YQDVNCTEV | YQDVNCTEV | 612 | CTPPAPNCY | CTPPAPNCY | 467 |
| QLTPTWRVY | QLTPTWRVY | 628 | FYTTSGIGY | FYTTSGIGY | 483 |
| AEHVNNSY | AEHVNNSY | 653 | TSGIGYQPY | TSGIGYQPY | 486 |
| VASQSIIAY | VASQSIIAY | 687 | FTDSVRDPK | FTDSVRDPK | 558 |
| 733 | ASSEVAVLY | ASSEVAVLY | 590 | ||
| 746 | SSEVAVLYQ | SSEVAVLYQ | 591 | ||
| 748 | CTDVSTLIH | CTDVSTLIH | 603 | ||
| 815 | QLTPAWRIY | QLTPAWRIY | 614 | ||
| 865 | GAEHVDTSY | GAEHVDTSY | 638 | ||
| GTITSGWTF | GTITSGWTF | 880 | TSQKSIVAY | TSQKSIVAY | 669 |
| 1039 | LGADSSIAY | LGADSSIAY | 681 | ||
| FVSNGTHWF | FVSNGTHWF | 1095 | 715 | ||
| VSNGTHWFV | VSNGTHWFV | 1096 | 728 | ||
| 1237 | 730 | ||||
| 1264 | 797 | ||||
| – | – | – | 847 | ||
| – | – | – | GTATAGWTF | GTATAGWTF | 862 |
| – | – | – | TTSTALGKL | TTSTALGKL | 922 |
| – | – | – | 1021 | ||
| – | – | – | 1219 | ||
| – | – | – | 1246 | ||
The table shows a comparison of predicted CTL epitopes in Spike glycoprotein of Wuhan-Hu-1-2019 novel coronavirus (2019-nCoV) and SARS coronavirus-2003(SARS-CoV) strains. Epitopes were generated by NetCTL 1.2 [22] and validated by IEDB Analysis Resource server [23] where scores > 1.25 shows highest sensitivity and specificity towards MHC class I. Italics indicates the identical CTL epitopes
*Italics showing common epitopes with one amino acid change between the two spike glycoproteins
Fig. 3Structural divergence of spike glycoprotein. The PDB structures of spike glycoprotein of 2019-nCoV (PDBA) and SARS-CoV (PDBB) were based on cryo-EM structure of the SARS coronavirus spike glycoprotein (PDB ID 6ACC). Models were superimposed using SuperPose which was visualized by Chimera where green color showing the S glycoprotein of SARS coronavirus and cyan color represents the S glycoprotein of 2019-nCoV. The analysis suggests that both the structures exhibits insignificant divergence with 1.39 A° deviation