| Literature DB >> 33301503 |
Elisa Guo1, Hailong Guo2.
Abstract
The ongoing pandemic of SARS-CoV-2 has brought tremendous crisis on global health care systems and industrial operations that dramatically affect the economic and social life of numerous individuals worldwide. Understanding anti-SARS-CoV-2 immune responses in population with different genetic backgrounds and tracking the viral evolution are crucial for successful vaccine design. In this study, we reported the generation of CD8 T cell epitopes by a total of 80 alleles of three major class I HLAs using NetMHC 4.0 algorithm for the SARS-CoV-2 spike protein, which can be targeted by both B cells and T cells. We found diverse capacities of S protein specific epitope presentation by different HLA alleles with very limited number of predicted epitopes for HLA-B*2705, HLA-B*4402 and HLA-B*4403 and as high as 132 epitopes for HLA-A*6601. Our analysis of 1000 S protein sequences from field isolates collected globally over the past few months identified three recurrent point mutations including L5F, D614G and G1124V. Differential effects of these mutations on CD8 T cell epitope generation by corresponding HLA alleles were observed. Finally, our multiple alignment analysis indicated the absence of seasonal CoV induced cross-reactive CD8 T cells to drive these mutations. Our findings suggested that individuals with certain HLA alleles, such as B*44 are more prone to SARS-CoV-2 infection. Studying anti-S protein specific CD8 T cell immunity in diverse genetic background is critical for better control and prevention of the SARS-CoV-2 pandemic.Entities:
Year: 2020 PMID: 33301503 PMCID: PMC7728258 DOI: 10.1371/journal.pone.0239566
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of the total number of predicted SARS-CoV-2 S protein specific CD8 T cell epitopes for each HLA alleles.
| HLA-A | HLA-B | HLA-C | |||
|---|---|---|---|---|---|
| Alleles | Total | Alleles | Total | Alleles | Total |
| HLA-A*0101 | 43 | HLA-B*0702 | 25 | HLA-C*0303 | 84 |
| HLA-A*0201 | 30 | HLA-B*0801 | 27 | HLA-C*0401 | 90 |
| HLA-A*0202 | 42 | HLA-B*0802 | 90 | HLA-C*0501 | 75 |
| HLA-A*0203 | 41 | HLA-B*0803 | 79 | HLA-C*0602 | 79 |
| HLA-A*0205 | 32 | HLA-B*1402 | 88 | HLA-C*0701 | 101 |
| HLA-A*0206 | 60 | HLA-B*1501 | 50 | HLA-C*0702 | 86 |
| HLA-A*0207 | 25 | HLA-B*1502 | 93 | HLA-C*0802 | 89 |
| HLA-A*0211 | 94 | HLA-B*1503 | 118 | HLA-C*1203 | 121 |
| HLA-A*0212 | 71 | HLA-B*1509 | 85 | HLA-C*1402 | 104 |
| HLA-A*0216 | 96 | HLA-B*1517 | 94 | HLA-C*1502 | 117 |
| HLA-A*0217 | 36 | HLA-B*1801 | 36 | ||
| HLA-A*0219 | 77 | HLA-B*2705 | 16 | ||
| HLA-A*0250 | 99 | HLA-B*2720 | 69 | ||
| HLA-A*0301 | 31 | HLA-B*3501 | 75 | ||
| HLA-A*1101 | 41 | HLA-B*3503 | 43 | ||
| HLA-A*2301 | 56 | HLA-B*3801 | 31 | ||
| HLA-A*2402 | 43 | HLA-B*3901 | 80 | ||
| HLA-A*2403 | 78 | HLA-B*4001 | 24 | ||
| HLA-A*2501 | 77 | HLA-B*4002 | 24 | ||
| HLA-A*2601 | 44 | HLA-B*4013 | 77 | ||
| HLA-A*2602 | 70 | HLA-B*4201 | 46 | ||
| HLA-A*2603 | 88 | HLA-B*4402 | 17 | ||
| HLA-A*2902 | 51 | HLA-B*4403 | 19 | ||
| HLA-A*3001 | 58 | HLA-B*4501 | 22 | ||
| HLA-A*3002 | 43 | HLA-B*4601 | 42 | ||
| HLA-A*3101 | 30 | HLA-B*4801 | 71 | ||
| HLA-A*3201 | 72 | HLA-B*5101 | 43 | ||
| HLA-A*3207 | 109 | HLA-B*5301 | 46 | ||
| HLA-A*3215 | 101 | HLA-B*5401 | 34 | ||
| HLA-A*3301 | 32 | HLA-B*5701 | 24 | ||
| HLA-A*6601 | 132 | HLA-B*5801 | 47 | ||
| HLA-A*6801 | 45 | HLA-B*5802 | 26 | ||
| HLA-A*6802 | 62 | HLA-B*7301 | 58 | ||
| HLA-A*6823 | 107 | HLA-B*8301 | 97 | ||
| HLA-A*6901 | 96 | ||||
| HLA-A*8001 | 70 | ||||
Fig 1Distribution of mutations over the entire reference S protein.
Multiple alignment analysis of 1000 S protein sequences of SARS-CoV-2 with the reference S protein was performed as described. The observed amino acids with mutations were highlighted in red and underlined on the reference S protein. Mutation of the boxed amino acid at position 5 (L to F) was observed 16 times, at position 614 (D to G) 632 times and at position 1124 (G to V) 13 times, respectively. All other mutations occurred less than 10 times each, typically 1–2 times.
Effect of L5F mutation on the generation of SARS-CoV-2 S protein specific CD8 T cell epitope FVFFVLLPL by different HLA alleles.
| Effect | HLA alleles |
|---|---|
| Affinity increased | HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0211, HLA-A*0212, HLA-A*0216, HLA-A*0250, HLA-A*2601, HLA-A2*603, HLA-A*3201, HLA-A*3207, HLA-A*3215, HLA-A*6601, HLA-A*6802, HLA-A*6823, HLA-A*6901, HLA-A*8001, HLA-B*1402, HLA-B*1517, HLA-B*2720, HLA-B*3501, HLA-B*4013, HLA-B*4201, HLA-B*4601, HLA-B*4801, HLA-B*5301, HLA-B*7301, HLA-B*8301, HLA-C*0303, HLA-C*0401, HLA-C*0602, HLA-C*0702, HLA-C*0802, HLA-C*1203, HLA-C*1402, HLA-C*1502 |
| Affinity decreased | HLA-A*0207, HLA-A*0217, HLA-A*0219, HLA-A*2501, HLA-B*0801, HLA-B*0803, HLA-B*1509, HLA-B*3503, HLA-B*3901, HLA-C*0701 |
| Epitope gained | HLA-A*2301, HLA-A*2602, HLA-A*2902, HLA-B*1502, HLA-B*5101 |
| Epitope lost | None |
Effect of G1124V mutation on the generation of SARS-CoV-2 S protein specific CD8 T cell epitope FVSVNCDVV by different HLA alleles.
| Effect | HLA alleles |
|---|---|
| Affinity increased | None |
| Affinity decreased | HLA-A*0202, HLA-A*0206, HLA-A*0212, HLA-A*0216, HLA-A*0219, HLA-A*0250, HLA-A*6802, HLA-C*0303, HLA-C*0501, HLA-C*0602, HLA-C*0802, HLA-C*1203, HLA-C*1502 |
| Epitope gained | None |
| Epitope lost | HLA-A*0201, HLA-A*0203, HLA-A*0207, HLA-A*0211 HLA-A*6901, HLA-C*0701 |
List of CD8 T cell epitopes containing the amino acid D614 on the reference S protein of SARS-CoV-2.
| HLA alleles | Epitope | |
|---|---|---|
| Position | Sequence | |
| HLA-A*0101 | 612 | YQDVNCTEV |
| HLA-A*0201 | 612 | YQDVNCTEV |
| HLA-A*0206 | 612 | YQDVNCTEV |
| HLA-A*0207 | 612 | YQDVNCTEV |
| HLA-A*0211 | 610 | VLYQDVNCT |
| 612 | YQDVNCTEV | |
| 614 | DVNCTEVPV | |
| HLA-A*0212 | 610 | VLYQDVNCT |
| 612 | YQDVNCTEV | |
| 614 | DVNCTEVPV | |
| HLA-A*0216 | 610 | VLYQDVNCT |
| 612 | YQDVNCTEV | |
| 614 | DVNCTEVPV | |
| HLA-A*0219 | 610 | VLYQDVNCT |
| 612 | YQDVNCTEV | |
| 614 | DVNCTEVPV | |
| HLA-A*0250 | 607 | QVAVLYQDV |
| 610 | VLYQDVNCT | |
| 612 | YQDVNCTEV | |
| HLA-A*2603 | 614 | DVNCTEVPV |
| HLA-A*6601 | 612 | YQDVNCTEV |
| HLA-A*6802 | 607 | QVAVLYQDV |
| 614 | DVNCTEVPV | |
| HLA-A*6901 | 607 | QVAVLYQDV |
| 614 | DVNCTEVPV | |
| HLA-B*1509 | 612 | YQDVNCTEV |
| HLA-B*2720 | 612 | YQDVNCTEV |
| HLA-B*3901 | 612 | YQDVNCTEV |
| HLA-B*4801 | 606 | NQVAVLYQD |
| HLA-C*0501 | 612 | YQDVNCTEV |
| HLA-C*0602 | 612 | YQDVNCTEV |
| HLA-C*0802 | 612 | YQDVNCTEV |
| HLA-C*1203 | 612 | YQDVNCTEV |
| HLA-C*1402 | 611 | LYQDVNCTE |
Effect of D614G mutation on the generation of SARS-CoV-2 S protein specific CD8 T cell epitopes by different HLA alleles.
| Alleles | Epitopes gained | Epitopes lost | Affinity decreased | Affinity increased | ||||
|---|---|---|---|---|---|---|---|---|
| Pos | Sequence | Pos | Sequence | Pos | Sequence | Pos | Sequence | |
| HLA-A*0101 | None | 612 | YQDVNCTEV | None | None | |||
| HLA-A*0201 | None | None | None | 612 | YQGVNCTEV | |||
| HLA-A*0203 | 612 | YQGVNCTEV | None | None | None | |||
| HLA-A*0205 | 607 | QVAVLYQGV | None | None | None | |||
| 612 | YQGVNCTEV | |||||||
| HLA-A*0206 | 607 | QVAVLYQGV | None | None | 612 | YQGVNCTEV | ||
| 614 | GVNCTEVPV | |||||||
| HLA-A*0207 | None | 612 | YQDVNCTEV | None | None | |||
| HLA-A*0211 | None | None | 610 | VLYQGVNCT | 612 | YQGVNCTEV | ||
| 614 | GVNCTEVPV | |||||||
| HLA-A*0212 | None | None | 610 | VLYQGVNCT | 612 | YQGVNCTEV | ||
| 614 | GVNCTEVPV | |||||||
| HLA-A*0216 | None | None | 610 | VLYQGVNCT | 612 | YQGVNCTEV | ||
| 614 | GVNCTEVPV | |||||||
| HLA-A*0219 | None | None | 610 | VLYQGVNCT | None | |||
| 612 | YQGVNCTEV | |||||||
| 614 | GVNCTEVPV | |||||||
| HLA-A*0250 | None | None | 607 | QVAVLYQGV | None | |||
| 610 | VLYQGVNCT | |||||||
| 612 | YQGVNCTEV | |||||||
| HLA-A*2403 | 611 | LYQGVNCTE | None | None | None | |||
| HLA-A*2501 | 607 | QVAVLYQGV | None | None | None | |||
| HLA-A*2601 | 607 | QVAVLYQGV | None | None | None | |||
| HLA-A*2603 | None | None | None | None | ||||
| HLA-A*6601 | None | None | 612 | YQGVNCTEV | None | |||
| HLA-A*6802 | None | 614 | DVNCTEVPV | 607 | QVAVLYQGV | None | ||
| HLA-A*6901 | None | 614 | DVNCTEVPV | 607 | QVAVLYQGV | None | ||
| HLA-B*1509 | None | 612 | YQDVNCTEV | None | None | |||
| HLA-B*2720 | None | None | 612 | YQGVNCTEV | None | |||
| HLA-B*3901 | None | 612 | YQDVNCTEV | None | None | |||
| HLA-B*4801 | None | None | None | 606 | NQVAVLYQG | |||
| HLA-C*0501 | None | 612 | YQDVNCTEV | None | None | |||
| HLA-C*0602 | None | None | None | 612 | YQGVNCTEV | |||
| HLA-C*0802 | None | 612 | YQDVNCTEV | None | None | |||
| HLA-C*1203 | 610 | VLYQGVNCT | 612 | YQDVNCTEV | None | None | ||
| HLA-C*1402 | None | None | 611 | LYQGVNCTE | None | |||
Fig 2Phylogenetic analysis of the reference S protein of SARS-CoV-2 with four seasonal human CoVs (NL63, HKU1, 229E and OC43).
The phylogenetic tree was performed using the maximum likelihood method and JTT model with a bootstrap of 1000 replicates within the Mega-X software. Scale bar indicates the number of substitutions per site for the trees. The accession number for S proteins of SARS-CoV-2, HCoV-NL63, HCoV-HKU1, HCoV-229E and HCoV-OC43 are QHD43416.1, APF29063.1, BBA20983.1, AAG48592.1 and CAA83661.1, respectively.