Literature DB >> 34056627

Landscape of epitopes targeted by T cells in 852 individuals recovered from COVID-19: Meta-analysis, immunoprevalence, and web platform.

Ahmed Abdul Quadeer1, Syed Faraz Ahmed1, Matthew R McKay1,2.   

Abstract

Knowledge of the epitopes of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) targeted by T cells in recovered (convalescent) individuals is important for understanding T cell immunity against coronavirus disease 2019 (COVID-19). This information can aid development and assessment of COVID-19 vaccines and inform novel diagnostic technologies. Here, we provide a unified description and meta-analysis of SARS-CoV-2 T cell epitopes compiled from 18 studies of cohorts of individuals recovered from COVID-19 (852 individuals in total). Our analysis demonstrates the broad diversity of T cell epitopes that have been recorded for SARS-CoV-2. A large majority are seemingly unaffected by current variants of concern. We identify a set of 20 immunoprevalent epitopes that induced T cell responses in multiple cohorts and in a large fraction of tested individuals. The landscape of SARS-CoV-2 T cell epitopes we describe can help guide immunological studies, including those related to vaccines and diagnostics. A web-based platform has been developed to help complement these efforts.
© 2021 The Author(s).

Entities:  

Keywords:  CD4; CD8; SARS-CoV-2; T cells; convalescent patients; epitopes; immunodominant epitopes; immunoprevalent epitopes; population coverage; variants of concern

Mesh:

Substances:

Year:  2021        PMID: 34056627      PMCID: PMC8139281          DOI: 10.1016/j.xcrm.2021.100312

Source DB:  PubMed          Journal:  Cell Rep Med        ISSN: 2666-3791


Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of coronavirus disease 2019 (COVID-19), has led to a global public health crisis. Development of COVID-19 vaccines and diagnostic tests are aided by an understanding of the natural protective immune responses to SARS-CoV-2. This includes humoral and cellular immunity, mediated by antibodies and T cells, respectively. A significant amount of COVID-19 research has focused on understanding antibody responses,1, 2, 3 but studies informing on the role of T cells have also started to emerge. Initial results suggest a potential key role of T cells in protecting against COVID-19., Studies of individuals recovered from COVID-19 have detected SARS-CoV-2-specific T cells 9 months after infection, showing promising signs for the potential of T cells to provide lasting immunity. Longevity may be a concern for antibody responses, which have been reported to decline within few months after infection.7, 8, 9 Observations consistent with this have also been reported for the most closely related human CoV, SARS-CoV, for which T cells have been shown to persist up to 17 years after infection, whereas antibody responses waned after a few years. Characterizing SARS-CoV-2 T cell epitopes as well as their human leukocyte antigen (HLA) association is important for multiple reasons. It informs on the expected SARS-CoV-2 natural or vaccine-induced T cell responses in a population of specific ethnicity or a specific geographical region, which is tied to the composition of HLA alleles prevalent in that population. It can help with assessing the T cell responses that may be induced by COVID-19 vaccines (which currently mainly focus on the spike [S] protein) and provide possible directions for boosting the T cell response by including specific immunodominant epitopes. It can guide vaccine assessment studies probing whether a vaccine induces T cell responses similar to those commonly generated during natural infection. It can also aid with monitoring potential viral escape from T cell responses via genetic mutations and can facilitate development of T cell-based diagnostics for distinguishing recovered from unexposed individuals. T cell-based diagnostics may have advantages over serological assays, given the uncertainties related to the appearance and persistence of SARS-CoV-2-specific antibody responses in infected individuals.7, 8, 9, Here we present a unified account of the current knowledge (as of April 20, 2021) of SARS-CoV-2 T cell epitopes associated with individuals recovered from COVID-19. We collate and analyze data of T cell epitopes that have been identified experimentally in independent studies of blood samples from different cohorts. These data are compiled from 18 studies (15 published, 3 preprints) of T cell responses in a total of 852 individuals recovered from COVID-19 (Table 1). Our analysis highlights the different characteristics of epitopes reported for SARS-CoV-2 and identifies a specific set of epitopes that appear to induce T cell responses in multiple cohorts and in a large fraction of tested individuals. This information regarding SARS-CoV-2 T cell epitopes can provide directions for future immunological studies. We report a web dashboard we developed to support these ongoing scientific efforts.
Table 1

Summary of immunological studies reporting SARS-CoV-2 T cell epitopes targeted in individuals recovered from COVID-19 (as of April 20, 2021)

No.StudyaGeography (Country)Total IndividualsGender (Female/Male)Median Age (Range)Disease Severityb (Asymptomatic or Mild)Disease Severityb (Moderate or Severe)Blood Collection Time (Days)cInitial Peptide Selection ProceduredProteinsTotal Peptides TestedT cell AssayTotal Distinct Epitopes Identified
1Saini et al.14Denmark186/1243.5 (29–82)711As close as possible to the first positive testNetMHCpan-4.1all2,204multimer qualitative binding122
2Kared et al.15Baltimore/ Washington, USA3012/1842.5 (19–77)N/AN/A27–62 after symptom resolutionGrifoni et al.;16 Prachar et al.17all408multimer qualitative binding46
3Schulien et al.18Germany2614/1232.5 (24–56)26024 (14–70) after symptom onsetANN-4.0, SARS-CoV epitopes16S, N, M, ORF1ab, ORF3a66multimer qualitative binding37
4Poran et al.19N/A3N/AN/AN/AN/AN/AHLAthenaall23multimer qualitative binding11
5Shomuradova et al.20Moscow, Russia3116/1535 (17–59)211033 (17–49) after positive test/after disease onsetNetMHCpan-4.0 and identity with SARS-CoV > 60%S13multimer qualitative binding10
6Nielsen et al.21Denmark20392/11147 (21–79)1718644 (14–129) after symptom onsetN/AS, N, M9multimer qualitative binding9
7Chour et al.22USA20/250 (30–70)026.5 (2–13) post symptom onsetNetMHC-4.0S96multimer qualitative binding6
8Sekine et al.23Sweden6614/4151402653.5 (IQR: 45.5–61) after symptom onsetNetMHCpan-4.1all13multimer qualitative binding4
9Nguyen et al.24Melbourne, Australia71/632 (19–74)6190 (5–145) after disease onsetoverlapping peptide pools and 5 N-specific immunogenic peptides25,18,26,27S, N, MN/AICS IFN-γ release; multimer qualitative binding3
10Rha et al.28South Korea3718/1946 (21–83)231446 (19–125) after symptom onsetN/AS, N, M8multimer qualitative binding2
11Ferretti et al.26New Jersey/Louisiana, USA7855/2319.5 (0–80)552349 (11–111) post diagnosisidentification of 20-mer peptides by TScan screen29 followed by NetMHC-4.0all~240Single-allele ELISA IFN-γ; multimer qualitative bindinge28
12Nelde et al.30Germany11655/6144 (18–75)368037.7 (19–52) after positive testSYFPEITHI-1.0, NetMHCpan-4.0all120ICS IFN-γ release; ELISPOT IFN-γ release47
13Hu et al.31Chongqing, China3716/2147 (20–67)343N/ANetMHCpan-4.0, SARS-CoV epitopes32,16S, N78ELISPOT IFN-γ release15
14Habel et al.25Melbourne, Australia1810/854 (22–76)11747 (36–102) after disease onsetNetCTLpan, NetMHCpanS, N, M, ORF1ab14ICS IFN-γ release14
15Lineburg et al.33Queensland, Australia3723/1451 (20–75)7562 (46–124) after positive testoverlapping peptide pools followed by peptide matrix analysisallN/AICS IFN-γ release4
16Lee et al.34New South Wales, Australia2N/AN/A0230–60 after symptom resolutionNetwork analysis35 followed by NetMHCpan-4.1 and NetMHCIIpan-4.0S, N2ICS IFN-γ release2
17Tarke et al.36San Diego, USA9958/4141 (19–91)90967 (3–184) after symptom onsetNetMHCpan-4.0all7,525AIM assay37803
18Peng et al.27UK4216/2657 (20–95)281442 (30–62) after symptom onset15- to 18-mer peptides overlapping by 10 residues, SARS-CoV epitopes32all except ORF1450ELISPOT IFN-γ release46f

Studies reporting precise epitopes along with the cognate HLA information.

Definition of disease severity varies among studies.

IQR, interquartile range.

NetCTLpan, NetMHC-4.0, NetMHCpan, NetMHCpan-4.0, NetMHCpan-4.1, SYFPEITHI-1.0, ANN-4.0, and HLAthena.

Six (of 28) epitopes were identified using multimer qualitative binding.

Only two (of 46) epitopes that were reported as precise epitopes with the cognate HLA information were considered.

Summary of immunological studies reporting SARS-CoV-2 T cell epitopes targeted in individuals recovered from COVID-19 (as of April 20, 2021) Studies reporting precise epitopes along with the cognate HLA information. Definition of disease severity varies among studies. IQR, interquartile range. NetCTLpan, NetMHC-4.0, NetMHCpan, NetMHCpan-4.0, NetMHCpan-4.1, SYFPEITHI-1.0, ANN-4.0, and HLAthena. Six (of 28) epitopes were identified using multimer qualitative binding. Only two (of 46) epitopes that were reported as precise epitopes with the cognate HLA information were considered.

Results and discussion

In the 18 studies we considered, the set of recovered individuals covers a population across four continents and well-distributed age, gender, disease severity, and blood collection time within and across studies (Table 1). Half of the studies have characterized T cell responses against the whole proteome, whereas the others have focused on responses mounted against subsets of SARS-CoV-2 proteins, typically involving the S and nucleocapsid [N] proteins. In the majority of cases, the T cell response was measured in blood samples of individuals against a set of peptides predicted by bioinformatics tools or earlier bioinformatics-based analyses,, (reviewed in Sohail et al.), whereas a few studies employed overlapping peptide pools spanning the SARS-CoV-2 proteome. All 18 studies reported optimal epitopes along with cognate HLA information. Of these, 10 studies (1–10 in Table 1) experimentally determined HLA restrictions of the reported epitopes by using multimer qualitative binding. Of the remaining eight studies, one study (11 in Table 1) employed mono-allelic cell line assays to identify specific HLA-restricted epitopes, whereas others (12–18 in Table 1) inferred HLA restrictions using standard functional assays (such as activation-induced markers [AIMs] or intracellular cytokine staining [ICS]/enzyme-linked immunospot [ELISPOT] interferon γ [IFN-γ] release) and HLA haplotype information for individuals. Although the latter set of studies involved some predictive element in deconvolving the HLA allele responsible for the observed response from the individual’s haplotype, the reported HLA associations were supported by HLA binding assays or use of accurate peptide-HLA binding prediction methods. A total of 711 unique T cell epitopes with information of a cognate HLA allele have been reported in the 18 immunological studies we considered (see STAR Methods for details). Of these, 635 are CD8+ (HLA class I restricted), and 76 are CD4+ (HLA class II restricted) (Figure 1A). The set of epitopes covers each protein from the canonical reading frames of SARS-CoV-2 except open reading frame 7b [ORF7b] and ORF10. The largest numbers of epitopes fall within S and ORF1a—the most exposed protein and the longest SARS-CoV-2 ORF, respectively. These epitopes broadly cover almost the entire S protein, including the receptor-binding domain, whereas the majority of epitopes in ORF1a lie within the non-structural protein [nsp3] (PL2-PROPapain-like proteinase) and nsp4 proteins (File S1). These nsp3 and nsp4 proteins participate in assembly of virally induced cytoplasmic double-membrane vesicles, which are necessary for viral replication.
Figure 1

Features of SARS-CoV-2-specific T cell epitopes reported to elicit an immune response in blood samples of individuals recovered from COVID-19

(A) The number of epitopes (n = 711) according to HLA class restriction. S, spike; E, envelope; M, membrane; N, nucleocapsid.

(B) The conservation of each epitope among the global SARS-CoV-2 genetic sequences (as of April 20, 2021). Further details of the S-derived epitopes (n = 16) with low genetic conservation (<0.9) and their association with current VOCs are provided in Table S1.

(C) Diversity of HLA associations reported for SARS-CoV-2 epitopes. The number of epitopes associated with a particular HLA allele is shown in brackets.

(D) The number of HLA alleles associated with each epitope.

(E) Estimate of the global population coverage of each epitope (STAR Methods). The median is shown as a black circle and the median absolute deviation as an error bar.

Features of SARS-CoV-2-specific T cell epitopes reported to elicit an immune response in blood samples of individuals recovered from COVID-19 (A) The number of epitopes (n = 711) according to HLA class restriction. S, spike; E, envelope; M, membrane; N, nucleocapsid. (B) The conservation of each epitope among the global SARS-CoV-2 genetic sequences (as of April 20, 2021). Further details of the S-derived epitopes (n = 16) with low genetic conservation (<0.9) and their association with current VOCs are provided in Table S1. (C) Diversity of HLA associations reported for SARS-CoV-2 epitopes. The number of epitopes associated with a particular HLA allele is shown in brackets. (D) The number of HLA alleles associated with each epitope. (E) Estimate of the global population coverage of each epitope (STAR Methods). The median is shown as a black circle and the median absolute deviation as an error bar. All identified epitopes have high genetic conservation (>0.9) among the ∼860,000 SARS-CoV-2 sequences (as of April 20, 2021), except for 24 epitopes (Figure 1B). Of these, three epitopes (612YQDVNCTEV620 in S and 305NVLFSTVFPPTSFGP319 and 309STVFPPTSF317, in ORF1b) have very low conservation (∼0.02) because they encompass mutations (S:D614G and ORF1b:P314L, underlined) that are now dominant globally, with the former mutation reported to have increased virus infectivity and transmission.48, 49, 50 Two-thirds of the 24 epitopes (16 of 24) with low genetic conservation (<0.9) belong to S. These make up ∼8% (16 of 208) of all unique T cell epitopes derived from S—4-fold higher than the fraction of epitopes with low conservation derived from all other proteins (∼2%, 8 of 503). This can be of interest in the context of T cell responses against current COVID-19 vaccines that focus largely on the S protein, assuming that the T cell epitopes targeted in response to vaccination are similar to those seen in recovered individuals. Although limited data are currently available about epitope-specific T cell responses following vaccination, this assumption is partially supported by a study of eight S-derived epitopes reported to elicit T cell responses in vaccinated persons, of which we observe seven to be targeted in recovered individuals (File S1). Of the 16 S-derived T cell epitopes with low genetic conservation (<0.9), all but one encompass sites harboring mutations that define the variants of concern (VOCs) (Table S1)., For example, the epitope 495YGFQPTNGV503 encompasses the N501Y mutation associated with the three VOCs B.1.1.7, B.1.351, and P.1, and the epitopes 142GVYYHKNNK150 and 144YYHKNNKSW152 encompass the deletion at Y144 associated with two variants, B.1.1.7 and B.1.525. The higher genetic variation observed in S as opposed to other proteins is, at least in part, likely to be driven by escape from neutralizing antibodies. Despite the potential of S-derived epitopes to escape T cells elicited to target the non-mutated epitopes (by natural infection or vaccination), responses against the significant majority of S-derived epitopes we studied are not expected to be affected by VOCs (Figure 1B). Moreover, at the level of HLA alleles, for each of the 36 HLA alleles associated with S-derived epitopes, each of them is associated with at least one conserved (>0.9) S-derived epitope (File S1). Collectively, the observed heterogeneity of T cell responses against SARS-CoV-2 provides little evidence to suggest that the observed genetic variation in the S protein may significantly affect T cell immunity, in line with a recent report. Escape from T cell pressure may become an important evolutionary factor when strong and diverse selective pressure is imposed by widespread vaccination, and this requires close monitoring. The overall set of reported epitopes was associated with 52 HLA class I and II alleles (Figure 1C). Most HLA alleles are associated with multiple epitopes, with each of 21 alleles having an association with 14 epitopes or more. The same epitope may be presented by multiple HLA alleles, as evidenced by studies of the related SARS-CoV as well as other viruses. However, for the majority of SARS-CoV-2 epitopes, only a single associated HLA allele has been reported so far (Figure 1D). This appears in part to be due to the limited number of recovered COVID-19 individuals who have been studied (Table 1). The limited number of associated HLA alleles translates to a median global population coverage estimate per epitope of only 12% (Figure 1E). Thus, investigation of additional HLA alleles associated with the identified SARS-CoV-2 epitopes is required to providing a more accurate indication of their individual population coverage. An expanded list of likely HLA associations may be predicted for some of the reported epitopes by using prior knowledge of genetically matched experimentally determined T cell epitopes of SARS-CoV and their associated HLA alleles (for details, see File S1 and Figure S1). These predictions, when confirmed, would provide an increase in the median population coverage of the selected SARS-CoV-2 epitopes from 16.8% to 40%, with a few epitopes having around 60% coverage (Figure S1). Quantifying the immunodominance of each reported epitope-HLA pair using the standard response frequency (RF) metric58, 59, 60 (STAR Methods) revealed that 179 of the reported (812) epitope-HLA pairs had an RF score exceeding 0.5, indicating that they induced T cell responses in over half of the subjects tested across studies (Figure 2A). Confidence in the estimated RF values varies with the number of tested subjects, with higher confidence being attributed to epitope-HLA pairs with larger numbers of tested subjects (STAR Methods). The majority (159 of 179) of epitope-HLA pairs with an RF exceeding 0.5 had been reported in a single study only. Among these, pairs with high confidence appear promising, but responses against them should be investigated in different cohorts to further confirm their immunodominance. Responses against the remaining (20 of 179) epitope-HLA pairs were registered in more than one study, and although per-study variation was observed in the results (Figure S2), these pairs appear to be immunoprevalent. This is because responses against each epitope-HLA pair were recorded for more than half of the tested recovered individuals collectively across multiple studies despite differences in characteristics of the donor cohorts (age, gender, geographical location, and disease severity), blood collection time, and methodology used to determine the epitopes (initial peptide selection procedure and T cell assay) (Table 1).
Figure 2

Identification of immunoprevalent SARS-CoV-2 T cell epitopes

(A) Response frequency (RF) of unique epitope-HLA pairs versus the number of immunological studies reporting a T cell response against them. The size of each circle represents the confidence in the respective RF value (STAR Methods). The 20 immunoprevalent epitope-HLA pairs (having an RF exceeding 0.5 and reported in more than one study) are shown with a shaded background, and five highly immunoprevalent epitope-HLA pairs (having an RF exceeding 0.5 and reported in at least four studies) are labeled.

(B) Details of the identified 20 immunoprevalent epitope-HLA pairs (ordered according to decreasing RF). Epitope-HLA pairs matched genetically to those determined experimentally for SARS-CoV are marked (#).

Identification of immunoprevalent SARS-CoV-2 T cell epitopes (A) Response frequency (RF) of unique epitope-HLA pairs versus the number of immunological studies reporting a T cell response against them. The size of each circle represents the confidence in the respective RF value (STAR Methods). The 20 immunoprevalent epitope-HLA pairs (having an RF exceeding 0.5 and reported in more than one study) are shown with a shaded background, and five highly immunoprevalent epitope-HLA pairs (having an RF exceeding 0.5 and reported in at least four studies) are labeled. (B) Details of the identified 20 immunoprevalent epitope-HLA pairs (ordered according to decreasing RF). Epitope-HLA pairs matched genetically to those determined experimentally for SARS-CoV are marked (#). All of the 20 identified immunoprevalent epitopes have high genetic conservation (>0.9) (Figure 2B), and none of them encompass mutations associated with the current VOCs. Interestingly, 35% of the immunoprevalent epitope-HLA pairs (n = 20) had an identical match to experimentally determined SARS-CoV epitope-HLA pairs (Figure 2B). This fraction of matched epitope-HLA pairs is over three times higher than that (11%) in the complete set of SARS-CoV-2 epitope-HLA pairs (p < 10−3, Fisher’s exact test), suggesting that many of the immunoprevalent T cell epitopes of SARS-CoV-2 are also cross-reactive to SARS-CoV. Of the 20 identified immunoprevalent epitope-HLA pairs, six each belonged to N and ORF1a, four to S, three to ORF3a, and one to the membrane [M] protein (Figure 2B). This indicates that ∼80% of the immunoprevalent epitopes lie in proteins other than S, suggesting that vaccine candidates targeting these proteins may have benefits in terms of T cell immunity. Of the identified immunoprevalent epitopes, five epitopes (HLA-A∗02:01-restricted 269YLQPRTFLL277 in S, HLA-B∗07:02-restricted 105SPRWYFYYL113 in N, HLA-A∗01:01-restricted 207FTSDYYQLY215 in ORF3a and 1637TTDPSFLGRY1646 in ORF1a, and HLA-A∗24:02-restricted 1208QYIKWPWYI1216 in S) appeared to be highly immunoprevalent, eliciting T cell responses in more than ∼60% of the tested individuals recovered from COVID-19 in four immunological studies or more. Collectively, around 71% of the global population is estimated to carry the associated HLA alleles and, hence, may generate a T cell response against at least one of these five epitopes. Two of these highly immunoprevalent epitopes (105SPRWYFYYL113 in N and 269YLQPRTFLL277 in S) are attracting considerable attention, and detailed analyses of T cell responses against them have been reported.,, The SARS-CoV-2 T cell epitope data we compiled and report here were integrated into a web-based dashboard (Figure 3). This dashboard provides exportable data tables listing the SARS-CoV-2 epitopes and graphic displays to summarize different characteristics of the epitopes, including aggregate information as well as specific details of the individual epitopes. We plan to update the dashboard with new experimental information as it becomes available, with the goal of aiding further research to understand T cell responses against SARS-CoV-2 and guide studies related to COVID-19 vaccines and diagnostics. Although we focused the current study on SARS-CoV-2-specific T cell responses recorded in recovered individuals, knowledge of T cell epitopes targeted in animal studies, is also informative. These may inform potential immune targets in COVID-19-infected individuals and can help guide further immunological experiments that seek to probe T cell responses that arise because of natural infection or those elicited by vaccination. The SARS-CoV-2 epitopes reported to be targeted in animal models were also incorporated into the web dashboard.
Figure 3

Snapshot of the web dashboard developed for reporting and analyzing SARS-CoV-2 T cell epitope data (as of April 20, 2021)

The web dashboard provides aggregated information regarding the T cell epitopes and their HLA associations. Exportable data tables are provided to aid further research.

Snapshot of the web dashboard developed for reporting and analyzing SARS-CoV-2 T cell epitope data (as of April 20, 2021) The web dashboard provides aggregated information regarding the T cell epitopes and their HLA associations. Exportable data tables are provided to aid further research. Overall, the data we described, based on recent experimental studies, demonstrates an impressive and diverse list of SARS-CoV-2 T cell epitopes targeted by individuals recovered from COVID-19. Subsets of these epitopes exhibit desirable properties, including high genetic conservation and high RF across multiple cohorts, and they appear to have the potential to collectively induce a T cell response in a large fraction of the population. Current knowledge of the landscape of T cell epitopes for SARS-CoV-2 is still evolving, and further studies of different cohorts of recovered individuals, encompassing a broad diversity of HLA profiles, are required to provide a more comprehensive understanding. Moreover, further systematic studies are required to ascertain possible correlates between the responses against T cell epitopes and disease protection. Knowledge of SARS-CoV-2 T cell epitopes could play an important role in contributing to the fight against COVID-19 by guiding diverse applications and novel technologies, including development, assessment, and monitoring of vaccines and development of improved diagnostic assays.

Limitations of the study

There are multiple limitations of our study. Our analysis is unable to make associations between epitope-specific T cell responses and levels of disease severity, nor can it capture differences between T cell responses according to age or gender. The majority of the immunological studies we investigate do not report epitope-specific T cell responses at this level of detail, and, hence, such an analysis could not be performed. In terms of the HLA associations of the reported SARS-CoV-2 epitopes, our study predicted additional associations based on homology with experimental T cell epitopes of SARS-CoV and their associated HLA alleles (File S1). These additional HLA associations, although promising, still need to be confirmed for SARS-CoV-2 by further immunological studies. Moreover, our analysis mainly involved CD8+ T cell epitopes. This was due to the limited number of CD4+ T cell epitopes that have been reported so far with a unique HLA allele association. Further experimental studies are needed to precisely identify CD4+ T cell epitopes with distinct HLA alleles. This would help to determine potential immunoprevalent CD4+ epitopes in the global population, similar to those reported for CD8+ T cells in this work, and would contribute to providing a more comprehensive understanding of the relationship between T cell responses and convalescence for COVID-19.

STAR★Methods

Key resources table

Resource availability

Lead contact

Further information and requests for resources should be directed to and will be fulfilled by the Lead Contact Matthew R. McKay (m.mckay@ust.hk).

Materials availability

This study did not generate new materials.

Data and code availability

Compiled data of the SARS-CoV-2 T cell epitopes is available to download from the web dashboard, https://www.mckayspcb.com/SARS2TcellEpitopes/. File S1 has been deposited to Mendeley Data: http://dx.doi.org/10.17632/fwn3kbbh6y.1.

Experimental model and subject details

We compiled data from 20 immunological studies that reported SARS-CoV-2 epitopes targeted by T cells in individuals recovered from COVID-19. Two studies, reported responses against synthetic peptide pool libraries using functional or molecular assays, and identified long immunogenic peptides. As complete information of epitopes was not available in these studies, we focused on the remaining 18 studies that provided precise epitopes along with their HLA restriction (Table 1). All statistics related to the patients that participated in each of these 18 studies (age, gender, geographical location, disease severity, blood collection time) are summarized in Table 1. Across the considered 18 studies, the total number of recovered COVID-19 individuals was 852. A total of 1,209 epitopes were obtained from these immunological studies (Table 1). Removing the epitopes with no HLA allele information and those for which the number of tested and responded patients was not reported at a distinct epitope-HLA level resulted in a total of 711 unique epitopes (812 unique epitope-HLA pairs; listed in File S1).

Method details

Sequence data, epitope conservation and coverage

SARS-CoV-2 genomic sequences were obtained from the GISAID database (https://www.gisaid.org/) on 20 April 2021. We downloaded only the complete (full-genome) sequences derived from human hosts with high coverage using the options provided on the GISAID database. All of the 859,233 downloaded sequences were aligned to the SARS-CoV-2 reference genome (GenBank: NC_045512.2) using MAFFT. The genomic MSA was translated using an in-house code to obtain the protein MSAs. The positions of the open reading frames provided with the reference sequence were used to identify the respective protein regions of the full genome. The conservation of each SARS-CoV-2 T cell epitope was calculated as the fraction of SARS-CoV-2 sequences that encompassed the precise epitope sequence. The coverage of SARS-CoV-2 T cell epitopes at any position of a protein was calculated by counting the number of epitopes that included that position.

Response frequency (RF)

RF score was used to quantify the immunodominance of the SARS-CoV-2 epitopes reported to be recognized by T cells in recovered COVID-19 individuals. The RF score of an epitope is defined as follows:where is the number of subjects responding to the epitope in study , is the number of subjects tested for a response against the epitope in study , and is the total number of studies. An RF score calculated using a large number of tested subjects would be more reliable than one calculated using relatively few subjects. To account for this, we computed the 95% confidence interval for the RF score of each epitope using the binomial cumulative distribution function. In Figure 2, we defined the confidence in RF value of an epitope as the inverse of the length of the corresponding 95% confidence interval. That is, values of RF with a smaller 95% confidence interval have higher confidence, and vice versa.

Estimating global population coverage of epitopes

The global population coverage of an epitope refers to the percentage of individuals in the world population that is expected to mount a T cell response against that epitope. The population coverage of a T cell epitope was calculated based on the HLA alleles associated with it using the tool downloaded from the IEDB Analysis Resource (http://tools.iedb.org/population/download/). This tool employs global HLA allele frequency data obtained from the Allele Frequency Net Database (http://www.allelefrequencies.net/) to estimate the population coverage.

Quantification and statistical analysis

Statistical analyses were performed using MATLAB (R2019b) and the R language (version 3.6) using the RStudio server (version 1.3). The web-based platform was developed using the open source R Shiny (version 1.5) development framework. Fisher’s exact test was used to compute the statistical significance associated with enrichment of SARS-CoV epitopes among immunoprevalent SARS-CoV-2 epitopes. The 95% confidence interval for the RF score of each epitope in Figure 2 was determined using the binomial cumulative distribution function.
REAGENT or RESOURCESOURCEIDENTIFIER
Deposited data

T cell epitopes from convalescent COVID-19 patientsChour et al.22Figure 3
T cell epitopes from convalescent COVID-19 patientsShomuradova et al.20Table 1
T cell epitopes from convalescent COVID-19 patientsNelde et al.30Extended Data Tables 2 and 3
T cell epitopes from convalescent COVID-19 patientsPoran et al.19Figure 3
T cell epitopes from convalescent COVID-19 patientsFerretti et al.26Table 1
T cell epitopes from convalescent COVID-19 patientsPeng et al.27Tables 1 and 2
T cell epitopes from convalescent COVID-19 patientsKared et al.15Figure S2
T cell epitopes from convalescent COVID-19 patientsSchulien et al.18Table S1
T cell epitopes from convalescent COVID-19 patientsHabel et al.25Figure 2B
T cell epitopes from convalescent COVID-19 patientsHu et al.31Table 1
T cell epitopes from convalescent COVID-19 patientsNielsen et al.21Figure 5B
T cell epitopes from convalescent COVID-19 patientsTarke et al.36Tables S3 and S5
T cell epitopes from convalescent COVID-19 patientsSekine et al.23Table S2
T cell epitopes from convalescent COVID-19 patientsSaini et al.14Table S5
T cell epitopes from convalescent COVID-19 patientsLee et al.34Figure 13
T cell epitopes from convalescent COVID-19 patientsRha et al.28Figure 1C
T cell epitopes from convalescent COVID-19 patientsLineburg et al.33Table S3
T cell epitopes from convalescent COVID-19 patientsNguyen et al.24Figure 3C
Response frequencies of SARS-CoV-2 T cell epitopesThis paperFile S1; Mendeley data: https://dx.doi.org/10.17632/fwn3kbbh6y.1
Genetic conservation of SARS-CoV-2 T cell epitopes among SARS-CoV-2 sequences (as of 20 April 2021)This paperFile S1; Mendeley data: https://dx.doi.org/10.17632/fwn3kbbh6y.1
Total subjects tested for each SARS-CoV-2 T cell epitope across studiesThis paperFile S1; Mendeley data: https://dx.doi.org/10.17632/fwn3kbbh6y.1
Total subjects responded to each SARS-CoV-2 T cell epitope across studiesThis paperFile S1; Mendeley data: https://dx.doi.org/10.17632/fwn3kbbh6y.1
Additional HLA alleles predicted for SARS-CoV-2 T cell epitopesThis paperFile S1; Mendeley data: https://dx.doi.org/10.17632/fwn3kbbh6y.1
SARS-CoV-2 genome sequence data for conservation analysisGISAID (https://www.gisaid.org)All full genome and high coverage sequences available as of 20 April 2021
SARS-CoV-2 reference genome for aligning the sequenceshttps://www.ncbi.nlm.nih.gov/nuccore/NC_045512.2GenBank: NC_045512.2
SARS-CoV T cell epitope dataIEDB; https://www.iedb.org/IEDB was queried for epitopes with positive MHC binding and positive T cell assays using “Severe acute respiratory syndrome-related coronavirus” as “Organism” on 21 February 2020.

Software and algorithms

SARS-CoV-2 T cell epitope web-dashboardThis paperhttps://www.mckayspcb.com/SARS2TcellEpitopes/
MAFFTKatoh and Standley65https://mafft.cbrc.jp/alignment/software/
Estimated population coverage of SARS-CoV-2 T cell epitopes based on associated HLA allelesIEDB Analysis Resource - Population coverage toolhttp://tools.iedb.org/population/download/
  55 in total

1.  T-Scan: A Genome-wide Method for the Systematic Discovery of T Cell Epitopes.

Authors:  Tomasz Kula; Mohammad H Dezfulian; Charlotte I Wang; Nouran S Abdelfattah; Zachary C Hartman; Kai W Wucherpfennig; Herbert Kim Lyerly; Stephen J Elledge
Journal:  Cell       Date:  2019-08-08       Impact factor: 41.582

2.  SARS-CoV-2 genome-wide T cell epitope mapping reveals immunodominance and substantial CD8+ T cell activation in COVID-19 patients.

Authors:  Ditte Stampe Hersby; Tripti Tamhane; Sunil Kumar Saini; Helle Rus Povlsen; Susana Patricia Amaya Hernandez; Morten Nielsen; Anne Ortved Gang; Sine Reker Hadrup
Journal:  Sci Immunol       Date:  2021-04-14

3.  NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data.

Authors:  Birkir Reynisson; Bruno Alvarez; Sinu Paul; Bjoern Peters; Morten Nielsen
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

4.  SARS-CoV-2-derived peptides define heterologous and COVID-19-induced T cell recognition.

Authors:  Annika Nelde; Tatjana Bilich; Jonas S Heitmann; Yacine Maringer; Helmut R Salih; Malte Roerden; Maren Lübke; Jens Bauer; Jonas Rieth; Marcel Wacker; Andreas Peter; Sebastian Hörber; Bjoern Traenkle; Philipp D Kaiser; Ulrich Rothbauer; Matthias Becker; Daniel Junker; Gérard Krause; Monika Strengert; Nicole Schneiderhan-Marra; Markus F Templin; Thomas O Joos; Daniel J Kowalewski; Vlatka Stos-Zweifel; Michael Fehr; Armin Rabsteyn; Valbona Mirakaj; Julia Karbach; Elke Jäger; Michael Graf; Lena-Christin Gruber; David Rachfalski; Beate Preuß; Ilona Hagelstein; Melanie Märklin; Tamam Bakchoul; Cécile Gouttefangeas; Oliver Kohlbacher; Reinhild Klein; Stefan Stevanović; Hans-Georg Rammensee; Juliane S Walz
Journal:  Nat Immunol       Date:  2020-09-30       Impact factor: 25.606

5.  SARS-CoV-2 elicits robust adaptive immune responses regardless of disease severity.

Authors:  Stine Sf Nielsen; Line K Vibholm; Ida Monrad; Rikke Olesen; Giacomo S Frattari; Marie H Pahus; Jesper F Højen; Jesper D Gunst; Christian Erikstrup; Andreas Holleufer; Rune Hartmann; Lars Østergaard; Ole S Søgaard; Mariane H Schleimann; Martin Tolstrup
Journal:  EBioMedicine       Date:  2021-06-04       Impact factor: 8.143

6.  NetMHCpan, a method for quantitative predictions of peptide binding to any HLA-A and -B locus protein of known sequence.

Authors:  Morten Nielsen; Claus Lundegaard; Thomas Blicher; Kasper Lamberth; Mikkel Harndahl; Sune Justesen; Gustav Røder; Bjoern Peters; Alessandro Sette; Ole Lund; Søren Buus
Journal:  PLoS One       Date:  2007-08-29       Impact factor: 3.240

7.  Connecting clusters of COVID-19: an epidemiological and serological investigation.

Authors:  Sarah Ee Fang Yong; Danielle Elizabeth Anderson; Wycliffe E Wei; Junxiong Pang; Wan Ni Chia; Chee Wah Tan; Yee Leong Teoh; Priyanka Rajendram; Matthias Paul Han Sim Toh; Cuiqin Poh; Valerie T J Koh; Joshua Lum; Nur-Afidah Md Suhaimi; Po Ying Chia; Mark I-Cheng Chen; Shawn Vasoo; Benjamin Ong; Yee Sin Leo; Linfa Wang; Vernon J M Lee
Journal:  Lancet Infect Dis       Date:  2020-04-21       Impact factor: 25.071

8.  Suboptimal SARS-CoV-2-specific CD8+ T cell response associated with the prominent HLA-A*02:01 phenotype.

Authors:  Jennifer R Habel; Thi H O Nguyen; Carolien E van de Sandt; Jennifer A Juno; Priyanka Chaurasia; Kathleen Wragg; Marios Koutsakos; Luca Hensen; Xiaoxiao Jia; Brendon Chua; Wuji Zhang; Hyon-Xhi Tan; Katie L Flanagan; Denise L Doolan; Joseph Torresi; Weisan Chen; Linda M Wakim; Allen C Cheng; Peter C Doherty; Jan Petersen; Jamie Rossjohn; Adam K Wheatley; Stephen J Kent; Louise C Rowntree; Katherine Kedzierska
Journal:  Proc Natl Acad Sci U S A       Date:  2020-09-10       Impact factor: 11.205

9.  Unbiased Screens Show CD8+ T Cells of COVID-19 Patients Recognize Shared Epitopes in SARS-CoV-2 that Largely Reside outside the Spike Protein.

Authors:  Andrew P Ferretti; Tomasz Kula; Yifan Wang; Dalena M V Nguyen; Adam Weinheimer; Garrett S Dunlap; Qikai Xu; Nancy Nabilsi; Candace R Perullo; Alexander W Cristofaro; Holly J Whitton; Amy Virbasius; Kenneth J Olivier; Lyndsey R Buckner; Angela T Alistar; Eric D Whitman; Sarah A Bertino; Shrikanta Chattopadhyay; Gavin MacBeath
Journal:  Immunity       Date:  2020-10-20       Impact factor: 31.745

View more
  17 in total

1.  Immunogenic epitope panel for accurate detection of non-cross-reactive T cell response to SARS-CoV-2.

Authors:  Aleksei Titov; Regina Shaykhutdinova; Olga V Shcherbakova; Yana V Serdyuk; Savely A Sheetikov; Ksenia V Zornikova; Alexandra V Maleeva; Alexandra Khmelevskaya; Dmitry V Dianov; Naina T Shakirova; Dmitry B Malko; Maxim Shkurnikov; Stepan Nersisyan; Alexander Tonevitsky; Ekaterina Khamaganova; Anton V Ershov; Elena Y Osipova; Ruslan V Nikolaev; Dmitry E Pershin; Viktoria A Vedmedskia; Michael Maschan; Victoria R Ginanova; Grigory A Efimov
Journal:  JCI Insight       Date:  2022-05-09

2.  Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes.

Authors:  Juan Rodriguez-Rivas; Giancarlo Croce; Maureen Muscat; Martin Weigt
Journal:  Proc Natl Acad Sci U S A       Date:  2022-01-25       Impact factor: 11.205

3.  Cellular Responses to Membrane and Nucleocapsid Viral Proteins Are Also Boosted After SARS-CoV-2 Spike mRNA Vaccination in Individuals With Either Past Infection or Cross-Reactivity.

Authors:  Alejandro Vallejo; Adrián Martín-Hondarza; Sandra Gómez; Héctor Velasco; Pilar Vizcarra; Johannes Haemmerle; José L Casado
Journal:  Front Microbiol       Date:  2022-02-11       Impact factor: 5.640

Review 4.  T Cells Targeting SARS-CoV-2: By Infection, Vaccination, and Against Future Variants.

Authors:  Thi H O Nguyen; Carolyn A Cohen; Louise C Rowntree; Maireid B Bull; Asmaa Hachim; Katherine Kedzierska; Sophie A Valkenburg
Journal:  Front Med (Lausanne)       Date:  2021-12-24

5.  Cellular therapies for the treatment and prevention of SARS-CoV-2 infection.

Authors:  Susan R Conway; Michael D Keller; Catherine M Bollard
Journal:  Blood       Date:  2022-07-21       Impact factor: 25.476

6.  Cytotoxic T-Cell-Based Vaccine against SARS-CoV-2: A Hybrid Immunoinformatic Approach.

Authors:  Alexandru Tirziu; Virgil Paunescu
Journal:  Vaccines (Basel)       Date:  2022-01-30

7.  Landscape and selection of vaccine epitopes in SARS-CoV-2.

Authors:  Christof C Smith; Kelly S Olsen; Benjamin G Vincent; Alex Rubinsteyn; Kaylee M Gentry; Maria Sambade; Wolfgang Beck; Jason Garness; Sarah Entwistle; Caryn Willis; Steven Vensko; Allison Woods; Misha Fini; Brandon Carpenter; Eric Routh; Julia Kodysh; Timothy O'Donnell; Carsten Haber; Kirsten Heiss; Volker Stadler; Erik Garrison; Adam M Sandor; Jenny P Y Ting; Jared Weiss; Krzysztof Krajewski; Oliver C Grant; Robert J Woods; Mark Heise
Journal:  Genome Med       Date:  2021-06-14       Impact factor: 15.266

8.  SARS-CoV-2 T Cell Responses Elicited by COVID-19 Vaccines or Infection Are Expected to Remain Robust against Omicron.

Authors:  Syed Faraz Ahmed; Ahmed Abdul Quadeer; Matthew R McKay
Journal:  Viruses       Date:  2022-01-02       Impact factor: 5.048

9.  A New Epitope Selection Method: Application to Design a Multi-Valent Epitope Vaccine Targeting HRAS Oncogene in Squamous Cell Carcinoma.

Authors:  Kush Savsani; Gabriel Jabbour; Sivanesan Dakshanamurthy
Journal:  Vaccines (Basel)       Date:  2021-12-31

10.  T cell epitopes in SARS-CoV-2 proteins are substantially conserved in the Omicron variant.

Authors:  Seong Jin Choi; Dong-Uk Kim; Ji Yun Noh; Sangwoo Kim; Su-Hyung Park; Hye Won Jeong; Eui-Cheol Shin
Journal:  Cell Mol Immunol       Date:  2022-01-18       Impact factor: 11.530

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.