| Literature DB >> 35143694 |
Paul R Buckley1,2, Chloe H Lee1,2, Mariana Pereira Pinho1, Rosana Ottakandathil Babu1,2, Jeongmin Woo1,2, Agne Antanaviciute1,2, Alison Simmons1, Graham Ogg1, Hashem Koohy1,2.
Abstract
The conditions and extent of cross-protective immunity between severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and common-cold human coronaviruses (HCoVs) remain open despite several reports of pre-existing T cell immunity to SARS-CoV-2 in individuals without prior exposure. Using a pool of functionally evaluated SARS-CoV-2 peptides, we report a map of 126 immunogenic peptides with high similarity to 285 MHC-presented peptides from at least one HCoV. Employing this map of SARS-CoV-2-non-homologous and homologous immunogenic peptides, we observe several immunogenic peptides with high similarity to human proteins, some of which have been reported to have elevated expression in severe COVID-19 patients. After combining our map with SARS-CoV-2-specific TCR repertoire data from COVID-19 patients and healthy controls, we show that public repertoires for the majority of convalescent patients are dominated by TCRs cognate to non-homologous SARS-CoV-2 peptides. We find that for a subset of patients, >50% of their public SARS-CoV-2-specific repertoires consist of TCRs cognate to homologous SARS-CoV-2-HCoV peptides. Further analysis suggests that this skewed distribution of TCRs cognate to homologous or non-homologous peptides in COVID-19 patients is likely to be HLA-dependent. Finally, we provide 10 SARS-CoV-2 peptides with known cognate TCRs that are conserved across multiple coronaviruses and are predicted to be recognized by a high proportion of the global population. These findings may have important implications for COVID-19 heterogeneity, vaccine-induced immune responses, and robustness of immunity to SARS-CoV-2 and its variants.Entities:
Keywords: CD8 T cells; SARS-CoV-2; T cell cross-reactivity; T cell response; coronavirus; immunogenicity; peptide presentation; pre-existing T cell immunity
Mesh:
Substances:
Year: 2022 PMID: 35143694 PMCID: PMC9111820 DOI: 10.1111/imm.13451
Source DB: PubMed Journal: Immunology ISSN: 0019-2805 Impact factor: 7.215
FIGURE 1Overview of the study. (a) An illustration describing the curation of a SARS‐CoV‐2 peptide immunogenicity dataset from published data. (b) A map of potential cross‐reactive SARS‐CoV‐2 peptides and their target counterparts from HCoVs is presented. (c) A set of immunogenic SARS‐CoV‐2 peptides with highsimilarity to the human proteome is reported. (d) The extent of cross‐reactivity and common‐specificity within SARS‐CoV‐2 is examined. (e) The landscape of potential SARS‐CoV‐2‐specific cross‐reactive public TCRs in health and/or COVID‐19 disease is detailed. (f) A set of 10 peptides which are highly conserved across HCoVs, SARS, SARS‐CoV‐2 and MERS are predicted to exhibit high global and regional ‘population coverage’ is identified
FIGURE 2A comprehensive pool of functionally validated SARS‐CoV‐2 peptides. Barplots showing (a) The number of SARS‐CoV‐2 peptides deemed’positive’ or ‘negative’. ‘Negative’ label reflects only negative qualitative observations and ‘positive’ reflects at least one immunogenic observation. (b) The number of total observations, including all assay and HLA combinations for each peptide. (c) The distribution of lengths of unique observations (peptide‐immunogenicity) in our SARS‐CoV‐2 dataset. Left panel shows nonimmunogenic ‘Negative’ peptides. Right panel shows immunogenic or ‘Positive’ peptides. MHC class of the unique observation is colour coded. (d) The frequencies of total immunogenic or non‐immunogenic observations (peptide‐MHC‐immunogenicity), where a specific HLA allele is available for a peptide. Numeric labels show the number of peptides in each group. (e) The logs odd ratio of observed and expected number of presented peptides of lengths 9 and 10 by common HLA alleles for SARS‐CoV‐2 proteins >100 amino acids in length. Significance calculated using binomial distribution. (f) The frequency of immunogenic and nonimmunogenic peptides as presented by HLA alleles arising from SARS‐CoV‐2. Numeric labels show the number of observations per immunogenicity status
FIGURE 3A set of peptides from human coronavirus strains with high similarity to immunogenic SARS‐CoV‐2 peptides. (a) A barplot showing the number of high similarity matches predicted to bind a set of common HLA class I and class II alleles. (b) A barplot showing the number of unique high similarity matches derived from each human common‐cold‐causing coronavirus. Each hit is defined as a unique observation—that fulfills all three criteria defined in our similarity metric—between an immunogenic SARS‐CoV‐2 peptide with length l, and a stretch of lengthlfrom one viral protein. (c) A dot and line plot showing each SARS‐CoV‐2 peptide and to which common‐cold‐causing coronavirus it exhibits a high similarity match. The size of each point represents the MatchScore, which reflects the physicochemical similarity with a counterpart HCoV match. Peptides are grouped by the number of high similarity matches one exhibits to human coronavirus strains (either 1, 2, 3 or 4). (d) Barplots showing the number of unique SARS‐CoV‐2 peptides (left) and SARS‐CoV‐2‐HCoV matches (right), at different thresholds of the sequence homology metric, i.e the % of the amino acids that must be conserved between the SARS‐CoV‐2 peptide and its HCoV match. (e) Sequence logo plots comparing amino acid usage of SARS‐CoV‐2‐homologous and non‐homologous 9‐mer peptide sequences
FIGURE 4A pool of immunogenic SARS‐CoV‐2 peptides with high similarity to human genes. (a) Jittered boxplots showing the similarity—evaluated by the ‘MatchScore’—of nonimmunogenic and immunogenic SARS‐CoV‐2 peptides with sequences derived from the human proteome of lengths 9 (immunogenic n = 906, nonimmunogenic n = 734 peptides) and 10 (immunogenic n = 394 and nonimmunogenic n = 394 peptides). The green dot indicates the median of each group, red line shows the median of the immunogenic group. (b) A barplot showing the physicochemical similarity scores (MatchScores) of peptides from the human proteome which were found to have high similarity with immunogenic SARS‐CoV‐2 peptides, labelled by protein length. (c) A dotplot showing the predicted HLAs of peptides from the human proteome which exhibit ‘high similarity’ to SARS‐CoV‐2 proteome. The size of the point reflects the MatchScore. An ‘X’ shows where the SARS‐CoV‐2 derived peptide and the match are predicted to bind the same allele. (d) A bar chart showing the genes from which the high similarity match peptides arise from the human proteome, grouped by whether the SARS‐CoV‐2 peptide exhibits a high similarity match with HCoV (Homologous) or not (Nonhomologous)
SARS‐CoV‐2 peptides with high similarity to the human proteome, from genes reported to be involved with severe COVID‐19. Note, peptides derived from CD163 were not predicted to bind HLA
| Human match | MatchScore | SARS‐CoV−2 Peptide | HLA Allele | Homologous/Non‐homologous | Protein | Length | Proportion conserved |
|---|---|---|---|---|---|---|---|
| STAALAVLL | 0·806 | STAALGVLM | HLA‐A*26:01 | Non‐homologous | >sp|P10147|CCL3_HUMAN C‐C motif chemokine 3 OS=Homo sapiens OX=9606 GN=CCL3 PE=1 SV=1 | 9 | 0·778 |
| STAALAVLL | 0·806 | STAALGVLM | HLA‐A*26:01 | Non‐homologous | >sp|P16619|CL3L1_HUMAN C‐C motif chemokine 3‐like 1 OS=Homo sapiens OX=9606 GN=CCL3L1 PE=1 SV=1 | 9 | 0·778 |
| ILGVVLLAIF | 0·818 | IVGVALLAVF | HLA class I | Non‐homologous | >sp|Q86VB7|C163A_HUMAN Scavenger receptor cysteine‐rich type 1 protein M130 OS=Homo sapiens OX=9606 GN=CD163 PE=1 SV=2 | 10 | 0·7 |
| ILGVVLLAIF | 0·818 | IVGVALLAVF | HLA class I | Non‐homologous | >tr|F5GZZ9|F5GZZ9_HUMAN Scavenger receptor cysteine‐rich type 1 protein M130 OS=Homo sapiens OX=9606 GN=CD163 PE=1 SV=1 | 10 | 0·7 |
| ILGVVLLAIF | 0·818 | IVGVALLAVF | HLA class I | Non‐homologous | >tr|H0YFM0|H0YFM0_HUMAN Scavenger receptor cysteine‐rich type 1 protein M130 (Fragment) OS=Homo sapiens OX=9606 GN=CD163 PE=1 SV=1 | 10 | 0·7 |
| ILGVVLLAIF | 0·818 | IVGVALLAVF | HLA class I | Non‐homologous | >tr|C9JHR8|C9JHR8_HUMAN Scavenger receptor cysteine‐rich type 1 protein M130 OS=Homo sapiens OX=9606 GN=CD163 PE=1 SV=1 | 10 | 0·7 |
FIGURE 5A landscape of T cell responses against SARS‐CoV‐2‐HCoV Homologous and SARS‐CoV‐2‐non‐homologous peptides in healthy or COVID‐19 convalescent individuals: (a) A bipartite network graph showing SARS‐CoV‐2‐specific public TCRs which recognize homologous or nonhomologous SARS‐CoV‐2 peptides in healthy or convalescent patients. TCRs are colour‐coded by whether they recognize only homologous peptides (blue), only non‐homologous peptides (orange) or both (yellow). COVID‐19 convalescent patients are labelled red while healthy controls are labelled green. Node size reflects degree of connectivity, i.e., the quantity of an individual's TCRs which are shared with other patients. (b) A bipartite network graph showing SARS‐CoV‐2 public TCRs that recognize SARS‐CoV‐2‐HCoV homologous peptides. Patient node size reflects the quantity of their TCRs which are shared with another patient. Healthy patients are labelled green, COVID‐19 convalescent are labelled red, and (public) TCRs are labelled blue. (c) A barplot showing the frequency that each convalescent patient's public TCRs recognize SARS‐CoV‐2‐non‐homologous (red) or SARS‐CoV‐2‐HCoV‐homologous (blue) peptides. Patients with identical frequencies are ordered by the number of TCRs. (d) Barplots showing the quantities of COVID‐19 convalescent patients who carry 14 class I HLA alleles of interest. Patients are grouped by whether their public TCRs predominately recognize “non‐homologous” (PubTCR_NonHomologous, n = 12, sampled 10 times) or “Homologous” (PubTCR_Homologous, n = 12) peptides. For the PubTCR‐NonHomologous group, 12 patients were sampled 10 times and the number of patients carrying alleles was measured. The mean number of patients carrying each allele and the error are visualized. For the PubTCR‐Homologous group, the data contain only 12 patients of interest, thus the number of patients carrying each allele is measured and visualised
FIGURE 6SARS‐CoV‐2 T cell epitopes with known cognate TCRs which are conserved across multiple coronaviruses exhibit broad population coverage: (a) A dot plot showing SARS‐CoV‐2 peptides with high similarity to more than one HCoV that are recognized by TCRs in the MIRA dataset. Size of the dot represents the MatchScore. (b) The frequency of cognate TCRs which recognize these peptides from the COVID‐19 convalescent or healthy cohorts. (c) The HLA alleles predicted to present SARS‐CoV‐2 peptides with high similarity matches to 3 or 4 HCoV strains. (d) Global population coverage as calculated by the ‘IEDB population coverage tool’ for each individual SARS‐CoV‐2 peptide with high similarity matches to 3 or 4 HCoV strains. (e) Accumulated global population coverage predicted by the IEDB population coverage tool. (f) Regional population coverage for the entire set of 10 SARS‐CoV‐2 peptides with matches to 3 or 4 HCoV
Highly conserved CD8+ T cell peptides across SARS‐CoV‐2 and HCoV strains, with high population coverage
| SARS‐CoV‐2 peptide | Virus | Protein | MatchScore |
|---|---|---|---|
| AIMTRCLAV | 229E, OC43, HKU1, NL63, SARS‐CoV, MERS_CoV | Replicase polyprotein 1ab, ORF1ab polyprotein, ORF1ab polyprotein, replicase polyprotein 1ab, ORF1ab polyprotein, 1AB polyprotein | 1, 1, 0·977, 0·977, 1, 0·977 |
| KLFAAETLK | NL63, 229E, HKU1, OC43, SARS‐CoV, MERS_CoV | Replicase polyprotein 1ab, replicase polyprotein 1ab, ORF1ab polyprotein, ORF1ab polyprotein, ORF1ab polyprotein, 1AB polyprotein | 0·881, 0·857, 0·847, 0·847, 1, 0·929 |
| LLLDDFVEI | HKU1, 229E, NL63, OC43, SARS‐CoV | ORF1ab polyprotein, replicase polyprotein 1ab,replicase polyprotein 1ab, ORF1ab polyprotein, ORF1ab polyprotein | 0·894, 0·871, 0·86, 0·777, 1 |
| LQLGFSTGV | OC43, HKU1, 229E, NL63, MERS_CoV, SARS‐CoV | ORF1ab polyprotein, ORF1ab polyprotein, replicase polyprotein 1ab, replicase polyprotein 1ab, 1AB polyprotein, ORF1ab polyprotein | 0·977, 0·955, 0·809, 0·809, 1, 1 |
| LSDDAVVCFNSTY | 229E, HKU1, OC43, SARS‐CoV, MERS_CoV | Replicase polyprotein 1ab,ORF1ab polyprotein, ORF1ab polyprotein, ORF1ab polyprotein, 1AB polyprotein | 0·843, 0·789, 0·789, 0·872, 0·789 |
| VLQAVGACV | HKU1, OC43, 229E, NL63, SARS‐CoV, MERS_CoV | ORF1ab polyprotein,ORF1ab polyprotein, replicase polyprotein 1ab, replicase polyprotein 1ab, ORF1ab polyprotein, 1AB polyprotein | 0·876, 0·876, 0·795, 0·773, 1, 0·832 |
| VQIDRLITGR | HKU1, 229E, NL63, OC43, SARS‐CoV, MERS_CoV | Surface glycoprotein (all) | 0·887, 0·845, 0·845, 0·804, 1, 0·804 |
| YEQYIKWPW | HKU1, OC43, NL63, 229E, SARS‐CoV | Surface glycoprotein (all) | 0·903, 0·873, 0·855, 0·794, 1 |
| YEQYIKWPWY | HKU1, OC43, NL63, SARS‐CoV | Surface glyco protein (all) | 0·913, 0·886, 0·775, 1 |
| YVFCTVNAL | 229E, NL63, HKU1, OC43, SARS‐CoV | Replicase polyprotein 1ab, replicase polyprotein 1ab, ORF1ab polyprotein, ORF1ab polyprotein, ORF1ab polyprotein | 0·84, 0·818, 0·809, 0·809, 1 |