| Literature DB >> 33064841 |
Zeynep Koşaloğlu-Yalçın1, John Sidney1, William Chronister1, Bjoern Peters1,2, Alessandro Sette1,2.
Abstract
Binding prediction tools are commonly used to identify peptides presented on MHC class II molecules. Recently, a wealth of data in the form of naturally eluted ligands has become available and discrepancies between ligand elution data and binding predictions have been reported. Quantitative metrics for such comparisons are currently lacking. In this study, we assessed how efficiently MHC class II binding predictions can identify naturally eluted peptides, and investigated instances with discrepancies between the two methods in detail. We found that, in general, MHC class II eluted ligands are predicted to bind to their reported restriction element with high affinity. But, for several studies reporting an increased number of ligands that were not predicted to bind, we found that the reported MHC restriction was ambiguous. Additional analyses determined that most of the ligands predicted to not bind, are predicted to bind other co-expressed MHC class II molecules. For selected alleles, we addressed discrepancies between elution data and binding predictions by experimental measurements and found that predicted and measured affinities correlate well. For DQA1*05:01/DQB1*02:01 (DQ2.5) however, binding predictions did miss several peptides that were determined experimentally to be binders. For these peptides and several known DQ2.5 binders, we determined key residues for conferring DQ2.5 binding capacity, which revealed that DQ2.5 utilizes two different binding motifs, of which only one is predicted effectively. These findings have important implications for the interpretation of ligand elution data and for the improvement of MHC class II binding predictions.Entities:
Keywords: MHC; binding motif; binding predictions; ligand elution
Mesh:
Substances:
Year: 2020 PMID: 33064841 PMCID: PMC7808151 DOI: 10.1111/imm.13279
Source DB: PubMed Journal: Immunology ISSN: 0019-2805 Impact factor: 7.397
Composition of eluted ligand dataset after each filtering step and the number of peptides selected to be tested experimentally for the four alleles selected for detailed investigation
| Allele | Peptides in ligand elution dataset downloaded from IEDB | Peptides with length 15–20 | Peptides with predicted %rank>20 | Predicted non‐binder redundant peptides removed | Predicted non‐binders selected for testing | Predicted binders selected for testing |
|---|---|---|---|---|---|---|
| HLA‐DQA1*05:01/DQB1*02:01 | 2245 | 2056 | 1097 | 811 | 30 | 20 |
| HLA‐DRB1*01:01 | 638 | 594 | 114 | 97 | 20 | 15 |
| HLA‐DRB1*04:01 | 555 | 463 | 222 | 176 | 30 | 20 |
| HLA‐DRB1*15:01 | 388 | 330 | 178 | 132 | 17 | 18 |
| 4 selected HLA combined | 3,826 | 3,443 | 1,611 | 1,216 | 97 | 73 |
Ligand elution data and associated prediction efficacy
| Allele | ligands | %recovered at 20% |
|---|---|---|
| HLA‐DRB5*01:01 | 131 | 75% |
| HLA‐DRB3*02:02 | 10 | 80% |
| HLA‐DRB1*15:01 | 330 | 46% |
| HLA‐DRB1*13:02 | 17 | 35% |
| HLA‐DRB1*11:01 | 50 | 62% |
| HLA‐DRB1*04:05 | 1206 | 59% |
| HLA‐DRB1*04:04 | 35 | 69% |
| HLA‐DRB1*04:02 | 29 | 31% |
| HLA‐DRB1*04:01 | 463 | 52% |
| HLA‐DRB1*03:01 | 79 | 49% |
| HLA‐DRB1*01:01 | 594 | 81% |
| HLA‐DQA1*05:05/DQB1*03:01 | 3908 | 58% |
| HLA‐DQA1*05:01/DQB1*02:01 | 2055 | 47% |
| HLA‐DQA1*02:01/DQB1*02:02 | 3492 | 36% |
| HLA‐DQA1*01:02/DQB1*06:02 | 27 | 85% |
| HLA‐DQA1*01:01/DQB1*05:01 | 23 | 52% |
Composition of peptide set and associated binding results
| HLA allele | Pred. non‐binders selected | Pred. non‐binders that don't bind | Pred. binders selected | Pred. binders that bound |
|---|---|---|---|---|
| DQA1*05:01/DQB1*02:01 | 30 | 22 (73%) | 20 | 18 (90%) |
| DRB1*01:01 | 20 | 17 (85%) | 15 | 14 (93%) |
| DRB1*04:01 | 30 | 28 (93%) | 20 | 17 (85%) |
| DRB1*15:01 | 17 | 12 (71%) | 18 | 18 (100%) |
| Total | 97 | 79 (81%) | 73 | 67 (92%) |
Figure 1Measured percentile ranks are plotted against predicted ones. Colors indicate different alleles, squares predicted binders, and circles predicted non‐binders. A red outline depicts cases with large discrepancies between measured and predicted affinities, that is measured rank < 10% and predicted rank > 30%.
Peptides showing greatest discrepancy between predicted and measured affinity
| Sequence | HLA allele | Measured %‐Rank | Predicted %‐Rank |
|---|---|---|---|
| APDTRFFVPEPGGRGAAP | HLA‐DRB1*01:01 | 2.19 | 34.31 |
| WISKQEYDESGPSIVHRK | HLA‐DRB1*15:01 | 7.26 | 79.28 |
| SPTEPKNYGSYSTQA | HLA‐DRB1*15:01 | 2.61 | 86.56 |
| GEPDYVNGEVAATEA | HLA‐DQA1*05:01/DQB1*02:01 | 2.98 | 30.02 |
| GDSDLQLDRISVYYNEA | HLA‐DQA1*05:01/DQB1*02:01 | 6.7 | 31.25 |
| RQEEPEYENVVPISRPP | HLA‐DQA1*05:01/DQB1*02:01 | 3.54 | 40.81 |
| KPPTADLFTGVLPNGYNPP | HLA‐DQA1*05:01/DQB1*02:01 | 3.91 | 43.8 |
| LPGRENYSSVDANGIQ | HLA‐DQA1*05:01/DQB1*02:01 | 2.23 | 55.87 |
Figure 2The fold change in IC50 between the unmodified peptide and the corresponding analog with substitutions at each position scanning the length of the respective peptide is shown. The dashed line indicated a 10‐fold reduction in binding. Residues for which substitutions resulted in a greater than 10‐fold reduction in binding were defined as anchor residues and are highlighted in green here.
Figure 3(A) All DQ2.5 ligands were divided into three groups according to whether they fit Motif 1, Motif 2, or no motif and sequence logos were generated. (B) The analyzed 9 DQ2.5 binding peptides are grouped according to the motif they conform to. Residues that were identified as anchor residues in single amino acid substitution experiments are highlighted in green. The red boxes highlight residues that explicitly match the position and specificity of Motif 1 or Motif 2. A + in the column prediction indicates that the peptide was predicted to bind DQ2.5 with a percentile rank of < 20%.
Peptides selected for DQ2.5 single amino acid substitution scan to determine the residues crucial for DQ2.5 binding
| Protein | Pos | Sequence | Set | Measured %‐Rank | Predcited %‐Rank |
|---|---|---|---|---|---|
| PI‐a | 70 | NPTVFFDIAVDGEPL | Eluted | 0.2 | 0.8 |
| TMA | 266 | LAKTAFDEAIAELDT | Eluted | 0.9 | 0.5 |
| aTIP | 430 | EEVDMTPADALDDFD | Epitope | 1.5 | 1.0 |
| DQA | 16 | YQSYGPSGQYTHEFD | Eluted | 10.6 | 55.0 |
| SR | 812 | RQEEPEYENVVPISRPP | Eluted | 2.2 | 40.8 |
| LAM | 246 | KPPTADLFTGVLPNGYNPP | Eluted | 5.6 | 43.8 |
| Cochlin | 160 | LPGRENYSSVDANGIQ | Eluted | 14.7 | 55.9 |
| LAT2 | 21 | GEPDYVNGEVAATEA | Eluted | 2.1 | 30.0 |
| Tubulin | 38 | GDSDLQLDRISVYYNEA | Eluted | 6.9 | 31.3 |