| Literature DB >> 32913247 |
Abstract
The lack of a high throughput assay for screening stabilizing peptides prior to building a library of peptide-major histocompatibility complex class I (pMHC-I) molecules has motivated the continual use of in silico tools without biophysical characterization. Here, based on de novo protein fragmentation, the EASY MHC-I (EZ MHC-I) assay favors peptide antigen screening to an unheralded hands-on time of seconds per peptide due to the empty single chain MHC-I protein instability. Unlike tedious traditional labeling- and antibody-based MHC-I assays, repurposed enterokinase directly fragments the unstable single MHC-I chain protein unless rescued by a stabilizing peptide under luminal condition. Herein, the principle behind EZ MHC-I assay not only characterizes the overlooked stability as a known better indicator of immunogenicity than classical affinity but also the novel use of enterokinase from the duodenum to target destabilized MHC-I protein not bearing the standard Asp-Asp-Asp-Asp-Lys motif, which may protend to other protein instability-based assays.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32913247 PMCID: PMC7483518 DOI: 10.1038/s41598-020-71785-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The destabilized single chain MHC-I protein is directly digested by enterokinase to ease stabilizing peptide screening. (a) Comparisons among pMHC-I stability-based assays such as traditional ELISA workflow, differential scanning fluorimetry (NeoScreen Technology, Immunitrack), NetMHC(stab-1.0) algorithm and EZ-MHC-I assay. (b) Empty MHC-I protein destabilization. In the absence of the original peptide either photolabile or cleavable, both forms of empty unchained MHC-I protein will dissociate unless stabilized by a rescue peptide. In the empty single chain MHC-I protein, the leaving β2m further destabilizes the enchained α chain. (c) EZ MHC-I workflow. Mix the single chain pMHC-I protein, the peptide of interest (preferably single peptide) and intestinal enterokinase in the same tube, incubate at 37 °C and detect fragments in a protein gel. The enterokinase preferably cleaves the unstable single chain MHC-I protein and will result in the loss of a 50 kDa protein band, which is quantified using the Image Lab software version 6.1 (Bio-Rad). (d) A plot of raw data of the stability of the unchained MHC-I protein bearing a photolabile peptide under varying conditions based on the volume intensity. About ~ 50% of 30 kDa α chain still remains as a protein band under different UV, thermal and enterokinase treatment. (e,f) A plot of raw data of the stability of the single chain MHC-I protein bearing a cleavable peptide in the presence of rescue peptides at pH 6.2 and 37 °C or 25 °C. HLA-A*11:01, HLA-A*02:01 and HLA-A*02:07 were digested without or with enterokinase. Significant improvement of up to 95% enchained MHC-I protein fragmentation at 37 °C instead of at 25 °C and is better than unchained photolabile MHC-I protein of only ~ 50% fragmentation. The positive peptide controls are IVTDFSVIK (HLA-A*11:01), NLVPMVATV (HLA-A*02:01) and LLDSDYERL or FLPSDYFFSV (HLA-A*02:07) alongside negative controls CLGGLLTMV for HLA-A*11:01 and IVTDFSVIK for both HLA-A*02:01 and HLA-A*02:07.
HLA-A*11:01 characterization of peptide epitopes at two different pHs retrieved from the Immune Epitope Database (IEDB) with EZ MHC-I assay, NetMHCpan-4.0 and NetMHCstab-1.0 algorithms.
| Index | SEQUENCE | IEDB | Origin | NetMHCpan-4 | NetMHCstab-1 | EZ50kD | EZ50kD |
|---|---|---|---|---|---|---|---|
| 1 | ATIGTAMYK | 5,002 | EBV | 0.0026 | 23.5793 | 1.031625511 | 1.190285064 |
| 2 | ATYGWNLVK | 5,223 | DENV1 | 0.0049 | 8.0727 | 0.870605547 | 1.994035494 |
| 3 | STYGWNIVK | 180,757 | DENV3 | 0.005 | 4.8528 | 0.630630693 | 1.394508024 |
| 4 | TTYLGPLSCK | 94,098 | LCM | 0.0399 | 6.0105 | 1.083751608 | 1.911496583 |
| 5 | IVTDFSVIK (EBV1101) | 29,466 | EBV | 0.0423 | 14.9481 | 1 | 1 |
| 6 | ATVQGQNLK | 5,200 | CMV | 0.044 | 13.4333 | 0.896765373 | 0.766999991 |
| 7 | AVQTKPGLFK | 150,153 | DENV2 | 0.0496 | 16.0378 | 0.955015006 | 1.546255991 |
| 8 | KTFVDLMRR | 124,391 | DENV2 | 0.086 | 1.4929 | 0.82408852 | 0.165047447 |
| 9 | KSGAIKVLK | 180,577 | DENV3 | 0.133 | 1.8625 | 0.818964843 | 0.48653242 |
| 10 | AINSEMFLR | 175,443 | IGRP* | 0.134 | 1.5431 | 0.901224151 | 1.189880277 |
| 11 | RVIDPRRCLK | 150,556 | DENV1, 3, 4 | 0.15 | 15.9042 | 1.07454914 | 0.733917026 |
| 12 | TSGSPIIDK | 150,660 | DENV2 | 0.169 | 1.968 | 0.795327602 | 0.159605484 |
| 13 | RQLANAIFK | 55,437 | DENV3 | 0.17 | 3.7199 | 1.303197077 | 1.118237914 |
| 14 | MSTYGWNIVK | 180,674 | DENV3 | 0.181 | 1.8548 | 0.642319424 | 0.125122315 |
| 15 | MATYGWNLVK | 180,645 | DENV1, 4 | 0.208 | 1.3226 | 0.638430333 | 0.007553278 |
| 16 | RVIDPRRCMK | 56,310 | DENV2 | 0.21 | 15.4 | 1.16173446 | 0.346334735 |
| 17 | SVQPTFSVQR | 144,488 | Influenza A | 0.31 | 4.2372 | 1.051411878 | 0.7376486 |
| 18 | VTRGAVLMHK | 150,715 | DENV2 | 0.393 | 1.9538 | 0.622778602 | − 0.068851052 |
| 19 | YVSAIAQTEK | 150,751 | DENV2 | 0.437 | 2.5672 | 0.303719293 | − 0.063560293 |
| 20 | TTKRDLGMSK | 180,788 | DENV3 | 0.477 | 1.4143 | 0.79595639 | 0.379550736 |
| 21 | MANEMGFLEK | 150,431 | DENV2 | 0.5519 | 3.2158 | 0.526954362 | − 0.019570479 |
| 22 | SSCSSCPLSK | 60,930 | EBV | 0.585 | 19.8885 | 0.957571019 | 0.840850452 |
| 23 | MVSRLLLNR | 180,678 | DENV3 | 0.592 | 0.8642 | 1.195884297 | 0.23955557 |
| 24 | MSYTMCSGK | 180,676 | DENV4 | 0.6211 | 6.1212 | 1.19750423 | 1.094062057 |
| 25 | GAMLFLISGK | 180,469 | DENV3 | 0.76 | 2.3863 | 0.746753143 | 1.036357414 |
| 26 | FTNDSIISH | 92,757 | LCM | 0.798 | 1.2262 | 0.696137865 | 0.133868081 |
| 27 | TLALEVAQQK | 175,503 | Insulin-2 | 0.93 | 1.021 | 0.739914818 | 0.451747794 |
| 28 | *GPISGHVLK | 21,665 | CMV | 2.5 | 0.6046 | − 0.065088414 | − 0.05275318 |
| 29 | *TMVMELIRMIK | 566,925 | Influenza A | 4.98 | 0.86 | − 0.10894061 | − 0.004370881 |
| 30 | *LVSFLLLAGR | 93,350 | LCM | 5.86 | 0.6594 | − 0.184620433 | 0.072058808 |
| 31 | *LALEVARQKR | 175,477 | Insulin-1 | 8.86 | 0.6903 | − 0.10544474 | − 0.012738075 |
| 32 | *LVTFLLLCGR | 119,268 | Lassa virus | 9.38 | 0.7715 | − 0.173186487 | − 0.376760316 |
| 33 | *LYASPQLEGF | 176,343 | Influenza A | 25.231 | 0.2322 | − 0.699688536 | − 0.061802853 |
| 34 | *AYQKRMGVQM | 175,777 | Influenza A | 27.1 | 0.25 | − 0.102005257 | − 0.081022458 |
| 35 | CLGGLLTMV (EBV0201) | 6,568 | EBV | 46.6 | 0.2538 | 0 | 0 |
For EZ MHC-I assay, the EZ50kDa score for each peptide is calculated using positive Epstein-Barr virus peptide control (EBV1101) and negative Epstein-Barr virus peptide control (EBV0201); ranging from above 1 (more stable than control), 1 (equally stable) to below 0 (very unstable), determined with Eq. (4). The most optimal peptides are scored closer to ~ 0 in NetMHCpan rank, greater than 2 h for NetMHCstab-1.0 half-life and greater than 0.3 for EZ50kDa. The seven peptides with EZ50kDa below EBV0201 (negative control) as shown in Fig. 2a, are marked with * in Table 1.
EBV Epstein–Barr virus, DENV Dengue virus, LCM lymphocytic choriomeningitis virus, CMV cytomegalovirus, IGRP Islet-specific glucose-6-phosphatase catalytic subunit-related protein.
Figure 2EZ MHC-I assay reveals additional IEDB peptides, which are not predicted strong candidates. (a) The pH effect on EZ MHC-I assay using 35 peptides retrieved from the IEDB resource and HLA-A*11:01 single chain protein. The NetMHCpan-4.0 cutoff is set as greater than 0% for all binders or less than 1% for potential binders (see Table 1). At pH 6.2, a tighter group of peptides is observed than at pH 8.0. (b) Comparison of traditional ELISA and EZ MHC-I assay with a collection of both IEDB peptide epitopes and peptides predicted using NetMHCpan-4 algorithm. Predicted non-binders tend not to be stabilizing whereas some predicted binders are found not stabilizing in both ELISA and EZ MHC-I assays (see Table 2). (c) A 4-dimensional comparison showing good agreement among the four methods. The bubble size is based on the ELISA assay whereby bigger size correlates to higher stability. The more stable and better binding peptides are clustered away from the less stable and poor binding peptides. Moderately predicted peptides may be confirmed using stability-based assays (see Table 2). (d) Additional peptides from IEDB resource are found stabilizing despite their higher NetMHCpan-4.0 ranks above 0.5%. (e) Additional peptides from IEDB resource are observed stabilizing despite falling below the recommended NetMHCstab-1.0 cutoff of 2 h. (f) A comprehensive EGFR peptide library (n = 177, duplicated) from key driver mutations were compared using EZ50kDa, NetMHCpan and NetMHCstab tools using HLA-A*11:01 single chain protein. The positive EBV1101 and negative EBV0201 control peptides are labeled and colored blue. Two more EGFR mutation-derived peptides (CS1012 and CS1017) are found strongly stabilizing despite their predicted weak affinities using NetMHCpan-4.0 algorithm for HLA-A*11:01. (g) Two stabilizing peptides (CS1012 and CS1017) were also not identified using NetMHCstab-1.0 algorithm based on its recommended cutoff of 2 h. (h) Analysis for understudied HLA-A*02:07 protein with peptides from the EGFR peptide library. The top eleven peptides with ascending NetMHCpan ranks are in the order of HepB0207 (FLPSDYFPSV, 0.028), CS1020 (TQLMPFGSLL, 0.086), CS1030 (ITQLMPFGSLL, 1.078), TM1011 (STVQLIMQL, 1.282), LR1012 (ITDFGRAKL, 1.534), EBV0201 (CLGGLLTMV, 1.596), TM1027 (MQLMPFGCLL, 2.194), CS1044 (QLMPFGSLLDYV, 2.238), CS1041 (LITQLMPFGSLL, 3.090), CS1002 (QLMPFGSL, 3.149) and EBV1101 (IVTDFSVIK, 11.64). For understudied HLA-A*02:07, there is a lack of correlation between NetMHCpan algorithm and measured EZ50kDa stability (n = 11, duplicated). HepB0207 (FLPSDYFPSV) is a known positive control. The cutoff for EZ50kDa at 0.3 is based on IEDB peptides with NetMHCpan-4.0 rank less than 1% and NetMHCstab-1.0 rank greater than 2 h (see Supplementary Fig. S3). Refer to Supplementary Fig. S2 for full gels of insert in (h).
HLA-A*11:01 characterization of peptide epitopes retrieved from the Immune Epitope Database (IEDB) and putative peptides with ELISA, EZ MHC-I assay, NetMHCpan-4.0 and NetMHCstab-1.0 algorithms.
| Index | SEQUENCE | IEDB | Origin | NetMHCpan-4 | NetMHCstab-1 | EZ50kD | ELISA450nm |
|---|---|---|---|---|---|---|---|
| 1 | RSFKDLLKK | 140,739 | Toxoplasma gondii | 0.029 | 7.95 | 0.609199079 | 1.682000041 |
| 2 | TVMDIISRR | 150,674 | DENV2 | 0.047 | 4.48 | 1.086457267 | 1.944100082 |
| 3 | NTLEQTVKK | 922,935 | Papillovirus | 0.052 | 9.78 | 0.564723004 | 1.631200015 |
| 4 | GTSGSPIIDK | 22,818 | DENV1 | 0.08 | 6.45 | 0.633428224 | 1.385050059 |
| 5 | ATVLMGLGK | 150,147 | DENV2 | 0.109 | 5.66 | 1.038798618 | 1.910549939 |
| 6 | AVFDRKSDAK | 5,316 | EBV | 0.125 | 26.36 | 0.624952021 | 1.078250051 |
| 7 | HTLWKAGILYK | 24,943 | HBV | 0.126 | 9.1 | 0.442617707 | 1.58130002 |
| 8 | SSCSSCPLSK | 60,930 | EBV | 0.191 | 19.89 | 0.664854145 | 1.272400141 |
| 9 | STIPETTVVR | N.D | HBV | 0.234 | 2.56 | 0.322579265 | 1.391299963 |
| 10 | STLPETTVIR | N.D | HBV | 0.274 | 3.33 | − 0.113447642 | 1.302249968 |
| 11 | RTLSPRRRR | N.D | HBV | 0.422 | 2.12 | − 1.143584408 | 0.865800023 |
| 12 | *RTQSPRRRR | 922,980 | HBV | 0.723 | 1.41 | − 0.686832534 | 1.20449996 |
| 13 | *SQSPRRRSK | N.D | Leptomonas seymouri | 0.832 | 0.72 | − 0.986875791 | 0.256799996 |
| 14 | SAICSVVRR | 56,814 | HBV | 0.917 | 1.77 | 0.238176504 | 1.618999958 |
| 15 | *KFLPDLYDYK | 144,377 | Influenza A | 1.325 | 0.83 | − 0.748050496 | 0.480850041 |
| 16 | #LVVDFSQFSR | 40,624 | HBV | 1.782 | 3.37 | 0.946052213 | 1.45175004 |
| 17 | *RTLSPRRR | N.D | Streptomyces sp | 3.529 | 2.78 | − 0.986037136 | 0.120900035 |
| 18 | *RTQSPRRR | 922,979 | HBV | 4.146 | 1.83 | − 1.322460579 | 0.331900001 |
| 19 | *TMVMELIRMIK | 566,925 | Influenza A | 4.228 | 0.86 | − 0.993923432 | 0.748042839 |
| 20 | *TVVGRRGRSPR | N.D | HBV | 4.961 | 1.12 | − 0.506637027 | 1.259450078 |
| 21 | *RSQSPRRR | N.D | HBV | 6.236 | 1.07 | − 1.32582118 | 0.185850024 |
| 22 | *DLAQCFFCFK | 922,790 | Human | 6.283 | 0.93 | − 0.822543625 | 0.395600021 |
| 23 | *MVWESGCTV | 140,054 | Human | 9.834 | 0.3 | − 1.156943761 | 0.593700051 |
| 24 | *TVIRRRGR | 923,033 | HBV | 10.848 | 0.61 | − 1.025365624 | 0.148200035 |
| 25 | *TVVRRRCRSPR | N.D | HBV | 11.401 | 1.39 | − 0.974649089 | 0.414450049 |
| 26 | *FLLTKILTI | 16,753 | HBV | 15.359 | 0.26 | − 1.236918075 | 0.179100007 |
| 27 | *TVVRRRCR | N.D | HBV | 16.528 | 0.8 | − 1.340376738 | 0.462499976 |
| 28 | *NILNPFLPLL | N.D | HBV | 16.986 | 0.29 | − 1.178306492 | 0.282749981 |
| 29 | *LNIDLLWSV | 103,368 | Human | 17.661 | 0.29 | − 1.081006694 | 0.273800016 |
| 30 | *GLSPTVWLSA | 21,137 | HBV | 18.362 | 0.35 | − 1.259120664 | 0.248750001 |
| 31 | *FIDSYICQV | 137,402 | Human | 20 | 0.25 | − 1.125312408 | 0.349050045 |
| 32 | *VYALPLKML | 71,976 | Herpes virus | 20.967 | 0.23 | − 0.923903128 | − 0.021949947 |
| 33 | *LLVPFVQWFV | 37,919 | HBV | 25.231 | 0.3 | − 1.193666275 | 0.19264999 |
| 34 | *AYQKRMGVQM | 175,777 | Influenza A | 26.167 | 0.25 | − 1.01135862 | − 0.033149958 |
| 35 | *LLAQFTSAL | N.D | HBV | 30.833 | 0.25 | − 1.231250251 | 0.213950008 |
| 36 | *EYLVSFGVW | 15,061 | HBV | 40.5 | 0.23 | − 1.034783308 | − 0.014099956 |
Peptides that are not stabilizing in at least two out of three stabilizing methods are marked with *. Also, a poorly predicted NetMHCpan peptide 16, which is stabilizing is marked with #. The instability cutoffs for the three stabilizing-based methods are NetMHCstab < 2 h, EZ MHC-I < 0.3 and (ELISA < 0.6 is based on non-HLA-A11:01 peptide 23 (MVWESGCTV), a known HLA-A*02:01 antigen but not non-validated peptide 20).
EBV Epstein–Barr virus, DENV Dengue virus, N.D denotes not determined.
HLA-A*11:01 peptides comparison derived from EGFR mutations.
| Index | Peptide ID | Sequence | EZ50kDa | EZ50kDa rank | NetMHCpan-4.0 rank | NetMHCstab-1.0 score |
|---|---|---|---|---|---|---|
| – | EBV1101 | IVTDFSVIK | 1 | – | 0.0423 | 14.9481 |
| 1 | CS1032 | QLMPFGSLLDY | 0.869 ± 0.174 | 1 | 3.7905 | 0.3434 |
| 2 | LR1011 | KITDFGRAK | 0.859 ± 0.003 | 2 | 0.14 | 9.3274 |
| 3 | CS1027 | SLLDYVREHK | 0.811 ± 0.171 | 3 | 0.4221 | 8.8147 |
| 4 | CS1017 | SLLDYVREH | 0.733 ± 0.140 | 4 | 3.2 | 0.7919 |
| 5 | CS1037 | GSLLDYVREHK | 0.367 ± 0.022 | 5 | 2.67 | 2.2755 |
| 6 | EX1018 | VAIKTSPKANK | 0.222 ± 0.010 | 6 | 2.5 | 2.1099 |
| 7 | LR1044 | KITDFGRAKLLG | 0.168 ± 0.030 | 7 | 30.7 | 0.5073 |
| 8 | TM1050 | MQLMPFGCLLDY | 0.0956 ± 0.004 | 8 | 15.2 | 0.2965 |
| 9 | LR1027 | RAKLLGAEEK | 0.0574 ± 0.002 | 9 | 1.8 | 0.8338 |
| 10 | LR1009 | HVKITDFGR | 0.0553 ± 0.003 | 10 | 2.64 | 0.7457 |
| 11 | CS1036 | FGSLLDYVREH | 0.0420 ± 0.010 | 11 | 49 | 0.2613 |
| 12 | LR1032 | KITDFGRAKLL | 0.0352 ± 0.024 | 12 | 20.4 | 0.2729 |
| 13 | EX1003 | KTSPKANK | 0.0341 ± 0.022 | 13 | 2.45 | 5.6887 |
| 14 | CS1050 | SLLDYVREHKDN | 0.0255 ± 0.005 | 14 | 20.85 | 0.4105 |
| – | EBV0201 | CLGGLLTMV | 0 | – | 46.6 | 0.2538 |
| – | EX1012 | AIKTSPKANK | − 0.007 ± 0.004 | 20 | 0.88 | 1.9707 |
| – | LR1020 | VKITDFGRAK | − 0.056 ± 0.059 | 77 | 1.6 | 0.5672 |
| – | LR1003 | ITDFGRAK | − 0.052 ± 0.001 | 63 | 3.45 | 1.4645 |
| – | LR1030 | HVKITDFGRAK | − 0.004 ± 0.022 | 18 | 3.6 | 2.0781 |
| – | CS1022 | LMPFGSLLDY | − 0.058 ± 0.012 | 80 | 3.655 | 0.2472 |
The top 14 peptides from EGFR mutations out of 177 peptides ranked using the EZ MHC-I assay and HLA-A*11:01 single chain protein. Five poorly scored peptides with EZ50kDa less than 0.1 and NetMHCpan rank less than 5% are also included for comparison. EBV Epstein–Barr virus, EX exon 19 deletion, LR L858R, CS C797S, TM T790M. The EZ50kDa score shows that some poorly predicted NetMHCpan-4.0 peptides (> 2%) could be stabilizing epitopes. The scores from the NetMHCstab-1.0 algorithm are also shown for comparison.