| Literature DB >> 17705876 |
Joo Chuan Tong1, Zong Hong Zhang, J Thomas August, Vladimir Brusic, Tin Wee Tan, Shoba Ranganathan.
Abstract
BACKGROUND: HLA-C locus products are poorly understood in part due to their low expression at the cell surface. Recent data indicate that these molecules serve as major restriction elements for human immunodeficiency virus type 1 (HIV-1) cytotoxic T lymphocyte (CTL) epitopes. We report here a structure-based technique for the prediction of peptides binding to Cw*0401. The models were rigorously trained, tested and validated using experimentally verified Cw*0401 binding and non-binding peptides obtained from biochemical studies. A new scoring scheme facilitates the identification of immunological hot spots within antigens, based on the sum of predicted binding energies of the top four binders within a window of 30 amino acids.Entities:
Year: 2007 PMID: 17705876 PMCID: PMC2040137 DOI: 10.1186/1745-7580-3-7
Source DB: PubMed Journal: Immunome Res ISSN: 1745-7580
HLA-Cw4 dataset used in this study.
| 1 | Training Set | Cw3 consensus | FAMPNFQTL | 651 | 24 |
| 2 | Training Set | Cw6 consensus | IPFPIVRYL | >30000 | 24 |
| 3 | Training Set | Cw7 consensus | KYPDFVDAL | 2.4 | 24 |
| 4 | Training Set | Cw4 consensus | QYDDAVYKL | 18 | 24 |
| 5 | Training Set | Histone H3.3 | RYRPGTVAL | >30000 | 24 |
| 6 | Training Set | Unknown Cw6 natural ligand | YQFTGIKKY | >30000 | 24 |
| 7 | Test Set | Transcripton factor SUPT4H 56–64 | SFDGIIAMM | Binder | 10 |
| 8 | Test Set | Transducin-like 3 120–128 | AFDPTSTLL | Binder | 10 |
| 9 | Test Set | UBE3B variant 1 742–750 | VFDPALNLF | Binder | 10 |
| 10 | Test Set | XP_173235 3–11 | LFDITGQDF | Binder | 10 |
| 11 | Test Set | 8 59–67 | VYDTNPAKF | Binder | 10 |
| 12 | Test Set | Acyl-CoA synthetase 4 82–90 | LFDHAVSKF | Binder | 10 |
| 13 | Test Set | Adenosylhomocyteinehydrolase 566–574 | SFDAHLTEL | Binder | 10 |
| 14 | Test Set | ART-1 Adenocarcinoma antigen 21–29 | SFDLLPREF | Binder | 10 |
| 15 | Test Set | ATP-binding cassette, sub-family F, member 3 500–508 | YYDPKHVIF | Binder | 10 |
| 16 | Test Set | Block of proliferation 1 639–647 | SYDSKLVWF | Binder | 10 |
| 17 | Test Set | BM-015 144–152 | HFDPEVVQI | Binder | 10 |
| 18 | Test Set | CDC45 541–549 | HFDLSVIEL | Binder | 10 |
| 19 | Test Set | Cholesterol acyltransferase 72–80 | HFDDFVTNL | Binder | 10 |
| 20 | Test Set | Chromosome 20 open reading frame 40 318–326 | FFDNISSEL | Binder | 10 |
| 21 | Test Set | Elongation factor 2 265–273 | YFDPANGKF | Binder | 10 |
| 22 | Test Set | Epithelial cell transforming oncogene 21–29 | IFDSKVTEI | Binder | 10 |
| 23 | Test Set | Ethanolamine kinase EKI1 132–141 | HWDPQEVTL | Binder | 10 |
| 24 | Test Set | Eukaryotic translation initiation factor 3, Su 6 interacting protein 478–486 | FLDLTEGEF | Binder | 10 |
| 25 | Test Set | Fatty acid synthetase 544–552 | TFDDIVHSF | Binder | 10 |
| 26 | Test Set | FK506 binding protein 9 303–311 | VFDIHVIDF | Binder | 10 |
| 27 | Test Set | Glutamine:fructose-6-phosphate amidotransferase (GFAT) 345–353 | NFDDYTVNL | Binder | 10 |
| 28 | Test Set | Tousled-like kinase 456–464 | AFDLTEQRY | Binder | 10 |
| 29 | Test Set | Transcription factor IIE 144–152 | LFDPMTGTF | Binder | 10 |
| 30 | Test Set | HSP 70 kDa 1A 179–205 | IFDLGGGTF | Binder | 10 |
| 31 | Test Set | HSPC198 19–27 | SYDLFVNSF | Binder | 10 |
| 32 | Test Set | HSP J2 156–164 | SFDTGFTSF | Binder | 10 |
| 33 | Test Set | Hypothetical protein FLJ00365 80–88 | YFDAIPVTM | Binder | 10 |
| 34 | Test Set | Hypothetical protein FLJ11220 373–381 | YLPDFLDYF | Binder | 10 |
| 35 | Test Set | Hypothetical protein FLJ20343 415–423 | RFDEAYIYM | Binder | 10 |
| 36 | Test Set | Insuline degrading enzyme IDE 150–158 | YFDVSHEHL | Binder | 10 |
| 37 | Test Set | Integrin alpha-V (Vitronectin) 224–232 | KYDPNVYSI | Binder | 10 |
| 38 | Test Set | KCIP-1 186–194 | AFDEAIAEL | Binder | 10 |
| 39 | Test Set | KIAA0461 721–729 | SMDPLPVFL | Binder | 10 |
| 40 | Test Set | KIAA1463 444–452 | FYDERIVVV | Binder | 10 |
| 41 | Test Set | KIAA1921 350–358 | VYDGKIYTL | Binder | 10 |
| 42 | Test Set | Metalloproteinase 10 331–339 | FWPSLPSYL | Binder | 10 |
| 43 | Test Set | Methionine-tRNA synthetase 2 (mitochondrial) 58–66 | YYDEKVVKL | Binder | 10 |
| 44 | Test Set | Mucin-5B 168–176 | TFDGTSYTF | Binder | 10 |
| 45 | Test Set | Myosin phosphatase target subunit 1 37–45 | KFDDGAVFL | Binder | 10 |
| 46 | Test Set | Nuclear autoantigenic sperm protein 599–607 | QYDEAVAQF | Binder | 10 |
| 47 | Test Set | Nucleoporin NUP358 261–269 | SFDSALQSV | Binder | 10 |
| 48 | Test Set | P621 70–78 | VFDKTLAEL | Binder | 10 |
| 49 | Test Set | Phosphate carrier precursor 331–339 | IYDSVKVYF | Binder | 10 |
| 50 | Test Set | PRO2242 61–69 | YFDPQYFEF | Binder | 10 |
| 51 | Test Set | Protein phosphatase 6 102–110 | KWPDRITLL | Binder | 10 |
| 52 | Test Set | Putative prostate tumor suppressor 165–173 | TFDLQRIGF | Binder | 10 |
| 53 | Test Set | Rac1 168–176 | VFDEAIRAV | Binder | 10 |
| 54 | Test Set | RNA Helicase A 697–705 | VFDPVPVGV | Binder | 10 |
| 55 | Test Set | Similiar to KIAA1911 80–88 | FWDGKIVLV | Binder | 10 |
| 56 | Test Set | Topoisomerase I 241–249 | YYDGKVMKL | Binder | 10 |
| 57 | Test Set | Tensin 3 143–151 | FYDDKVSAL | Binder | 10 |
| 58 | Test Set | HIV-1 (BRU) gp120 350–358 | SFNCGGEFF | Binder | 18 |
| 59 | Test Set | HIV-1 (BRU) gag p24 307–315 | QASQEVKNW | Binder | 21 |
| 60 | Test Set | Synthetic peptide | AYDDAVYKL | Binder | 25 |
| 61 | Test Set | Cw7 consensus | AYADFVYAY | >30000 | 24 |
| 62 | Test Set | CKShs2 | KYFDEHYEY | >30000 | 24 |
| 63 | Test Set | Unknown Cw6 natural ligand | YRHDGGNVL | >30000 | 24 |
| 64 | Test Set | Unknown Cw7 natural ligand | NKADVILKY | >30000 | 24 |
Figure 1Predicted start positions of Cw*0401-specific hotspots (sliding window size = 30) along (A) p24gag and (B) gp160gag glycoproteins. Scores are computed based on the sum of predicted binding energies of top four binders within the 30 amino acid sliding window. Predicted hotspots regions are shown in blue boxes with experimentally verified regions shown in red.
Predicted Cw*0401-specific immunological hotspots for p24gag.
| No. | Position | Sequence |
| 1 | 20–75 | RLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILGQLQPSLQTGSE |
| 2 | 89–148 | YNTVATLYCVHQRIEIKDTKEALDKIEEEQNKSKKKAQQAAADTGHSNQVSQNY |
| 3 | 143–262 | PIVQNIQGQMVHQAISPRTLNAWVKVVEEKAFSPEVIPMFSALSEGATPQDLNT |
| 4 | 311–381 | QEVKNWMTETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARVLA |
| 5 | 447–506 | EFSSEQTRANSPTRRELQVWGRDNNSPSEAGADRQGTVSFNFPQVTLWQRPLVT |
| 6 | 526–590 | LPGRWKPKMIGGIGGFIKVRQYDQILIEICGHKAIGTVLVGPTPVNIIGRNLLT |
| 7 | 664–727 | FRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDEDFRKYTAF |
| 8 | 708–791 | DEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQ |
| 9 | 873–930 | TKALTEVIPLTEEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTY |
| 10 | 996–1053 | TWIPEWEFVNTPPLVKLWYQLEKEPIVGAETFYVDGAANRETKLGKAGYVTNRG |
| 11 | 1054–1157 | VTLTDTTNQKTELQAIYLALQDSGLEVNIVTDSQYALGIIQAQPDQSESELVNQ |
| 12 | 1157–1316 | DEHEKYHSNWRAMASDFNLPPVVAKEIVASCDKCQLKGEAMHGQVDCSPGIWQL |
| 13 | 1308–1368 | IIGQVRDQAEHLKTAVQMAVFIHNFKRKGGIGGYSAGERIVDIIATDIQTKELQ |
| 14 | 1378–1427 | RNPLWKGPAKLLWKGEGAVVIQDNSDIKVVPRRKAKIIRDYGKQMAGDDC |
Predicted Cw*0401-specific immunological hotspots for gp160gag.
| No. | Position | Sequence |
| 1 | 44–89 | VWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVVLVNV |
| 2 | 102–171 | EQMHEDIISLWDQSLKPCVKLTPLCVSLKCTDLGNATNTNSSNTNSSSGEMM |
| 3 | 174–289 | VQKEYAFFYKLDIIPIDNDTTSYTLTSCNTSVITQACPKVSFEPIPIHYCAP |
| 4 | 341–504 | AKWNATLKQIASKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNS |
| 5 | 489–559 | KYKVVKIEPLGVAPTKAKRRVVQREKRAVGIGALFLGFLGAAGSTMGARSMT |
| 6 | 581–639 | LQARILAVERYLKDQQLLGIWGCSGKLICTTAVPWNASWSNKSLEQIWNNMT |
| 7 | 667–721 | ELDKWASLWNWFNITNWLWYIKIFIMIVGGLVGLRIVFAVLSIVNRVRQGYS |
| 8 | 708–766 | SIVNRVRQGYSPLSFQTHLPTPRGPDRPEGIEEEGGERDRDRSIRLVNGSLA |
| 9 | 793–852 | RGWEALKYWWNLLQYWSQELKNSAVSLLNATAIAVAEGTDRVIEVVQGACRA |