| Literature DB >> 27832200 |
Sudheer Gupta1, Kumardeep Chaudhary1, Sandeep Kumar Dhanda1, Rahul Kumar1, Shailesh Kumar1, Manika Sehgal1, Gandharva Nagpal1, Gajendra P S Raghava1.
Abstract
Due to advancement in sequencing technology, genomes of thousands of cancer tissues or cell-lines have been sequenced. Identification of cancer-specific epitopes or neoepitopes from cancer genomes is one of the major challenges in the field of immunotherapy or vaccine development. This paper describes a platform Cancertope, developed for designing genome-based immunotherapy or vaccine against a cancer cell. Broadly, the integrated resources on this platform are apportioned into three precise sections. First section explains a cancer-specific database of neoepitopes generated from genome of 905 cancer cell lines. This database harbors wide range of epitopes (e.g., B-cell, CD8+ T-cell, HLA class I, HLA class II) against 60 cancer-specific vaccine antigens. Second section describes a partially personalized module developed for predicting potential neoepitopes against a user-specific cancer genome. Finally, we describe a fully personalized module developed for identification of neoepitopes from genomes of cancerous and healthy cells of a cancer-patient. In order to assist the scientific community, wide range of tools are incorporated in this platform that includes screening of epitopes against human reference proteome (http://www.imtech.res.in/raghava/cancertope/).Entities:
Mesh:
Substances:
Year: 2016 PMID: 27832200 PMCID: PMC5104390 DOI: 10.1371/journal.pone.0166372
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Frequency and type of mutations reported for each vaccine candidate.
Each numerical value representing the number of mutations across different cell lines in a vaccine candidate, for instance, vaccine target PRKDC has been mutated 842 times (frame shift insertions) in the different cell lines.
Number of deleterious mutations (fD), polymorphism/neutral variants (fP) and cancer association (fD/fP) in each vaccine target.
| Target | fD | fP | fD/fP | Family/subfamily of target/protein |
|---|---|---|---|---|
| PTEN | 389 | 1 | 389 | NA |
| TP53 | 1353 | 7 | 193.3 | P53_family |
| CTNNB1 | 132 | 1 | 132 | Beta-catenin_family |
| BRAF | 99 | 1 | 99 | Protein_kinase_superfamily,_TKL_Ser/Thr_protein_kinase_family,_RAF_subfamily |
| NF2 | 74 | 1 | 74 | NA |
| EGFR | 188 | 3 | 62.7 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_EGF_receptor_subfamily |
| SMAD4 | 107 | 2 | 53.5 | Dwarfin/SMAD_family |
| VHL | 272 | 6 | 45.3 | NA |
| KIT | 131 | 3 | 43.7 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_CSF-1/PDGF_receptor_subfamily |
| PIK3CA | 174 | 4 | 43.5 | PI3/PI4-kinase_family |
| NRAS | 36 | 1 | 36 | Small_GTPase_superfamily,_Ras_family |
| MSH2 | 103 | 5 | 20.6 | DNA_mismatch_repair_MutS_family |
| GATA1 | 20 | 1 | 20 | NA |
| MLH1 | 118 | 6 | 19.7 | DNA_mismatch_repair_MutL/HexB_family |
| FBXW7 | 67 | 4 | 16.8 | NA |
| MEN1 | 49 | 3 | 16.3 | NA |
| FGFR3 | 31 | 2 | 15.5 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_Fibroblast_growth_factor_receptor_subfamily |
| TSHR | 46 | 3 | 15.3 | G-protein_coupled_receptor_1_family,_FSH/LSH/TSH_subfamily |
| JAK2 | 40 | 3 | 13.3 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_JAK_subfamily |
| RB1 | 102 | 8 | 12.8 | Retinoblastoma_protein_(RB)_family |
| PDGFRA | 35 | 3 | 11.7 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_CSF-1/PDGF_receptor_subfamily |
| NF1 | 65 | 6 | 10.8 | NA |
| FGFR2 | 43 | 4 | 10.8 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_Fibroblast_growth_factor_receptor_subfamily |
| FLT3 | 35 | 4 | 8.8 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_CSF-1/PDGF_receptor_subfamily |
| CDH1 | 68 | 8 | 8.5 | NA |
| TNFAIP3 | 31 | 4 | 7.8 | Peptidase_C64_family |
| CBL | 30 | 4 | 7.5 | NA |
| RET | 58 | 8 | 7.3 | Protein_kinase_superfamily,_Tyr_protein_kinase_family |
| MSH6 | 40 | 8 | 5 | DNA_mismatch_repair_MutS_family |
| ERBB2 | 29 | 6 | 4.8 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_EGF_receptor_subfamily |
| MET | 23 | 5 | 4.6 | Protein_kinase_superfamily,_Tyr_protein_kinase_family |
| ABL1 | 23 | 7 | 3.3 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_ABL_subfamily |
| ALK | 27 | 9 | 3 | Protein_kinase_superfamily,_Tyr_protein_kinase_family,_Insulin_receptor_subfamily |
| ATM | 134 | 52 | 2.6 | PI3/PI4-kinase_family,_ATM_subfamily |
Fig 2The functional characterization of cancer-sensitive and other proteins based on their gene ontologies.
Expression analysis depicting number of cell lines with expression more than a given cutoff (e.g., 3, 7, 9) for each antigen.
| Target | >= 3 | >= 7 | >= 9 | >= 12 |
|---|---|---|---|---|
| AAK1 | 901 | 0 | 0 | 0 |
| ABL1 | 901 | 900 | 595 | 0 |
| AKAP12 | 901 | 509 | 294 | 8 |
| AKAP9 | 901 | 900 | 723 | 2 |
| ALK | 901 | 23 | 12 | 1 |
| ALPK2 | 901 | 179 | 92 | 1 |
| ATM | 901 | 816 | 165 | 0 |
| BRAF | 901 | 282 | 1 | 0 |
| CARD10 | 901 | 591 | 109 | 0 |
| CBL | 901 | 754 | 4 | 0 |
| CDH1 | 901 | 358 | 217 | 0 |
| CHD1 | 901 | 901 | 644 | 0 |
| CREB3L2 | 901 | 849 | 424 | 1 |
| CTBP2 | 901 | 802 | 638 | 0 |
| CTNNB1 | 901 | 807 | 60 | 0 |
| EGFR | 901 | 318 | 21 | 0 |
| ERBB2 | 901 | 368 | 39 | 14 |
| FBXW7 | NA | NA | NA | NA |
| FGFR2 | 901 | 164 | 34 | 1 |
| FGFR3 | 901 | 88 | 6 | 0 |
| FLT3 | 901 | 35 | 27 | 3 |
| FMN2 | 901 | 101 | 22 | 0 |
| GATA1 | 901 | 19 | 17 | 0 |
| GPR112 | 901 | 0 | 0 | 0 |
| HSP90B1 | 901 | 901 | 897 | 282 |
| JAK2 | 901 | 68 | 7 | 1 |
| KIT | 901 | 170 | 82 | 12 |
| MAML2 | 901 | 49 | 0 | 0 |
| MAP3K1 | 901 | 393 | 28 | 0 |
| MAP3K4 | 901 | 900 | 678 | 0 |
| MEN1 | NA | NA | NA | NA |
| MET | NA | NA | NA | NA |
| MLH1 | 901 | 875 | 861 | 2 |
| MLL3 | 0 | 0 | 0 | 0 |
| MSH2 | 901 | 887 | 731 | 0 |
| MSH3 | 901 | 555 | 1 | 0 |
| MSH6 | 901 | 892 | 820 | 7 |
| MYLK | 901 | 473 | 286 | 26 |
| MYST4 | NA | NA | NA | NA |
| NCOA3 | 901 | 825 | 156 | 0 |
| NF1 | 901 | 264 | 2 | 0 |
| NF2 | 901 | 19 | 0 | 0 |
| NR1H2 | 901 | 161 | 0 | 0 |
| NRAS | 901 | 891 | 675 | 4 |
| PDE4DIP | 901 | 198 | 12 | 0 |
| PDGFRA | 901 | 115 | 73 | 9 |
| PIK3C2G | 901 | 14 | 2 | 0 |
| PIK3CA | 901 | 856 | 89 | 0 |
| PRKDC | 901 | 901 | 815 | 3 |
| PTEN | 901 | 834 | 315 | 0 |
| RB1 | 901 | 674 | 26 | 0 |
| RECQL4 | 901 | 804 | 39 | 0 |
| RET | 901 | 52 | 15 | 0 |
| SMAD4 | 901 | 575 | 5 | 0 |
| TNFAIP3 | 901 | 596 | 252 | 9 |
| TNRC6B | 901 | 803 | 8 | 0 |
| TP53 | 901 | 597 | 75 | 0 |
| TSHR | 901 | 22 | 3 | 0 |
| TTBK1 | 901 | 0 | 0 | 0 |
| VHL | 901 | 499 | 5 | 0 |
For example, HSP90B1 has 282 cell lines having expression greater or equal to 12.
Total number of generated neopeptides (9-mer peptides) in each vaccine candidate and number of neopeptides after applying different filters.
| Vaccine Candidate | Total 9-mer | Reference Protein | Reference Proteome | 1000-Genome Proteomes |
|---|---|---|---|---|
| TP53 | 2589 | 2204 | 2204 | 2204 |
| MLL3 | 6570 | 1671 | 1671 | 1670 |
| PDE4DIP | 3468 | 1130 | 1121 | 1013 |
| PRKDC | 5269 | 1149 | 1149 | 1149 |
| TNRC6B | 2730 | 905 | 905 | 886 |
| AKAP9 | 4873 | 974 | 974 | 938 |
| ATM | 4016 | 968 | 968 | 968 |
| GPR112 | 4061 | 989 | 989 | 989 |
| FMN2 | 2322 | 850 | 805 | 797 |
| NF1 | 3650 | 819 | 819 | 810 |
| MYST4 | 2705 | 668 | 668 | 639 |
| PTEN | 1148 | 753 | 753 | 753 |
| CTBP2 | 1550 | 573 | 573 | 573 |
| ALK | 2185 | 573 | 573 | 557 |
| MYLK | 2512 | 615 | 615 | 596 |
| ALPK2 | 2765 | 603 | 603 | 603 |
| AKAP12 | 2265 | 491 | 491 | 430 |
| MAML2 | 1536 | 443 | 443 | 440 |
| MAP3K4 | 2078 | 482 | 482 | 480 |
| PIK3CA | 1573 | 513 | 513 | 513 |
| SMAD4 | 1005 | 461 | 461 | 460 |
| RECQL4 | 1658 | 458 | 458 | 431 |
| PDGFRA | 1563 | 482 | 473 | 461 |
| MSH6 | 1839 | 487 | 487 | 459 |
| CHD1 | 2209 | 507 | 507 | 490 |
| PIK3C2G | 1881 | 444 | 426 | 418 |
| CDH1 | 1279 | 405 | 405 | 405 |
| EGFR | 1628 | 426 | 426 | 426 |
| FGFR3 | 1190 | 390 | 390 | 380 |
| MSH3 | 1489 | 363 | 363 | 334 |
| FBXW7 | 1111 | 412 | 412 | 409 |
| MET | 1813 | 413 | 413 | 401 |
| TNFAIP3 | 1139 | 357 | 357 | 348 |
| CTNNB1 | 1122 | 349 | 349 | 349 |
| RB1 | 1241 | 321 | 321 | 321 |
| RET | 1459 | 353 | 353 | 353 |
| NCOA3 | 1701 | 305 | 305 | 305 |
| KIT | 1280 | 312 | 312 | 303 |
| MLH1 | 1063 | 315 | 315 | 306 |
| MAP3K1 | 1835 | 331 | 330 | 322 |
| BRAF | 1106 | 348 | 348 | 345 |
| FLT3 | 1290 | 305 | 305 | 295 |
| FGFR2 | 1102 | 288 | 288 | 288 |
| ABL1 | 1400 | 259 | 259 | 259 |
| JAK2 | 1379 | 255 | 255 | 255 |
| MSH2 | 1198 | 272 | 272 | 254 |
| CARD10 | 1264 | 241 | 241 | 232 |
| TSHR | 1017 | 261 | 261 | 261 |
| ERBB2 | 1482 | 235 | 235 | 232 |
| CBL | 1123 | 225 | 225 | 223 |
| NRAS | 380 | 199 | 199 | 189 |
| MEN1 | 804 | 197 | 197 | 197 |
| TTBK1 | 1483 | 189 | 189 | 180 |
| NF2 | 798 | 211 | 211 | 201 |
| GATA1 | 569 | 164 | 164 | 164 |
| HSP90B1 | 980 | 185 | 185 | 185 |
| CREB3L2 | 670 | 158 | 158 | 158 |
| NR1H2 | 592 | 139 | 139 | 138 |
| AAK1 | 1080 | 132 | 132 | 122 |
| VHL | 297 | 92 | 92 | 92 |
The filters remove neoepitopes present in reference protein, human reference proteome and 1000 Genomes-based proteomes.
Fig 3A general workflow exhibiting the overall concept of database section of Cancertope workbench.
Fig 4The personalized module of Cancertope workbench.
Fig 5The fully personalized module of Cancertope workbench.