| Literature DB >> 33193746 |
Guang-Ping Li1, Pu-Feng Du1, Zi-Ang Shen1, Hang-Yu Liu1, Tao Luo1.
Abstract
Eukaryotic cells contain numerous components, which are known as subcellular compartments or subcellular organelles. Proteins must be sorted to proper subcellular compartments to carry out their molecular functions. Mis-localized proteins are related to various cancers. Identifying mis-localized proteins is important in understanding the pathology of cancers and in developing therapies. However, experimental methods, which are used to determine protein subcellular locations, are always costly and time-consuming. We tried to identify cancer-related mis-localized proteins in three different cancers using computational approaches. By integrating gene expression profiles and dynamic protein-protein interaction networks, we established DPPN-SVM (Dynamic Protein-Protein Network with Support Vector Machine), a predictive model using the SVM classifier with diffusion kernels. With this predictive model, we identified a number of mis-localized proteins. Since we introduced the dynamic protein-protein network, which has never been considered in existing works, our model is capable of identifying more mis-localized proteins than existing studies. As far as we know, this is the first study to incorporate dynamic protein-protein interaction network in identifying mis-localized proteins in cancers.Entities:
Keywords: differentially gene expression; diffusion kernel; mis-localized proteins; protein subcellular localization; protein-protein interactions
Year: 2020 PMID: 33193746 PMCID: PMC7644922 DOI: 10.3389/fgene.2020.600454
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1The summary of dataset. (A) The number of proteins with different number of subcellular locations. Among the 6461 annotated BioGRID proteins, there were 4112 proteins with only one subcellular location, 1731 proteins with two locations, 503 proteins with three locations, 98 proteins with four locations, 15 proteins with five locations and 2 proteins with six locations. The average multiplicity degree of the dataset was 1.48. (B) The number of locative proteins in different subcellular locations. There are 6461 proteins with experimentally annotated subcellular locations in the dataset. Because one protein may have more than one subcellular location, the number of locative proteins is 9562.
Performance comparison in non-disease state.
| Measuresa | Our method | Hum-mPLoc 3.0 |
| AIM | 72.00% | 68.10% |
| CVR | 69.50% | 65.10% |
| mlACC | 68.60% | 65.00% |
| ATR | 64.30% | 61.80% |
Representative prediction of mis-localized proteins.
| Disorder | Uniprot ID | Mis-localizationsa | Rankb |
| Leukemia | MAGA3_HUMAN | +Cell cortex (+Inf) | 1 |
| F217B_HUMAN | +Peroxisome (+Inf) | 2 | |
| EI24_HUMAN [41] | +Mitochondrion (+3349.02%) | 3 | |
| ROP1A_HUMAN | +Peroxisome(+3086.82%) | 4 | |
| THYN1_HUMAN | +Nucleus (+2461.70%) | 5 | |
| CLGN_HUMAN | +Nucleus (+2425.50%) | 6 | |
| SETBP_HUMAN [40] | +Endoplasmic reticulum (+658.04%) | 30 | |
| TF2L1_HUMAN | −Nucleus (−99.57%) | 1 | |
| UPP1_HUMAN | −Cell cortex (−97.49%) | 2 | |
| AL1A1_HUMAN | −Peroxisome (−96.63%) | 3 | |
| ABCA1_HUMAN | −Lysosome (−95.71%) | 4 | |
| PARP4_HUMAN | −Lysosome (−94.87%) | 5 | |
| AL7A1_HUMAN | −Peroxisome (−93.09%) | 6 | |
| SETBP_HUMAN [40] | −Nucleus (−83.94%) | 28 | |
| EI24_HUMAN [41] | −Endoplasmic reticulum(−50.22%) | 348 | |
| Breast cancer | TM258_HUMAN | +Peroxisome (+Inf) | 1 |
| KCNKI_HUMAN | +Mitochondrion (+Inf) | 2 | |
| MARC2_HUMAN | +Cell cortex (+Inf) | 3 | |
| HEBP2_HUMAN | +Lysosome (+Inf) | 4 | |
| SIT1_HUMAN | +Mitochondrion (+13310.16%) | 5 | |
| PIM3_HUMAN | +Lysosome (+9723.01%) | 6 | |
| PD1L1_HUMAN [42] | +Nucleolus (+290.65%) | 242 | |
| INGR2_HUMAN [43] | +Mitochondrion (+184.50%) | 437 | |
| VGFR3_HUMAN [42] | +Nucleolus (+125.16%) | 755 | |
| ANO4_HUMAN | −Nucleus (−98.91%) | 1 | |
| ABCA1_HUMAN | −Lysosome (−98.78%) | 2 | |
| NDUB7_HUMAN | −Mitochondrion (−98.35%) | 3 | |
| TM127_HUMAN | −Plasma membrane (−98.25%) | 4 | |
| RUBIC_HUMAN | −Endoplasmic reticulum (−96.45%) | 5 | |
| TRIM4_HUMAN | −Nucleus (−96.45%) | 6 | |
| INGR2_HUMAN [43] | −Plasma membrane (−63.74%) | 595 | |
| Hepatitis carcinoma | TBCA_HUMAN | +Cell cortex (+Inf) | 1 |
| F217B_HUMAN | +Peroxisome (+Inf) | 2 | |
| HKDC1_HUMAN | +Nucleus (+65006.48%) | 3 | |
| SYAC_HUMAN | +Peroxisome (+10652.77%) | 4 | |
| RFWD3_HUMAN | +Lysosome (+10599.05%) | 5 | |
| ABCA1_HUMAN [10] | +Lysosome (+8115.45%) | 6 | |
| S10AB_HUMAN [44] | +Peroxisome (+6868.17%) | 12 | |
| FOXP1_HUMAN [10] | +Peroxisome (+612.39%) | 478 | |
| RM14_HUMAN | −Cell cortex (−99.99%) | 1 | |
| RM47_HUMAN | −Cell cortex (−99.99%) | 2 | |
| RT30_HUMAN | −Cell cortex (−99.99%) | 3 | |
| DUS11_HUMAN | −Cell cortex (−99.99%) | 4 | |
| CLGN_HUMAN | −Cell cortex (−99.99%) | 5 | |
| RM01_HUMAN | −Cell cortex (−99.99%) | 6 | |
| ABCA1_HUMAN [10] | −Cell cortex (−99.28%) | 249 |