| Literature DB >> 23940716 |
Wan Li1, Lina Chen, Weiming He, Weiguo Li, Xiaoli Qu, Binhua Liang, Qianping Gao, Chenchen Feng, Xu Jia, Yana Lv, Siya Zhang, Xia Li.
Abstract
The cardiomyopathies are a group of heart muscle diseases which can be inherited (familial). Identifying potential disease-related proteins is important to understand mechanisms of cardiomyopathies. Experimental identification of cardiomyophthies is costly and labour-intensive. In contrast, bioinformatics approach has a competitive advantage over experimental method. Based on "guilt by association" analysis, we prioritized candidate proteins involving in human cardiomyopathies. We first built weighted human cardiomyopathy-specific protein-protein interaction networks for three subtypes of cardiomyopathies using the known disease proteins from Online Mendelian Inheritance in Man as seeds. We then developed a method in prioritizing disease candidate proteins to rank candidate proteins in the network based on "guilt by association" analysis. It was found that most candidate proteins with high scores shared disease-related pathways with disease seed proteins. These top ranked candidate proteins were related with the corresponding disease subtypes, and were potential disease-related proteins. Cross-validation and comparison with other methods indicated that our approach could be used for the identification of potentially novel disease proteins, which may provide insights into cardiomyopathy-related mechanisms in a more comprehensive and integrated way.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23940716 PMCID: PMC3733802 DOI: 10.1371/journal.pone.0071191
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The workflow of our method in prioritizing disease candidate proteins.
First, cardiomyopathy (DCM, HCM or ARVC)-specific PPINs were constructed, which were composed of seed proteins and their direct neighbors (candidate proteins) from human PPIN. Secondly, two weights (interaction confidence scores and functional similarities) were used to measure each protein interaction. The disease relevance score of each protein was measured by using these weights. Finally, the proteins ranked at top of each candidate list in descending order of disease relevance score were taken as potential disease-related proteins.
Official symbols of seed genes of DCM, HCM and ARVC.
| DCM | HCM | ARVC | |
| Seed genes/ | NEXN (Q0ZGT2 | NEXN (Q0ZGT2) | RYR2 (Q92736) |
| proteins | LMNA (P02545) | TNNT2 (P45379) | TMEM43 (Q9BTV4) |
| TNNT2 (P45379) | TTN (Q8WZ42) | RPSA (P08865) | |
| PSEN2 (P49810) | CAV3 (P56539) | DSP (P15924) | |
| ACTN2 (P35609) | MYL3 (P08590) | PKP2 (Q99959) | |
| TTN (Q8WZ42) | TNNC1 (P63316) | TGFB3 (P10600) | |
| DES (P17661) | MYOZ2 (Q9NPC6) | JUP (P14923) | |
| SCN5A (Q14524) | SLC25A4 (P12235) | DSC2 (Q02487) | |
| TNNC1 (P63316) | MYO6 (Q9UM54) | DSG2 (Q14126) | |
| SDHA (P31040) | PLN (P26678) | ||
| SGCD (Q92629) | PRKAG2 (Q9UGJ0) | ||
| DSP (P15924) | VCL (P18206) | ||
| PLN (P26678) | COX15 (Q7KZN9) | ||
| EYA4 (O95677) | CSRP3 (P50461) | ||
| GATAD1 (Q8WUU5) | MYBPC3 (Q14896) | ||
| FKTN (O75072) | MYL2 (P10916) | ||
| VCL (P18206) | MYH6 (P13533 | ||
| LDB3 (O75112) | MYH7 (P12883) | ||
| RBM20 (Q5T481) | ACTC1 (P68032) | ||
| BAG3 (O95817) | TPM1 (P09493) | ||
| CSRP3 (P50461) | CALR3 (Q96L12) | ||
| MYBPC3 (Q14896) | TNNI3 (P19429) | ||
| ABCC9 (O60706) | MYLK2 (Q9H1R3) | ||
| TMPO (P42166) | JPH2 (Q9BR39) | ||
| MYH6 (P13533) | |||
| MYH7 (P12883) | |||
| PSEN1 (P49768) | |||
| ACTC1 (P68032) | |||
| TPM1 (P09493) | |||
| TCAP (O15273) | |||
| DSG2 (Q14126) | |||
| TNNI3 (P19429) | |||
| DMD (P11532) |
Accession number of the corresponding protein.
Top 50 candidate proteins from DCM-specific PPIN.
| Protein | Accession number | Rank | Disease relevance score | Relevance | Literature |
| MYL2 | P10916 | 1 | 107629670.500 | cardiomyopathy |
|
| MYL3 | P08590 | 2 | 73161485.690 | cardiomyopathy |
|
| TNNI1 | P19237 | 3 | 52203517.710 | ||
| MYH14 | Q7Z406 | 4 | 45310956.140 | ||
| NEB | P20929 | 5 | 44470968.060 | ||
| TNNI2 | P48788 | 6 | 29830367.350 | ||
| GJA1 | P17302 | 7 | 24199871.590 | cardiac arrhythmias |
|
| ACTA1 | P68133 | 8 | 23568069.020 | DCM |
|
| MYL1 | P05976 | 9 | 19532022.970 | ||
| TNNC2 | P02585 | 10 | 18200557.600 | ||
| TPM2 | P07951 | 11 | 17743459.410 | cardiac dysfunction |
|
| VIM | P08670 | 12 | 15069691.590 | ||
| TNNT3 | P45378 | 13 | 12691030.660 | ||
| TNNT1 | P13805 | 14 | 10598443.490 | ||
| GJA5 | P36382 | 15 | 9497687.715 | cardiac arrhythmias |
|
| SP4 | Q02446 | 16 | 7947705.183 | ||
| MYL4 | P12829 | 17 | 7586555.815 | ||
| TPM3 | P06753 | 18 | 7337310.387 | ||
| TMOD1 | P28289 | 19 | 7014582.495 | DCM |
|
| MYOT | Q9UBF9 | 20 | 6610783.862 | ||
| TPM4 | P67936 | 21 | 6600584.576 | ||
| MYH3 | P11055 | 22 | 6368705.641 | ||
| MYBPC1 | Q00872 | 23 | 6317707.232 | ||
| MYBPC2 | Q14324 | 24 | 4820328.071 | ||
| CAV3 | P56539 | 25 | 3746694.232 | DCM |
|
| MYOD1 | P15172 | 26 | 2449472.494 | cardiomyopathy |
|
| CALM1 | P62158 | 27 | 2291170.029 | DCM |
|
| ACTB | P60709 | 28 | 2039574.700 | ||
| MYOG | P15173 | 29 | 1304602.403 | ||
| DNAH8 | Q96JB1 | 30 | 1247401.222 | ||
| CKM | P06732 | 31 | 1139350.347 | DCM |
|
| CAPN3 | P20807 | 32 | 1108281.072 | ||
| PRKAG2 | Q9UGJ0 | 33 | 1068188.085 | cardiomyopathy |
|
| ZMPSTE24 | O75844 | 34 | 1050441.814 | DCM |
|
| AMY1A | P04745 | 35 | 1028615.159 | ||
| AMY1B | P04745 | 36 | 983506.614 | ||
| HRAS | P01112 | 37 | 931783.319 | cardiomyopathy |
|
| HLA-DR4 | P13760 | 38 | 926276.147 | DCM |
|
| DNM2 | P50570 | 39 | 896713.915 | ||
| NKX2-5 | P52952 | 40 | 791481.623 | cardiomyopathy |
|
| FXN | Q16595 | 41 | 591821.092 | cardiomyopathy |
|
| DYSF | O75923 | 42 | 591430.047 | DCM |
|
| AMY2A | P04746 | 43 | 547354.552 | ||
| ACTG1 | P63261 | 44 | 532313.791 | ||
| C1QBP | Q07021 | 45 | 491410.248 | cardiac cell damage |
|
| CALD1 | Q05682 | 46 | 485082.005 | ||
| AMY2B | P19961 | 47 | 476690.079 | ||
| DAG1 | Q14118 | 48 | 475398.858 | DCM |
|
| AMY1C | P04745 | 49 | 455785.472 | ||
| PRKCA | P17252 | 50 | 442473.436 |
Proteins are represented in their corresponding gene symbols.
Figure 2DCM pathway.
DCM seed proteins are colored in cyan. Red nodes are proteins which were verified to be DCM-related proteins, and yellow nodes represent proteins which are potential DCM-related proteins.
Figure 3DCM pathway and its relevant pathways.
DCM pathway is colored in yellow. Purple nodes are DCM-related pathways, and green nodes are other pathways. Black edges connect pathways which are directly connected to the DCM pathway.
AUC for three subtypes of cardiomyopathies obtained using five different methods.
| Our developed method | Chen’s protein ranking method | DADA | ToppGene | ToppNet | |
| DCM | 0.963 | 0.956 | 0.854 | 0.884 | 0.741 |
| HCM | 0.919 | 0.916 | 0.979 | 0.911 | 0.716 |
| ARVC | 0.995 | 0.934 | 0.770 | 0.946 | 0.756 |
Figure 4The number of proteins related with DCM.
50 potential disease proteins identified either by our developed method (the top left circle), or by Chen’s protein ranking method (the top right circle), and the number of proteins which have been confirmed to be related with DCM in literature were plotted.
AUC for three subtypes of cardiomyopathies obtained using GeneMANIA and ToppGenet.
| Our developed method | GeneMANIA | ToppGenet | |||||
| Distance to seeds | |||||||
| 1 | 2 | 3 | 4 | ||||
| DCM | 0.963 | 0.466 | Network based | 0.373 | 0.660 | 0.728 | 0.725 |
| Functional annotation based | 0.485 | 0.724 | 0.774 | 0.809 | |||
| HCM | 0.919 | 0.588 | Network based | 0.291 | 0.670 | 0.767 | 0.776 |
| Functional annotation based | 0.369 | 0.834 | 0.905 | 0.904 | |||
| ARVC | 0.995 | 0.569 | Network based | 0.519 | 0.630 | 0.665 | 0.669 |
| Functional annotation based | 0.801 | 0.873 | 0.894 | 0.894 | |||