| Literature DB >> 27466623 |
Shu Tadaka1, Kengo Kinoshita1,2,3.
Abstract
MOTIVATION: The identification of functional modules from protein-protein interaction (PPI) networks is an important step toward understanding the biological features of PPI networks. The detection of functional modules in PPI networks is often performed by identifying internally densely connected subnetworks, and often produces modules with "core" and "peripheral" proteins. The core proteins are the ones having dense connections to each other in a module. The difference between core and peripheral proteins is important to understand the functional roles of proteins in modules, but there are few methods to explicitly elucidate the internal structure of functional modules at gene level.Entities:
Mesh:
Year: 2016 PMID: 27466623 PMCID: PMC5181566 DOI: 10.1093/bioinformatics/btw488
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Summary of extracted clusters from an artificially generated network
| Running time | # of found clusters | Cluster size (avg, SD) | Cluster cliqueness (avg, SD) | |
|---|---|---|---|---|
| NCMine | 10 s | 706 | 4–14 (6.77, 1.95) | 0.60–0.97(0.67, 0.06) |
| MCODE | 8 s | 370 | 3–53 (7.72, 5.84) | 0.09–1.00(0.77, 0.24) |
| CFinder | 10 s | 1340 | 3–13 (8.41, 2.52) | 0.57–1.00(0.87, 0.10) |
| NeMo | 1 m 30 s | 953 | 4–13 (6.54, 2.31) | 0.00–1.00(0.57, 0.37) |
Summary of extracted clusters from the HPRD human protein–protein interaction (PPI) network using each method
| Running time | # of extracted clusters (A) | Cluster size (avg, SD) | Cluster cliqueness (avg, SD) | # of biologically meaningful clusters (B) | Ratio (B/A) | |
|---|---|---|---|---|---|---|
| NCMine | 15 s | 2309 (102 | 3–21 | 0.600–1.000 | 1259 | 0.54 |
| (4.880, 2.887) | (0.764,0.162) | |||||
| MCODE | 7 s | 250 | 2–160 | 0.016–1.000 | 93 | 0.45 |
| (9.359,19.945) | (0.654, 0.337) | |||||
| CFinder | 28 s (10 s | 764 | 3–1010 | 0.017–1.000 | 315 | 0.41 |
| (7.055,38.954) | (0.900,0.164) | |||||
| NeMo | 180 s | 1510 | 4–68 | 0.000–1.000 | 784 | 0.52 |
| (8.940, 7.456) | (0.128, 0.244) |
*The number in the parenthesis is the number of clusters that are involved in core-peripheral structures.
**The number in the parenthesis is calculation time with approx. option = 0.1 s.
Fig. 1.Recall and precision calculated from cluster extraction results by comparing extracted clusters and actual embedded clusters in artificial networks. Higher recall and precision indicate better performance
Parameter settings for MCODE and NCMine used for performance tests with artificially generated networks
| Default value | Range of value used in tests (step) | |
|---|---|---|
| Degree cutoff | 2 | 2 – 10 (2) |
| Node score cutoff | 0.2 | 0.0–1.0 (0.2) |
| K-core cutoff | 2 | 2–10 (2) |
| Degree cutoff | 2 | 2–10 (2) |
| Cliqueness threshold | 0.6 | 0.0–1.0 (0.2) |
| Merge threshold | 0.6 | 0.0–1.0 (0.2) |
| Cliqueness-change threshold | 0.2 | 0.0–1.0 (0.2) |
Fig. 2.(A) Comparison of cliqueness of clusters extracted by each method. (B) Comparison of the sizes of extracted clusters and known protein complexes (C) Relationship between node degree and cluster membership calculated from the NCMine cluster extraction results. Genes included in the plot are listed in Supplementary Table S1(A) and (B)
Fig. 3.(A) Two sets of peripheral proteins (blue and red) with the same core proteins (green) are shown. In this example, most of the proteins were related to cancer development. However, the proteins that participate in blue peripherals and red peripherals had different functions. Genes in the clusters: AKT1: v-akt murine thymoma viral oncogene homolog 1; AR: Androgen receptor; BRCA1: Breast cancer 1, early onset; CREBBP: CREB-binding protein; CTNNB1: Catenin (cadherin-associated protein), beta 1, 88 kDa; EP300: E1A-binding protein p300; ESR1: Estrogen receptor 1; JUN: Jun proto-oncogene; RB1: Retinoblastoma 1; RELA: v-rel avian reticuloendotheliosis viral oncogene homolog A; SMAD1: SMAD family member 1; SMAD2: SMAD family member 2; SMAD3: SMAD family member 3; SMAD4: SMAD family member 4; SP1: Sp1 transcription factor; STAT3: Signal transducer and activator of transcription 3 (acute-phase response factor); TP53: Tumor protein p53; UBE2I: Ubiquitin-conjugating enzyme E2I (B) Most of the core-proteins (green) were involved in the Wnt signaling pathway, which is related to cancer, generally; however, peripheral-proteins involved specific cancer development pathways. Genes in the clusters: AKT1: v-akt murine thymoma viral oncogene homolog 1; AR: Androgen receptor; BRCA1: Breast cancer 1, early onset; CREBBP: CREB-binding protein; CTNNB1: Catenin (cadherin-associated protein), beta 1, 88 kDa; EP300: E1A-binding protein p300; ESR1: Estrogen receptor 1; JUN: Jun proto-oncogene; MAPK1: Mitogen-activated protein kinase 1; RB1: Retinoblastoma 1; SMAD1: SMAD family member 1; SMAD2: SMAD family member 2; SMAD3: SMAD family member 3; SMAD4: SMAD family member 4; SMAD9: SMAD family member 9; SP1: Sp1 transcription factor; STAT3: Signal transducer and activator of transcription 3 (acute-phase response factor); TGFBR1: Transforming growth factor, beta receptor 1; TP53: Tumor protein p53; UBE2I: Ubiquitin-conjugating enzyme E2I
Fig. 4.Cytoscape plugin for NCMine, and basic workflow