| Literature DB >> 27648447 |
Yang Hu1, Ying Zhang2, Jun Ren1, Yadong Wang3, Zhenzhen Wang4, Jun Zhang2.
Abstract
The overall goal is to establish a reliable human protein-protein interaction network and develop computational tools to characterize a protein-protein interaction (PPI) network and the role of individual proteins in the context of the network topology and their expression status. A novel and unique feature of our approach is that we assigned confidence measure to each derived interacting pair and account for the confidence in our network analysis. We integrated experimental data to infer human PPI network. Our model treated the true interacting status (yes versus no) for any given pair of human proteins as a latent variable whose value was not observed. The experimental data were the manifestation of interacting status, which provided evidence as to the likelihood of the interaction. The confidence of interactions would depend on the strength and consistency of the evidence.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27648447 PMCID: PMC5015007 DOI: 10.1155/2016/5313050
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Data sets or databases used to construct the human protein-protein interaction network.
| Method | Organism | Reference |
|---|---|---|
| Y2H | Human | Stelzl et al. [ |
| Y2H | Human | Rual et al. [ |
| MPC | Human | Ewing et al. [ |
| Literature | Human | HPRD [ |
| Y2H | Yeast | Ito et al. [ |
| Y2H | Yeast | Uetz et al. [ |
| MPC | Yeast | Gavin et al. [ |
| MPC | Yeast | Ho et al. [ |
| MPC | Yeast | Gavin et al. [ |
| MPC | Yeast | Krogan et al. [ |
| Literature | Multiple | IntAct [ |
| Literature | Multiple | MIPS [ |
| Multiple | Multiple | DIP [ |
Figure 1Overall scheme to construct the human protein-protein interaction network. The interaction status of a given pair of human proteins and their homolog in other organisms are unobserved (dashed box) and the experimental data and genomic features are observed evidence (solid boxes). Solid arrows represent model hierarchy and dashed arrows represent inference steps.
Algorithm 1Monte Carlo expectation maximization for parameter estimation.
Comparison of parameters based on different data.
| Parameters | High-throughput Y2H | High-throughput MPC | Human PPI data | All PPI data |
|---|---|---|---|---|
|
| 6.8 × 10−3 | 1.9 × 10−3 | 6.1 × 10−3 | 1.4 × 10−2 |
|
| 7.7 × 10−5 | — | 5.3 × 10−5 | 8.9 × 10−5 |
|
| 0.658 | — | 0.543 | 0.933 |
|
| 0.426 | — | 0.496 | 0.852 |
|
| 4.5 × 10−3 | — | 9.7 × 10−4 | 0.007 |
|
| — | 0.738 | 0.755 | 0.809 |
|
| — | 0.623 | 0.764 | 0.788 |
Figure 2The optimization of Q and Q for different ε. Red line and green line correspond to Q and Q separately.
Figure 3A reliable subnetwork for hela cell. Circles correspond to IDPs. And the degree of grey corresponds to the length of intrinsically disordered region for IDP.