| Literature DB >> 25707432 |
Qianghua Xiao, Jianxin Wang, Xiaoqing Peng, Fang-xiang Wu, Yi Pan.
Abstract
Essential proteins are vitally important for cellular survival and development, and identifying essential proteins is very meaningful research work in the post-genome era. Rapid increase of available protein-protein interaction (PPI) data has made it possible to detect protein essentiality at the network level. A series of centrality measures have been proposed to discover essential proteins based on the PPI networks. However, the PPI data obtained from large scale, high-throughput experiments generally contain false positives. It is insufficient to use original PPI data to identify essential proteins. How to improve the accuracy, has become the focus of identifying essential proteins. In this paper, we proposed a framework for identifying essential proteins from active PPI networks constructed with dynamic gene expression. Firstly, we process the dynamic gene expression profiles by using time-dependent model and time-independent model. Secondly, we construct an active PPI network based on co-expressed genes. Lastly, we apply six classical centrality measures in the active PPI network. For the purpose of comparison, other prediction methods are also performed to identify essential proteins based on the active PPI network. The experimental results on yeast network show that identifying essential proteins based on the active PPI network can improve the performance of centrality measures considerably in terms of the number of identified essential proteins and identification accuracy. At the same time, the results also indicate that most of essential proteins are active.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25707432 PMCID: PMC4331804 DOI: 10.1186/1471-2164-16-S3-S1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Number of essential proteins detected by each methods in two different networks. As is shown in Fig.1, the performance of each centrality measures in identifying essential proteins based on APPIN is better than PPIN. Especially, the improvements of SC based on APPIN are more than 50% when predicting 100 proteins, the number of essential proteins identified by LAC and NC based on APPIN achieves to 80.
Figure 2DC, BC, CC, SC, LAC and NC are compared in two different networks by a jackknife methodology. To further illustrate the efficiency of our strategy, we have analyzed by using a jackknife methodology. In Fig.2, proteins are ordered in descending according to their scores. The curve is plotted with the cumulative counters of true essential proteins and the cumulative counters of predicted essential proteins.
The case of overlaps essential proteins in different two networks when predicting 100 proteins
| Centrality measures | ||||
|---|---|---|---|---|
| Degree Centrality (DC) | 56/46 | 26 | 30 | 20 |
| Betweenness Centrality (BC) | 54/44 | 23 | 31 | 21 |
| Closeness Centrality (CC) | 55/41 | 12 | 43 | 29 |
| Subgraph Centrality(SC) | 57/37 | 10 | 47 | 27 |
| Edge Clustering Coefficient (NC) | 80/56 | 26 | 54 | 30 |
| Local Average Connectivity Centrality (LAC) | 82/59 | 35 | 47 | 24 |