| Literature DB >> 22419782 |
Hang T T Phan1, Michael J E Sternberg.
Abstract
MOTIVATION: Analysis of protein-protein interaction networks (PPINs) at the system level has become increasingly important in understanding biological processes. Comparison of the interactomes of different species not only provides a better understanding of species evolution but also helps with detecting conserved functional components and in function prediction. Method andEntities:
Mesh:
Substances:
Year: 2012 PMID: 22419782 PMCID: PMC3338015 DOI: 10.1093/bioinformatics/bts119
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.PINALOG global alignment method comprises of three main steps: (i) Community detection: identifies dense subnetworks of input networks using CFinder (e.g. red and blue 3, 4 and 5-node subnetworks in the lower part of the diagram). (ii) Community mapping: maps similar communities that have high similarity scores, i.e. containing many inter-species proteins with high similarity scores. In the first table (lower part of the diagram), mapped communities with high similarity scores are marked in red numbers. Similar protein pairs from mapped communities are extracted to form a list of core pairs. (iii) Extension mapping: maps proteins in the neighbourhood of the core protein pairs which are then added to the core, e.g. protein I mapped with X and H with Y in the second table in the lower part of the diagram. Extension mapping is repeated until no more pair is added.
Alignment results of different pairs of species by PINALOG (PA) (obtained with the automatically generated θ=0.871), IsoRank (IR), MI-GRAAL (MG) and BLAST (BL) for human/yeast
| Stat | PA | IR | MG | BL |
|---|---|---|---|---|
| NA | 5222 | 5674 | 5674 | 1818 |
| NC | 3319 | 717 | 4107 | 530 |
| NF | 3139 | 734 | 146 | 1347 |
| NH | 454 | 165 | 0 | 818 |
| NH/NA | 0.09 | 0.023 | 0 | 0.45 |
| NI | 460 | 136 | 0 | 465 |
| NI/NC | 0.14 | 0.19 | 0 | 0.88 |
The statistics (Stat) are NA, NC, NF, NH and NI which denote the number of aligned pairs of proteins, the number of conserved interactions, the number of protein pairs with functional similarity >0.5 and the number of Homologene pairs and interlogs. The ratios NH/NA and NI/NC are also given.
Fig. 2.Complex prediction results of PINALOG in comparison with other prediction methods based on recall, precision and F-measure. Complex predictions for yeast by these methods are compared with the gold-standard yeast complexes in MIPS CYDG. A prediction that has overlap with a true protein complex with an OS > 0.2 is considered matched. The number of matched complexes are used to calculate recall, precision and F-measure as defined in the main text.
Protein function prediction assessment by 100-fold cross validation, comparison with PSI-BLAST and IsoRank
| PINALOG-A | PSI-BLAST | IsoRank | ||||
|---|---|---|---|---|---|---|
| BP | MF | BP | MF | BP | MF | |
| Recall | 0.14 | 0.28 | 0.07 | 0.29 | 0.08 | 0.17 |
| Precision | 0.28 | 0.43 | 0.29 | 0.47 | 0.2 | 0.32 |
BP and MF stands for biological process and molecular function terms in GO.