| Literature DB >> 20509897 |
Asa Pérez-Bercoff1, Takashi Makino, Aoife McLysaght.
Abstract
BACKGROUND: There is increasing interest in the evolution of protein-protein interactions because this should ultimately be informative of the patterns of evolution of new protein functions within the cell. One model proposes that the evolution of new protein-protein interactions and protein complexes proceeds through the duplication of self-interacting genes. This model is supported by data from yeast. We examined the relationship between gene duplication and self-interaction in the human genome.Entities:
Mesh:
Year: 2010 PMID: 20509897 PMCID: PMC2894830 DOI: 10.1186/1471-2148-10-160
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Data collection. Flow chart illustrating how the human interaction data were collected from HPRD release 7, and subsequently matched with blastable Ensembl Core release 50 identifiers in order to extract the final 8881 genes involved in 34808 protein-protein interactions.
Figure 2Definition of mouse-specific duplicate genes in human. Human genes (H) were classified as singletons if they had a one to one orthologous relationship with mouse (M), and as mouse-specific duplicate genes if the relationship with mouse was one to many i.e. if one human gene had at least two orthologous genes in the mouse lineage.
Proportion of self-interacting singletons and duplicates.
| Classification system | self-interacting | Total | |
|---|---|---|---|
| Human BLASTP | Singletons | 433 (16%) | 2595 |
| Duplicates | 1285 (23%) | 5531 | |
| Mouse duplicability | 1:1 orthologs | 1682 (21%) | 7968 |
| 1:many orthologs | 51 (27%) | 186 |
Proportion of self-interacting duplicate genes generated by different mechanisms.
| Duplication Mechanism | self-interacting | Total |
|---|---|---|
| WGD | 717 (25%) | 2877 |
| SSD | 630 (21%) | 2961 |
Figure 3Relationship of duplication type and number of interactions. The proportion of WGD genes among all duplicate (WGD and SSD) genes increases with increased number of protein-protein interactions irrespective of self-interactions. a) Proportion of WGD genes among all duplicate genes with respect to the number of interactions. (Bins created to contain similar amounts of genes.) b) Proportion of self-interacting WGD genes among all self-interacting duplicate genes with respect to the number of interactions. (Bins created to contain similar amounts of genes.) c) Relationship of synonymous divergence rate, and the number of PPI partners of each gene in the duplicate pair. The x-axis displays the synonymous substitution rate (KS) between a duplicate pair, while the y-axis is the mean value of the total number of PPIs of all genes in each KS bin (category).
Over-represented GO terms when self- and nonself-interacting genes are compared against each other.
| GO IDs | Term | Mean | Z score | |||
|---|---|---|---|---|---|---|
| GO:0008219 | cell death | 197 | 116.651 | 8.8 | 9.1 | 2.12E-16 |
| GO:0007154 | cell communication | 588 | 480.984 | 15.4 | 6.9 | 2.78E-11 |
| GO:0050789 | regulation of biological process | 855 | 748.686 | 15.3 | 7.0 | 4.33E-11 |
| GO:0051704 | multi-organism process | 116 | 66.152 | 6.7 | 7.5 | 1.41E-10 |
| GO:0006928 | cell motion | 113 | 71.586 | 7.0 | 5.9 | 8.45E-07 |
| GO:0050896 | response to stimulus | 321 | 252.834 | 12.2 | 5.6 | 1.42E-06 |
| GO:0007610 | behavior | 86 | 52.189 | 6.0 | 5.6 | 3.96E-06 |
| GO:0009987 | cellular process | 833 | 764.383 | 15.3 | 4.5 | 8.67E-05 |
| GO:0030154 | cell differentiation | 222 | 173.684 | 10.3 | 4.7 | 1.66E-04 |
| GO:0043170 | macromolecule metabolic process | 697 | 628.169 | 15.9 | 4.3 | 1.79E-04 |
| GO:0007275 | multicellular organismal development | 374 | 321.383 | 12.9 | 4.1 | 2.29E-03 |
| GO:0032501 | multicellular organismal process | 222 | 188.816 | 10.8 | 3.1 | 4.20E-02 |
| GO:0005515 | protein binding | 1026 | 874.792 | 14.8 | 10.2 | 1.25E-23 |
| GO:0016301 | kinase activity | 208 | 118.109 | 9.3 | 9.7 | 2.34E-19 |
| GO:0016740 | transferase activity | 213 | 145.362 | 10.3 | 6.6 | 1.52E-09 |
| GO:0004871 | signal transducer activity | 113 | 75.003 | 7.3 | 5.2 | 2.24E-05 |
| GO:0004872 | receptor activity | 208 | 169.553 | 11.0 | 3.5 | 1.22E-02 |
a The estimated P values were adjusted by Bonferroni correction.
Over- and under-represented (italics) GO terms when duplicated genes are compared against singleton genes.
| GO IDs | Term | Mean | Z score | |||
|---|---|---|---|---|---|---|
| GO:0007154 | cell communication | 2009 | 1819.3 | 17.5 | 10.8 | 5.86E-27 |
| - | ||||||
| - | ||||||
| GO:0050789 | regulation of biological process | 3130 | 3001.9 | 18.3 | 7.0 | 5.91E-11 |
| GO:0007275 | multicellular organismal development | 1346 | 1241.1 | 15.8 | 6.7 | 1.76E-10 |
| GO:0032501 | multicellular organismal process | 787 | 712.5 | 12.4 | 6.0 | 2.34E-08 |
| - | ||||||
| - | ||||||
| GO:0030154 | cell differentiation | 726 | 673.2 | 12.1 | 4.4 | 3.02E-04 |
| GO:0007610 | behavior | 218 | 189.6 | 7.1 | 4.0 | 3.39E-04 |
| GO:0006928 | cell motion | 300 | 267.4 | 8.3 | 3.9 | 6.67E-04 |
| - | ||||||
| - | ||||||
| - | ||||||
| GO:0005488 | binding | 2905 | 2686.899 | 18.3 | 11.9 | 3.85E-29 |
| GO:0004872 | receptor activity | 714 | 608.911 | 12.3 | 8.6 | 2.06E-19 |
| GO:0016301 | kinase activity | 496 | 420.462 | 10.0 | 7.5 | 4.99E-14 |
| GO:0015267 | channel activity | 203 | 159.644 | 6.6 | 6.6 | 1.28E-12 |
| GO:0015075 | ion transmembrane transporter activity | 300 | 247.312 | 8.1 | 6.5 | 2.48E-11 |
| GO:0004871 | signal transducer activity | 315 | 265.132 | 8.2 | 6.1 | 2.76E-09 |
| - | ||||||
| GO:0016787 | hydrolase activity | 870 | 813.748 | 13.8 | 4.1 | 7.61E-04 |
| - | ||||||
| GO:0003774 | motor activity | 68 | 55.557 | 3.7 | 3.4 | 1.09E-02 |
| - | ||||||
a The estimated P values were adjusted by Bonferroni correction.
Over- and under-representation (italics) of GO terms in self-interacting WGD with respect to SSD genes.
| - | ||||||
| GO:0050789 | regulation of biological process | 484 | 456.64 | 8.2 | 3.3 | 0.017011463 |
| GO:0043170 | macromolecule metabolic process | 399 | 372.058 | 9.1 | 2.9 | 0.039105725 |
| GO:0016301 | kinase activity | 143 | 111.112 | 6.7 | 4.8 | 3.34E-05 |
| GO:0016740 | transferase activity | 141 | 113.834 | 6.8 | 4.0 | 1.17E-03 |
| GO:0030528 | transcription regulator activity | 134 | 107.982 | 6.7 | 3.9 | 1.67E-03 |
| - | ||||||
| - | ||||||
| - | ||||||
a The estimated P values were adjusted by Bonferroni correction.