| Literature DB >> 20829394 |
Tanusree Bhattacharya1, Tapash Chandra Ghosh.
Abstract
It has previously been reported that protein complexity (i.e. number of subunits in a protein complex) is negatively correlated to gene duplicability in yeast as well as in humans. However, unlike in yeast, protein connectivity in a protein-protein interaction network has a positive correlation with gene duplicability in human genes. In the present study, we have analyzed 1732 human and 1269 yeast proteins that are present both in a protein-protein interaction network as well as in a protein complex network. In the human case, we observed that both protein connectivity and protein complexity complement each other in a mutually exclusive manner over gene duplicability in a positive direction. Analysis of human haploinsufficient proteins and large protein complexes (complex size >10) shows that when protein connectivity does not have any direct association with gene duplicability, there exists a positive correlation between gene duplicability and protein complexity. The same trend, however, is not found in case of yeast, where both protein connectivity and protein complexity independently guide gene duplicability in the negative direction. We conclude that the higher rate of duplication of human genes may be attributed to organismal complexity either by increasing connectivity in the protein-protein interaction network or by increasing protein complexity.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20829394 PMCID: PMC2955712 DOI: 10.1093/dnares/dsq019
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Connectivity, complexity and average number of paralogs per gene across haploinsufficient proteins and after removal of haploinsufficient proteins in human
| Haploinsufficint proteins present in both the protein–protein interaction network and protein complex network | After removing haploinsufficient proteins from proteins present in both the protein–protein interaction network and protein complex network | Level of significance | |
|---|---|---|---|
| Average complexity | 8.6511 | 12.9321 | |
| Average interaction | 26.3720 | 27.2717 | |
| Average number of paralogs per gene | 1.8720 | 2.2857 |
Note. Average complexity means average number of subunits per complex in a particular group of proteins and average interactions indicate average protein connectivity in the protein–protein interaction network of a particular group of proteins.
Figure 1Average number of paralogs per gene of highly connected (hub) and lowly connected (non-hub) protein present in the protein complex.
Correlation between protein connectivity and gene duplicability as well as between gene duplicability and protein complexity in large complexes in human genes
| Spearman rank correlation for Gene duplicability | Level of significance | Partial correlation for gene duplicability with control | Level of significance | |
|---|---|---|---|---|
| Protein complexity | 0.150 | 0.179 (Connectivity control) | ||
| Protein connectivity | 0.075 | 0.124 (Complexity control) |
Figure 2The distribution showing the relation of gene duplicability in overall complex, large complex and protein connectivity in human.
Analysis of human Spliceosome, Nop56p-associated pre-rRNA and overall complex to estimate the percentage of overhang generated in each case
| Name | Total subunit | Total subunit duplicated | Number of paralogs produced | Percentage of overhang produced (%) |
|---|---|---|---|---|
| Spliceosome | 148 | 36 | 47 | 61.70 |
| Nop56p-associated pre-rRNA | 104 | 50 | 134 | 89.6 |
| Overall complex | 2106 | 720 | 1290 | 71.78 |
Figure 3Comparison of relationship between protein connectivity, protein complexity over gene duplicability (A) when both are influencing gene duplication; (B) when only complexity influencing gene duplication.