| Literature DB >> 19056822 |
Ching-Ti Liu1, Shinsheng Yuan, Ker-Chau Li.
Abstract
Many successful functional studies by gene expression profiling in the literature have led to the perception that profile similarity is likely to imply functional association. But how true is the converse of the above statement? Do functionally associated genes tend to be co-regulated at the transcription level? In this paper, we focus on a set of well-validated yeast protein complexes provided by Munich Information Center for Protein Sequences (MIPS). Using four well-known large-scale microarray expression data sets, we computed the correlations between genes from the same complex. We then analyzed the relationship between the distribution of correlations and the complex size (the number of genes in a protein complex). We found that except for a few large protein complexes, such as mitochondrial ribosomal and cytoplasmic ribosomal proteins, the correlations are on the average not much higher than that from a pair of randomly selected genes. The global impact of large complexes on the expression of other genes in the genome is also studied. Our result also showed that the expression of over 85% of the genes are affected by six large complexes: the cytoplasmic ribosomal complex, mitochondrial ribosomal complex, proteasome complex, F0/F1 ATP synthase (complex V) (size 18), rRNA splicing (size 24) and H+- transporting ATPase, vacular (size 15).Entities:
Mesh:
Substances:
Year: 2008 PMID: 19056822 PMCID: PMC2632914 DOI: 10.1093/nar/gkn972
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Comparison of correlation distributions for protein pairs with respect to functional association (shown in left panel) and complex size (shown in right panel). The terms ‘cc’, ‘yg’, ‘rst’ and ‘st1’ represent four different data sets: cell cycle, segregation genetics, rosetta and stress data, respectively. Protein complex pairs are abbreviated as ‘rel’ and unrelated pairs are abbreviated as ‘unrel’.
Figure 2.(a) Histogram of protein complex size. (b) Boxplots are used to sort out the relationship between the size of protein complex and the median correlation for gene pairs within a complex.
List of highly co-expressed complexes with size ≥15
| Complex name | Complex size |
|---|---|
| Cytoplasmic ribosomal complex | 138 |
| Mitochondrial ribosomal complex | 75 |
| Proteasome complex | 33 |
| rRNA splicing complex | 24 |
| F0/F1 ATP synthase (complex V) | 18 |
| H+-transporting ATPase, vacular | 15 |
The number of genes whose expression level can be explained by complex
| Mitochondrial | Cytoplasmic | rRNA splicing | Proteasome | ATP synthase | H+-transporting | |
|---|---|---|---|---|---|---|
| Cellcycle (5878) | ||||||
| Mitochondrial | 2373 (0.026) | 677 (0) | 1040 (0) | 1901 (0) | 1456 (0) | 1349 (0) |
| Cytoplasmic | 1859 (0) | 1141 (0) | 819 (0) | 805 (0) | 935 (0) | |
| rRNA splicing | 2545 (0) | 1200 (0) | 1049 (0) | 1022 (0) | ||
| Proteasome | 2802 (0) | 1568 (0) | 1490 (0) | |||
| ATP synthase | 2343 (0) | 1680 (0) | ||||
| H+transporting | 2490 (0) | |||||
| Genetic (6229) | ||||||
| Mitochondrial | 3349 (0) | 2116 (0) | 1985 (0) | 2353 (0) | 1652 (0) | 1738 (0) |
| Cytoplasmic | 4008 (0) | 3241 (0) | 2628 (0) | 1635 (0) | 2089 (0) | |
| rRNA splicing | 3933 (0) | 2505 (0) | 1551 (0) | 2019 (0) | ||
| Proteasome | 3734 (0) | 1471 (0) | 1840 (0) | |||
| ATP synthase | 2601 (0) | 1304 (0) | ||||
| H+transporting | 3169 (0) | |||||
| Rosetta (6283) | ||||||
| Mitochondrial | 4001 (0) | 2791 (0) | 2723 (0) | 2094 (0) | 2302 (0) | 2187 (0) |
| Cytoplasmic | 4379 (0) | 3101 (0) | 1923 (0) | 2468 (0) | 2503 (0) | |
| rRNA splicing | 4237 (0) | 1868 (0) | 2433 (0) | 2391 (0) | ||
| Proteasome | 2819 (0) | 1539 (0) | 1510 (0) | |||
| ATP synthase | 3432 (0) | 1933 (0) | ||||
| H+transporting | 3508 (0) | |||||
| Stress (6152) | ||||||
| Mitochondrial | 3083 (0.024) | 3258 (0) | 3283 (0) | 2274 (0) | 3063 (0) | 3123 (0) |
| Cytoplasmic | 5184 (0) | 4795 (0) | 2924 (0) | 4228 (0) | 4592 (0) | |
| rRNA splicing | 5129 (0) | 2864 (0) | 4224 (0) | 4438 (0) | ||
| Proteasome | 3554 (0) | 2579 (0) | 2836 (0) | |||
| ATP synthase | 4642 (0) | 4083 (0) | ||||
| H+transporting | 5053 (0) |
Figure 3.Hierarchical clustering with centroid method. The cluster result using cell-cycle expression data clearly shows three subclusters in the replication protein complex.