| Literature DB >> 23160177 |
David Alvarez-Ponce1, Mario A Fares.
Abstract
Genes show a bewildering variation in their patterns of molecular evolution, as a result of the action of different levels and types of selective forces. The factors underlying this variation are, however, still poorly understood. In the last decade, the position of proteins in the protein-protein interaction network has been put forward as a determinant factor of the evolutionary rate and duplicability of their encoding genes. This conclusion, however, has been based on the analysis of the limited number of microbes and animals for which interactome-level data are available (essentially, Escherichia coli, yeast, worm, fly, and humans). Here, we study, for the first time, the relationship between the position of proteins in the high-density interactome of a plant (Arabidopsis thaliana) and the patterns of molecular evolution of their encoding genes. We found that genes whose encoded products act at the center of the network are more evolutionarily constrained than those acting at the network periphery. This trend remains significant when potential confounding factors (gene expression level and breadth, duplicability, function, and length of the encoded products) are controlled for. Even though the correlation between centrality measures and rates of evolution is generally weak, for some functional categories, it is comparable in strength to (or even stronger than) the correlation between evolutionary rates and expression levels or breadths. In addition, genes encoding interacting proteins in the network evolve at relatively similar rates. Finally, Arabidopsis proteins encoded by duplicated genes are more highly connected than those encoded by singleton genes. This observation is in agreement with the patterns observed in humans, but in contrast with those observed in E. coli, yeast, worm, and fly (whose duplicated genes tend to act at the periphery of the network), implying that the relationship between duplicability and centrality inverted at least twice during eukaryote evolution. Taken together, these results indicate that the structure of the A. thaliana network constrains the evolution of its components at multiple levels.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23160177 PMCID: PMC3542556 DOI: 10.1093/gbe/evs101
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FCorrelation between ω and betweenness.
Spearman’s Correlations among the Parameters Considered in the Study
| ω | Degree | Closeness | Betweenness | Expression Level | Expression Breadth | |||
|---|---|---|---|---|---|---|---|---|
| 0.732 | ||||||||
| −0.219 | 0.310 | |||||||
| Degree | −0.030 | −0.026 | −0.019 | |||||
| Closeness | −0.034 | −0.032 | 0.001 | 0.412 | ||||
| Betweenness | −0.053 | −0.051 | −0.017 | 0.875 | 0.436 | |||
| Expression level | −0.331 | −0.342 | −0.049 | −0.013 | 0.066 | 0.046 | ||
| Expression breadth | −0.245 | −0.264 | −0.072 | −0.006 | 0.024 | 0.035 | 0.692 | |
| Protein length | 0.050 | −0.009 | −0.124 | −0.017 | −0.042 | −0.023 | −0.088 | 0.012 |
*P < 0.05.
**P < 0.01.
***P < 10−6.
Correlation between Degree and ω for Each Functional Category
| Correlation | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KOG Category | Description | Average ω | Average Degree | Average Expression Level | Average Expression Breadth | ω–Degree | ω–Expression Level | ω–Expression Breadth | |||||
| Comparison | |||||||||||||
| J | Translation, ribosomal structure, and biogenesis | 138 | 0.122 | 2.36 | 1,240.4 | 76.3 | 0.086 | 0.316 | 0.149 | −0.379 | 2.10 × 10−5 | −0.254 | 0.005 |
| A | RNA processing and modification | 148 | 0.169 | 5.89 | 327.4 | 74.3 | −0.162 | 0.049 | 0.105 | −0.360 | 2.06 × 10−5 | −0.211 | 0.015 |
| K | Transcription | 313 | 0.204 | 6.43 | 178.8 | 61.0 | −0.118 | 0.037 | 0.143 | −0.233 | 7.71 × 10−5 | −0.167 | 0.005 |
| L | Replication, recombination, and repair | 73 | 0.162 | 3.85 | 147.9 | 62.7 | −0.203 | 0.086 | 0.138 | −0.223 | 0.074 | −0.171 | 0.173 |
| B | Chromatin structure and dynamics | 45 | 0.167 | 3.91 | 357.6 | 73.2 | −0.357 | 0.016 | 0.025 | −0.436 | 0.003 | −0.198 | 0.202 |
| D | Cell cycle control, cell division, and chromosome partitioning | 107 | 0.189 | 10.24 | 244.6 | 63.5 | −0.092 | 0.344 | 0.526 | −0.444 | 5.09 × 10−6 | −0.487 | 4.26 × 10−7 |
| T | Signal transduction mechanisms | 305 | 0.128 | 6.47 | 223.8 | 65.8 | −0.168 | 0.003 | 0.009 | −0.291 | 5.11 × 10−7 | −0.314 | 5.19 × 10−8 |
| Z | Cytoskeleton | 68 | 0.138 | 4.99 | 472.7 | 66.3 | −0.205 | 0.094 | 0.149 | −0.312 | 0.014 | −0.252 | 0.050 |
| U | Intracellular trafficking, secretion, and vesicular transport | 180 | 0.117 | 4.91 | 369.3 | 73.8 | −0.040 | 0.592 | 0.876 | −0.359 | 2.47 × 10−6 | −0.280 | 2.99 × 10−4 |
| O | Posttranslational modification, protein turnover, and chaperones | 342 | 0.137 | 6.35 | 580.3 | 70.0 | −0.069 | 0.203 | 0.450 | −0.309 | 4.90 × 10−8 | −0.181 | 0.002 |
| C | Energy production and conversion | 121 | 0.106 | 3.24 | 936.5 | 73.9 | −0.197 | 0.031 | 0.067 | −0.342 | 2.72 × 10−4 | −0.418 | 5.99 × 10−6 |
| G | Carbohydrate transport and metabolism | 129 | 0.099 | 4.45 | 803.4 | 71.0 | −0.165 | 0.062 | 0.129 | −0.461 | 1.67 × 10−7 | −0.296 | 0.001 |
| E | Amino acid transport and metabolism | 91 | 0.128 | 2.45 | 720.2 | 73.3 | −0.069 | 0.517 | 0.733 | −0.389 | 5.14 × 10−4 | −0.332 | 0.003 |
| F | Nucleotide transport and metabolism | 32 | 0.103 | 2.97 | 697.2 | 76.0 | −0.063 | 0.733 | 0.861 | −0.130 | 0.518 | 0.025 | 0.900 |
| H | Coenzyme transport and metabolism | 26 | 0.100 | 3.38 | 480.5 | 72.2 | −0.030 | 0.883 | 0.998 | −0.237 | 0.253 | −0.250 | 0.228 |
| I | Lipid transport and metabolism | 63 | 0.141 | 2.49 | 431.0 | 71.9 | −0.165 | 0.197 | 0.292 | −0.443 | 4.41 × 10−4 | −0.249 | 0.055 |
| P | Inorganic ion transport and metabolism | 68 | 0.155 | 3.46 | 465.4 | 67.3 | −0.134 | 0.275 | 0.392 | −0.044 | 0.736 | −0.047 | 0.722 |
| Q | Secondary metabolites biosynthesis, transport, and catabolism | 47 | 0.128 | 2.09 | 308.7 | 57.6 | −0.190 | 0.200 | 0.284 | −0.453 | 0.003 | −0.279 | 0.077 |
| R | General function prediction only | 425 | 0.160 | 4.12 | 297.5 | 69.4 | 0.013 | 0.783 | 0.364 | −0.235 | 3.56 × 10−6 | −0.175 | 6.18 × 10−4 |
| S | Function unknown | 156 | 0.158 | 3.35 | 306.1 | 72.1 | −0.053 | 0.512 | 0.776 | −0.185 | 0.030 | −0.031 | 0.718 |
Note.—Only functional categories comprising more than n = 25 A. thaliana network genes are presented.
aComparison of the correlation for genes within the category versus the correlation for all other network genes. Comparisons were conducted using Fisher’s z-transformation, followed by standard normal comparison. The suitability of this test for Spearman’s correlations is discussed in Myers and Sirois (2006).
*P < 0.05.
**P < 0.01.
***P < 10−6.
FNormalized absolute difference between the ω values of pairs of interacting genes in the network (X). The arrow points to the observed value and the histogram represent the null distribution obtained from 10,000 random networks.
FAverage number of interactors for proteins encoded by singleton, whole-genome duplication (WGD), and small-scale duplication (SSD) genes. Error bars represent the standard error of the means.