| Literature DB >> 22056313 |
Corey M Hudson1, Emily E Puckett, Michaël Bekaert, J Chris Pires, Gavin C Conant.
Abstract
The evolutionary origins of the multitude of duplicate genes in the plant genomes are still incompletely understood. To gain an appreciation of the potential selective forces acting on these duplicates, we phylogenetically inferred the set of metabolic gene families from 10 flowering plant (angiosperm) genomes. We then compared the metabolic fluxes for these families, predicted using the Arabidopsis thaliana and Sorghum bicolor metabolic networks, with the families' duplication propensities. For duplications produced by both small scale (small-scale duplications) and genome duplication (whole-genome duplications), there is a significant association between the flux and the tendency to duplicate. Following this global analysis, we made a more fine-scale study of the selective constraints observed on plant sodium and phosphate transporters. We find that the different duplication mechanisms give rise to differing selective constraints. However, the exact nature of this pattern varies between the gene families, and we argue that the duplication mechanism alone does not define a duplicated gene's subsequent evolutionary trajectory. Collectively, our results argue for the interplay of history, function, and selection in shaping the duplicate gene evolution in plants.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22056313 PMCID: PMC3240960 DOI: 10.1093/gbe/evr115
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPlant species used in reconciling gene trees. The phylogenetic relationships (branch lengths are arbitrary) among these species have been described previously (Moore et al. 2007; Paterson et al. 2009). The histograms depict the number of duplications per gene family. Thus, on the x-axes is the number of duplicates observed in a family at that node in the tree (on a natural log scale). The y-axes are then the frequency of families with that number of duplications. The scale is consistent across histograms. Black circles indicate whole-genome duplication events.
FIon transporter gene trees used in this study. Branches demarcating speciation events are colored orange, whole-genome duplication events green, and non-WGDs purple. (a) High-affinity phosphate transporters AtPHT1; 1-AtPHT1; 9 with 16 P. trichocarpa and 7 C. papaya homologs. (b) Low-affinity phosphate transporters AtPHT2; 1 with 2 poplar and 1 papaya homologs. (c) Mitochondrial phosphate transporters AtPHT3; 1-ATPHT3; 3 with 6 poplar and 1 papaya homologs. (d) Chloroplast phosphate transporters AtPHT4; 1-AtPHT4; 6 with 10 poplar and 6 papaya homologs. (e) Sodium ion transporters AtNHX1-AtNHX6, AtNHX8 with 6 poplar and 4 papaya homologs.
Correlations Between Duplication and Flux by Gene Family
| All Flux Values | Excluding Null-Flux | |||
| Duplications per gene family | ||||
| All conditions | 0.245 | <10−15 | 0.336 | <10−15 |
| C3 leaves | 0.218 | <10−15 | 0.328 | <10−15 |
| C4 leaves | 0.176 | <10−8 | 0.218 | <10−4 |
| Roots | 0.223 | <10−15 | 0.359 | <10−15 |
| Duplications per species per gene family | ||||
| All conditions | 0.227 | <10−15 | 0.306 | <10−15 |
| C3 leaves | 0.203 | <10−15 | 0.272 | <10−14 |
| C4 leaves | 0.163 | <10−7 | 0.206 | <10−4 |
| Roots | 0.211 | <10−15 | 0.342 | <10−15 |
Flux values equaling 0 can have confounding biological and computational meanings.
Spearman's r.
Correlations and statistical significance calculated in R.
Number of duplication events per gene family divided by the number of species in that family.
Duplication Status per Gene Family Split by Cellular Compartment
| Duplication versus Flux | Duplication | ||||
| Cellular Compartment | |||||
| Nucleus | 56 | 0.320 | 0.016 | 3.481 | |
| Cytosol | 74 | 0.133 | 0.258 | 4.910 | |
| Chloroplast and plastid | 273 | 0.275 | −0.646 | 0.518 | |
| Mitochondria | 134 | 0.434 | 1.012 | 0.311 | |
| Plasma membrane | 97 | 0.135 | 0.187 | 7.371 | |
| Endoplasmic reticulum | 44 | 0.304 | 0.045 | −1.161 | 0.246 |
| Golgi apparatus | 12 | 0.401 | 0.196 | 2.280 | 0.023 |
| Cell wall | 52 | 0.343 | 0.013 | 3.210 | |
| Extracellular | 51 | 0.089 | 0.532 | 5.115 | |
Bold values are significant at a Bonferroni corrected α = 0.0055.
Duplications per gene family versus the maximum flux.
Wilcoxon rank test of difference across compartments (positive values: overduplication; negative values: underduplication).
Spearman's r, calculated in SAS (v9.2.2, Cary, NC).
Wilcoxon's Z, calculated in SAS (v9.2.2, Cary, NC).
Duplication Status per Gene Family Split by Functional Annotation
| Duplication versus Flux | Duplication | ||||
| Function | |||||
| Cell organization and biogenesis | 29 | 0.209 | 0.274 | 1.010 | 0.312 |
| Developmental processes | 20 | −0.132 | 0.578 | 1.693 | 0.090 |
| DNA or RNA binding or metabolism | 26 | 0.760 | −2.284 | 0.022 | |
| Electron transport | 7 | 0.860 | 0.013 | 0.460 | 0.645 |
| Hydrolase activity | 114 | 0.310 | <0.001 | −1.661 | 0.097 |
| Kinase activity | 62 | −0.031 | 0.817 | 1.167 | 0.243 |
| Nucleic acid or Nucleotide binding | 94 | 0.062 | 0.554 | 0.011 | 0.991 |
| Protein binding or metabolism | 121 | 0.139 | 0.128 | 1.463 | 0.143 |
| Signal transduction | 13 | 0.104 | 0.735 | 2.450 | 0.014 |
| Stimulus or stress response | 199 | 0.302 | 2.650 | 0.008 | |
| Transferase activity | 166 | 0.211 | 0.006 | −0.833 | 0.405 |
| Transporters or transport | 56 | −0.034 | 0.801 | 1.755 | 0.079 |
Bold values are significant at a Bonferroni corrected α = 0.0042.
Duplications per gene family versus the maximum flux.
Wilcoxon rank test of difference across compartments (positive values: overduplication; negative values: underduplication).
Spearman's r, calculated in SAS (v9.2.2, Cary, NC).
Wilcoxon's Z, calculated in SAS (v9.2.2, Cary, NC).
Selective Constraint Estimated with Three Models of Gene Evolution for Ion Transporters of A. thaliana, C. papaya, and P. trichocarpa
| PHT1–High-Affinity Phosphate Transporter | PHT2–Low-Affinity Phosphate Transporter | PHT3-Mitochondrial Phosphate Transporter | PHT4-Chloroplast Phosphate Transporter | NHX-Sodium Ion Transporter | |||||||
| Model | Branches | −lnL | −lnL | −lnL | −lnL | −lnL | |||||
| R_Null | All | 0.076 | 0.207 | 0.114 | 0.148 | 0.049 | |||||
| 15379.5 | 3577.0 | 5898.3 | 24869.3 | 11210.1 | |||||||
| R_Dupl | Speciation | ||||||||||
| Duplication | |||||||||||
| 15376.7 | 3570.2 | 5894.1 | 24847.0 | 11188.5 | |||||||
| R_WGD | Speciation | 0.063 | — | 0.123 | |||||||
| WGD | 0.081 | — | 0.233 | ||||||||
| SSD | 0.085 | — | 0.265 | ||||||||
| 15376.6 | — | 5891.2 | 24846.7 | 11175.3 | |||||||
Bold values indicate a significant improvement over the model immediately above at P < 0.05; nested likelihood ratio test (distributed χ2, P < 0.05, degrees of freedom = 1).
No small scale duplications in PHT2, so model R_Dupl is equivalent to model R_WGD.
WGD: determined by syntenic paralogy using the Plant Genome Duplication Database (Tang, Bowers, et al. 2008).
SSD: determined either by a lack of syntenic paralogy and/or by tandem duplication status.