| Literature DB >> 23926418 |
Timothy D O'Connor1, Nicholas I Mundy.
Abstract
Variation in substitution rates across a phylogeny can be indicative of shifts in the evolutionary dynamics of a protein or non-protein coding regions. One way to understand these signals is to seek the phenotypic correlates of rate variation. Here, we extended a previously published likelihood method designed to detect evolutionary associations between genotypic evolutionary rate and phenotype over a phylogeny. In simulation with two discrete categories of phenotype, the method has a low false-positive rate and detects greater than 80% of true-positives with a tree length of three or greater and a three-fold or greater change in substitution rate given the phenotype. In addition, we successfully extend the test from two to four phenotype categories and evaluated its performance. We then applied the method to two major hypotheses for rate variation in the mitochondrial genome of primates-longevity and generation time as well as body mass which is correlated with many aspects of life history-using three categories of phenotype through discretization of continuous values. Similar to previous results for mammals, we find that the majority of mitochondrial protein-coding genes show associations consistent with the longevity and body mass predictions and that the predominant signal of association comes from the third codon position. We also found a significant association between maximum lifespan and the evolutionary rate of the control region of the mtDNA. In contrast, 24 protein-coding genes from the nuclear genome do not show a consistent pattern of association, which is inconsistent with the generation time hypothesis. These results show the extended method can robustly identify genotype-phenotype associations up to at least four phenotypic categories, and demonstrate the successful application of the method to study factors affecting neutral evolutionary rate in protein-coding and non-coding loci.Entities:
Keywords: generation time hypothesis; genotype-phenotype; longevity hypothesis; mitochondria; primate
Year: 2013 PMID: 23926418 PMCID: PMC3733722 DOI: 10.4137/EBO.S11600
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Phenotype data summary statistics.
| Variable | N | Mean | |||
|---|---|---|---|---|---|
| Female body mass (ln (grams)) | 210 | 7.11 | 8.68 | 7.72 | 1.46 |
| Time to female sexual maturity (ln (days)) | 112 | 6.68 | 7.29 | 6.90 | 0.68 |
| Maximum lifespan (ln (days)) | 149 | 9.12 | 9.43 | 9.26 | 0.36 |
Notes: The three key traits used with the number of primates used to estimate the percentiles (N) and the percentiles used to discretize the variables into three groups. P0.33 and P0.66 are the first third and second third log values of the distribution. The discrete phenotypic mapping was: 0 (x < P0.33), 1 (P0.33 < x < P0.66), and 2 (P0.66 < x).
Figure 1Results of simulation for binary phenotype. The Z-axis is percentage significant out of 50 tests done for each permutation of tree length (sum of all branch lengths) and W1. Tree length and W1 take on values from 1 to 10 at each integer. All tests have W0 set to 1, a 16 taxa tree, and 50% sites associated.
Strict vs. relaxed alternative simulations.
| Taxa | Hypothesis | Model | χ20.05 | χ20.01 | |
|---|---|---|---|---|---|
| 16 | 2 | Null | 3.5 | 0.0 | |
| 16 | 2 | Null | 2.0 | 0.0 | |
| 16 | 3 | Null | 2.5 | 0.5 | |
| 16 | 3 | Null | 1.0 | 0.5 | |
| 16 | 4 | Null | 0.5 | 0.0 | |
| 16 | 4 | Null | 0.0 | 0.0 | |
| 16 | 2 | Alt. | 81.0 | 71.5 | |
| 16 | 2 | Alt. | 86.5 | 71.5 | |
| 16 | 3 | Alt. | 82.5 | 68.0 | |
| 16 | 3 | Alt. | 87.5 | 74.0 | |
| 16 | 4 | Alt. | 69.5 | 53.5 | |
| 16 | 4 | Alt. | 67.0 | 53.5 | |
| 32 | 2 | Null | 4.0 | 1.5 | |
| 32 | 2 | Null | 4.5 | 1.0 | |
| 32 | 3 | Null | 3.5 | 0.0 | |
| 32 | 3 | Null | 0.5 | 0.0 | |
| 32 | 4 | Null | 0.5 | 0.5 | |
| 32 | 4 | Null | 1.5 | 0.0 | |
| 32 | 2 | Alt. | 88.0 | 78.5 | |
| 32 | 2 | Alt. | 91.5 | 78.5 | |
| 32 | 3 | Alt. | 84.0 | 73.5 | |
| 32 | 3 | Alt. | 85.5 | 76.5 | |
| 32 | 4 | Alt. | 78.5 | 67.5 | |
| 32 | 4 | Alt. | 66.0 | 50.5 |
Notes: Simulation results from a symmetrical tree for either 16 or 32 taxa. N are the number of phenotypes being tested simulated under either the null or alternative hypotheses (Alt.). These are examined with a test using a strict (W < W+1) or relaxed (W ≠ 6 W+1) alternative hypothesis. The ALRSs are compared against a χ2 with degrees of freedom equal to the number of phenotypes minus one and here are reported as the percentage out of 200 simulated tests that were significant.
Results of mitochondrial coding and non-protein-coding genes under three different phenotypic hypotheses.
| Gene | N | Length | Sexual maturity | Max. lifespan | Body mass | |||
|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||
| ALRS | ALRS | ALRS | ||||||
| Protein-coding | ||||||||
| | 26 | 678 | 1.962 | 1.00 | 38.518 | 1.8 × 10−7** | 21.887 | 7.2 × 10−4** |
| | 26 | 207 | 0.201 | 1.00 | −0.056 | 1.00 | 0.935 | 1.00 |
| | 26 | 1536 | 1.648 | 1.00 | 56.729 | 2.0 × 10−11** | 14.735 | 2.6 × 10−2* |
| | 26 | 681 | 0.485 | 1.00 | 0.000 | 1.00 | 0.001 | 1.00 |
| | 26 | 781 | 0.001 | 1.00 | 72.172 | 8.7 × 10−15** | 19.758 | 2.1 × 10−3** |
| | 26 | 1141 | 1.457 | 1.00 | 67.133 | 1.1 × 10−13** | 33.677 | 2.0 × 10−6** |
| | 26 | 954 | 17.741 | 6.0 × 10−3** | 49.468 | 7.4 × 10−10** | 18.118 | 4.8 × 10−3** |
| | 26 | 1057 | 0.033 | 1.00 | 58.806 | 7.0 × 10−12** | −0.019 | 1.00 |
| | 26 | 344 | 4.394 | 1.00 | 39.372 | 1.2 × 10−7** | 27.643 | 4.1 × 10−5** |
| | 26 | 1378 | −0.037 | 1.00 | 115.289 | 3.8 × 10−24** | 14.915 | 2.4 × 10−2* |
| | 26 | 294 | 0.037 | 1.00 | 4.039 | 1.00 | 6.657 | 1.00 |
| | 26 | 1812 | 21.683 | 1.0 × 10−3** | 192.480 | 6.5 × 10−41** | 39.047 | 1.4 × 10−7** |
| | 26 | 528 | −0.001 | 1.00 | 7.762 | 0.85 | 3.120 | 1.00 |
| Other | ||||||||
| | 26 | 1031 | 0.015 | 1.00 | −0.023 | 1.00 | −0.001 | 1.00 |
| | 26 | 1680 | 0.000 | 1.00 | 0.536 | 1.00 | 0.602 | 1.00 |
| | 26 | 1636 | 1.646 | 1.00 | 0.000 | 1.00 | 0.000 | 1.00 |
| | 24 | 1746 | −0.007 | 1.00 | 21.806 | 1.0 × 10−3** | −0.023 | 1.00 |
Notes: Time to sexual maturity, maximum lifespan, and body mass are used as indicators for the generation time, longevity, and general life history hypotheses respectively. P-values are corrected by Bonferroni multiple test correction (N = 41). Significant values with a P-value less than 0.05 or 0.01 are signified with * and ** respectively using a χ2 with two degrees of freedom. The control region is signified as “Cont. reg.”.
ALRS by codon position.
| Gene | Phenotype | Codon position ALRS | |||
|---|---|---|---|---|---|
|
| |||||
| 1st | 2nd | 3rd | 3rd—1st—3rd | ||
| Body mass | 7.06 | −15.98 | 30.81 | 39.73 | |
| Body mass | −30.98 | −20.47 | 66.18 | 117.63 | |
| Body mass | −2.63 | −8.89 | 31.06 | 42.58 | |
| Body mass | 5.16 | −33.56 | 62.09 | 90.49 | |
| Body mass | −13.86 | −6.8 | 38.78 | 59.44 | |
| Body mass | 1.71 | 2.8 | 23.07 | 18.57 | |
| Body mass | −22.28 | −21.2 | 58.17 | 101.65 | |
| Body mass | 1.35 | −53.1 | 90.47 | 142.22 | |
| Body mass | 6.69 | 6.5 | 10.22 | −2.97 | |
| Body mass | 11.14 | 25.96 | 12.68 | −24.41 | |
| Body mass | 4.52 | 7.48 | 2.33 | −9.67 | |
| Max. lifespan | −9.89 | −13.11 | 61.52 | 84.52 | |
| Max. lifespan | −39.06 | −28.49 | 124.28 | 191.84 | |
| Max. lifespan | −12.11 | 1.13 | 82.96 | 93.94 | |
| Max. lifespan | −17.38 | −29.70 | 114.27 | 161.35 | |
| Max. lifespan | −9.61 | −48.24 | 107.32 | 165.17 | |
| Max. lifespan | 2.26 | −42.45 | 98.99 | 139.18 | |
| Max. lifespan | −20.23 | 3.03 | 56.57 | 73.77 | |
| Max. lifespan | −11.72 | −33.84 | 160.6 | 206.15 | |
| Max. lifespan | 32.68 | −46.87 | 206.67 | 220.87 | |
| Max. lifespan | 15.07 | 33.12 | 14.84 | −33.35 | |
| Sexual maturity | 18.72 | −4.91 | 3.93 | −9.87 | |
| Sexual maturity | 19.25 | 14.96 | −12.53 | −46.74 | |
| Sexual maturity | 1.54 | 10.88 | 3.44 | −8.98 | |
| Sexual maturity | 9.06 | −0.14 | 10.63 | 1.71 | |
Notes: Break down of the ALRS by codon position for significant results (see Tables 3 and 4). Negative values occur because the parameters are optimized for the entire gene. The final column shows the strength of signal from the third codon position compared to the other two. Genes from the mitochondrial genome are in bold.
Results of nuclear genes under three different phenotypic hypotheses.
| Gene | N | Length | Sexual maturity | Max. lifespan | Body mass | |||
|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||
| ALRS | ALRS | ALRS | ||||||
| Male associated | ||||||||
| | 13 | 1161 | −0.105 | 1.00 | 2.054 | 1.00 | 2.463 | 1.00 |
| | 11 | 2893 | −0.005 | 1.00 | −0.012 | 1.00 | 23.393 | 3.4 × 10−4** |
| | 20 | 2514 | 13.010 | 6.1 × 10−2 | 63.034 | 8.4 × 10−13** | 49.779 | 6.4 × 10−10** |
| | 18 | 315 | 0.000 | 1.00 | 0.133 | 1.00 | 1.158 | 1.00 |
| | 12 | 786 | 0.230 | 1.00 | 0.185 | 1.00 | 0.191 | 1.00 |
| | 13 | 441 | −0.035 | 1.00 | −0.001 | 1.00 | −0.001 | 1.00 |
| | 11 | 786 | −0.007 | 1.00 | 2.212 | 1.00 | −0.517 | 1.00 |
| | 14 | 2649 | 15.865 | 1.5 × 10−2* | 4.458 | 1.00 | 14.330 | 3.2 × 10−2* |
| | 16 | 4245 | 12.635 | 7.4 × 10−2 | 0.100 | 1.00 | 4.831 | 1.00 |
| | 37 | 648 | 1.584 | 1.0 | 12.568 | 7.7 × 10−2 | −0.014 | 1.00 |
| | 15 | 2055 | 0.893 | 1.00 | 0.000 | 1.00 | 0.000 | 1.00 |
| | 15 | 1479 | 0.000 | 1.00 | 0.000 | 1.00 | 0.000 | 1.00 |
| | 14 | 1212 | 19.557 | 2.3 × 10−3** | 3.687 | 1.00 | −0.005 | 1.00 |
| | 16 | 555 | 0.000 | 1.00 | 0.761 | 1.00 | 2.460 | 1.00 |
| Mt complex IV | ||||||||
| | 19 | 438 | 2.790 | 1.00 | 0.251 | 1.00 | 0.064 | 1.00 |
| | 19 | 459 | 0.467 | 1.00 | 0.000 | 1.00 | 0.000 | 1.00 |
| | 12 | 228 | 0.219 | 1.00 | 3.247 | 1.00 | −0.225 | 1.00 |
| | 10 | 243 | 4.499 | 1.00 | 7.640 | 0.90 | 8.507 | 0.58 |
| | 12 | 192 | 3.528 | 1.00 | 4.609 | 1.00 | 0.122 | 1.00 |
| | 19 | 210 | 1.070 | 1.00 | 0.669 | 1.00 | 3.809 | 1.00 |
| Other | ||||||||
| | 14 | 3963 | −0.003 | 1.00 | −0.003 | 1.00 | −0.004 | 1.00 |
| | 8 | 444 | 0.784 | 1.00 | 1.598 | 1.00 | 0.021 | 1.00 |
| | 26 | 447 | −0.001 | 1.00 | 0.002 | 1.00 | 0.665 | 1.00 |
| | 18 | 954 | 2.271 | 1.00 | 4.287 | 1.00 | 0.753 | 1.00 |
Notes: Male associated genes are genes expressed in semen, found on the Y chromosome, or implicated in sperm competition. P-values are corrected by Bonferroni multiple test correction (N = 41). Significant values with a P-value less than 0.05 or 0.01 are signified with * and ** respectively using a χ2 with two degrees of freedom.