| Literature DB >> 23563973 |
Ting-Yan Chang1, Ben-Yang Liao.
Abstract
Understanding the general rules governing the rate of protein evolution is fundamental to evolutionary biology. However, attempts to address this issue in yeasts and mammals have revealed considerable differences in the relative importance of determinants for protein evolutionary rates. This phenomenon was previously explained by the fact that yeasts and mammals are different in many cellular and genomic properties. Flagellated algae species have several cellular and genomic characteristics that are intermediate between yeasts and mammals. Using partial correlation analyses on the evolution of 6,921 orthologous proteins from Chlamydomonas reinhardtii and Volvox carteri, we examined factors influencing evolutionary rates of proteins in flagellated algae. Previous studies have shown that mRNA abundance and gene compactness are strong determinants for protein evolutionary rates in yeasts and mammals, respectively. We show that both factors also influence algae protein evolution with mRNA abundance having a larger impact than gene compactness on the rates of algae protein evolution. More importantly, among all the factors examined, coding sequence (CDS) length has the strongest (positive) correlation with protein evolutionary rates. This correlation between CDS length and the rates of protein evolution is not due to alignment-related issues or domain density. These results suggest no simple and universal rules governing protein evolutionary rates across different eukaryotic lineages. Instead, gene properties influence the rate of protein evolution in a lineage-specific manner.Entities:
Keywords: expression level; functional density; gene compactness; mRNA abundance; protein length
Mesh:
Substances:
Year: 2013 PMID: 23563973 PMCID: PMC3673635 DOI: 10.1093/gbe/evt055
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Spearman’s Rank Correlations of Chlamydomonas reinhardtii Gene Properties with dN and dN/dS
| Gene Properties | All 6,921 Orthologs | 3,480 Orthologs | ||
|---|---|---|---|---|
| ρ ( | ρ ( | ρ ( | ρ ( | |
| 0.549 (<10−300) | 0.401 (<10−265) | 0.552 (<10−275) | 0.403 (<10−135) | |
| # | 0.269 (<10−114) | 0.191 (<10−57) | 0.281 (<10−63) | 0.205 (<10−33) |
| Gene expression level | ||||
| | NA | NA | −0.546 (<10−268) | −0.356 (<10−104) |
| | −0.287 (<10−130) | −0.184 (<10−52) | −0.281 (<10−63) | −0.175 (<10−24) |
| Gene compactness | ||||
| | −0.055 (<10−5) | −0.069 (<10−8) | −0.085 (<10−6) | −0.075 (<10−5) |
| 5′- | 0.154 (<10−37) | 0.073 (<10−8) | 0.147 (<10−17) | 0.064 (<10−3) |
| 3′- | −0.020 (0.09) | −0.032 (<10−2) | −0.020 (0.23) | −0.038 (0.03) |
Note.—NA, not applicable.
aAll gene properties were based on C. reinhardtii genes.
bThe subset of one-to-one orthologs with RNA-seq expression levels in C. reinhardtii genes.
cP values show the probabilities of the observations under the hypothesis of no correlation.
FThe network showing the inter-relatedness of the gene properties in (A) Chlamydomonas reinhardtii or (B) Volvox carteri. Gene properties are represented by nodes. Edges between nodes represent a highly significant rank correlation coefficient (ρ) between the two corresponding features. Edge thickness corresponds to the magnitude of ρ; edge color corresponds to the sign of ρ (orange = positive, blue = negative). Volvox carteri genes have no ExpLevmRNA data.
Partial Rank Correlations of the Gene Properties with dN or dN/dS After Controlling for All the Other Gene Properties
| Gene Properties | ||||
|---|---|---|---|---|
| ρp ( | ρp ( | ρp ( | ρp ( | |
| 0.340 (<10−99) | 0.247 (<10−50) | 0.592 (<10−300) | 0.446 (<10−188) | |
| # | 0.041 (0.01) | 0.074 (<10−4) | 0.035 (0.04) | −0.050 (<10−2) |
| Gene expression level | ||||
| | −0.276 (<10−63) | −0.128 (<10−13) | NA | NA |
| | −0.140 (<10−16) | −0.067 (<10−4) | −0.173 (<10−24) | −0.043 (0.01) |
| Gene compactness | ||||
| | −0.077 (<10−5) | −0.062 (<10−3) | −0.016 (0.34) | −0.039 (0.02) |
| 5′- | −0.002 (0.93) | −0.063 (<10−3) | −0.073 (<10−4) | 0.025 (0.13) |
| 3′- | −0.019 (0.25) | −0.012 (0.47) | −0.099 (<10−8) | −0.030 (0.08) |
Note.—NA, not applicable.
aAll gene properties were based on C. reinhardtii genes.
bGene properties can be defined based on C. reinhardtii genes or Volvox carteri genes, as indicated.
cP values show the probabilities of the observations under the hypothesis of no correlation.
Effect of Alignment on Rank Correlations of CDS Length or ExpLevmRNA with dN and dN/dS
| Orthologs | ClustalW | MUSCLE | ||
|---|---|---|---|---|
| ρ ( | ρ ( | ρ ( | ρ ( | |
| Similar | ||||
| | 0.687 (<10−243) | 0.483 (<10−101) | 0.666 (<10−222) | 0.444 (<10−84) |
| | −0.647 (<10−206) | −0.361 (<10−54) | −0.636 (<10−197) | −0.319 (<10−41) |
| Dissimilar | ||||
| | 0.390 (<10−63) | 0.318 (<10−41) | 0.371 (<10−57) | 0.295 (<10−35) |
| | −0.418 (<10−73) | −0.340 (<10−47) | −0.410 (<10−70) | −0.323 (<10−42) |
| Strongly alignable (bottom 50% | ||||
| | 0.661 (<10−218) | 0.467 (<10−94) | 0.592 (<10−163) | 0.361 (<10−53) |
| | −0.630 (<10−192) | −0.346 (<10−49) | −0.573 (<10−151) | −0.236 (<10−22) |
| Poorly aligned (top 50% | ||||
| | 0.382 (<10−60) | 0.311 (<10−39) | 0.371 (<10−57) | 0.319 (<10−41) |
| | −0.414 (<10−72) | −0.340 (<10−47) | −0.416 (<10−73) | −0.353 (<10−51) |
aP values show the probabilities of the observations under the hypothesis of no correlation.
F(A) Genes with longer CDS (larger CDS Length) encode proteins with a lower percent of the total sequence length annotated as protein domains (domain%). Controlling for domain%, CDS Length remains positively correlated with (B) dN and (C) dN/dS. The linear regression line and the Spearman’s rank correlation coefficient are shown for each domain% bin.