| Literature DB >> 22844194 |
Gideon C-T Wu1, Feng-Chi Chen.
Abstract
What causes the variations in evolutionary rates is fundamental to molecular evolution. However, in plants, the causes of within-gene evolutionary rate variations remain underexplored. Here we use the principal component regression to examine the contributions of eleven exon features to the within-gene variations in nonsynonymous substitution rate (d(N)), synonymous substitution rate (d(S)), and the d(N)/d(S) ratio in Arabidopsis species. We demonstrate that exon features related to protein structural-functional constraints and mRNA splicing account for the largest proportions of within-gene variations in d(N)/d(S) and d(N). Meanwhile, for d(S), a combination of expression level, exon length, and structural-functional features explains the largest proportion of within-gene variances. Our results suggest that the determinants of within-gene variations differ from those of between-gene variations in evolutionary rates. Furthermore, the relative importance of different exon features also differs between plants and animals. Our study thus may shed a new light on the evolution of plant genes.Entities:
Keywords: Arabidopsis thaliana; exonic evolutionary rates; nonsynonymous substitution rate; principal component regression; synonymous substitution rate
Year: 2012 PMID: 22844194 PMCID: PMC3399485 DOI: 10.4137/EBO.S9743
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
The Pearson’s coefficient of correlation between each of the eleven exon features and dN/dS, dN, and dS.
| Exon feature | Pearson’s coefficient of correlation (rank) | ||
|---|---|---|---|
|
| |||
| % Pfam domain | −0.2338 (3) | −0.2292 (5) | 0.0126 (8) |
| % solvent-accessible amino acid residues | 0.2372 (2) | 0.2434 (3) | 0.0055 (10) |
| % intrinsically disordered regions | 0.2694 (1) | 0.2842 (2) | 0.0165 (7) |
| ASE/CSE exon type | −0.0294 (11) | −0.0210 (10) | 0.0171 (5) |
| Weighted exon frequency | −0.0309 (10) | −0.0232 (9) | 0.0169 (6) |
| 5′ intron length | 0.0496 (7) | 0.0475 (7) | −0.0068 (9) |
| 3′ intron length | 0.0421 (8) | 0.0096 (11) | −0.0537 (3) |
| Exon length | 0.2180 (4) | 0.3133 (1) | 0.1560 (1) |
| Exonic expression level | −0.1852 (5) | −0.2382 (4) | −0.0892 (2) |
| Exon duplicability | −0.0625 (6) | −0.0898 (6) | −0.0394 (4) |
| G+C content | −0.0349 (9) | −0.0356 (8) | −0.0016 (11) |
Notes:
The number in the parenthesis indicates the “rank” of the specific exon feature according to the absolute value of the coefficient of correlation;
statistical significance.
P < 0.05;
P < 10−3;
P < 10−6;
P < 10−9.
The percent contributions of individual features to the principal components in the within-gene analysis.
| Exon features | Comp 1 | Comp 2 | Comp 3 | Comp 4 | Comp 5 | Comp 6 | Comp 7 | Comp 8 | Comp 9 | Comp 10 | Comp 11 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ASE/CSE exon type | 0.070 | 0.421 | 0.002 | 0.001 | 0.003 | 0.000 | 0.001 | 0.000 | 0.000 | 0.000 | 0.501 |
| Weighted exon frequency (WEF) | 0.070 | 0.418 | 0.002 | 0.001 | 0.007 | 0.002 | 0.001 | 0.000 | 0.000 | 0.000 | 0.498 |
| Average exonic expression level | 0.042 | 0.018 | 0.454 | 0.000 | 0.004 | 0.006 | 0.000 | 0.000 | 0.462 | 0.013 | 0.000 |
| 5′ intron length | 0.014 | 0.000 | 0.017 | 0.088 | 0.330 | 0.161 | 0.362 | 0.025 | 0.002 | 0.001 | 0.000 |
| 3′ intron length | 0.001 | 0.000 | 0.003 | 0.422 | 0.161 | 0.001 | 0.389 | 0.010 | 0.012 | 0.001 | 0.000 |
| Exon length | 0.092 | 0.028 | 0.330 | 0.005 | 0.001 | 0.000 | 0.063 | 0.002 | 0.434 | 0.046 | 0.000 |
| G+C content | 0.005 | 0.001 | 0.017 | 0.428 | 0.252 | 0.089 | 0.133 | 0.017 | 0.038 | 0.020 | 0.000 |
| Exon duplicability | 0.000 | 0.013 | 0.001 | 0.022 | 0.216 | 0.708 | 0.035 | 0.000 | 0.002 | 0.000 | 0.001 |
| % solvent-accessible amino acid residues | 0.208 | 0.031 | 0.087 | 0.030 | 0.024 | 0.029 | 0.000 | 0.270 | 0.032 | 0.289 | 0.000 |
| % intrinsically disordered regions | 0.298 | 0.042 | 0.053 | 0.002 | 0.001 | 0.002 | 0.002 | 0.022 | 0.012 | 0.566 | 0.000 |
| % Pfam domain | 0.198 | 0.027 | 0.035 | 0.000 | 0.001 | 0.001 | 0.013 | 0.655 | 0.006 | 0.063 | 0.000 |
Notes:
The numbers larger than 0.2 are color-shaded. The color code in the table is the same as that used in Figure 1. eleven principal components in the within-gene analysis are given in Supplementary Figure S2.
The percent contributions of individual features to the principal components in the between-gene analysis.
| Exon features | Comp 1 | Comp 2 | Comp 3 | Comp 4 | Comp 5 | Comp 6 | Comp 7 | Comp 8 | Comp 9 | Comp 10 | Comp 11 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ASE/CSE exon type | 0.003 | 0.485 | 0.000 | 0.005 | 0.001 | 0.003 | 0.000 | 0.000 | 0.000 | 0.000 | 0.500 |
| Weighted exon frequency (WEF) | 0.003 | 0.483 | 0.001 | 0.006 | 0.000 | 0.003 | 0.000 | 0.000 | 0.000 | 0.000 | 0.500 |
| Average exonic expression level | 0.094 | 0.001 | 0.409 | 0.004 | 0.003 | 0.003 | 0.003 | 0.001 | 0.481 | 0.002 | 0.000 |
| 5′ intron length | 0.018 | 0.009 | 0.002 | 0.301 | 0.080 | 0.058 | 0.525 | 0.001 | 0.000 | 0.000 | 0.000 |
| 3′ intron length | 0.008 | 0.004 | 0.003 | 0.385 | 0.097 | 0.035 | 0.458 | 0.005 | 0.008 | 0.000 | 0.000 |
| Exon length | 0.137 | 0.006 | 0.259 | 0.008 | 0.071 | 0.026 | 0.001 | 0.050 | 0.422 | 0.025 | 0.000 |
| G+C content | 0.000 | 0.000 | 0.094 | 0.207 | 0.267 | 0.355 | 0.004 | 0.014 | 0.061 | 0.014 | 0.000 |
| Exon duplicability | 0.010 | 0.004 | 0.006 | 0.014 | 0.462 | 0.480 | 0.005 | 0.000 | 0.000 | 0.000 | 0.000 |
| % solvent accessible amino acid residues | 0.187 | 0.003 | 0.127 | 0.062 | 0.013 | 0.035 | 0.001 | 0.300 | 0.025 | 0.248 | 0.000 |
| % intrinsically disordered regions | 0.302 | 0.003 | 0.076 | 0.008 | 0.005 | 0.000 | 0.000 | 0.006 | 0.002 | 0.600 | 0.000 |
| % Pfam domain | 0.237 | 0.002 | 0.024 | 0.000 | 0.002 | 0.000 | 0.002 | 0.622 | 0.001 | 0.109 | 0.000 |
Notes:
The numbers larger than 0.2 are color-shaded. The color code in the table is the same as that used in Figure 1.
Figure 1The percent of variances in dN/dS (upper panel), dN (middle panel), and dS (lower panel) explained by the principal components for (A) the within-gene comparison; and (B) the between-gene comparison.
Notes: Only the five most important components are shown here. The leftmost bar in each panel indicates the total variances explained by all of the eleven components.
Figure 2The linear correlation between difference in exonic expression level (ΔLn(expL)) and difference in dN/dS (ΔLn(dN/dS), upper panel) or difference in dN (ΔLn(dN), lower panel) for (A) the within-gene comparison; and (B) the between-gene comparison.
Figure 3The linear correlation between difference in percent of solvent-accessible amino acid residues (ΔPSA) and difference in dN/dS (ΔLn(dN/dS), upper panel) or difference in dN (ΔLn(dN), lower panel) for (A) the within-gene comparison; and (B) the between-gene comparison.
The within-gene variance in exonic dN/dS explained by the principal components broken down to eleven exon features (upper half); or to six feature categories (lower half).
| Var. explained | Comp 1 | Comp 2 | Comp 9 | Comp 10 | Comp 3 | Comp 4 | Comp 5 | Comp 6 | Comp 7 | Comp 8 | Comp 11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8.00 | 6.47 | 1.09 | 0.27 | 0.11 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.00 | |
| ASE/CSE exon type | 0.913 | 0.4536 | 0.4589 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Weighted exon frequency (WEF) | 0.909 | 0.4536 | 0.4556 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0001 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Average exonic expression level | 0.423 | 0.2722 | 0.0196 | 0.1247 | 0.0014 | 0.0045 | 0.0000 | 0.0000 | 0.0001 | 0.0000 | 0.0000 | 0.0000 |
| 5′ intron length | 0.101 | 0.0907 | 0.0000 | 0.0005 | 0.0001 | 0.0002 | 0.0009 | 0.0033 | 0.0016 | 0.0036 | 0.0003 | 0.0000 |
| 3′ intron length | 0.020 | 0.0065 | 0.0000 | 0.0032 | 0.0001 | 0.0000 | 0.0042 | 0.0016 | 0.0000 | 0.0039 | 0.0001 | 0.0000 |
| Exon length | 0.753 | 0.5962 | 0.0305 | 0.1172 | 0.0051 | 0.0033 | 0.0001 | 0.0000 | 0.0000 | 0.0006 | 0.0000 | 0.0000 |
| G+C content | 0.055 | 0.0324 | 0.0011 | 0.0103 | 0.0022 | 0.0002 | 0.0043 | 0.0025 | 0.0009 | 0.0013 | 0.0002 | 0.0000 |
| Exon duplicability | 0.025 | 0.0000 | 0.0142 | 0.0005 | 0.0000 | 0.0000 | 0.0002 | 0.0022 | 0.0071 | 0.0004 | 0.0000 | 0.0000 |
| % solvent-accessible amino acid residues | 1.426 | 1.3478 | 0.0338 | 0.0086 | 0.0318 | 0.0009 | 0.0003 | 0.0002 | 0.0003 | 0.0000 | 0.0027 | 0.0000 |
| % intrinsically disordered regions | 2.043 | 1.9310 | 0.0458 | 0.0032 | 0.0623 | 0.0005 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0002 | 0.0000 |
| % Pfam domain | 1.328 | 1.2830 | 0.0294 | 0.0016 | 0.0069 | 0.0004 | 0.0000 | 0.0000 | 0.0000 | 0.0001 | 0.0066 | 0.0000 |
| Structural-functional features | 4.798 | 4.562 | 0.109 | 0.014 | 0.101 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 | 0.009 | 0.000 |
| Splicing features | 1.822 | 0.907 | 0.915 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Exon length | 0.753 | 0.596 | 0.031 | 0.117 | 0.005 | 0.003 | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 |
| Expression level | 0.423 | 0.272 | 0.020 | 0.125 | 0.001 | 0.005 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Compactness features | 0.121 | 0.097 | 0.000 | 0.004 | 0.000 | 0.000 | 0.005 | 0.005 | 0.002 | 0.008 | 0.000 | 0.000 |
| Other features | 0.080 | 0.032 | 0.015 | 0.011 | 0.002 | 0.000 | 0.005 | 0.005 | 0.008 | 0.002 | 0.000 | 0.000 |
The within-gene variance in exonic dN explained by the principal components broken down to eleven exon features (upper half); or to six feature categories (lower half).
| Var. explained | Comp 1 | Comp 2 | Comp 9 | Comp 3 | Comp 10 | Comp 7 | Comp 6 | Comp 4 | Comp 5 | Comp 11 | Comp 8 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 13.49 | 9.38 | 2.33 | 0.7 | 0.52 | 0.24 | 0.21 | 0.05 | 0.02 | 0.02 | 0.02 | 0.00 | |
| ASE/CSE exon type | 1.650 | 0.6580 | 0.9809 | 0.0000 | 0.0010 | 0.0000 | 0.0002 | 0.0000 | 0.0000 | 0.0001 | 0.0100 | 0.0000 |
| Weighted exon frequency (WEF) | 1.643 | 0.6580 | 0.9739 | 0.0000 | 0.0010 | 0.0000 | 0.0002 | 0.0001 | 0.0000 | 0.0001 | 0.0100 | 0.0000 |
| Average exonic expression level | 1.000 | 0.3948 | 0.0419 | 0.3234 | 0.2361 | 0.0031 | 0.0000 | 0.0003 | 0.0000 | 0.0001 | 0.0000 | 0.0000 |
| 5′ intron length | 0.235 | 0.1316 | 0.0000 | 0.0014 | 0.0088 | 0.0002 | 0.0760 | 0.0081 | 0.0018 | 0.0066 | 0.0000 | 0.0000 |
| 3′ intron length | 0.113 | 0.0094 | 0.0000 | 0.0084 | 0.0016 | 0.0002 | 0.0817 | 0.0001 | 0.0084 | 0.0032 | 0.0000 | 0.0000 |
| Exon length | 1.430 | 0.8648 | 0.0652 | 0.3038 | 0.1716 | 0.0110 | 0.0132 | 0.0000 | 0.0001 | 0.0000 | 0.0000 | 0.0000 |
| G+C content | 0.136 | 0.0470 | 0.0023 | 0.0266 | 0.0088 | 0.0048 | 0.0279 | 0.0045 | 0.0086 | 0.0050 | 0.0000 | 0.0000 |
| Exon duplicability | 0.080 | 0.0000 | 0.0303 | 0.0014 | 0.0005 | 0.0000 | 0.0074 | 0.0354 | 0.0004 | 0.0043 | 0.0000 | 0.0000 |
| % solvent-accessible amino acid residues | 2.167 | 1.9552 | 0.0722 | 0.0224 | 0.0452 | 0.0694 | 0.0000 | 0.0015 | 0.0006 | 0.0005 | 0.0000 | 0.0000 |
| % intrinsically disordered regions | 3.071 | 2.8012 | 0.0979 | 0.0084 | 0.0276 | 0.1358 | 0.0004 | 0.0001 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| % Pfam domain | 1.964 | 1.8612 | 0.0629 | 0.0042 | 0.0182 | 0.0151 | 0.0027 | 0.0001 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Structural-functional features | 7.203 | 6.618 | 0.233 | 0.035 | 0.091 | 0.220 | 0.003 | 0.002 | 0.001 | 0.001 | 0.000 | 0.000 |
| Splicing features | 3.294 | 1.316 | 1.955 | 0.000 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.020 | 0.000 |
| Exon length | 1.430 | 0.865 | 0.065 | 0.304 | 0.172 | 0.011 | 0.013 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Expression level | 1.000 | 0.395 | 0.042 | 0.323 | 0.236 | 0.003 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Compactness features | 0.348 | 0.141 | 0.000 | 0.010 | 0.010 | 0.000 | 0.158 | 0.008 | 0.010 | 0.010 | 0.000 | 0.000 |
| Other features | 0.215 | 0.047 | 0.033 | 0.028 | 0.009 | 0.005 | 0.035 | 0.040 | 0.009 | 0.009 | 0.000 | 0.000 |
The within-gene variance in exonic dS explained by the principal components broken down to eleven exon features (upper half); or to six feature categories (lower half).
| Var. explained | Comp 3 | Comp 2 | Comp 1 | Comp 7 | Comp 6 | Comp 9 | Comp 4 | Comp 10 | Comp 11 | Comp 5 | Comp 8 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3.12 | 1.48 | 0.44 | 0.32 | 0.26 | 0.23 | 0.19 | 0.1 | 0.07 | 0.03 | 0.00 | 0.00 | |
| ASE/CSE exon type | 0.226 | 0.0030 | 0.1852 | 0.0224 | 0.0003 | 0.0000 | 0.0000 | 0.0001 | 0.0000 | 0.0150 | 0.0000 | 0.0000 |
| Weighted exon frequency (WEF) | 0.225 | 0.0030 | 0.1839 | 0.0224 | 0.0003 | 0.0005 | 0.0000 | 0.0001 | 0.0000 | 0.0149 | 0.0000 | 0.0000 |
| Average exonic expression level | 0.783 | 0.6719 | 0.0079 | 0.0134 | 0.0000 | 0.0014 | 0.0878 | 0.0000 | 0.0009 | 0.0000 | 0.0000 | 0.0000 |
| 5′ intron length | 0.170 | 0.0252 | 0.0000 | 0.0045 | 0.0941 | 0.0370 | 0.0004 | 0.0088 | 0.0001 | 0.0000 | 0.0000 | 0.0000 |
| 3′ intron length | 0.151 | 0.0044 | 0.0000 | 0.0003 | 0.1011 | 0.0002 | 0.0023 | 0.0422 | 0.0001 | 0.0000 | 0.0000 | 0.0000 |
| Exon length | 0.633 | 0.4884 | 0.0123 | 0.0294 | 0.0164 | 0.0000 | 0.0825 | 0.0005 | 0.0032 | 0.0000 | 0.0000 | 0.0000 |
| G+C content | 0.134 | 0.0252 | 0.0004 | 0.0016 | 0.0346 | 0.0205 | 0.0072 | 0.0428 | 0.0014 | 0.0000 | 0.0000 | 0.0000 |
| Exon duplicability | 0.182 | 0.0015 | 0.0057 | 0.0000 | 0.0091 | 0.1628 | 0.0004 | 0.0022 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| % solvent-accessible amino acid residues | 0.245 | 0.1288 | 0.0136 | 0.0666 | 0.0000 | 0.0067 | 0.0061 | 0.0030 | 0.0202 | 0.0000 | 0.0000 | 0.0000 |
| % intrinsically disordered regions | 0.235 | 0.0784 | 0.0185 | 0.0954 | 0.0005 | 0.0005 | 0.0023 | 0.0002 | 0.0396 | 0.0000 | 0.0000 | 0.0000 |
| % Pfam domain | 0.136 | 0.0518 | 0.0119 | 0.0634 | 0.0034 | 0.0002 | 0.0011 | 0.0000 | 0.0044 | 0.0000 | 0.0000 | 0.0000 |
| Expression level | 0.783 | 0.672 | 0.008 | 0.013 | 0.000 | 0.001 | 0.088 | 0.000 | 0.001 | 0.000 | 0.000 | 0.000 |
| Exon length | 0.633 | 0.488 | 0.012 | 0.029 | 0.016 | 0.000 | 0.082 | 0.001 | 0.003 | 0.000 | 0.000 | 0.000 |
| Structural-functional features | 0.617 | 0.259 | 0.044 | 0.225 | 0.004 | 0.007 | 0.010 | 0.003 | 0.064 | 0.000 | 0.000 | 0.000 |
| Splicing features | 0.451 | 0.006 | 0.369 | 0.045 | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.030 | 0.000 | 0.000 |
| Compactness features | 0.321 | 0.030 | 0.000 | 0.005 | 0.195 | 0.037 | 0.003 | 0.051 | 0.000 | 0.000 | 0.000 | 0.000 |
| Other features | 0.315 | 0.027 | 0.006 | 0.002 | 0.044 | 0.183 | 0.008 | 0.045 | 0.001 | 0.000 | 0.000 | 0.000 |
The between-gene variance in exonic dN/dS explained by the principal components broken down to eleven exon features (upper half); or to six feature categories (lower half).
| Var. explained | Comp 1 | Comp 3 | Comp 4 | Comp 7 | Comp 8 | Comp 5 | Comp 2 | Comp 10 | Comp 6 | Comp 11 | Comp 9 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 13.678 | 13.088 | 0.237 | 0.157 | 0.070 | 0.039 | 0.030 | 0.020 | 0.017 | 0.015 | 0.003 | 0.002 | |
| ASE/CSE exon type | 0.046 | 0.0340 | 0.0001 | 0.0008 | 0.0000 | 0.0000 | 0.0000 | 0.0097 | 0.0000 | 0.0000 | 0.0015 | 0.0000 |
| Weighted exon frequency (WEF) | 0.057 | 0.0446 | 0.0002 | 0.0009 | 0.0000 | 0.0000 | 0.0000 | 0.0097 | 0.0000 | 0.0001 | 0.0015 | 0.0000 |
| Average exonic expression level | 1.332 | 1.2329 | 0.0967 | 0.0006 | 0.0002 | 0.0000 | 0.0001 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0011 |
| 5′ intron length | 0.324 | 0.2365 | 0.0004 | 0.0471 | 0.0367 | 0.0000 | 0.0024 | 0.0002 | 0.0000 | 0.0008 | 0.0000 | 0.0000 |
| 3′ intron length | 0.201 | 0.1036 | 0.0008 | 0.0604 | 0.0320 | 0.0002 | 0.0029 | 0.0001 | 0.0000 | 0.0005 | 0.0000 | 0.0000 |
| Exon length | 1.863 | 1.7945 | 0.0612 | 0.0012 | 0.0001 | 0.0019 | 0.0021 | 0.0001 | 0.0004 | 0.0004 | 0.0000 | 0.0010 |
| G+C content | 0.070 | 0.0004 | 0.0222 | 0.0324 | 0.0003 | 0.0006 | 0.0081 | 0.0000 | 0.0002 | 0.0052 | 0.0000 | 0.0001 |
| Exon duplicability | 0.160 | 0.1354 | 0.0013 | 0.0023 | 0.0003 | 0.0000 | 0.0141 | 0.0001 | 0.0000 | 0.0070 | 0.0000 | 0.0000 |
| % solvent-accessible amino acid residues | 2.509 | 2.4524 | 0.0302 | 0.0098 | 0.0001 | 0.0116 | 0.0004 | 0.0001 | 0.0042 | 0.0005 | 0.0000 | 0.0001 |
| % intrinsically disordered regions | 3.983 | 3.9533 | 0.0179 | 0.0012 | 0.0000 | 0.0002 | 0.0001 | 0.0001 | 0.0101 | 0.0000 | 0.0000 | 0.0000 |
| % Pfam domain | 3.133 | 3.1008 | 0.0057 | 0.0001 | 0.0002 | 0.0242 | 0.0001 | 0.0000 | 0.0018 | 0.0000 | 0.0000 | 0.0000 |
| Structural-functional features | 9.625 | 9.507 | 0.054 | 0.011 | 0.000 | 0.036 | 0.001 | 0.000 | 0.016 | 0.001 | 0.000 | 0.000 |
| Exon length | 1.863 | 1.794 | 0.061 | 0.001 | 0.000 | 0.002 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 | 0.001 |
| Expression level | 1.332 | 1.233 | 0.097 | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.001 |
| Compactness features | 0.593 | 0.340 | 0.062 | 0.109 | 0.069 | 0.002 | 0.008 | 0.000 | 0.000 | 0.002 | 0.000 | 0.001 |
| Other features | 0.230 | 0.136 | 0.024 | 0.035 | 0.001 | 0.001 | 0.022 | 0.000 | 0.000 | 0.012 | 0.000 | 0.000 |
| Splicing features | 0.103 | 0.079 | 0.000 | 0.002 | 0.000 | 0.000 | 0.000 | 0.019 | 0.000 | 0.000 | 0.003 | 0.000 |
The between-gene variance in exonic dN explained by the principal components broken down to eleven exon features (upper half); or to six feature categories (lower half).
| Var. explained | Comp 1 | Comp 3 | Comp 4 | Comp 8 | Comp 9 | Comp 2 | Comp 6 | Comp 10 | Comp 5 | Comp 7 | Comp 11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17.973 | 15.941 | 1.269 | 0.267 | 0.207 | 0.121 | 0.059 | 0.050 | 0.042 | 0.008 | 0.005 | 0.003 | |
| ASE/CSE exon type | 0.077 | 0.0457 | 0.0003 | 0.0014 | 0.0001 | 0.0000 | 0.0286 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0013 |
| Weighted exon frequency (WEF) | 0.091 | 0.0590 | 0.0010 | 0.0016 | 0.0000 | 0.0000 | 0.0285 | 0.0002 | 0.0000 | 0.0000 | 0.0000 | 0.0013 |
| Average exonic expression level | 2.082 | 1.5039 | 0.5183 | 0.0011 | 0.0002 | 0.0583 | 0.0001 | 0.0001 | 0.0001 | 0.0000 | 0.0000 | 0.0000 |
| 5′ intron length | 0.380 | 0.2908 | 0.0022 | 0.0797 | 0.0002 | 0.0000 | 0.0005 | 0.0033 | 0.0000 | 0.0008 | 0.0025 | 0.0000 |
| 3′ intron length | 0.241 | 0.1272 | 0.0042 | 0.1027 | 0.0010 | 0.0010 | 0.0002 | 0.0015 | 0.0000 | 0.0009 | 0.0022 | 0.0000 |
| Exon length | 2.583 | 2.1880 | 0.3275 | 0.0021 | 0.0105 | 0.0511 | 0.0004 | 0.0013 | 0.0011 | 0.0006 | 0.0000 | 0.0000 |
| G+C content | 0.206 | 0.0005 | 0.1197 | 0.0553 | 0.0029 | 0.0075 | 0.0000 | 0.0175 | 0.0006 | 0.0022 | 0.0000 | 0.0000 |
| Exon duplicability | 0.204 | 0.1647 | 0.0072 | 0.0040 | 0.0000 | 0.0000 | 0.0002 | 0.0242 | 0.0000 | 0.0038 | 0.0000 | 0.0000 |
| % solvent-accessible amino acid residues | 3.239 | 2.9830 | 0.1619 | 0.0167 | 0.0622 | 0.0031 | 0.0003 | 0.0017 | 0.0104 | 0.0001 | 0.0000 | 0.0000 |
| % intrinsically disordered regions | 4.933 | 4.8077 | 0.0961 | 0.0021 | 0.0012 | 0.0002 | 0.0002 | 0.0000 | 0.0252 | 0.0000 | 0.0000 | 0.0000 |
| % Pfam domain | 3.935 | 3.7704 | 0.0307 | 0.0001 | 0.1290 | 0.0001 | 0.0001 | 0.0000 | 0.0046 | 0.0000 | 0.0000 | 0.0000 |
| Structural-functional features | 12.107 | 11.561 | 0.289 | 0.019 | 0.192 | 0.003 | 0.001 | 0.002 | 0.040 | 0.000 | 0.000 | 0.000 |
| exon length | 2.583 | 2.188 | 0.328 | 0.002 | 0.010 | 0.051 | 0.000 | 0.001 | 0.001 | 0.001 | 0.000 | 0.000 |
| expression level | 2.082 | 1.504 | 0.518 | 0.001 | 0.000 | 0.058 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Compactness features | 0.621 | 0.418 | 0.006 | 0.182 | 0.001 | 0.001 | 0.001 | 0.005 | 0.000 | 0.002 | 0.005 | 0.000 |
| Other features | 0.410 | 0.165 | 0.127 | 0.059 | 0.003 | 0.007 | 0.000 | 0.042 | 0.001 | 0.006 | 0.000 | 0.000 |
| Splicing features | 0.169 | 0.105 | 0.001 | 0.003 | 0.000 | 0.000 | 0.057 | 0.000 | 0.000 | 0.000 | 0.000 | 0.003 |
The between-gene variance in exonic dS explained by the principal components broken down to eleven exon features (upper half); or to six feature categories (lower half).
| Var. explained | Comp 3 | Comp 1 | Comp 9 | Comp 6 | Comp 8 | Comp 5 | Comp 7 | Comp 2 | Comp 4 | Comp 10 | Comp 11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2.991 | 1.329 | 0.411 | 0.298 | 0.247 | 0.239 | 0.198 | 0.112 | 0.073 | 0.043 | 0.038 | 0.004 | |
| ASE/CSE exon type | 0.040 | 0.0003 | 0.0011 | 0.0000 | 0.0008 | 0.0001 | 0.0002 | 0.0000 | 0.0355 | 0.0002 | 0.0000 | 0.0018 |
| Weighted exon frequency (WEF) | 0.041 | 0.0011 | 0.0014 | 0.0000 | 0.0009 | 0.0000 | 0.0000 | 0.0000 | 0.0354 | 0.0002 | 0.0000 | 0.0018 |
| Average exonic expression level | 0.726 | 0.5426 | 0.0388 | 0.1430 | 0.0006 | 0.0003 | 0.0005 | 0.0003 | 0.0001 | 0.0002 | 0.0001 | 0.0000 |
| 5′ intron length | 0.113 | 0.0023 | 0.0074 | 0.0001 | 0.0153 | 0.0002 | 0.0160 | 0.0584 | 0.0006 | 0.0133 | 0.0000 | 0.0000 |
| 3′ intron length | 0.107 | 0.0043 | 0.0033 | 0.0024 | 0.0082 | 0.0011 | 0.0197 | 0.0513 | 0.0003 | 0.0169 | 0.0000 | 0.0000 |
| Exon length | 0.559 | 0.3432 | 0.0564 | 0.1254 | 0.0063 | 0.0121 | 0.0141 | 0.0002 | 0.0004 | 0.0003 | 0.0010 | 0.0000 |
| G+C content | 0.296 | 0.1248 | 0.0000 | 0.0183 | 0.0872 | 0.0034 | 0.0529 | 0.0004 | 0.0000 | 0.0087 | 0.0005 | 0.0000 |
| Exon duplicability | 0.223 | 0.0074 | 0.0042 | 0.0000 | 0.1189 | 0.0000 | 0.0910 | 0.0006 | 0.0003 | 0.0006 | 0.0000 | 0.0000 |
| % solvent-accessible amino acid residues | 0.349 | 0.1699 | 0.0769 | 0.0075 | 0.0084 | 0.0716 | 0.0024 | 0.0001 | 0.0002 | 0.0027 | 0.0094 | 0.0000 |
| % intrinsically disordered regions | 0.251 | 0.1008 | 0.1240 | 0.0006 | 0.0001 | 0.0014 | 0.0009 | 0.0000 | 0.0001 | 0.0003 | 0.0229 | 0.0000 |
| % Pfam domain | 0.283 | 0.0321 | 0.0972 | 0.0002 | 0.0001 | 0.1488 | 0.0005 | 0.0003 | 0.0001 | 0.0000 | 0.0042 | 0.0000 |
| Structural-functional features | 0.884 | 0.303 | 0.298 | 0.008 | 0.009 | 0.222 | 0.004 | 0.000 | 0.000 | 0.003 | 0.037 | 0.000 |
| expression level | 0.726 | 0.543 | 0.039 | 0.143 | 0.001 | 0.000 | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| exon length | 0.559 | 0.343 | 0.056 | 0.125 | 0.006 | 0.012 | 0.014 | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 |
| Other features | 0.519 | 0.132 | 0.004 | 0.018 | 0.206 | 0.003 | 0.144 | 0.001 | 0.000 | 0.009 | 0.001 | 0.000 |
| Compactness features | 0.221 | 0.007 | 0.011 | 0.002 | 0.023 | 0.001 | 0.036 | 0.110 | 0.001 | 0.030 | 0.000 | 0.000 |
| Splicing features | 0.081 | 0.001 | 0.002 | 0.000 | 0.002 | 0.000 | 0.000 | 0.000 | 0.071 | 0.000 | 0.000 | 0.004 |
The Pearson’s coefficient of correlation between each of the eleven principal components and dN/dS, dN, and dS in the within-gene analysis.
| Principal component | Pearson correlation coefficient | ||
|---|---|---|---|
|
| |||
| Component 1 | −0.0763 | −0.0450 | 0.0515 |
| Component 2 | 0.0745 | 0.1481 | 0.1011 |
| Component 3 | 0.0504 | 0.0519 | 0.0034 |
| Component 4 | 0.1210 | 0.2049 | 0.1123 |
| Component 5 | 0.0398 | 0.1307 | 0.1278 |
| Component 6 | 0.0745 | 0.1205 | 0.0596 |
| Component 7 | −0.1415 | −0.2097 | −0.0941 |
| Component 8 | −0.0391 | −0.0388 | 0.0060 |
| Component 9 | 0.0291 | 0.0663 | 0.0499 |
| Component 10 | −0.1976 | −0.2229 | −0.0236 |
| Component 11 | 0.0561 | 0.1260 | 0.0970 |
Notes:
Statistical significance.
P < 0.05;
P < 0.01;
P < 10−3.