| Literature DB >> 22220197 |
Abstract
BACKGROUND: A mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the average fixation probability depending on the type of amino acid replacement, has advantages over nucleotide, amino acid, and empirical codon substitution models in evolutionary analysis of protein-coding sequences. It can approximate a wide range of codon substitution processes. If no selection pressure on amino acids is taken into account, it will become equivalent to a nucleotide substitution model. If mutation rates are assumed not to depend on the codon type, then it will become essentially equivalent to an amino acid substitution model. Mutation at the nucleotide level and selection at the amino acid level can be separately evaluated.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22220197 PMCID: PMC3248416 DOI: 10.1371/journal.pone.0028892
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Brief description of models.
| A. Empirical amino acid substitution models converted into codon substitution models | |
| JTT- | The empirical amino acid exchangeabilities of JTT |
In the models specified with the suffix “F”, equilibrium codon frequencies are assumed to be equal to codon frequencies in codon sequences. dGm[rs], i.e., dGmr or dGms, means that the variation of mutation rate or selective constraint over site is approximated by a discrete gamma distribution with m categories [32], respectively; means no variation and the suffix dG1[rs] is omitted.
Comparisons between various codon substitution models in the concatenating analysis of cpDNA-9.
| Codon substitution model | K |
|
|
|
|
|
|
|
|
| cpREV64-1-F-dG4r | 62 |
| 6358.6 | 6351.2 | (0.0) | 0.977 | |||
| LG-1-F-dG4r | 62 |
| 5823.6 | 5816.1 | (0.0) | 0.973 | |||
| JTT-1-F-dG4r | 62 |
| 5214.8 | 5207.4 | (0.0) | 1.020 | |||
| WAG-1-F-dG4r | 62 |
| 5000.5 | 4993.0 | (0.0) | 1.148 | |||
| cpREV10-1-F-dG4r | 62 |
| 3149.3 | 3141.9 | (0.0) | 1.195 | |||
| cpREV64-1-F-dG4s | 62 |
| 3006.4 | 2999.0 | (0.0) | 0.505 | |||
| LG-1-F-dG4s | 62 |
| 2640.8 | 2633.4 | (0.0) | 0.496 | |||
| WAG-1-F-dG4s | 62 |
| 2250.5 | 2243.1 | (0.0) | 0.573 | |||
| JTT-1-F-dG4s | 62 |
| 2090.0 | 2082.6 | (0.0) | 0.519 | |||
| cpREV10-1-F-dG4s | 62 |
| 566.0 | 558.6 | (0.0) | 0.591 | |||
| KHG-2-F-dG4r | 63 |
| 2475.7 | 2475.7 | 0.031 | 1.301 | |||
| KHG-2-F-dG4s | 63 | 0.0 | 0.0 | 0.0 | 0.290 | 0.575 | |||
| No-Constraint-10-F-dG4r | 71 |
| 38801.2 | 38860.7 | (1.0) | 0.000 | 0.040 | 2.541 | 1.830 |
| Equal-Constraint-11-F-dG4r | 72 |
| 2729.2 | 2796.1 | 0.021 | 0.424 | 0.292 | 2.053 | 1.178 |
| EI-12-F-dG4r | 73 |
| 526.3 | 600.6 | 0.023 | 0.000 | 0.494 | 2.217 | 1.160 |
| JTT-ML91+-12-F-dG4r | 73 | 288.7 |
|
| 0.018 | 0.002 | 0.569 | 1.702 | 1.131 |
| WAG-ML91+-12-F-dG4r | 73 | 477.4 |
|
| 0.015 | 0.272 | 0.526 | 2.184 | 1.126 |
| KHG-ML200-12-F-dG4r | 73 | 562.9 |
|
| 0.039 | 0.000 | 0.325 | 1.610 | 1.122 |
| LG-ML91+-12-F-dG4r | 73 | 627.3 |
|
| 0.023 | 0.000 | 0.485 | 2.158 | 1.144 |
| Equal-Constraint-11-F-dG4s | 72 | 680.2 |
|
| 0.063 | 0.414 | 0.208 | 2.196 | 0.384 |
| EI-12-F-dG4s | 73 | 1935.2 |
|
| 0.060 | 0.000 | 0.431 | 2.307 | 0.390 |
| JTT-ML91+-12-F-dG4s | 73 | 2640.2 |
|
| 0.052 | 0.125 | 0.461 | 1.774 | 0.363 |
| KHG-ML200-12-F-dG4s | 73 | 2646.5 |
|
| 0.106 | 0.170 | 0.215 | 1.705 | 0.388 |
| WAG-ML91+-12-F-dG4s | 73 | 2827.2 |
|
| 0.048 | 0.313 | 0.405 | 2.349 | 0.359 |
| LG-ML91+-12-F-dG4s | 73 | 2956.6 |
|
| 0.064 | 0.201 | 0.364 | 2.369 | 0.370 |
| LG-ML91+-11s-F-dG4s | 72 | 2412.8 |
|
| 0.066 | 2.335 | (0.0) | 2.667 | 0.297 |
| LG-ML91+-11-F-dG4s | 72 | 2942.4 |
|
| 0.066 | (0.0) | 0.409 | 2.292 | 0.385 |
| LG-ML91+-12-F | 72 |
| 3684.4 | 3751.4 | 0.026 | 0.878 | 0.622 | 2.039 |
The prior probability of each category for the mechanistic codon models of “dG4s” is , and ; equal probability is used in other models.
The number of adjustable parameters.
Differences from the reference state; , , and .
The average of over all amino acid pairs ; .
The value parenthesized means that the parameter is fixed at the value specified.
The scale parameter of a distribution for the variation of mutation rate over time.
The ratio of double to single and of triple to double nucleotide change exchangeability; .
The ratio of mean transitional to mean transversional exchangeability; .
The shape parameter of a discrete gamma distribution for the variation of mutation rate or selective constraint over sites.
Comparisons between various codon substitution models in the concatenating analysis of cpDNA-55.
| Codon substitution model | K |
|
|
|
|
|
|
|
|
| LG-1-F-dG4r | 62 |
| 31370.2 | 31362.6 | (0.0) | 1.055 | |||
| WAG-1-F-dG4r | 62 |
| 26221.8 | 26214.3 | (0.0) | 1.094 | |||
| cpREV10-1-F-dG4r | 62 |
| 22399.4 | 22391.8 | (0.0) | 1.096 | |||
| JTT-1-F-dG4r | 62 |
| 20913.6 | 20906.1 | (0.0) | 1.092 | |||
| cpREV64-1-F-dG4r | 62 |
| 13792.0 | 13784.5 | (0.0) | 1.091 | |||
| LG-1-F-dG4s | 62 |
| 22206.2 | 22198.7 | (0.0) | 0.289 | |||
| WAG-1-F-dG4s | 62 |
| 17425.2 | 17417.7 | (0.0) | 0.300 | |||
| cpREV10-1-F-dG4s | 62 |
| 13422.7 | 13415.1 | (0.0) | 0.298 | |||
| JTT-1-F-dG4s | 62 |
| 11639.7 | 11632.1 | (0.0) | 0.299 | |||
| cpREV64-1-F-dG4s | 62 |
| 3915.4 | 3907.8 | (0.0) | 0.299 | |||
| KHG-2-F-dG4r | 63 |
| 6322.5 | 6322.5 | 0.068 | 1.073 | |||
| KHG-2-F-dG4s | 63 | 0.0 | 0.0 | 0.0 | 0.150 | 0.277 | |||
| No-Constraint-10-F-dG4r | 71 | 1705.9 |
|
| (1.0) | 0.000 | 0.018 | 3.557 | 1.055 |
| Equal-Constraint-11-F-dG4r | 72 | 26281.9 |
|
| 0.156 | 0.000 | 0.101 | 2.671 | 1.107 |
| EI-12-F-dG4r | 73 | 26941.5 |
|
| 0.143 | 0.000 | 0.107 | 2.732 | 1.100 |
| JTT-ML91+-12-F-dG4r | 73 | 27198.5 |
|
| 0.122 | 0.000 | 0.122 | 2.501 | 1.111 |
| WAG-ML91+-12-F-dG4r | 73 | 27378.4 |
|
| 0.125 | 0.000 | 0.115 | 2.690 | 1.100 |
| LG-ML91+-12-F-dG4r | 73 | 27664.8 |
|
| 0.142 | 0.000 | 0.112 | 2.707 | 1.109 |
| KHG-ML200-12-F-dG4r | 73 | 27683.4 |
|
| 0.163 | 0.000 | 0.099 | 2.479 | 1.106 |
| Equal-Constraint-11-F-dG4s | 72 | 34659.7 |
|
| 0.276 | 0.124 | 0.056 | 2.664 | 0.259 |
| EI-12-F-dG4s | 73 | 35716.3 |
|
| 0.235 | 0.103 | 0.071 | 2.727 | 0.247 |
| KHG-ML200-12-F-dG4s | 73 | 36243.5 |
|
| 0.251 | 0.116 | 0.058 | 2.477 | 0.285 |
| JTT-ML91+-12-F-dG4s | 73 | 36257.9 |
|
| 0.204 | 0.072 | 0.098 | 2.438 | 0.231 |
| WAG-ML91+-12-F-dG4s | 73 | 36362.6 |
|
| 0.222 | 0.109 | 0.074 | 2.670 | 0.234 |
| LG-ML91+-12-F-dG4s | 73 | 36583.3 |
|
| 0.233 | 0.105 | 0.073 | 2.701 | 0.256 |
| LG-ML91+-11s-F-dG4s | 72 | 36336.9 |
|
| 0.250 | 0.260 | (0.0) | 2.788 | 0.237 |
| LG-ML91+-11-F-dG4s | 72 | 36479.9 |
|
| 0.213 | (0.0) | 0.123 | 2.623 | 0.273 |
| LG-ML91+-12-F | 72 | 14390.7 |
|
| 0.135 | 0.000 | 0.182 | 2.569 |
The prior probability of each category for the “dG4s” is , and .
The number of adjustable parameters.
Differences from the reference state; , , and .
The average of over all amino acid pairs ; .
The value parenthesized means that the parameter is fixed at the value specified.
The scale parameter of a distribution for the variation of mutation rate over time.
The ratio of double to single and of triple to double nucleotide change exchangeability; .
The ratio of mean transitional to mean transversional exchangeability; .
The shape parameter of a discrete gamma distribution for the variation of mutation rate or selective constraint over sites.
Comparisons between various codon substitution models in the concatenating analysis of mammalian-mtDNA.
| Codon substitution model | K |
|
|
|
|
|
|
|
|
| WAG-1-F-dG4r | 62 |
| 10454.0 | 10454.0 | (0.0) | 0.786 | |||
| LG-1-F-dG4r | 62 |
| 10309.8 | 10309.8 | (0.0) | 0.771 | |||
| JTT-1-F-dG4r | 62 |
| 7347.7 | 7347.7 | (0.0) | 0.783 | |||
| mtREV-1-F-dG4r | 62 |
| 3725.9 | 3725.9 | (0.0) | 0.870 | |||
| WAG-1-F-dG4s | 62 |
| 5325.2 | 5325.2 | (0.0) | 0.327 | |||
| LG-1-F-dG4s | 62 |
| 5256.1 | 5256.1 | (0.0) | 0.314 | |||
| JTT-1-F-dG4s | 62 |
| 2570.2 | 2570.2 | (0.0) | 0.329 | |||
| mtREV-1-F-dG4s | 62 | 0.0 | 0.0 | 0.0 | (0.0) | 0.339 | |||
| No-Constraint-10-F-dG4r | 71 |
| 127247.9 | 127303.6 | (1.0) | 0.000 | 0.000 | 4.908 | 1.965 |
| Equal-Constraint-11-F-dG4r | 72 | 464.7 |
|
| 0.013 | 0.000 | 0.108 | 4.508 | 0.495 |
| EI-12-F-dG4r | 73 | 4336.4 |
|
| 0.007 | 0.000 | 0.271 | 4.697 | 0.928 |
| KHG-ML200-12-F-dG4r | 73 | 5340.3 |
|
| 0.022 | 0.000 | 0.088 | 3.238 | 0.480 |
| JTT-ML91+-12-F-dG4r | 73 | 5501.7 |
|
| 0.006 | 0.000 | 0.228 | 3.679 | 0.452 |
| WAG-ML91+-12-F-dG4r | 73 | 5728.6 |
|
| 0.006 | 0.000 | 0.206 | 5.614 | 0.492 |
| LG-ML91+-12-F-dG4r | 73 | 6315.1 |
|
| 0.009 | 0.000 | 0.147 | 5.921 | 0.515 |
| Equal-Constraint-11-F-dG4s | 72 | 6961.9 |
|
| 0.036 | 1.313 | 0.031 | 4.984 | 0.269 |
| EI-12-F-dG4s | 73 | 10402.4 |
|
| 0.024 | 1.137 | 0.124 | 5.426 | 0.267 |
| KHG-ML200-12-F-dG4s | 73 | 11621.0 |
|
| 0.063 | 1.119 | 0.039 | 3.658 | 0.306 |
| JTT-ML91+-12-F-dG4s | 73 | 11698.3 |
|
| 0.022 | 1.637 | 0.091 | 4.189 | 0.259 |
| WAG-ML91+-12-F-dG4s | 73 | 11997.4 |
|
| 0.020 | 1.686 | 0.092 | 6.588 | 0.259 |
| LG-ML91+-12-F-dG4s | 73 | 12532.5 |
|
| 0.028 | 1.826 | 0.065 | 7.158 | 0.262 |
| LG-ML91+-11-F-dG4s | 72 | 12113.1 |
|
| 0.035 | (0.0) | 0.128 | 6.009 | 0.290 |
| LG-ML91+-11s-F-dG4s | 72 | 12268.3 |
|
| 0.028 | 3.066 | (0.0) | 7.600 | 0.252 |
| LG-ML91+-12-F | 72 |
| 9627.1 | 9689.0 | 0.011 | 3.713 | 0.196 | 5.477 |
The prior probability of each category for the “dG4s” is , and .
The number of adjustable parameters.
Differences from the reference state; , , and .
The average of over all amino acid pairs ; .
The value parenthesized means that the parameter is fixed at the value specified.
The scale parameter of a distribution for the variation of mutation rate over time.
The ratio of double to single and of triple to double nucleotide change exchangeability; .
The ratio of mean transitional to mean transversional exchangeability; .
The shape parameter of a discrete gamma distribution for the variation of mutation rate or selective constraint over sites.
Comparisons between various codon substitution models in the concatenating analysis of human-mtDNA.
| Codon substitution model | K |
|
|
|
|
|
|
|
|
| LG-1-F-dG4r | 62 |
| 84.8 | 84.8 | (0.0) | 0.338 | |||
| WAG-1-F-dG4r | 62 |
| 80.1 | 80.1 | (0.0) | 0.343 | |||
| JTT-1-F-dG4r | 62 | 5.9 |
|
| (0.0) | 0.345 | |||
| mtREV-1-F-dG4r | 62 | 7.7 |
|
| (0.0) | 0.331 | |||
| LG-1-F-dG4s | 62 |
| 98.3 | 98.3 | (0.0) | 0.080 | |||
| WAG-1-F-dG4s | 62 |
| 92.5 | 92.5 | (0.0) | 0.079 | |||
| JTT-1-F-dG4s | 62 |
| 3.7 | 3.7 | (0.0) | 0.091 | |||
| mtREV-1-F-dG4s | 62 | 0.0 | 0.0 | 0.0 | (0.0) | 0.085 | |||
| No-Constraint-10-F-dG4r | 71 | 315.1 |
|
| (1.0) | 0.000 | 0.000 | 47.760 | 0.465 |
| Equal-Constraint-11-F-dG4r | 72 | 515.6 |
|
| 0.092 | 0.066 | 0.000 | 39.685 | 0.612 |
| EI-12-F-dG4r | 73 | 525.8 |
|
| 0.076 | 0.000 | 0.000 | 36.208 | 0.620 |
| KHG-ML200-12-F-dG4r | 73 | 530.8 |
|
| 0.110 | 0.000 | 0.000 | 31.182 | 0.648 |
| WAG-ML91+-12-F-dG4r | 73 | 535.5 |
|
| 0.069 | 0.000 | 0.000 | 41.404 | 0.635 |
| LG-ML91+-12-F-dG4r | 73 | 535.7 |
|
| 0.089 | 0.002 | 0.000 | 42.787 | 0.627 |
| JTT-ML91+-12-F-dG4r | 73 | 541.0 |
|
| 0.051 | 0.000 | 0.000 | 32.733 | 0.646 |
| Equal-Constraint-11-F-dG4s | 72 | 517.4 |
|
| 0.108 | 0.000 | 0.000 | 36.267 | 0.106 |
| EI-12-F-dG4s | 73 | 528.3 |
|
| 0.079 | 0.000 | 0.000 | 34.994 | 0.123 |
| KHG-ML200-12-F-dG4s | 73 | 536.6 |
|
| 0.106 | 0.000 | 0.000 | 30.457 | 0.227 |
| LG-ML91+-12-F-dG4s | 73 | 538.4 |
|
| 0.078 | 0.000 | 0.000 | 39.024 | 0.233 |
| WAG-ML91+-12-F-dG4s | 73 | 539.3 |
|
| 0.059 | 0.000 | 0.000 | 38.794 | 0.207 |
| JTT-ML91+-12-F-dG4s | 73 | 542.6 |
|
| 0.049 | 0.000 | 0.000 | 32.064 | 0.168 |
| JTT-ML91+-11-F-dG4s | 72 | 542.6 |
|
| 0.049 | (0.0) | 0.000 | 32.067 | 0.168 |
| JTT-ML91+-11s-F-dG4s | 72 | 542.6 |
|
| 0.049 | 0.000 | (0.0) | 32.067 | 0.168 |
| JTT-ML91+-12-F | 72 | 522.3 |
|
| 0.052 | 0.000 | 0.000 | 32.207 |
The prior probability of each category for the “dG4s” is , and .
The number of adjustable parameters.
Differences from the reference state; , , and .
The average of over all amino acid pairs ; .
The value parenthesized means that the parameter is fixed at the value specified.
The scale parameter of a distribution for the variation of mutation rate over time.
The ratio of double to single and of triple to double nucleotide change exchangeability; .
The ratio of mean transitional to mean transversional exchangeability; .
The shape parameter of a discrete gamma distribution for the variation of mutation rate or selective constraint over sites.
Comparisons between various codon substitution models in the concatenating analysis of nDNA.
| Codon substitution model | K |
|
|
|
|
|
|
|
|
| LG-1-F-dG4r | 62 |
| 111.7 | 106.7 | (0.0) | 0.429 | |||
| WAG-1-F-dG4r | 62 |
| 109.6 | 104.6 | (0.0) | 0.431 | |||
| JTT-1-F-dG4r | 62 |
| 83.3 | 78.3 | (0.0) | 0.434 | |||
| LG-1-F-dG4s | 62 |
| 163.1 | 158.1 | (0.0) | 0.102 | |||
| WAG-1-F-dG4s | 62 |
| 160.1 | 155.0 | (0.0) | 0.103 | |||
| JTT-1-F-dG4s | 62 |
| 135.3 | 130.3 | (0.0) | 0.110 | |||
| KHG-2-F-dG4r | 63 | 20.5 |
|
| 0.082 | 0.472 | |||
| KHG-2-F-dG4s | 63 | 0.0 | 0.0 | 0.0 | 0.214 | 0.118 | |||
| No-Constraint-3-F-dG4r | 64 |
| 187.9 | 192.9 | (1.0) | 0.000 | 0.000 | 4.368 | 0.499 |
| Equal-Constraint-4-F-dG4r | 65 | 137.1 |
|
| 0.083 | 0.000 | 0.020 | 2.638 | 0.465 |
| KHG-ML200-5-F-dG4r | 66 | 143.2 |
|
| 0.085 | 0.000 | 0.018 | 2.500 | 0.461 |
| EI-5-F-dG4r | 66 | 144.6 |
|
| 0.061 | 0.000 | 0.026 | 2.675 | 0.458 |
| LG-ML91+-5-F-dG4r | 66 | 146.6 |
|
| 0.065 | 0.000 | 0.025 | 2.756 | 0.456 |
| JTT-ML91+-5-F-dG4r | 66 | 147.3 |
|
| 0.050 | 0.000 | 0.027 | 2.515 | 0.463 |
| WAG-ML91+-5-F-dG4r | 66 | 147.7 |
|
| 0.053 | 0.000 | 0.025 | 2.737 | 0.459 |
| Equal-Constraint-4-F-dG4s | 65 | 119.4 |
|
| 0.092 | 0.003 | 0.032 | 2.470 | 0.109 |
| KHG-ML200-5-F-dG4s | 66 | 124.4 |
|
| 0.080 | 0.000 | 0.039 | 2.333 | 0.157 |
| EI-5-F-dG4s | 66 | 125.9 |
|
| 0.060 | 0.000 | 0.049 | 2.450 | 0.152 |
| LG-ML91+-5-F-dG4s | 66 | 125.9 |
|
| 0.063 | 0.000 | 0.050 | 2.466 | 0.172 |
| JTT-ML91+-5-F-dG4s | 66 | 128.0 |
|
| 0.051 | 0.000 | 0.053 | 2.344 | 0.133 |
| WAG-ML91+-5-F-dG4s | 66 | 128.4 |
|
| 0.056 | 0.000 | 0.049 | 2.503 | 0.127 |
| WAG-ML91+-4s-F-dG4r | 65 | 146.1 |
|
| 0.055 | 0.016 | (0.0) | 2.755 | 0.449 |
| WAG-ML91+-4-F-dG4r | 65 | 147.7 |
|
| 0.053 | (0.0) | 0.025 | 2.737 | 0.459 |
| WAG-ML91+-4s-F-dG4s | 65 | 127.5 |
|
| 0.057 | 0.079 | (0.0) | 2.572 | 0.133 |
| WAG-ML91+-4-F-dG4s | 65 | 128.4 |
|
| 0.056 | (0.0) | 0.049 | 2.507 | 0.129 |
| WAG-ML91+-5-F | 65 | 109.7 |
|
| 0.055 | 0.001 | 0.059 | 2.535 |
In the models specified with the suffix “-3-”, “-4-”, “-4s-” or “-5-”, three, four or five parameters are optimized with and . The prior probability of each category for the “dG4s” is , and .
The number of adjustable parameters.
Differences from the reference state; , , and .
The average of over all amino acid pairs ; .
The value parenthesized means that the parameter is fixed at the value specified.
The scale parameter of a distribution for the variation of mutation rate over time.
The ratio of double to single and of triple to double nucleotide change exchangeability; .
The ratio of mean transitional to mean transversional exchangeability; .
The shape parameter of a discrete gamma distribution for the variation of mutation rate or selective constraint over sites.
Figure 1Comparisons of of each gene in each dataset among models.
of each gene in cpDNA-9 (A), cpDNA-55 (B), mammalian-mtDNA (C), human-mtDNA (D), and nDNA (E) for each specified model is plotted against its log-likelihood value for the best model in the concatenating analysis of the genes. The horizontal dotted line of shows the reference model for each dataset. The best model is shown by the lowest dot-dashed line. The lower dotted line in each figure shows the data points for the EI model. The lower and the upper broken lines show the Equal-Constraint and the No-Constraint models, respectively. The No-Constraint model is not shown for cpDNA-9 and mammalian-mtDNA, because its values are too large. In the models specified with the suffix “-5-” for human-mtDNA and nDNA, five parameters were optimized with and .
Figure 2Comparison of of each gene in each dataset between the models with variable mutation rates and with variable selective constraints over sites.
of each gene in cpDNA-9 (A), cpDNA-55 (B), mammalian-mtDNA (C), human-mtDNA (D), and nDNA (E) is compared between the models with the variation of mutation rate (dG4r) and with the variation of selection constraint (dG4s) over sites. The dotted line shows the line of equal values between the ordinate and the abscissa. In the models specified with the suffix “-5-” for human-mtDNA, five parameters were optimized with and .
Figure 3Site dependences of selective constraints.
Site dependences of selective constraints in the photosystem II CP47 protein (psbB gene) (A) and cytochrome c oxidase subunit 1 mitochondrial protein (COX1 gene) (B) are shown. Residue sites are categorized by the number of van der Waals contacts with surrounding non-solvent atoms in the protein structure; neighboring residues along a polypeptide chain are not counted. The degree of van der Waals contact for an atom pair, which is separated by and whose van der Waals distance is equal to , is defined as for and for . The van der Waals contacts are evaluated for the psbB in the 38-meric state of the photosystem II complex from Thermosynechococcus vulcanus, and for the COX1 in the biological 26-meric state of bovine heart cytochrome C oxidase in the fully reduced state; the protein coordinates 3ARC and 2EIJ in the PDB database were used. Posterior mean of selective constrains () averaged over sites in each residue category is shown in the ordinate. The posterior mean of selective constrains were calculated by the LG-ML91+-12-F-dG4s for the concatenated sequences of the datasets cpDNA-9 and mammalian-mtDNA.
Figure 4Comparisons of branch lengths estimated by the models with a uniform rate, variable mutation rates, and variable selective constraints over sites.
Branch lengths estimated for the phylogenetic trees of cpDNA-9 (A), cpDNA-55 (B), mammalian-mtDNA (C), human-mtDNA (D), and nDNA (E) are compared among models. The abscissa shows the branch lengths estimated by the model with the variation of selection constraint (dG4s). The LG-ML91+-12-F-dG4s is the best model except for human-mtDNA and nDNA. The best model is JTT-ML91+-12-F-dG4s for human-mtDNA and WAG-ML91+-5-F-dG4r for nDNA. The models with the variation of mutation rate (dG4r) and with a uniform substitution rate over sites are shown by cross and plus marks, respectively. The model with the variation of selection constraint (dG4s) is shown by the middle dotted line. The dotted lines in each figure are ones connecting the origin and the respective estimates for the longest branch on the abscissa. In the models specified with the suffix “-5-” for human-mtDNA and nDNA, five parameters were optimized with and .
Figure 5The estimates of branch lengths for the phylogenetic tree of each dataset under the different types of models.
Branch lengths estimated for cpDNA-9 (A) and mammalian-mtDNA (B) are compared among models. The abscissa shows the branch lengths estimated by the best model with the variation of selective constraint, LG-ML91+-12-F-dG4s. The dotted line in each figure shows branch lengths estimated by the best model shown on the abscissa.