| Literature DB >> 15743518 |
Bjarne Knudsen1, Michael M Miyamoto.
Abstract
BACKGROUND: The f factor is a new parameter for accommodating the influence of both the starting and ending states in the rate matrices of "generalized weighted frequencies" (+gwF) models for sequence evolution. In this study, we derive an expected value for f, starting from a nearly neutral model of weak selection, and then assess the biological interpretation of this factor with evolutionary simulations.Entities:
Mesh:
Year: 2005 PMID: 15743518 PMCID: PMC554786 DOI: 10.1186/1471-2148-5-21
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Adjustment factor as a function of the ratio of . The adjustment factor is given by (equation (7)).
Starting equilibrium base frequencies and results for the simulations with either homogeneous or heterogeneous sequences (i.e., those with sites from single versus multiple categories, respectively).
| Categories | Biasb | ||||||
| A | 0.10 | 0.40 | 0.30 | 0.20 | 0.154 | 0.50 ± 0.01 | 0.00 ± 0.01 |
| B | 0.30 | 0.30 | 0.30 | 0.10 | 0.105 | 0.50 ± 0.01 | 0.00 ± 0.01 |
| C | 0.30 | 0.20 | 0.20 | 0.30 | 0.029 | 0.50 ± 0.02 | 0.00 ± 0.03 |
| D | 0.40 | 0.20 | 0.20 | 0.20 | 0.078 | 0.49 ± 0.01 | 0.01 ± 0.01 |
| E | 0.20 | 0.40 | 0.20 | 0.20 | 0.078 | 0.51 ± 0.01 | -0.01 ± 0.02 |
| F | 0.20 | 0.20 | 0.40 | 0.20 | 0.078 | 0.50 ± 0.02 | -0.01 ± 0.01 |
| A+B | 0.20 | 0.35 | 0.30 | 0.15 | 0.074 | 0.43 ± 0.01 | -0.11 ± 0.02 |
| A+C | 0.20 | 0.30 | 0.25 | 0.25 | 0.015 | 0.34 ± 0.03 | -0.16 ± 0.03 |
| B+C | 0.30 | 0.25 | 0.25 | 0.20 | 0.015 | 0.24 ± 0.02 | -0.33 ± 0.03 |
| A+B+Ce | 0.23 | 0.30 | 0.27 | 0.20 | 0.016 | 0.39 ± 0.04 | -0.13 ± 0.04 |
| D+E+Fe | 0.27 | 0.27 | 0.27 | 0.20 | 0.010 | 0.68 ± 0.03 | 0.29 ± 0.04 |
aExpected nucleotide distribution.
bNucleotide bias, as information content measured in bits: .
cMean ± twice the standard error of the estimate.
df = 0:0 for these simulations with the HKY model. With f = 0:0, the HKY+gwF variant is reduced in these simulations to its more standard F81 based model.
eThe heterogeneous sequences in these simulations were of length 9,999, rather than 10,000, since the latter is not a multiple of 3.
Figure 2Two situations where . (A) The effect of a change in Non the value of f. This change in Noccurs in the most recent common ancestor of the four simulated sequences. Population ratio refers to its Nafter versus before this change. (B) The effect of an increased C to T substitution rate. Categories A, B, and C are defined in Table 1.