| Literature DB >> 18826599 |
Daniel J Gaffney1, Peter D Keightley.
Abstract
BACKGROUND: Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18826599 PMCID: PMC2576242 DOI: 10.1186/1471-2148-8-265
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Examples of CpG and non-CpG misassignment. Arrows along lineages denote a single nucleotide substitution.
Figure 2Substitution rates at mutational equilibrium. Ratio of estimated to true CpG differences and estimated to true non-CpG differences as a function of increasing sequence divergence for phylogenies derived from randomly generated sequences evolved to be at mutational equilibrium. Results are shown for four different levels of hypermutability: no hypermutability, 5-fold, 10-fold and 20-fold hypermutability (a, b, c and d). Each line represents 50 data points, each of which was estimated from the evolution of a single, randomly-generated 3 Mb sequence.
Figure 3Substitution rates in phylogenies derived from real sequence data. Ratio of estimated (CpG) to true (CpG) CpG differences and estimated (nCpG) to true (nCpG) non-CpG differences plotted against increasing sequence divergence for phylogenies derived from real mouse noncoding (a, b) and coding (c, d) sequences. Separate lines show results for different levels of hypermutability (5-fold, 10-fold and 20-fold hypermutability). Each line is composed of is estimated from 80 simulated replicates of an ~8 Mb sequence.
Nucleotide substitution rates estimated at non-CpG (K) and non-CpG prone (K) sites at fourfold degenerate and intronic sites in 1470 human-chimpanzee orthologues.
| 4-fold | 0.0061 (0.0058,0.0064) | 0.0079 (0.0073,0.0084) | 0.771 |
| Intronic | 0.0092 (0.0089,0.0095) | 0.0105 (0.0102,0.0109) | 0.876 |
| 0.663 (0.621,0.704) | 0.750 (0.692,0.810) |
Non CpG sites were identified using CpG/non-CpG assignment. Numbers in parentheses show 95% confidence intervals estimated by bootstrapping the data by gene, 10000 times.