| Literature DB >> 19043566 |
Steffen Schmidt1, Anna Gerasimova, Fyodor A Kondrashov, Ivan A Adzhubei, Ivan A Adzuhbei, Alexey S Kondrashov, Shamil Sunyaev.
Abstract
Mutation rate varies greatly between nucleotide sites of the human genome and depends both on the global genomic location and the local sequence context of a site. In particular, CpG context elevates the mutation rate by an order of magnitude. Mutations also vary widely in their effect on the molecular function, phenotype, and fitness. Independence of the probability of occurrence of a new mutation's effect has been a fundamental premise in genetics. However, highly mutable contexts may be preserved by negative selection at important sites but destroyed by mutation at sites under no selection. Thus, there may be a positive correlation between the rate of mutations at a nucleotide site and the magnitude of their effect on fitness. We studied the impact of CpG context on the rate of human-chimpanzee divergence and on intrahuman nucleotide diversity at non-synonymous coding sites. We compared nucleotides that occupy identical positions within codons of identical amino acids and only differ by being within versus outside CpG context. Nucleotides within CpG context are under a stronger negative selection, as revealed by their lower, proportionally to the mutation rate, rate of evolution and nucleotide diversity. In particular, the probability of fixation of a non-synonymous transition at a CpG site is two times lower than at a CpG site. Thus, sites with different mutation rates are not necessarily selectively equivalent. This suggests that the mutation rate may complement sequence conservation as a characteristic predictive of functional importance of nucleotide sites.Entities:
Mesh:
Year: 2008 PMID: 19043566 PMCID: PMC2583910 DOI: 10.1371/journal.pgen.1000281
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Estimates of the impact of CpG context on the mutation rates of transitions and transversions.
|
| Kondrashov (2003) | Hwang & Green (2004) |
| (human-chimp)-orangutan |
|
| 15.4 | 13.7 | 14.5 | 12.2 |
|
| 2.8 | 4.2 | 3.5 | 2.4 |
The last column contains ratios computed using a ((human-chimp)-orangutan) alignment.
Non-synonymous substitutions in human-chimp divergence and human polymorphism data.
| Divergence | Diversity | |||||||||||||
| Macaque as outgroup | Orangutan as outgroup | |||||||||||||
| CpG | ⌝CpG | CpGtarget | ⌝CpGtarget | CpGimpact | CpG | ⌝CpG | CpGtarget | ⌝CpGtarget | CpGimpact | CpG | ⌝CpG | CpGimpact (orangutan) | ||
|
| V→I | 368 | 481 | 11,714 | 128,771 |
| 318 | 353 | 10,814 | 103,463 |
| 247 | 190 |
|
| V→M | 197 | 217 | 14,951 | 92,303 |
| 176 | 154 | 13,593 | 76,037 |
| 166 | 88 |
| |
| A→T | 352 | 582 | 26,266 | 269,079 |
| 345 | 408 | 22,865 | 212,630 |
| 213 | 237 |
| |
| G→S | 177 | 238 | 14,793 | 118,403 |
| 150 | 154 | 12,891 | 93,855 |
| 130 | 83 |
| |
| G→R | 124 | 205 | 12,561 | 131,213 |
| 96 | 155 | 10,913 | 101,692 |
| 93 | 70 |
| |
| D→N | 119 | 236 | 16,226 | 191,210 |
| 119 | 160 | 14,172 | 150,842 |
| 104 | 99 |
| |
| E→K | 150 | 307 | 21,964 | 286,222 |
| 106 | 229 | 19,512 | 223,672 |
| 121 | 132 |
| |
| S→L | 162 | 79 | 14,345 | 56,932 |
| 122 | 62 | 12,596 | 43,892 |
| 101 | 29 |
| |
| P→L | 323 | 401 | 20,988 | 229,606 |
| 215 | 284 | 18,088 | 185,826 |
| 179 | 137 |
| |
| A→V | 275 | 477 | 22,769 | 260,480 |
| 235 | 363 | 19,841 | 215,997 |
| 176 | 198 |
| |
|
| V→L | 26 | 151 | 26,938 | 227,412 |
| 43 | 161 | 39,449 | 284,400 |
| 27 | 85 |
|
| V→F | 9 | 41 | 9,937 | 97,770 |
| 7 | 22 | 9,088 | 78,368 |
| 8 | 16 |
| |
| A→P | 24 | 120 | 25,938 | 268,617 |
| 22 | 89 | 22,542 | 212,311 |
| 12 | 58 |
| |
| A→S | 69 | 242 | 25,983 | 268,739 |
| 69 | 171 | 22,589 | 212,393 |
| 26 | 87 |
| |
| G→R | 37 | 104 | 27,601 | 253,250 |
| 18 | 31 | 23,805 | 196,963 |
| 12 | 31 |
| |
| G→C | 19 | 39 | 14,635 | 118,204 |
| 14 | 20 | 12,755 | 93,721 |
| 5 | 14 |
| |
| G→W | 1 | 17 | 8,088 | 54,094 |
| 4 | 10 | 7,229 | 43,326 |
| 4 | 5 |
| |
| D→H | 11 | 91 | 16,118 | 191,065 |
| 9 | 65 | 14062 | 150,747 |
| 6 | 39 |
| |
| D→Y | 8 | 54 | 16,115 | 191,028 |
| 8 | 36 | 14,061 | 150,718 |
| 4 | 21 |
| |
| E→Q | 32 | 203 | 21,846 | 286,118 |
| 24 | 149 | 19,430 | 223,592 |
| 13 | 74 |
| |
| P→Q | 27 | 37 | 20,692 | 74,730 |
| 24 | 23 | 17,897 | 59,091 |
| 11 | 15 |
| |
| T→K | 22 | 43 | 17,764 | 66,600 |
| 11 | 30 | 15,900 | 53,435 |
| 10 | 13 |
| |
| A→E | 30 | 50 | 22,524 | 70,891 |
| 28 | 30 | 19,634 | 56,882 |
| 12 | 13 |
| |
| P→R | 18 | 120 | 20,683 | 229,325 |
| 13 | 100 | 17,886 | 185,642 |
| 15 | 51 |
| |
| A→G | 25 | 144 | 22,519 | 260,147 |
| 20 | 114 | 19,626 | 215,748 |
| 11 | 79 |
| |
| T→R | 16 | 50 | 17,758 | 66,607 |
| 10 | 34 | 19,416 | 223,477 |
| 6 | 11 |
| |
| F→L | 13 | 68 | 6,884 | 77,474 |
| 7 | 47 | 6,173 | 62,281 |
| 3 | 19 |
| |
| C→W | 6 | 13 | 5,004 | 47,722 |
| 2 | 7 | 4,293 | 37,600 |
| 0 | 4 |
| |
| H→Q | 15 | 82 | 6,737 | 55,640 |
| 17 | 46 | 5,903 | 43,819 |
| 5 | 25 |
| |
| N→K | 9 | 95 | 7,605 | 71,708 |
| 13 | 68 | 6,812 | 57,756 |
| 5 | 39 |
| |
| S→R | 25 | 74 | 8,433 | 71,141 |
| 20 | 49 | 7,227 | 56,443 |
| 4 | 35 |
| |
| D→E | 31 | 145 | 10,551 | 90,245 |
| 14 | 104 | 9,121 | 72,379 |
| 11 | 40 |
| |
Column heads: identity of an amino acid change; the data for transitions is shown in the upper part of the table, transversions below. Columns: divergence data with macaque or orangutan as outgroup followed by diversity data computed from human non-synonymous SNP. CpG/⌝CpG: number of changes within/outside CpG context; CpGtarget/⌝CpGtarget: number of targets within/outside CpG context. CpGimpact impact of CpGs as calculated in formula 1. The nsSNP data shown here is based on the Applera data set using human-chimp alignments to determine the direction of the mutation.
Figure 1CpG impact on transitions in amino acid changes.
The effect on human-chimpanzee divergence is shown in blue; the effect on non-synonymous SNPs in human in red.
Average impacts of CpG context for different types of sites using orangutan as outgroup.
| CpGimpact | Type | Divergence | Diversity |
| non-synonymous | transition | 7.1 | 11.2 |
| transversion | 2.5 | 2.4 | |
| synonymous | transition | 8.6 | 11.7 |
| transversion | 2.1 | 2.3 | |
| non-coding | transition | 12.2 | 13.7 |
| transversion | 2.4 | 2.0 |