| Literature DB >> 20846930 |
David N Cooper1, Matthew Mort, Peter D Stenson, Edward V Ball, Nadia A Chuzhanova.
Abstract
The cytosine-guanine (CpG) dinucleotide has long been known to be a hotspot for pathological mutation in the human genome. This hypermutability is related to its role as the major site of cytosine methylation with the attendant risk of spontaneous deamination of 5-methylcytosine (5mC) to yield thymine. Cytosine methylation, however, also occurs in the context of CpNpG sites in the human genome, an unsurprising finding since the intrinsic symmetry of CpNpG renders it capable of supporting a semi-conservative model of replication of the methylation pattern. Recently, it has become clear that significant DNA methylation occurs in a CpHpG context (where H = A, C or T) in a variety of human somatic tissues. If we assume that CpHpG methylation also occurs in the germline, and that 5mC deamination can occur within a CpHpG context, then we might surmise that methylated CpHpG sites could also constitute mutation hotspots causing human genetic disease. To test this postulate, 54,625 missense and nonsense mutations from 2,113 genes causing inherited disease were retrieved from the Human Gene Mutation Database (http://www.hgmd.org). Some 18.2 per cent of these pathological lesions were found to be C → T and G → A transitions located in CpG dinucleotides (compatible with a model of methylation-mediated deamination of 5mC), an approximately ten-fold higher proportion than would have been expected by chance alone. The corresponding proportion for the CpHpG trinucleotide was 9.9 per cent, an approximately two-fold higher proportion than would have been expected by chance. We therefore estimate that ∼5 per cent of missense/nonsense mutations causing human inherited disease may be attributable to methylation-mediated deamination of 5mC within a CpHpG context.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20846930 PMCID: PMC3525222 DOI: 10.1186/1479-7364-4-6-406
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Numbers of C → T and G → A mutations found in CpG dinucleotides and CpHpG trinucleotides in a dataset of 54,625 missense and nonsense mutations in 2,113 different human genes (HGMD) and the numbers of possible C → T and G → A mutations in CpG dinucleotides and CpHpG trinucleotides within the coding regions of the mutated genes.
| Di/trinucleotide | Dataset | Number of mutations in | ||
|---|---|---|---|---|
| in di/trinucleotide | not in di/trinucleotide | |||
| CpG | HGMD | 9,947 | 44,678 | < 10-230 |
| Possible | 292,147 | 13,269,850 | ||
| CpHpG | HGMD | 5,402 | 49,223 | < 10-230 |
| Possible | 610,714 | 12,951,283 | ||
Numbers of C → T and G → A mutations found in CpG dinucleotides and CpHpG trinucleotides in a dataset of 1,766 regulatory mutations of 191 gene promoters (HGMD) and the numbers of possible C → T and G → A mutations in CpG dinucleotides and CpHpG trinucleotides.
| Di/trinucleotide | Dataset | Number of mutations in | ||
|---|---|---|---|---|
| in di/trinucleotide | not in di/trinucleotide | |||
| CpG | HGMD | 94 | 1,672 | 6.03 × 10-9 |
| Possible | 1,940 | 64,213 | ||
| CpHpG | HGMD | 54 | 1,712 | 0.011 |
| Possible | 2,838 | 63,315 | ||