| Literature DB >> 16916449 |
Hui-Qi Qu1, Steve G Lawrence, Fan Guo, Jacek Majewski, Constantin Polychronakos.
Abstract
BACKGROUND: Complementary single-nucleotide polymorphisms (SNPs) may not be distributed equally between two DNA strands if the strands are functionally distinct, such as in transcribed genes. In introns, an excess of A<-->G over the complementary C<-->T substitutions had previously been found and attributed to transcription-coupled repair (TCR), demonstrating the valuable functional clues that can be obtained by studying such asymmetry. Here we studied asymmetry of human synonymous SNPs (sSNPs) in the fourfold degenerate (FFD) sites as compared to intronic SNPs (iSNPs).Entities:
Mesh:
Substances:
Year: 2006 PMID: 16916449 PMCID: PMC1559705 DOI: 10.1186/1471-2164-7-213
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The asymmetry pattern of A↔G and C↔T iSNPs and FFD SNPs
| 1411 (19.5%) | 1030 (14.2%) | 1200 (16.6%) | 1052 (14.5%) | 4693 (64.8%) | |
| 678 (9.4%) | 561 (7.8%) | 664 (9.2%) | 643 (8.9%) | 2546 (35.3%) | |
| χ2 = 3.2, | χ2 = 2.1, | ||||
| 2089 (28.9%) | 1591 (22.0%) | 1864 (25.8%) | 1695 (23.4%) | 7239 (100%) | |
| 148(6.2%) | 127(5.3%) | 207(8.7%) | 307(12.9%) | 789(33.2%) | |
| 316(13.3%) | 163(6.9%) | 654(27.5%) | 452(19.0%) | 1585(66.8%) | |
| χ2 = 10.9, | χ2 = 50.1, | ||||
| 464(19.5%) | 290(12.2%) | 861(36.3%) | 759(32.0%) | 2374(100.0%) | |
* χ2 test of the difference of complementary substitution between non-CpG site and CpG site.
The nucleotide composition of human genome introns and FFD codons
| 126,530,426 (28.0%) | 139,073,101 (30.7%) | 94,268,606 (20.8%) | 92,418,338 (20.4%) | 452,290,471 (100.0%) | |
| 575,500 (19.5%) | 615,324 (20.8%) | 822,103 (27.8%) | 939,137 (31.8%) | 2,952,064 (100.0%) |
* The first introns as well as the first and last 200 bp of each intron were excluded;
Figure 1The proportion of each type of A↔G and C↔T substitution (the ancestral allele vs. the recent allele of 7,239 iSNPs and 2,374 FFD SNPs. (a) The proportion of each type of substitution corrected by nucleotide compositions. (b) The FFD SNP distribution corrected by FFD codon compositions. The non-CpG site sSNP distribution is corrected by four types of FFD CpG codons, i.e. NDA, NNT|H, NDG, and NNC|H (D represents A or G or T, and H represents A or C or T). FFD CpG site SNP distribution is corrected by four types of FFD CpG codon compositons, i.e., NCA, NNT|G, NCG, and NNC|G.
The proportions of A→G and C→T FFD substitutions corrected by codon compositions
| A→G | 148 | NDA | 265,987 | 0.350 | χ2 = 13.6, | A→G vs. T→C 1.56 (1.23, 1.97) | |
| T→C | 127 | NNT|H | 355,455 | 0.225 | |||
| G→A | 207 | NDG | 689,990 | 0.189 | χ2 = 6.2, | C→T vs. G→A 1.25 (1.05, 1.49) | |
| C→T | 307 | NNC|H | 818,126 | 0.236 | |||
| A→G | 316 | NCA | 309,513 | 0.099 | χ2 = 26.0, | A→G vs. T→C 1.63 (1.35, 1.97) | |
| T→C | 163 | NNT|G | 259,869 | 0.061 | |||
| G→A | 654 | NCG | 132,113 | 0.479 | χ2 = 21.3, | C→T vs. G→A 0.75 (0.67, 0.85) | |
| C→T | 452 | NNC|G | 121,011 | 0.361 |
* N represents A or C or G or T, D represents A or G or T, and H represents A or C or T
Decreased FFD SNPs at A|T dinucleotides
| G→A | 161 | 46 | χ2 = 7.9, | |
| A→G | 95 | 53 | ||
| G→A | 505 | 149 | χ2 = 19.1, | |
| A→G | 202 | 114 |