| Literature DB >> 18226240 |
Andrew T M Bagshaw1, Joel P W Pitt, Neil J Gemmell.
Abstract
BACKGROUND: Microsatellites are highly abundant in eukaryotic genomes but their function and evolution are not yet well understood. Their elevated mutation rate makes them ideal markers of genetic difference, but high levels of unexplained heterogeneity in mutation rates among microsatellites at different genomic locations need to be elucidated in order to improve the power and accuracy of the many types of study that use them as genetic markers. Recombination could contribute to this heterogeneity, since while replication errors are thought to be the predominant mechanism for microsatellite mutation, meiotic recombination is involved in some mutation events. There is also evidence suggesting that microsatellites could function as recombination signals. The yeast S. cerevisiae is a useful model organism with which to further explore the link between microsatellites and recombination, since it is very amenable to genetic study, and meiotic recombination hotspots have been mapped throughout its entire genome.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18226240 PMCID: PMC2267716 DOI: 10.1186/1471-2164-9-49
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Total number of microsatellite repeats and percentage of regions with at least one repeat in the S. cerevisiae genome. The e value denotes the number of bases in any part of a repeat within which no more than one mismatch was allowed with respect to the consensus motif. A lower e value therefore results in the detection of more imperfect repeats.
| Motif length | Copy number | Mis-matches allowed | Hot (n = 473) | Non hot (n = 5520) | Hot (n = 297) | Non hot (n = 5683) | ||||
| No. of repeats | % of IGRs with a rpt. | No. of repeats | % of IGRs with a rpt. | No. of repeats | % of ORFs with a rpt. | No. of repeats | % of ORFs with a rpt | |||
| 6+ | perfect | 1277 | 83.1 | 12547 | 77.4 | 339 | 57.6 | 13556 | 74.7 | |
| e = 10 | 1236 | 82.2 | 12262 | 77.0 | 338 | 57.6 | 13495 | 74.8 | ||
| e = 6 | 1470 | 85.4 | 15153 | 82.2 | 437 | 64.3 | 17657 | 80.8 | ||
| 14+ | perfect | 79 | 15.6 | 409 | 6.99 | 4 | 1.35 | 30 | 0.475 | |
| e = 10 | 146 | 27.5 | 741 | 12.2 | 5 | 1.68 | 73 | 1.16 | ||
| e = 6 | 173 | 31.9 | 917 | 14.7 | 7 | 2.02 | 132 | 2.16 | ||
| 6+ | perfect | 33 | 6.55 | 241 | 4.09 | 32 | 10.4 | 474 | 7.80 | |
| e = 10 | 32 | 6.34 | 240 | 4.08 | 32 | 10.4 | 474 | 7.80 | ||
| e = 6 | 46 | 8.67 | 307 | 5.16 | 44 | 13.8 | 641 | 10.3 | ||
| 14+ | perfect | 2 | 0.423 | 2 | 0.0362 | 0 | 0 | 0 | 0 | |
| e = 10 | 2 | 0.423 | 2 | 0.0362 | 0 | 0 | 0 | 0 | ||
| e = 6 | 2 | 0.423 | 2 | 0.0362 | 0 | 0 | 0 | 0 | ||
| 6+ | perfect | 57 | 10.4 | 357 | 6.05 | 8 | 2.36 | 21 | 0.352 | |
| e = 10 | 100 | 18.7 | 668 | 11.1 | 15 | 4.38 | 137 | 2.32 | ||
| e = 6 | 130 | 23.5 | 1016 | 16.3 | 24 | 7.07 | 246 | 4.12 | ||
| 10+ | perfect | 19 | 3.81 | 117 | 2.08 | 3 | 1.01 | 6 | 0.106 | |
| e = 10 | 28 | 5.71 | 171 | 3.04 | 5 | 1.68 | 12 | 0.211 | ||
| e = 6 | 33 | 6.77 | 213 | 3.77 | 5 | 1.68 | 16 | 0.282 | ||
| 6+ | perfect | 7 | 1.27 | 27 | 0.435 | 8 | 2.36 | 165 | 2.46 | |
| e = 10 | 11 | 2.11 | 66 | 1.12 | 20 | 5.39 | 316 | 4.43 | ||
| e = 6 | 21 | 4.02 | 118 | 1.96 | 28 | 7.74 | 478 | 6.49 | ||
| 10+ | perfect | 1 | 0.211 | 8 | 0.145 | 0 | 0 | 29 | 0.493 | |
| e = 10 | 3 | 0.634 | 17 | 0.308 | 0 | 0 | 64 | 1.09 | ||
| e = 6 | 3 | 0.634 | 20 | 0.362 | 0 | 0 | 100 | 1.57 | ||
| 6+ | perfect | 0 | 0 | 5 | 0.0906 | 0 | 0 | 1 | 0.0176 | |
| e = 10 | 0 | 0 | 12 | 0.217 | 0 | 0 | 1 | 0.0176 | ||
| e = 6 | 0 | 0 | 19 | 0.344 | 0 | 0 | 2 | 0.0352 | ||
| 10+ | perfect | 0 | 0 | 1 | 0.0181 | 0 | 0 | 0 | 0 | |
| e = 10 | 0 | 0 | 1 | 0.0181 | 0 | 0 | 0 | 0 | ||
| e = 6 | 0 | 0 | 1 | 0.0181 | 0 | 0 | 0 | 0 | ||
| 6+ | perfect | 0 | 0 | 2 | 0.0362 | 0 | 0 | 0 | 0 | |
| e = 10 | 1 | 0.211 | 4 | 0.0725 | 0 | 0 | 3 | 0.0528 | ||
| e = 6 | 1 | 0.211 | 5 | 0.0906 | 0 | 0 | 4 | 0.0704 | ||
| 10+ | perfect | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| e = 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| e = 6 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.0176 | ||
| 6+ | perfect | 1 | 0.211 | 3 | 0.0543 | 0 | 0 | 3 | 0.0528 | |
| e = 10 | 1 | 0.211 | 21 | 0.326 | 2 | 0.673 | 15 | 0.246 | ||
| e = 6 | 1 | 0.211 | 10 | 0.181 | 4 | 1.35 | 11 | 0.176 | ||
| 10+ | perfect | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| e = 10 | 0 | 0 | 9 | 0.145 | 1 | 0.337 | 1 | 0.0176 | ||
| e = 6 | 0 | 0 | 4 | 0.0725 | 1 | 0.337 | 5 | 0.0704 | ||
Microsatellite types with a significant difference in frequency either between hot and non-hot IGRs, or hot and non-hot ORFs, in the S. cerevisiae genome. Significance was inferred where p < 0.0026, with the level of alpha adjusted for 19 independent classes of repeat using Bonferroni's correction. The Mann-Whitney U Test or T Test was used, depending whether samples were normally distributed. The e value denotes the number of bases in any part of a repeat within which no more than one mismatch was allowed with respect to the consensus motif. A lower e value therefore results in the detection of more imperfect repeats.
| Motif length | Copy number | Mis-matches allowed | Mean per kb freq. | Freq. ratio (hot/non hot) | P value | Mean per kb freq. | Freq. ratio (hot/non hot) | P value | ||
| Hot | Non hot | Hot | Non hot | |||||||
| 3 to 5 | perfect | 35.0 | 39.9 | 0.88 | < 0.0001 | 29.2 | 36.1 | 0.81 | < 0.0001 | |
| e = 10 | 34.3 | 39.4 | 0.87 | < 0.0001 | 29.1 | 36.0 | 0.81 | < 0.0001 | ||
| e = 6 | 31.8 | 36.7 | 0.87 | < 0.0001 | 28.0 | 34.7 | 0.81 | < 0.0001 | ||
| 6+ | perfect | 5.42 | 4.61 | 1.17 | < 0.0001 | 0.981 | 1.64 | 0.60 | < 0.0001 | |
| e = 10 | 5.24 | 4.50 | 1.16 | < 0.0001 | 0.978 | 1.64 | 0.60 | < 0.0001 | ||
| e = 6 | 6.12 | 5.53 | 1.11 | 0.00173 | 1.28 | 2.13 | 0.60 | < 0.0001 | ||
| 14+ | perfect | 0.418 | 0.171 | 2.45 | < 0.0001 | 0.0134 | 0.00733 | 1.83 | n/s | |
| e = 10 | 0.733 | 0.311 | 2.36 | < 0.0001 | 0.0182 | 0.0166 | 1.10 | n/s | ||
| e = 6 | 0.854 | 0.377 | 2.26 | < 0.0001 | 0.0218 | 0.0271 | 0.80 | n/s | ||
| 3 to 5 | perfect | 9.18 | 7.25 | 1.27 | < 0.0001 | 12.9 | 10.1 | 1.28 | < 0.0001 | |
| e = 10 | 9.16 | 7.24 | 1.27 | < 0.0001 | 12.9 | 10.1 | 1.28 | < 0.0001 | ||
| e = 6 | 8.89 | 7.13 | 1.25 | < 0.0001 | 12.6 | 9.93 | 1.27 | < 0.0001 | ||
| 6+ | e = 6 | 0.160 | 0.0931 | 1.72 | 0.00124 | 0.135 | 0.0798 | 1.69 | n/s | |
| 14+ | perfect | 0.0035 | 0.000725 | 4.83 | 0.00179 | 0 | 0 | n/a | n/a | |
| e = 10 | 0.0035 | 0.000725 | 4.83 | 0.00179 | 0 | 0 | n/a | n/a | ||
| e = 6 | 0.0035 | 0.000725 | 4.83 | 0.00179 | 0 | 0 | n/a | n/a | ||
| 3 to 5 | perfect | 4.67 | 3.96 | 1.18 | < 0.0001 | 1.82 | 1.74 | 1.05 | n/s | |
| e = 10 | 4.34 | 3.68 | 1.18 | 0.000266 | 1.78 | 1.71 | 1.04 | n/s | ||
| e = 6 | 6.17 | 5.52 | 1.12 | 0.000234 | 3.18 | 3.14 | 1.01 | n/s | ||
| 6+ | perfect | 0.368 | 0.196 | 1.88 | 0.000248 | 0.0405 | 0.00572 | 7.09 | n/s | |
| e = 10 | 0.599 | 0.356 | 1.68 | < 0.0001 | 0.0720 | 0.0229 | 3.14 | n/s | ||
| e = 6 | 0.797 | 0.529 | 1.51 | < 0.0001 | 0.109 | 0.0398 | 2.73 | n/s | ||
| 10+ | perfect | 0.158 | 0.0606 | 2.62 | n/s | 0.0163 | 0.00173 | 9.44 | < 0.0001 | |
| e = 10 | 0.221 | 0.0931 | 2.38 | 0.00164 | 0.0207 | 0.00275 | 7.53 | < 0.0001 | ||
| e = 6 | 0.252 | 0.109 | 2.32 | 0.00132 | 0.0159 | 0.00584 | 2.72 | < 0.0001 | ||
| 6+ | e = 6 | 0.00552 | 0.00341 | 1.62 | n/s | 0.0135 | 0.000877 | 15.40 | < 0.0001 | |
Figure 1Frequencies of high-copy, short-motif repeats in yeast intergenic regions. Mean microsatellite frequencies in S. cerevisiae IGRs divided according to DSB intensity into 473 hot, 89 cold and 5431 other regions, which were all IGRs not categorized as either hot or cold. Poly-AT arrays comprised the majority of dinucleotide repeats and are highlighted in grey. Error bars are plus and minus one SEM.