| Literature DB >> 17573970 |
Abstract
BACKGROUND: Microsatellites (repeated subsequences based on motifs of one to six nucleotides) are widely used as codominant genetic markers because of their frequent polymorphism and relative selective neutrality. Minisatellites are repeats of motifs having seven or more nucleotides. The large number of EST sequences now available in public databases offers an opportunity to compare microsatellite and minisatellite properties and evaluate their evolution over a broad range of plant taxa.Entities:
Mesh:
Year: 2007 PMID: 17573970 PMCID: PMC1940011 DOI: 10.1186/1471-2164-8-173
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Abundance of microsatellites and minisatellites versus motif length, with zero, one, or two allowed mismatched repeat positions.
Mean microsatellite frequency, repeat count, and polymorphism by motif length
| Mismatches | Type | Motif length | |||||
| 1 | 2 | 3 | 4 | 5 | 6 | ||
| 0 | simpfreq | 13.574 | 24.427 | 50.259 | 7.826 | 14.244 | 17.152 |
| 0 | compfreq | 67.573 | 114.02 | 230.762 | 517.736 | 203.929 | 144.361 |
| 0 | totfreq | 81.147 | 138.447 | 281.02 | 525.563 | 218.174 | 161.515 |
| 0 | fracsimp | 0.155 | 0.106 | 0.153 | 0.012 | 0.054 | 0.099 |
| 0 | simprepe | 18.701 | 11.593 | 5.786 | 4.39 | 3.227 | 3.161 |
| 0 | comprepe | 10.287 | 5.018 | 3.363 | 2.038 | 2.034 | 2.066 |
| 0 | simppoly | 0.092 | 0.135 | 0.167 | 0.174 | 0.151 | 0.185 |
| 0 | comppoly | 0.095 | 0.112 | 0.141 | 0.113 | 0.121 | 0.144 |
| 1 | simpfreq | 23.588 | 27.15 | 79.092 | 16.725 | 33.478 | 19.782 |
| 1 | compfreq | 132.425 | 135.595 | 268.245 | 146.506 | 3001.555 | 1474.271 |
| 1 | totfreq | 156.013 | 162.745 | 347.337 | 163.232 | 3035.033 | 1494.054 |
| 1 | fracsimp | 0.141 | 0.124 | 0.223 | 0.1 | 0.009 | 0.011 |
| 1 | simprepe | 18.417 | 10.901 | 5.759 | 4.248 | 3.122 | 3.147 |
| 1 | comprepe | 12.317 | 6.526 | 4.533 | 3.134 | 2.007 | 2.018 |
| 1 | simppoly | 0.102 | 0.126 | 0.159 | 0.165 | 0.147 | 0.185 |
| 1 | comppoly | 0.112 | 0.125 | 0.157 | 0.125 | 0.128 | 0.138 |
| 2 | simpfreq | 16.906 | 34.942 | 45.113 | 11.936 | 17.453 | 20.905 |
| 2 | compfreq | 2.928 | 5.148 | 5.326 | 1.259 | 1.796 | 3.169 |
| 2 | totfreq | 19.834 | 40.091 | 50.439 | 13.195 | 19.249 | 24.074 |
| 2 | fracsimp | 0.879 | 0.832 | 0.903 | 0.919 | 0.93 | 0.879 |
| 2 | simprepe | 24.525 | 13.555 | 8.206 | 5.495 | 4.297 | 4.213 |
| 2 | comprepe | 27.141 | 15.596 | 8.785 | 5.695 | 4.172 | 3.728 |
| 2 | simppoly | 0.086 | 0.145 | 0.169 | 0.181 | 0.176 | 0.205 |
| 2 | comppoly | 0.093 | 0.167 | 0.199 | 0.184 | 0.191 | 0.159 |
Spearman rank correlation between features in relation to microsatellite motif length
| x | y | mismatches | 3-mer (8 d.f.) | 4-mer (31 d.f.) | 5-mer (99 d.f.) | 6-mer (348 d.f.) | all (499 d.f.) |
| compfreq | simpfreq | 0 | 0.915*** | 0.714*** | 0.860*** | 0.789*** | 0.840*** |
| compfreq | simpfreq | 1 | 0.964*** | 0.900*** | 0.901*** | 0.693*** | 0.732*** |
| compfreq | simpfreq | 2 | 0.960*** | 0.809*** | 0.844*** | 0.755*** | 0.805*** |
| simprepe | comprepe | 0 | 0.736* | 0.19 | 0.138 | 0.390*** | 0.313*** |
| simprepe | comprepe | 1 | 0.467 | 0.556*** | 0.183 | 0.519*** | 0.660*** |
| simprepe | comprepe | 2 | 0.758* | 0.494** | 0.067 | 0.370*** | 0.488*** |
| simppoly | comppoly | 0 | 0.024 | 0.029 | 0.201* | 0.068 | 0.127** |
| simppoly | comppoly | 1 | 0.701* | 0.452** | 0.291** | 0.160** | 0.222*** |
| simppoly | comppoly | 2 | 0.284 | -0.217 | 0.037 | 0.143** | 0.090* |
| simpfreq | simprepe | 0 | 0.079 | 0.327 | -0.053 | 0.486*** | 0.487*** |
| simpfreq | simprepe | 1 | 0.2 | 0.073 | 0.062 | 0.487*** | 0.404*** |
| simpfreq | simprepe | 2 | 0.03 | 0.303 | -0.159 | 0.315*** | 0.363*** |
| compfreq | comprepe | 0 | 0.517 | 0.167 | 0.236* | 0.296*** | 0.100* |
| compfreq | comprepe | 1 | 0.333 | 0.015 | 0.433*** | 0.204*** | -0.05 |
| compfreq | comprepe | 2 | -0.17 | 0.432* | 0.314** | 0.609*** | 0.601*** |
| simprepe | simppoly | 0 | -0.018 | 0.073 | 0.168 | 0.185*** | 0.136** |
| simprepe | simppoly | 1 | -0.255 | 0.263 | 0.055 | 0.130* | 0.115* |
| simprepe | simppoly | 2 | -0.122 | 0.181 | 0.087 | 0.228*** | 0.129** |
| comprepe | comppoly | 0 | -0.25 | 0.534** | -0.049 | -0.029 | 0.068 |
| comprepe | comppoly | 1 | 0.105 | 0.298 | -0.099 | 0.201*** | 0.142** |
| comprepe | comppoly | 2 | -0.614 | 0.082 | 0.292** | 0.434*** | 0.394*** |
| *, p <= 0.05; **, p <= 0.01; ***, p <= 0.001 |
Mean frequency, repeat count, and polymorphism by motif length and GC content in perfect microsatellites
| Length | GC content | simpfreq | compfreq | totfreq | fracsimp | simprepe | comprepe | simppoly | comppoly |
| 3 | 0 | 3.464 | 16.996 | 20.46 | 0.144 | 6.254 | 3.412 | 0.168 | 0.115 |
| 0.333 | 5.178 | 24.711 | 29.889 | 0.142 | 5.864 | 3.369 | 0.166 | 0.142 | |
| 0.667 | 4.66 | 21.682 | 26.342 | 0.163 | 5.646 | 3.354 | 0.169 | 0.146 | |
| 1 | 7.442 | 28.194 | 35.636 | 0.167 | 5.564 | 3.323 | 0.158 | 0.146 | |
| 4 | 0 | 0.8 | 29.279 | 30.079 | 0.022 | 4.407 | 2.058 | 0.131 | 0.106 |
| 0.25 | 0.353 | 24.872 | 25.225 | 0.012 | 4.555 | 2.042 | 0.147 | 0.108 | |
| 0.5 | 0.158 | 12.031 | 12.189 | 0.013 | 4.317 | 2.039 | 0.195 | 0.121 | |
| 0.75 | 0.148 | 10.598 | 10.746 | 0.012 | 4.345 | 2.03 | 0.169 | 0.113 | |
| 1 | 0.085 | 9.502 | 9.587 | 0.003 | 4.373 | 2.018 | 0.217 | 0.091 | |
| 5 | 0 | 0.663 | 6.274 | 6.937 | 0.076 | 3.171 | 2.051 | 0.123 | 0.103 |
| 0.2 | 0.245 | 3.426 | 3.671 | 0.05 | 3.194 | 2.031 | 0.144 | 0.112 | |
| 0.4 | 0.118 | 2.056 | 2.174 | 0.051 | 3.242 | 2.03 | 0.149 | 0.121 | |
| 0.6 | 0.091 | 1.293 | 1.384 | 0.056 | 3.264 | 2.04 | 0.163 | 0.135 | |
| 0.8 | 0.083 | 1.084 | 1.167 | 0.055 | 3.174 | 2.033 | 0.146 | 0.109 | |
| 1 | 0.104 | 1.927 | 2.03 | 0.055 | 3.18 | 2.026 | 0.122 | 0.099 | |
| 6 | 0 | 0.099 | 1.074 | 1.173 | 0.052 | 3.093 | 2.054 | 0.108 | 0.109 |
| 0.167 | 0.048 | 0.674 | 0.721 | 0.046 | 3.085 | 2.05 | 0.162 | 0.122 | |
| 0.333 | 0.034 | 0.399 | 0.433 | 0.067 | 3.188 | 2.046 | 0.188 | 0.142 | |
| 0.5 | 0.047 | 0.39 | 0.436 | 0.098 | 3.202 | 2.065 | 0.188 | 0.147 | |
| 0.667 | 0.05 | 0.286 | 0.335 | 0.133 | 3.106 | 2.086 | 0.192 | 0.152 | |
| 0.833 | 0.081 | 0.46 | 0.541 | 0.151 | 3.202 | 2.092 | 0.187 | 0.151 | |
| 1 | 0.084 | 0.462 | 0.546 | 0.137 | 3.079 | 2.068 | 0.183 | 0.116 |
Mean frequency, repeat count, and polymorphism by motif length and GC content in microsatellites with one allowed mismatched repeat position
| Length | GC content | simpfreq | compfreq | totfreq | fracsimp | simprepe | comprepe | simppoly | comppoly |
| 3 | 0 | 4.578 | 17.571 | 22.148 | 0.197 | 5.912 | 4.614 | 0.151 | 0.123 |
| 0.333 | 7.923 | 27.527 | 35.45 | 0.211 | 5.789 | 4.508 | 0.156 | 0.157 | |
| 0.667 | 7.718 | 25.59 | 33.307 | 0.236 | 5.705 | 4.553 | 0.167 | 0.166 | |
| 1 | 11.949 | 38.208 | 50.157 | 0.246 | 5.698 | 4.472 | 0.144 | 0.158 | |
| 4 | 0 | 1.466 | 11.444 | 12.91 | 0.121 | 4.224 | 3.156 | 0.122 | 0.11 |
| 0.25 | 0.773 | 7.478 | 8.251 | 0.091 | 4.277 | 3.152 | 0.168 | 0.127 | |
| 0.5 | 0.366 | 3.009 | 3.375 | 0.105 | 4.25 | 3.13 | 0.19 | 0.13 | |
| 0.75 | 0.303 | 2.466 | 2.768 | 0.103 | 4.24 | 3.136 | 0.141 | 0.117 | |
| 1 | 0.217 | 2.475 | 2.692 | 0.067 | 4.179 | 3.064 | 0.13 | 0.132 | |
| 5 | 0 | 1.364 | 71.877 | 73.241 | 0.016 | 3.11 | 2.015 | 0.117 | 0.113 |
| 0.2 | 0.627 | 47.195 | 47.822 | 0.01 | 3.094 | 2.009 | 0.14 | 0.12 | |
| 0.4 | 0.295 | 31.143 | 31.438 | 0.008 | 3.123 | 2.007 | 0.15 | 0.129 | |
| 0.6 | 0.201 | 21.117 | 21.317 | 0.008 | 3.145 | 2.007 | 0.158 | 0.133 | |
| 0.8 | 0.173 | 17.65 | 17.823 | 0.008 | 3.115 | 2.007 | 0.138 | 0.13 | |
| 1 | 0.248 | 25.358 | 25.607 | 0.007 | 3.065 | 2.008 | 0.12 | 0.125 | |
| 6 | 0 | 0.019 | 6.452 | 6.471 | 0.003 | 3.096 | 2.009 | 0.141 | 0.11 |
| 0.167 | 0.039 | 6.483 | 6.523 | 0.005 | 3.136 | 2.01 | 0.155 | 0.118 | |
| 0.333 | 0.045 | 4.697 | 4.741 | 0.008 | 3.121 | 2.011 | 0.176 | 0.13 | |
| 0.5 | 0.061 | 4.169 | 4.231 | 0.012 | 3.18 | 2.017 | 0.194 | 0.142 | |
| 0.667 | 0.055 | 2.999 | 3.054 | 0.015 | 3.158 | 2.025 | 0.196 | 0.147 | |
| 0.833 | 0.094 | 3.778 | 3.871 | 0.019 | 3.199 | 2.031 | 0.191 | 0.146 | |
| 1 | 0.089 | 3.137 | 3.226 | 0.015 | 2.58 | 2.019 | 0.193 | 0.137 |
Mean frequency, repeat count, and polymorphism by motif length and GC content in microsatellites with two allowed mismatched repeat positions
| Length | GC content | simpfreq | compfreq | totfreq | fracsimp | simprepe | comprepe | simppoly | comppoly |
| 3 | 0 | 3.226 | 0.447 | 3.673 | 0.882 | 8.739 | 9.487 | 0.189 | 0.145 |
| 0.333 | 4.822 | 0.595 | 5.417 | 0.905 | 8.309 | 9.217 | 0.163 | 0.189 | |
| 0.667 | 4.037 | 0.483 | 4.521 | 0.896 | 8.051 | 8.418 | 0.172 | 0.214 | |
| 1 | 6.45 | 0.566 | 7.017 | 0.942 | 7.888 | 7.817 | 0.162 | 0.231 | |
| 4 | 0 | 1.107 | 0.101 | 1.208 | 0.917 | 5.497 | 5.513 | 0.133 | 0.152 |
| 0.25 | 0.577 | 0.074 | 0.651 | 0.919 | 5.585 | 5.878 | 0.191 | 0.169 | |
| 0.5 | 0.225 | 0.018 | 0.244 | 0.928 | 5.475 | 5.772 | 0.185 | 0.206 | |
| 0.75 | 0.237 | 0.024 | 0.262 | 0.909 | 5.504 | 5.578 | 0.174 | 0.176 | |
| 1 | 0.14 | 0.015 | 0.156 | 0.899 | 5.224 | 5.109 | 0.201 | 0.166 | |
| 5 | 0 | 0.763 | 0.073 | 0.836 | 0.918 | 4.188 | 4.283 | 0.129 | 0.144 |
| 0.2 | 0.314 | 0.035 | 0.348 | 0.925 | 4.276 | 4.333 | 0.176 | 0.185 | |
| 0.4 | 0.14 | 0.014 | 0.154 | 0.925 | 4.318 | 4.507 | 0.167 | 0.245 | |
| 0.6 | 0.11 | 0.009 | 0.119 | 0.939 | 4.341 | 3.825 | 0.19 | 0.142 | |
| 0.8 | 0.109 | 0.013 | 0.121 | 0.939 | 4.242 | 3.99 | 0.178 | 0.207 | |
| 1 | 0.143 | 0.022 | 0.165 | 0.879 | 4.12 | 4.299 | 0.168 | 0.137 | |
| 6 | 0 | 0.101 | 0.016 | 0.117 | 0.878 | 4.213 | 4.385 | 0.148 | 0.165 |
| 0.167 | 0.06 | 0.008 | 0.069 | 0.867 | 4.167 | 3.86 | 0.168 | 0.116 | |
| 0.333 | 0.042 | 0.006 | 0.048 | 0.889 | 4.19 | 3.392 | 0.195 | 0.115 | |
| 0.5 | 0.058 | 0.009 | 0.067 | 0.9 | 4.323 | 3.682 | 0.234 | 0.164 | |
| 0.667 | 0.056 | 0.01 | 0.067 | 0.839 | 4.103 | 3.846 | 0.206 | 0.19 | |
| 0.833 | 0.104 | 0.014 | 0.118 | 0.897 | 4.261 | 4.23 | 0.188 | 0.207 | |
| 1 | 0.102 | 0.01 | 0.112 | 0.916 | 4.137 | 3.57 | 0.143 | 0.219 |
Figure 2GC content versus motif length for tandemly repeated subsequences. There are separate traces for zero, one, and two allowed mismatched repeat positions.
Mean within-group Euclidean distance of genera for total motif frequency, mean repeat count, and polymorphism in perfect microsatellite loci
| Group | Mean Euclidean distance | ||
| Total frequency | Repeat count | Polymorphism | |
| Lower vascular plants: | 128.957 | 41.14 | 1.534** |
| 52.489** | 52.622 | 7.458 | |
| 53.881**** | 54.362 | 6.594 | |
| Other gymnosperms: | 127.215 | 33.696 | 2.375 |
| 252.658 | 44.591 | 4.749 | |
| 106.051 | 45.096 | 5.529 | |
| 117.741 | 51.308 | 6.615 | |
| 183.203 | 53.741 | 5.709 | |
| 126.63 | 47.548 | 5.882 | |
| 161.912 | 38.581 | 3.548 | |
| 62.281*** | 44.787 | 6.565 | |
| 87.183** | 43.732 | 4.179 | |
| 248.316 | 49.556 | 5.014 | |
| 179.427 | 49.285 | 5.018 | |
| 176.441 | 53.458 | 6.143 | |
| 205.429 | 47.349 | 5.285 | |
| 160.725 | 36.687 | 3.221 | |
| Other monocots: | 185.791 | 37.821 | 2.937 |
| Group 1: | 88.991 | 36.793 | 1.892** |
| Group 2: | 63.819**** | 41.682 | 3.425 |
| Group 3: | 73.179** | 52.924 | 5.982 |
| Group 4: | 152.077 | 43.471 | 3.194 |
| Group 5: | 85.727** | 43.56 | 4.838 |
| Group 6: | 113.958 | 49.053 | 6.698 |
| Group 7: | 51.524*** | 32.473 | 2.117 |
| Group 8: | 271.98 | 51.628 | 6.96 |
| Group 9: | 54.605 | 37.591 | 1.207 |
| Group 10: | 107.381 | 48.805 | 7.114 |
| Group 11: | 69.613** | 44.171 | 6.412 |
| Group 12: | 51.245**** | 42.076 | 4.515 |
| Group 13: | 104.739 | 35.615 | 2.156** |
| Group 14: | 134.97 | 44.042 | 6.047 |
| Group 15: | 177.011 | 49.198 | 6.187 |
| Group 16: | 95.789 | 39.884 | 2.992 |
Mean within-group Euclidean distance of genera for total motif frequency, mean repeat count, and polymorphism in microsatellite loci with one allowed mismatch
| Group | Mean Euclidean distance | ||
| Total frequency | Repeat count | Polymorphism | |
| Lower vascular plants: | 229.155 | 41.982 | 2.604 |
| 139.117 | 50.474 | 7.411 | |
| 131.475** | 51.5 | 6.603 | |
| Other gymnosperms: | 255.205 | 35.919 | 3.384 |
| 330.728 | 44.916 | 5.173 | |
| 181.561 | 46.964 | 5.931 | |
| 252.623 | 52.162 | 6.718 | |
| 287.749 | 48.202 | 6.216 | |
| 259.861 | 48.064 | 6.67 | |
| 128.443*** | 45.334 | 5.81 | |
| 261.175 | 36.538 | 3.962 | |
| 167.495** | 43.556 | 4.585 | |
| 403.092 | 47.223 | 5.678 | |
| 302.884 | 45.347 | 5.808 | |
| 297.79 | 47.58 | 6.977 | |
| 303.835 | 45.552 | 5.879 | |
| 270.545 | 35.907 | 4.317 | |
| Other monocots: | 218.631 | 37.964 | 3.657 |
| Group 1: | 164.376** | 35.474 | 3.313 |
| Group 2: | 165.313 | 51.348 | 6.861 |
| Group 3: | 238.256 | 39.407 | 3.454 |
| Group 4: | 145.794** | 46.777 | 6.325 |
| Group 5: | 184.201 | 42.602 | 5.33 |
| Group 6: | 152.767** | 43.374 | 6.477 |
| Group 7: | 114.516*** | 35.681 | 3.258 |
| Group 8: | 138.397** | 33.618 | 2.432*** |
| Group 9: | 361.635 | 47.201 | 7.402 |
| Group 10: | 141.561**** | 40.511 | 3.768 |
| Group 11: | 256.198 | 43.848 | 5.768 |
| Group 12: | 278.936 | 42.717 | 6.583 |
| Group 13: | 126.825** | 42.493 | 3.569 |
Mean within-group Euclidean distance of genera for total motif frequency, mean repeat count, and polymorphism in microsatellite loci with two allowed mismatches
| Group | Mean Euclidean distance | ||
| Total frequency | Repeat count | Polymorphism | |
| Lower vascular plants: | 36.903 | 56.8 | 1.411*** |
| 10.519**** | 69.929 | 7.196 | |
| 14.546*** | 67.788 | 6.506 | |
| Other gymnosperms: | 33.035 | 50.695 | 3.059 |
| 67.331 | 62.875 | 4.632 | |
| 39.626 | 60.01 | 5.567 | |
| 30.364 | 66.912 | 6.765 | |
| 23.864 | 59.554 | 6.267 | |
| 88.122 | 67.092 | 6.054 | |
| 13.763**** | 58.708 | 6.594 | |
| 46.143 | 48.69 | 3.589 | |
| 25.452** | 57.543 | 4.426 | |
| 50.746 | 63.157 | 5.731 | |
| 43.303 | 62.033 | 5.498 | |
| 49.094 | 65.451 | 6.753 | |
| 37.194 | 60.681 | 6.531 | |
| 42.527 | 44.446 | 3.14 | |
| Other monocots: | 131.502 | 50.358 | 3.087 |
| Group 1: | 69.26 | 54.761 | 3.502 |
| Group 2: | 23.313 | 43.599 | 3.106 |
| Group 3: | 14.049** | 57.31 | 8.335 |
| Group 4: | 15.270** | 60.331 | 6.408 |
| Group 5: | 31.587 | 62.644 | 6.379 |
| Group 6: | 24.786** | 51.956 | 3.878 |
| Group 7: | 30.045 | 59.185 | 6.25 |
| Group 8: | 17.538** | 58.194 | 8.405 |
| Group 9: | 15.427*** | 55.703 | 4.072 |
| Group 10: | 42.779 | 49.258 | 2.600** |
| Group 11: | 130.585 | 45.719 | 2.757 |
| Group 12: | 35.261 | 61.135 | 4.616 |
| Group 13: | 30.553 | 58.252 | 4.258 |
| Group 14: | 31.365 | 60.619 | 6.723 |
| Group 15: | 46.006 | 63.169 | 6.435 |
Figure 3Maximum likelihood tree based on perfect microsatellites with motifs from two through five bases long. The names have been truncated to 10 letters.
Figure 4Maximum likelihood tree based on microsatellites with one allowed mismatch and motifs of two through five bases.
Figure 5Maximum likelihood tree based on microsatellites with two allowed mismatches and motifs of two through five bases.
DNA content of genera with lowest total microsatellite frequencies
| Genus | Total frequency/MB | 2c DNA content (pg) | Reference |
| 647.59 | 5.45 | [28] | |
| 676.63 | 9.30 | [27] | |
| 741.21 | 25.50 | [26] | |
| 804.71 | Not available | ||
| 805.14 | 14.40 | [26] | |
| 822.56 | 19.90 | [26] | |
| 844.84 | 22.09 | [26] | |
| 848.09 | Not available | ||
| 851.01 | 33.50 | [28] | |
| 853.76 | 38.10 | [26] |
Figure 6Two examples of imperfect minisatellite loci.