| Literature DB >> 23208373 |
Liang-Wei Xiong1, Qun Wang, Gao-Feng Qiu.
Abstract
Microsatellites are simple sequence repeats with a high degree of polymorphism in the genome; they are used as DNA markers in many molecular genetic studies. Using traditional methods such as the magnetic beads enrichment method, only a few microsatellite markers have been isolated from the Chinese mitten crab Eriocheir sinensis, as the crab genome sequence information is unavailable. Here, we have identified a large number of microsatellites from the Chinese mitten crab by taking advantage of Solexa genomic surveying. A total of 141,737 SSR (simple sequence repeats) motifs were identified via analysis of 883 Mb of the crab genomic DNA information, including mono-, di-, tri-, tetra-, penta- and hexa-nucleotide repeat motifs. The number of di-nucleotide repeat motifs was 82,979, making this the most abundant type of repeat motif (58.54%); the second most abundant were the tri-nucleotide repeats (42,657, 30.11%). Among di-nucleotide repeats, the most frequent repeats were AC motifs, accounting for 67.55% of the total number. AGG motifs were the most frequent (59.32%) of the tri-nucleotide motifs. A total of 15,125 microsatellite loci had a flanking sequence suitable for setting the primer of a polymerase chain reaction (PCR). To verify the identified SSRs, a subset of 100 primer pairs was randomly selected for PCR. Eighty two primer sets (82%) produced strong PCR products matching expected sizes, and 78% were polymorphic. In an analysis of 30 wild individuals from the Yangtze River with 20 primer sets, the number of alleles per locus ranged from 2--14 and the mean allelic richness was 7.4. No linkage disequilibrium was found between any pair of loci, indicating that the markers were independent. The Hardy-Weinberg equilibrium test showed significant deviation in four of the 20 microsatellite loci after sequential Bonferroni corrections. This method is cost- and time-effective in comparison to traditional approaches for the isolation of microsatellites.Entities:
Mesh:
Year: 2012 PMID: 23208373 PMCID: PMC3546693 DOI: 10.3390/ijms131216333
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The percentages of mono-, di-, tri-, tetra-, penta- and hexa-nucleotide repeats in motif sequences from the Chinese mitten crab.
Figure 2Observed counts of identified microsatellite loci for different repeat sequence motifs of (A) di- nucleotide and (B) tri-nucleotide repeats from the Chinese mitten crab.
Figure 3Observed number of microsatellites with mono-, di-, tri-, tetra-, penta- and hexa-nucleotide motifs in 883 Mb sequence.
Frequency of simple sequence repeats (SSRs) in Eriocheir sinensis.
| Motif length | Repeat numbers | Total number | % | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4–5 | 6–7 | 8–9 | 10–11 | 12–13 | 14–15 | 16–17 | 18–19 | 20–21 | 22–23 | 24–25 | >25 | |||
| Mono- | 184 | 100 | 149 | 433 | 0.31 | |||||||||
| Di- | 38990 | 15208 | 6239 | 3541 | 3052 | 2830 | 2441 | 2070 | 8608 | 82979 | 58.54 | |||
| Tri- | 30358 | 7936 | 2189 | 969 | 497 | 264 | 151 | 73 | 46 | 192 | 42675 | 30.11 | ||
| Tetra- | 9278 | 1093 | 208 | 61 | 8 | 2 | 6 | 1 | 7 | 3 | 1 | 10668 | 7.53 | |
| Penta- | 2403 | 909 | 125 | 33 | 16 | 3 | 2 | 2 | 1 | 1 | 1 | 3496 | 2.47 | |
| Hexa- | 1460 | 23 | 2 | 1 | 1486 | .05 | ||||||||
| Total | 3863 | 10210 | 31578 | 47168 | 17474 | 7219 | 4042 | 3324 | 2983 | 2706 | 2219 | 8951 | 141737 | 100 |
| % | 2.73 | 7.20 | 22.28 | 33.28 | 12.33 | 5.09 | 2.85 | 2.35 | 2.10 | 1.91 | 1.57 | 6.32 | ||
Characteristics of 20 microsatellite loci in Eriocheir sinensis, tested with 30 samples.
| Locus | Primer sequence (5′–3′) | Repeat motif | Number of alleles | Allele size range (bp) | GeneBank Accession No. | ||||
|---|---|---|---|---|---|---|---|---|---|
| F:GATAGACCGTAAATGAGACGGCTG | (GGA)9 | 63 | 8 | 151–174 (157) | 0.643 | 0.759 | 0.130 | KC143114 | |
| F:GGATTTACTTAAGTTGGGGCTCGT | (GAG)8 | 63 | 10 | 119–161 (146) | 0.750 | 0.918 | 0.057 | KC143100 | |
| F:CAGCGAAAAACAGGAAGCATTTAG | (AC)11 | 63 | 12 | 140–212 (166) | 0.793 | 0.950 | 0.000 | KC143117 | |
| F:TTCTTTGAGCGACATGCAAAAGT | (TG)29 | 62 | 14 | 132–192 (150) | 0.950 | 0.971 | 0.100 | KC143110 | |
| F:TAGGGGGTTTTAGGTGTGGTGATA | (TGA)9 | 62 | 8 | 125–167 (143) | 0.658 | 0.855 | 0.361 | KC143115 | |
| F:CAACCACTACAACTATCAAAACCACC | (CA)26 | 62 | 9 | 91–181 (109) | 0.792 | 0.897 | 0.151 | KC143108 | |
| F:TAACCTAAACAGCAACAGCAGCAA | (AGT)8 | 63 | 4 | 128–146 (134) | 0.567 | 0.693 | 0.251 | KC143118 | |
| F:TGTTGAGTGTGATGTTTGTGATGC | (GT)12 | 65 | 11 | 160–226 (196) | 0.958 | 0.935 | 0.001 | KC143116 | |
| F:TGCATATTGTTGTTTTTACTGACGTGT | (AC)18 | 63 | 12 | 140–188 (154) | 0.884 | 0.945 | 0.641 | KC143109 | |
| F:TACCTTTTTCAGGGTGAGTGAAGG | (GT)23 | 63 | 7 | 138–208 (160) | 0.865 | 0.917 | 0.317 | KC143102 | |
| F:ATGTTTATTTTCACAACGCGAAGC | (ACAT)6 | 63 | 4 | 157–177 (157) | 0.583 | 0.732 | 0.002 | KC143103 | |
| F:ACCCATCTCAAGTCCAGACTCATC | (CCT)8 | 63 | 7 | 152–170 (161) | 0.700 | 0.784 | 0.113 | KC143113 | |
| F:AAGGAAGGCAGTTAGGAGGGTATG | (AC)11 | 65 | 8 | 189–259 (221) | 0.668 | 0.950 | 0.147 | KC143106 | |
| F:TGTTGTGTTGTCATGTCTTGTCTTTT | (TCTCA)5 | 62 | 2 | 142–147 (142) | 0.420 | 0.510 | 0.000 | KC143119 | |
| F:TCACCCCTTACTGAGCATAACACA | (CA)12 | 63 | 5 | 93–109 (105) | 0.736 | 0.773 | 0.250 | KC143105 | |
| F:TCCTCCCTATGCTCTTTGTAGGTG | (AC)11 | 63 | 8 | 150–192 (156) | 0.699 | 0.786 | 0.115 | KC143111 | |
| F:CAGCATGTCCAGTCTCTTCTGTGT | (GT)15 | 65 | 6 | 141–159 (145) | 0.772 | 0.864 | 0.054 | KC143107 | |
| F: TGGCATTGATTGATGTGAGTAGTG | (GGT)10 | 62 | 5 | 85–106 (94) | 0.326 | 0.551 | 0.269 | KC143104 | |
| F:CAGACCCTCCCGATGATACACTAC | (ATCT)8 | 63 | 4 | 133–155 (143) | 0.563 | 0.668 | 0.531 | KC143112 | |
| F: GAGATGGAGGTAGATGATCGAGGA | (GAAG)7 | 63 | 4 | 120–144 (132) | 0.447 | 0.532 | 0.516 | KC143101 |
annealing temperature;
allele size (size of sequenced allele);
p-values of test for deviation from Hardy-Weinberg equilibrium;
microsatellite loci revealed the presence of null alleles with MICRO-CHEKER 2.2.3.