| Literature DB >> 27547349 |
Abhijeet B Shah1, Holger Schielzeth2, Andreas Albersmeier3, Joern Kalinowski3, Joseph I Hoffman1.
Abstract
Despite recent advances in high-throughput sequencing, difficulties are often encountered when developing microsatellites for species with large and complex genomes. This probably reflects the close association in many species of microsatellites with cryptic repetitive elements. We therefore developed a novel approach for isolating polymorphic microsatellites from the club-legged grasshopper (Gomphocerus sibiricus), an emerging quantitative genetic and behavioral model system. Whole genome shotgun Illumina MiSeq sequencing was used to generate over three million 300 bp paired-end reads, of which 67.75% were grouped into 40,548 clusters within RepeatExplorer. Annotations of the top 468 clusters, which represent 60.5% of the reads, revealed homology to satellite DNA and a variety of transposable elements. Evaluating 96 primer pairs in eight wild-caught individuals, we found that primers mined from singleton reads were six times more likely to amplify a single polymorphic microsatellite locus than primers mined from clusters. Our study provides experimental evidence in support of the notion that microsatellites associated with repetitive elements are less likely to successfully amplify. It also reveals how advances in high-throughput sequencing and graph-based repetitive DNA analysis can be leveraged to isolate polymorphic microsatellites from complex genomes.Entities:
Keywords: Acrididae; Gomphocerus sibiricus; Orthoptera; genetic marker development; high‐throughput sequencing; microsatellite; transposable elements
Year: 2016 PMID: 27547349 PMCID: PMC4983586 DOI: 10.1002/ece3.2305
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Flowchart detailing the bioinformatic pipeline used to identify polymorphic microsatellites in the club‐legged grasshopper, Gomphocerus sibricus.
Figure 2Results of RepeatExplorer analysis showing the numbers of reads classified as clusters or singletons.
Figure 3A summary of the RepeatMasker analysis showing the number of reads annotated for repeative elements with at least 550 reads using a metazoan repeat database.
In vitro verification of the primer pairs. Shown are the polymorphism characteristics of 17 microsatellite loci that amplified clearly interpretable and polymorphic PCR products in eight unrelated Gomphocerus sibricus individuals
| Locus | Repeat motif | Tandem repeats | Forward primer | Reverse primer | Number of alleles |
|
| HWE |
|---|---|---|---|---|---|---|---|---|
| Gsib01 | TC | 16 | AGAGGGAGACAGATAGACGGC | TTCCACACTTTTAAGACTGAATGC | 10 | 1.00 | 0.93 | 1.00 |
| Gsib02 | TC | 10 | CTGATTCACAGATAGGGGCG | GTCCATATCCTCCTCCCTCC | 5 | 0.50 | 0.82 | 0.09 |
| Gsib07 | AC | 8 | ACACACAACTGCAAACTCCG | TCTTCAGAAAAGATCTCTCCCC | 11 | 1.00 | 0.93 | 1.00 |
| Gsib08 | TC | 8 | AGAGACCACAGGCAGAGAGC | CCCTTTATTGATCGCAAAGC | 2 | 0.17 | 0.53 | 0.15 |
| Gsib13 | TC | 21 | TGAAATCCATGTAGCATCGC | CGGACTTCAACGAAGATTCC | 9 | 0.88 | 0.12 | 0.88 |
| Gsib16 | TC | 8 | TGTGCGATCTTACTCGACCC | GGCCACTTCTTTGTCAGAGC | 6 | 0.38 | 0.86 | 0.01 |
| Gsib18 | TC | 11 | AAGGGAGAAGGAAGACGTGC | GAGAAACATGATGTCGACCG | 8 | 0.75 | 0.91 | 0.08 |
| Gsib19 | ATC | 10 | TCTATGCTCCAGACGGAACG | CAGACATGAAGCCAAAACCC | 6 | 0.88 | 0.82 | 0.57 |
| Gsib21 | ATC | 9 | ACACAAAATATTCCGTGCCC | GACTTACACCAGGTAGGGCG | 3 | 0.50 | 0.66 | 0.38 |
| Gsib24 | ATC | 9 | AGTCTAACGGCCAGAAATGC | TAGTTTTGGCGAAGGAGTCG | 3 | 0.75 | 0.67 | 1.00 |
| Gsib28 | ATT | 8 | ATGTTCATGGTGACAATGCC | CCCCTCACAGGTTATCTTTGC | 2 | 0.25 | 0.50 | 0.22 |
| Gsib29 | ATT | 8 | TCTAGAACCTTTGGTCTGTTGC | ACGAATGTCCCAAGAACAGG | 3 | 0.12 | 0.61 | 0.01 |
| Gsib32 | TCC | 10 | CTACCTTCCTCCTATCGCCC | ATGTGGTTCCCTGTTTCTGC | 6 | 0.75 | 0.84 | 0.38 |
| Gsib35 | ATC | 10 | TATGCTGCAATATGCTTGGC | TCCTCACAAGTGCAGAATGC | 3 | 0.50 | 0.42 | 1.00 |
| Gsib42 | ATAC | 9 | GAGGCTGTAGCCATTTCTCG | GTCTTCACTTCCCATGAGGC | 3 | 0.17 | 0.71 | 0.01 |
| Gsib45 | AAAG | 8 | CAAGGCCACAGTTAAGGAGG | AATGTCTGTGAAATATTACGTGCC | 3 | 0.62 | 0.67 | 0.66 |
| Gsib46 | AATC | 9 | TATTGCCTCTGAATCTGCCC | ATATAGCTGTCCTCAGCGCC | 2 | 1.00 | 0.53 | 0.03 |
HWE, Hardy–Weinberg equilibrium.
Polymorphism characteristics of 20 microsatellite loci that were multiplexed and amplified in 40 unrelated Gomphocerus sibricus individuals. The initial PCR mixes (1 and 2) were modified to minimize interference between loci, resulting in mixes 1a, 1b, 2a, and 2b. * denotes HWE tests that remained significant after table‐wide false discovery rate correction for multiple statistical testing
| Locus | Mix | Dye | Size range (bp) | Number of alleles | HO | HE | HWE | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1a | 1b | 2 | 2a | 2b | |||||||
| Gsib01 | x | x | FAM | 99–151 | 22 | 0.53 | 0.93 | <0.0001* | ||||
| Gsib02 | x | x | x | PET | 244–255 | 10 | 0.64 | 0.82 | 0.0517 | |||
| Gsib03 | x | FAM | 280–284 | 2 | 0.11 | 0.10 | 1.00 | |||||
| Gsib07 | x | x | FAM | 93–132 | 19 | 0.64 | 0.89 | <0.0001* | ||||
| Gsib09 | x | x | x | NED | 182 | 1 | 1.00 | 1.00 | NA | |||
| Gsib13 | x | x | x | PET | 167–232 | 17 | 0.38 | 0.90 | <0.0001* | |||
| Gsib14 | x | x | x | VIC | 119–129 | 5 | 0.21 | 0.71 | <0.0001* | |||
| Gsib16 | x | FAM | 188–232 | 14 | 0.72 | 0.86 | 0.1961 | |||||
| Gsib17 | x | x | FAM | 180–212 | 11 | 0.24 | 0.68 | <0.0001* | ||||
| Gsib18 | x | x | FAM | 162–223 | 22 | 0.74 | 0.93 | <0.0001* | ||||
| Gsib19 | x | FAM | 236–276 | 12 | 0.74 | 0.91 | 0.0304 | |||||
| Gsib21 | x | x | x | VIC | 171–186 | 6 | 0.36 | 0.68 | <0.0001* | |||
| Gsib23 | x | FAM | 232–253 | 5 | 0.29 | 0.74 | 0.0001* | |||||
| Gsib24 | x | x | x | VIC | 157–169 | 3 | 0.44 | 0.52 | 0.2274 | |||
| Gsib28 | x | x | x | NED | 223–235 | 5 | 0.37 | 0.53 | 0.0937 | |||
| Gsib29 | x | x | x | PET | 234–265 | 5 | 0.11 | 0.61 | <0.0001* | |||
| Gsib32 | x | x | x | VIC | 202–230 | 11 | 0.67 | 0.86 | 0.029 | |||
| Gsib35 | x | x | x | VIC | 204–226 | 3 | 0.38 | 0.32 | 0.706 | |||
| Gsib36 | x | x | x | PET | 154–170 | 8 | 0.28 | 0.80 | <0.0001* | |||
| Gsib45 | x | x | x | NED | 220–232 | 5 | 0.63 | 0.67 | 0.2345 | |||
HWE, Hardy–Weinberg equilibrium; HO, observed heterozygosity; HE, expected heterozygosity.