| Literature DB >> 25780924 |
Luis Peñarrubia1, Nuria Sanz1, Carles Pla1, Oriol Vidal1, Jordi Viñas1.
Abstract
The zebra mussel (Dreissena polymorpha, Pallas, 1771) is one of the most invasive species of freshwater bivalves, due to a combination of biological and anthropogenic factors. Once this species has been introduced to a new area, individuals form dense aggregations that are very difficult to remove, leading to many adverse socioeconomic and ecological consequences. In this study, we identified, tested, and validated a new set of polymorphic microsatellite loci (also known as SSRs, Single Sequence Repeats) using a Massive Parallel Sequencing (MPS) platform. After several pruning steps, 93 SSRs could potentially be amplified. Out of these SSRs, 14 were polymorphic, producing a polymorphic yield of 15.05%. These 14 polymorphic microsatellites were fully validated in a first approximation of the genetic population structure of D. polymorpha in the Iberian Peninsula. Based on this polymorphic yield, we propose a criterion for establishing the number of SSRs that require validation in similar species, depending on the final use of the markers. These results could be used to optimize MPS approaches in the development of microsatellites as genetic markers, which would reduce the cost of this process.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25780924 PMCID: PMC4364119 DOI: 10.1371/journal.pone.0120732
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Experimental design and protocol pipeline.
Gray steps: Laboratory protocols. White steps: 454 GS System. Black steps: Bioinformatic analysis.
Summary of MPS results and of polymorphic microsatellites identification.
| Output description | Default assembly | Modified assembly | ||
|---|---|---|---|---|
| Phase A | Sequencing step | Reads number | 110,593 | |
| Largest read size (bp) | 691 | |||
| Shorter read size (bp) | 24 | |||
| Reads mean size (bp) | 331 | |||
| Sequenced bases (Kb) | 36,555 | |||
| Assembling step | Aligned reads (of the total right reads) | 31,330 (28.33%) | 35,374 (31.99%) | |
| Aligned bases (Kb) (of the sequenced bases) | 7,620 (20.84%) | 9,030 (24.71%) | ||
| Assembled reads | 18,758 | 20,570 | ||
| Contigs number | 2,326 | 3,885 | ||
| Bases number into contigs (kb) | 1,063 | 1,635 | ||
| Largest contig size (bp) | 8,697 | 8,717 | ||
| Shorter contig size (bp) | 100 | 100 | ||
| Contig mean size (bp) | 457 | 421 | ||
| Contig N50 value (bp) | 825 | 860 | ||
| Singletons number | 70,208 | 68,463 | ||
| Phase B | Initial | - | 288 | |
| Potential Amplifiable Loci (% initial SSR) | - | 93 (32.29%) | ||
| Phase C | Positive PCR amplification (% initial SSR) | - | 81 (28.13%) | |
| Polymorphic markers (% SSR) | - | 14 (4.86%) | ||
Phase A: MPS data processing; Phase B: SSR isolation and primer design; Phase C: Microsatellite validation
Polymorphic microsatellites description.
| Locus | Repeat motif |
| Primer Sequences 5’ → 3’ |
|
|
|
|
|
| GenBank Accession |
|---|---|---|---|---|---|---|---|---|---|---|
|
| [ATA]21 | 279–299 |
| 60 | 5 | 0.83 / 0.81 | 0.76 | No | No | JQ812984 |
|
| ||||||||||
|
| [TGA]17 | 410–440 |
| 60 | 9 | 0.20 / 0.66 | 0.62 | Yes | No | JQ812985 |
|
| ||||||||||
|
| [TAA]11 4 [ATT]5 48 [TAT]11 71 [ATA]14 | 411–416 |
| 50 | 2 | 0.28 / 0.49 | 0.37 | No | No | JQ812986 |
|
| ||||||||||
|
| [TTG]7 57 [GTT]6 | 214–234 |
| 60 | 6 | 0.58 / 0.67 | 0.62 | No | No | JQ812987 |
|
| ||||||||||
|
| [ATT]13 | 231–306 |
| 50 | 13 | 0.79 / 0.86 | 0.83 | No | No | JQ812988 |
|
| ||||||||||
|
| [GGCG]11 | 385–393 |
| 50 | 3 | 0.56 / 0.66 | 0.58 | No | No | JQ812989 |
|
| ||||||||||
|
| [GTTG]9 | 243–256 |
| 50 | 3 | 0.60 / 0.59 | 0.52 | No | No | JQ812990 |
|
| ||||||||||
|
| [TTA]10 | 224–259 |
| 50 | 5 | 0.34 / 0.38 | 0.35 | Yes | No | JQ812991 |
|
| ||||||||||
|
| [GACC]8 | 139–153 |
| 50 | 5 | 0.60 / 0.67 | 0.60 | No | No | JQ812992 |
|
| ||||||||||
|
| [TGTTC]5 | 291–297 |
| 50 | 3 | 0.23 / 0.21 | 0.19 | No | No | JQ812993 |
|
| ||||||||||
|
| [GGTA]8 | 378–382 |
| 50 | 2 | 0.31 / 0.41 | 0.32 | Yes | No | JQ812994 |
|
| ||||||||||
|
| [CGTC]9 | 332–360 |
| 60 | 3 | 0.57 / 0.58 | 0.49 | No | No | JQ812995 |
|
| ||||||||||
|
| [CGTC]5 | 306–315 |
| 50 | 4 | 0.73 / 0.71 | 0.65 | No | No | JQ812996 |
|
| ||||||||||
|
| [CGTC]8 | 260–284 |
| 50 | 4 | 0.56 / 0.53 | 0.42 | No | No | JQ812997 |
|
|
Repeat motif, size range in base pairs (S), forward (F) and reverse (R) (5′– 3′) sequences, annealing temperatures in°C (Ta), number of alleles (A), Observed (H ) and Expected (H ) heterozygosities, PIC index, presence of null alleles (NA), linkage disequilibrium (LD) and GenBank accession number for the 14 polymorphic microsatellites loci tested on 8 individuals for each of the 6 populations (n = 48) in D. polymorpha.
Microsatellites described in Dreissena polymorpha.
| Initial SSR identified | PAL loci | Polymorphic microsatellites | Polymorphic yield | Reference | ||
|---|---|---|---|---|---|---|
| (% of initial SSR) | ||||||
| Traditional techniques | 65 | 16 (24.62%) | 5 | 31.25% | [ | |
| 48 | 8 (16.67%) | 8 | 100.00% | [ | ||
| 28 | 8 (28.57%) | 5 | 62.50% | [ | ||
| Mean | 47 | 10.67 (23.40%) | 6 | 64.58% | ||
|
| 18.52 | 4.62 | 1.73 | 34.42 | ||
| This study | 288 | 93 (32.29%) | 14 | 15.05% | This study |
Initial SSR identified, PAL loci, Polymorphic microsatellites, and Polymorphic yield in all studies describing microsatellite loci in D. polymorpha. SD: Standard deviation. PAL: Potential Amplifiable Loci. Polymorphic yield is the ratio between the numbers of polymorphic microsatellites divided by the number of PCR validated loci.
Microsatellite validation effort depending of the application.
| Genetic application using microsatellite markers |
| Range | Mean ( | Median | Initial SSR markers to be validated |
|---|---|---|---|---|---|
| Genetic mapping | 19 | 31–2000 | 337.89 (442.37) | 209 | 1388.70 |
| Individual DNA identification and parentage assignment | 15 | 1–14 | 7.73 (4.03) | 8 | 53.16 |
| Phylogeny, population and conservation genetics | 30 | 1–33 | 8.77 (6.32) | 7 | 46.51 |
| Molecular epidemiology and pathology | 26 | 1–227 | 26.15 (49.56) | 9 | 59.80 |
| Quantitative trait loci mapping | 13 | 15–428 | 96.54 (77.38) | 77 | 511.63 |
| Marker-assisted selection | 6 | 1–7 | 4.17 (1.94) | 4 | 26.58 |
Studies analyzed (n), SSR markers used and validation effort to get markers required in each genetic application using microsatellite markers. Validation effort was determined using the median values of the number of SSR used in each microsatellite-containing analysis according with 104 studies revised in Chistiakov et al. [24]. The number of locus required to be validated were calculated using the median of SSR analyzed and the polymorphism yield (15.05%) calculated in this study.