Literature DB >> 25360247

Validation of microsatellite multiplexes for parentage analysis and species discrimination in two hybridizing species of coral reef fish (Plectropomus spp., Serranidae).

Hugo B Harrison1, Kevin A Feldheim2, Geoffrey P Jones3, Kayan Ma4, Hicham Mansour5, Sadhasivam Perumal5, David H Williamson1, Michael L Berumen6.   

Abstract

Microsatellites are often considered ideal markers to investigate ecological processes in animal populations. They are regularly used as genetic barcodes to identify species, individuals, and infer familial relationships. However, such applications are highly sensitive the number and diversity of microsatellite markers, which are also prone to error. Here, we propose a novel framework to assess the suitability of microsatellite datasets for parentage analysis and species discrimination in two closely related species of coral reef fish, Plectropomus leopardus and P. maculatus (Serranidae). Coral trout are important fisheries species throughout the Indo-Pacific region and have been shown to hybridize in parts of the Great Barrier Reef, Australia. We first describe the development of 25 microsatellite loci and their integration to three multiplex PCRs that co-amplify in both species. Using simulations, we demonstrate that the complete suite of markers provides appropriate power to discriminate between species, detect hybrid individuals, and resolve parent-offspring relationships in natural populations, with over 99.6% accuracy in parent-offspring assignments. The markers were also tested on seven additional species within the Plectropomus genus with polymorphism in 28-96% of loci. The multiplex PCRs developed here provide a reliable and cost-effective strategy to investigate evolutionary and ecological dynamics and will be broadly applicable in studies of wild populations and aquaculture brood stocks for these closely related fish species.

Entities:  

Keywords:  Plectropomus spp; coral trout; hybridization; microsatellite multiplex; parentage analysis

Year:  2014        PMID: 25360247      PMCID: PMC4201420          DOI: 10.1002/ece3.1002

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


Introduction

Microsatellite loci are commonly used in ecology to measure genetic variability within and among populations (Hartl and Clark 1989; Slatkin 1995). Their high allelic diversity and relative ease of development also make them ideal for individual genotyping to assist in species identification (Guichoux et al. 2011a), to uniquely identify individuals (Lukacs and Burnham 2005), and to infer phylogenetic or genealogical relationships (Blouin 2003; Jones et al. 2010). In the marine environment, these genetic tools may be the only means to measure important ecological processes such as larval dispersal (Planes et al. 2009; Saenz-Agudelo et al. 2011; Berumen et al. 2012), adult migrations (Hansen et al. 2001), and reproductive success (Araki et al. 2007; Beldade et al. 2012). However, applying these methods accurately can require numerous, highly polymorphic markers (Harrison et al. 2013a,b2013b), and optimized PCR multiplexes can maximize the cost-effectiveness of using microsatellites. Coral trout (Plectropomus, Serranidae) are large predatory coral reef fishes that are widely distributed throughout the Indo-Pacific. They are among the highest market priced reef fish and often heavily exploited by both artisanal and commercial fisheries. The emergence of an international trade in live reef fish has significantly increased the demand for coral trout (Sadovy et al. 2003) with two of eight species in the genus now listed as “Vulnerable” in the IUCN species assessment (Sadovy de Mitcheson et al. 2013). On the Great Barrier Reef (GBR), Plectropomus leopardus (Lacepède, 1802) and P. maculatus (Bloch, 1790) are the most common coral trout species and frequently occur on the same reefs; however, their relative densities vary according to the cross-shelf position of individual reefs (Mapstone et al. 1998). Densities of P. maculatus are generally highest on inner-shelf reefs, while P. leopardus are at higher densities on mid- and outer-shelf reefs (Heemstra and Randall 1993; Mapstone et al. 1998; Russ et al. 2008). Adult P. leopardus and P. maculatus are easily identified in field observations by their characteristic spot patterns; however, the two species are not readily distinguishable as juveniles under approximately 60 mm in length. This poses a significant challenge in assessing the recruitment levels of P. leopardus and P. maculatus in areas where the two species co-occur. Furthermore, tank experiments have also shown that the two species can produce viable hybrid offspring (Frisch and van Herwerden 2006), and hybridization has led to genetic introgression in wild populations (van Herwerden et al. 2006). In this study, we describe the development and validation of three multiplex PCR kits for individual barcoding and species discrimination of P. leopardus and P. maculatus DNA tissue samples, and for determining parent–offspring relationships in natural populations. Using an enriched cloning library developed for P. maculatus and 454 pyrosequencing libraries for both species, we identified and characterized 25 polymorphic microsatellite loci that amplified in multiplex PCR for both species. Using simulated datasets, we demonstrate the capacity of the marker set to discriminate between the two species, identify putative hybrid individuals, and resolve parent-offspring relationships in natural populations. Finally, we assess the transferability of each locus in seven species and subspecies in the genus Plectropomus.

Material and Methods

Sampling and DNA isolation

Tissue samples for 285 adult Plectropomus leopardus were collected from the Capricorn Bunker group (23°30′S, 152°50′E), a mid-shelf reef system of the southern Great Barrier Reef where P. leopardus and P. maculatus occur in sympatry. A further, 285 adult P. maculatus were collected from the Keppel Island group (23°10′S, 150°57′E), an inshore island group dominated by P. maculatus. All samples were collected between November 2007 and September 2012 under Marine Parks permit No. G06/17981.1 and G11/33554.1, and Queensland General Fisheries permit No. 87381 and 148534. Fish were captured using either baited, barbless hooks or on SCUBA with the use of biopsy probes attached to spears following Evans (2008). All individuals were identified to species level by trained observers and released alive at the capture site. Either fin or muscle tissue were removed from each fish and preserved in 95–100% ethanol. DNA extractions were performed from fish fin and muscle tissue using a Nucleospin-96 Tissue kit (Macherey-Nagel) with a double elution for final eluates of 200 μL. Average DNA concentrations were 59.7 ng μL−1 ± 8.9 SE for P. leopardus and 59.4 ng μL−1 ± 5.5 SE for P. maculatus. In addition, samples of congeneric species were collected in order to examine cross-species amplification of microsatellite loci. These included 23 P. areolatus (Saudi Arabia), 24 P. pessuliferus marisburi (subspecies, Saudi Arabia), 29 P. pessuliferus pessuliferus (subspecies, Thailand and Maldives), 8 P. oligacanthus (Philippines), and 8 P. laevis (Maldives). Fin clips were removed from each fish and preserved in 95% ethanol. Total DNA was extracted using the QIAamp Tissue Kit (Qiagen, Germany), following the manufacturer's protocol.

Microsatellite enrichment protocols

Cloning library

Microsatellite markers were developed for P. maculatus using an enrichment protocol developed by Glenn and Schable (2005). Approximately 4 mg of genomic DNA (gDNA) from one individual was digested with RsaI and XmnI, and SuperSNX24 linkers were ligated onto the ends of gDNA fragments. Linkers act as priming sites for polymerase chain reactions (PCRs) in subsequent steps. Five biotinylated tetranucleotide probes were hybridized to gDNA: (AAAT)8, (AACT)8, (AAGT)8, (ACAT)8, and (AGAT)8. The biotinylated probe-gDNA complex was added to magnetic beads coated with streptavidin (Dynabeads® M-280 Invitrogen, Carlsbad, USA). This mixture was washed twice with 2× SSC, 0.1% SDS and four times with 1×SSC, 0.1% SDS at 52°C. For the final two washes, the mixture was incubated for 1 min in a 52°C water bath. Between washes, a magnetic particle-collecting unit was used to capture the magnetic beads, which are bound to the biotin-gDNA complex. Enriched fragments were removed from the biotinylated probe by denaturing at 95°C and precipitated with 95% ethanol and 3 M sodium acetate. To increase the amount of enriched fragments, a “recovery” PCR was performed in 25 μL reactions containing 1× PCR buffer (10 mM Tris-HCl, 50 mM KCl, pH 8.3), 1.5 mM MgCl2, 0.16 mM of each dNTP, 10× BSA, 0.52 μM of the SuperSNX24 forward primer, 1U Taq DNA polymerase, and approximately 25 ng enriched gDNA fragments. Thermal cycling was performed in an MJ Research DYAD as follows: 95°C for 2 min followed by 25 cycles of 95°C for 20 s, 60°C for 20 s, and 72°C for 90 s, and a final elongation step of 72°C for 30 min. Subsequent PCR fragments were cloned using the TOPO-TA Cloning® kit (Invitrogen) following the manufacturer's protocol. Bacterial colonies containing a vector with gDNA (i.e., white colonies) were used as a template for subsequent PCR in a 25 μL reaction containing 1× PCR buffer (10 mM Tris-HCl, 50 mM KCl, pH 8.3), 1.5 mM MgCl2, 0.12 mM of each dNTP, 10× BSA, 0.25 μM of the M13 primers, and 1U Taq DNA polymerase. Thermal cycling was as follows: an initial denaturing step of 95°C for 7 min was followed by 35 cycles of 95°C for 20 s, 50°C for 20 s, and 72°C for 90 s, and PCR products were desalted using MultiScreen-PCR Filter Plates (Millipore, Billerica, USA) following the manufacturer's protocol. DNA sequencing was performed using the BigDye® Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems, Foster City, USA). Sequencing reactions were precipitated with ethanol and 125 mM EDTA and run on an ABI 3730 DNA Analyzer.

454 pyrosequencing

454 pyrosequencing was performed on genomic DNA extracted from Plectropomus leopardus and P. maculatus, using the Genome Sequencer FLX, following the manufacturer's instructions (Roche 454 Life Sciences, Basel, Switzerland). Briefly, 500 ng of gDNA from each species was randomly sheared via nebulization, and double-stranded DNA adaptors were blunt-ligated to fragment ends using the GS FLX Titanium Rapid Library MID Adaptors Kit. The final single-stranded DNA library was isolated via magnetic streptavidin-coated beads binding to biotinylated adaptors. The library was then quantified via fluoro-spectrometer nanodrop 3300 (ThermoScientific, Wilmington, USA) and the size of the insert checked by 2100 Bioanalyser (Agilent biotechnologies, Inc, Santa Clara, USA) prior to emulsion PCR. Genomic shotgun library molecules were clonally amplified via emulsion PCR employing a GS FLX Titanium LV emPCR Kit. Following amplification, emPCRs were collected, and emulsions were broken. Beads containing sufficient copies of clonally amplified library fragments were selected via the specified enrichment procedure and counted with a Multisizer Coulter Counter (Beckman Coulter, Fullerton, USA) prior to sequencing. Following emulsion PCR enrichment, beads were deposited into the wells of a PicoTiterPlate device, and sequencing was performed. Image analysis, signal processing and base calling were performed using the Genome Sequencer FLX System Software supplied by 454 Life Sciences.

Multiplex PCR development

Primer design

Microsatellite markers were isolated from two genomic libraries designed for Plectropomus leopardus and P. maculatus, and an enriched cloning library for P. maculatus of which, five loci were previously published in Harrison et al. (2012). Each library was screened for microsatellite loci containing uninterrupted tri- and tetra-nucleotide repeats with sufficiently large flanking regions for the design of oligonucleotide primers. Primers were designed in msatcommander (Rozen and Skaletsky 2000; Faircloth 2008) targeting 26 base pair oligonucleotides, melting temperatures of 60°C and 3′ G/C clamps for higher specificity. Tertiary structure formations were minimized between forward and reverse primers at each locus and between loci to reduce primer heteroduplexing in multiplex PCRs. In total, 46 markers were selected, 7 P. leopardus microsatellites, 14 and 25 P. maculatus microsatellites from the cloning library and genomic library, respectively. For each marker, the reverse primer was labeled with one of the fluorescent dies 6-FAM, HEX, Atto-550 or Atto-565. All novel loci were amplified in simplex PCRs on three individuals from each species using the Invitrogen Platinum PCR kit with the following protocol. Each 20 μL PCR contained, 2 μL 10× buffer, 0.4 μL dNTP (10 mM), 0.8 μL MgCl (50 mM), 0.08 μL HotStart Taq (5 U μL−1), 2 μL forward and reverse primers (2 μM), 1 μL genomic DNA and 11.72 μL distilled water. PCRs were performed on Veriti thermal cyclers (Applied Biosystems) with the following sequence: 5 min initial denaturation at 95°C, 5 cycles of 30 s at 95°C, 30 s at 62°C, and 30 s at 72°C, then 5 cycles of 30 s at 95°C, 30 s at 60°C, and 30 s at 72°C, then 20 cycles of 30 s at 95°C, 30 s at 58°C, and 30 s at 72°C, followed by 10 min at 72°C. Touchdown PCRs greatly increase the binding specificity of primers and thus reduce noise and artifacts; these are commonly used for multiplex PCRs where optimal annealing temperatures of each locus differ. PCR products were screened on an ABI 3370xl DNA Analyzer (Applied Biosystems) following a 1:10 dilution. Markers that did not amplify in both species, with low-quality profiles (poor amplification or stutter), or with overlapping ranges were excluded from further experiments.

Multiplex PCR optimization

Microsatellites were selected for multiplex PCRs based on their likely size range, taking into account primer heteroduplexing. Selected primer pairs were combined in a primer premix for in-reaction concentrations ranging from 10 to 50 pM. Multiplex PCR amplification was performed on eight individuals of each species using the Qiagen Microsatellite Type-it kit (Qiagen, Germany). All three multiplex reactions were performed in a total volume of 10 μL containing 5 μL of Qiagen Multiplex Master Mix (2×), 3 μL of distilled water, 1 μL of primer premix, and 1 μL template DNA. Multiplex PCRs were performed on Veriti thermal cyclers with the following sequence: 15 min initial denaturation at 95°C, 5 cycles of 30 s at 95°C, 90 s at 62°C, and 60 s at 72°C, then 5 cycles of 30 s at 95°C, 90 s at 60°C, and 60 s at 72°C, then 20 cycles of 30 s at 95°C, 90 s at 58°C, and 60 s at 72°C, followed by 30 min at 60°C. PCR products were screened on an ABI 3370xl DNA Analyzer (Applied Biosystems) with the GeneScan 500 or 600 LIZ (Applied Biosystems) internal size standard following a 1:15 dilution. The concentrations of primers were adjusted with each run for even amplification of all microsatellites. Individual genotypes were scored in genemapper v4.0, and unique alleles were distinguished using marker-specific binsets msatallele (Alberto 2009).

Diversity and power analyses

For both P. leopardus and P. maculatus, observed genotypes were tested for departures from Hardy–Weinberg equilibrium due to heterozygote deficiency at each locus using the exact test (Guo and Thompson 1992) based on 1 000 000 Markov chain iterations as implemented in arlequin v3.5 (Excoffier and Lischer 2010). Significance of multiple P-values (α = 0.05) was assessed with strict Bonferroni correction applied for multiple comparisons (P < 0.001; Rice 1989). The number of alleles (Na), observed heterozygosity (Ho), expected heterozygosity (He), and the fixation index (Fis) were measured at each locus using genalex v6.5 (Peakall and Smouse 2012). The exclusion probability (PE) and cumulative exclusion probability were calculated according to Jamieson and Taylor (1997) in genalex v6.5. In order to estimate locus-specific genotypic error rates, 96 PCRs were repeated for each multiplex kit, which included 48 P. leopardus and 48 P. maculatus individuals.

Species discrimination analysis

Using species-specific allelic frequency estimates, we simulated 5000 individuals for each species and 5000 F1 hybrid individuals in hybridlab v1.0 (Nielsen et al. 2006). We applied a model-based Bayesian clustering method implemented in structure 2.3.3 (Pritchard et al. 2000; Falush et al. 2007), using a Markov Chain Monte Carlo (MCMC) resampling procedure, to estimate the distribution of posterior probabilities of all 15 000 simulated individuals. With the number of groups (K) set to 2 corresponding to the two species and assuming population admixture, we performed a single run using 50 000 MCMC iterations with a burn-in period of 50 000 steps, first with each individual multiplex kit, and three combinations of multiplex kits combined. Following Vähä and Primmer (2006), we examined the distribution of posterior probabilities and identified species-specific thresholds of assignment to correctly allocate individuals to either species or hybrid individuals. The accuracy of assignments to purebred of hybrid groups was measured as the proportion of individuals in a group that were correctly identified.

Accuracy of parentage analyses

Species-specific allelic frequencies were again used to determine the accuracy of parentage analyses in simulated populations of P. leopardus and P. maculatus. For each species, we generated 500 adult genotypes in the software program mykiss (Kalinowski 2009) with a sex ratio of two females per male, which approximates the observed sex ratio of Plectropomus spp. on inshore reefs of the Great Barrier Reef (Adams et al. 2000). We then generated 1000 offspring genotypes of known and unknown decent, simulating incomplete sampling of the adult populations. The proportion of known parents in the sample was fixed at 20%, and genotyping error was introduced at a rate of 1% for each locus. Simulated datasets were analyzed using the pairwise likelihood score method implemented in famoz (Gerber et al. 2003). As the sex of individuals in wild-caught samples is generally unknown, our analyses combined simulated female and male genotypes. This approach accounts for genotyping error by introducing a calculation error rate when estimating the likelihood of each putative dyad. Here, a calculation error rate of 0.01% was used in all analyses (Gerber et al. 2000; Morrissey and Wilson 2005), with LOD score thresholds of 5 and 10 for single parents and parent pair assignments, respectively. For each simulated offspring, the assigned parent or parent pair was compared with the known true parents. When an offspring was assigned to a parent that was not its true parent or not assigned (excluded), we determined whether the true parent was in the sample and identified it as either false-positive (type I) or false-negative (type II) errors (Fig. S1). The overall accuracy was measured as the proportion of correct assignments to single parents or parent pairs and the number of correct exclusions over all possible assignments (Harrison et al. 2013a). Assignments to a single parent when both parents were present in the sample were also considered as incorrect assignments (Fig. S1). Processing of all software outputs was performed in r with scripts uploaded as online supporting information.

Results

Multiplex PCR optimization

Among the 46 microsatellite loci isolated from either the enriched cloning library or the two genomic libraries for P. leopardus and P. maculatus, 21 were excluded due to incompatibility of product lengths (5 loci), low polymorphism (4 loci), poor amplification (6 loci), or because they did not amplify in one of the two species (6 loci). Of the 25 loci retained for multiplex PCRs, one contained tri-nucleotide (Ple04) and 24 contained tetra-nucleotide repeats (Table 1). All loci had good amplification qualities for both species, with sharp and even peaks standardized with variable primer concentrations. Loci covered an optimal range of product lengths allowing 8 or 9 loci per kit (Fig. 1) and additional loci may be included with further optimization.
Table 1

Description of 25 microsatellite loci isolated from a cloning and genomic libraries in P. leopardus and P. maculatus. The expected size range, fluorescent dye, and concentration of primers in PCRs and GenBank Accession Number is indicated for each locus

LocusForward primer (5′–3′)Reverse primer (5′–3′)GenBank accession no.LibraryDyeRepeat motifReaction conc. (μM)Size range (bp)
kit-1
Pma0361GGGTCTGCAGGCAACACAAAGACATTGGAGAAAATTGTTGAGTGAAGAGTGGJN222545CloningAtto565TAGA0.040328–596
Pma043TGACTAACACTCAAATTGTCACCTTCATGCTGATAGGATGGTTTAATACAGCKF992554Cloning6FAMTATC0.020288–430
Pma0971AGTGGGGCCATGTTTAACAACAGCAACGAGTTTTGTGAGATGGATGGGTGGAJN222546CloningAtto565ATCT0.01096–206
Pma1041CCATAACGGGGACTTTGGCCAATCACTGCACTTGTAGAACAGCCATGGGAJN222547CloningAtto550TATC0.010160–266
Pma1061CAGGAGCCATTGAGACAGGGAGAGGAGTGTTGGTGGTTTCGCTGATGCTTJN222548CloningHEXGATA0.010129–245
Pma109TGCCCGACTCGATTTGTAACAGTGCACTCAGATATCTTGAGGTTAGAGGTCKF992557CloningHEXATCT0.020376–460
Pma112CTGCACTTTAATACCCATGAAATAGCTGGAAACCAGTTAAATAATCCCTGACKF992558Cloning6FAMTATC0.020136–196
Pma1141CTTGAACAGGCAGTGTAAAGGGGGCACCTGGAGCCAGTCATGTTCATGGTJN222549CloningAtto565TATC0.010216–281
Pma180AAATGGATATGACACAGAGATAGGACAATGAGAAGACATGTTGAAGCTGGKF992560GenomicAtto550AGAT0.010270–362
kit-2
Ple002TACTCGCAATTATAACACAGATCCAGTTTGTCCAGCACTGTATTTATCTATCKF992545GenomicHEXAGAT0.030196–284
Ple004ATTAGTATACAAGGAGCCACAGAATCTCACTACGGCATTCCTAATAATTGTCKF992546Genomic6FAMAAT0.030366–430
Pma012ATATGGCCATTATTGTGAGTTAGGTGAAATCTTTAAACCTACCACTGATCCCKF992548Genomic6FAMAATG0.010128–182
Pma025AGTAGACTCCGATAACTCATTCTCACTGAGACAAGAAGCTTTACAAGTGAAGKF992551GenomicAtto565ACTC0.010326–438
Pma038TATGGAGGGATGATGCTATCTAAGAGATGCTAAACTGGATGCACTACAATCKF992553GenomicAtto550AGAT0.010286–400
Pma090GATGTCCAAATATCACCTCTAACCAGAGAGGCTCAATATTATCATGTGAACGKF992555CloningHEXTAGA0.010352–408
Pma101TGTTCTGTCAGATATGTAATGTGCTGGGGGATAGACAAGAGGAAAGAGAGGGGAKF992556Cloning6FAMTATC0.030246–316
Pma412AAAGTTAGCCATTTAAACACAGAGCCTAGGTAGAGGTCACTGTTGCATTATCKF992563GenomicHEXACAT0.010132–188
kit-3
Ple001CATCACTGATCACACTGCCTCCAACCTTCACTACAGTTAATACCACACKF992544Genomic6FAMACGC0.020142–215
Ple005AACTACAATGAAACCTGCCTCTTATGTTTGATTATGACTCAATGATCGCAAGKF992547GenomicHEXACAG0.050396–452
Pma020TATGGAGGGATGATGCTATCTAAGAGATGCTAAACTGGATGCACTACAATCKF992549GenomicAtto550AGAT0.015285–399
Pma022AAGATGTGCACTGTCAATACACTATGGATGTCAGATATCAGGCTCCTAAATGKF992550GenomicHEXAGAT0.020278–340
Pma027TAGACTAGTTCAGGGTGTCAGTTGAAGGGAATGGAAATAAACTGTCATACKF992552GenomicAtto565AAAT0.015280–394
Pma121CTATTAGTTTCACTGAGGAAGAGTCGATATGAAGTTCACACCTCAGTTGAACKF992559Cloningv6FAMATAG0.050361–433
Pma191GCCTTCGGAAACAATCATTATTCATCGGGAAATTAAGAAGTCTACATTGAAGCKF992561GenomicAtto550AAAT0.010174–232
Pma288TTGTATGTAATTTCGCCATGTTTGAGTGTTGTCCGGTCATATTAATTGAGAGKF992562GenomicAtto565AAAG0.010140–210

Previously published in Harrison et al. (2012).

Figure 1

Allelic size ranges and fluorescent dyes used in each of three multiplex PCR kits designed for species discrimination and individual barcoding in natural populations of P. leopardus and P. maculatus. Horizontal axis shows the size ranges for each locus in base pairs (bp).

Description of 25 microsatellite loci isolated from a cloning and genomic libraries in P. leopardus and P. maculatus. The expected size range, fluorescent dye, and concentration of primers in PCRs and GenBank Accession Number is indicated for each locus Previously published in Harrison et al. (2012). Allelic size ranges and fluorescent dyes used in each of three multiplex PCR kits designed for species discrimination and individual barcoding in natural populations of P. leopardus and P. maculatus. Horizontal axis shows the size ranges for each locus in base pairs (bp).

Locus characteristics

We found no discernible differences in the quality and polymorphism of microsatellites developed from the cloning or genomic libraries. Among 285 P. leopardus and 285 P. maculatus individuals genotyped with all 25 loci, we found similar levels of genetic diversity in P. leopardus than in P. maculatus samples (Table 2). For P. leopardus, the mean number of alleles per locus was 17.9 ± 1.6 SE with an average observed heterozygosity of 0.723 ± 0.038 SE across all samples. For P. maculatus, the mean number of alleles per locus was 18.2 ± 2.2 SE with an average observed heterozygosity of 0.796 ± 0.019 SE across all samples. Several loci showed significant departure from Hardy–Weinberg equilibrium, though only one (Pma112) showed significant departure from expectations in both species. One locus, Pma191, was almost monomorphic in P. leopardus with one allele observed at a frequency of 0.98, but polymorphic in P. maculatus. The high level of polymorphism provide an average power of exclusion per locus of 0.463 ± 0.042 SE for P. leopardus and 0.479 ± 0.034 SE for P. maculatus, and each kit provided a cumulative power of exclusion of ∼0.99 (Table 2). The overall combined power of exclusion with 25 loci was approximately 1 – 2 × 10−8. Scoring errors in repeated PCR samples were rare, and error rates were estimated from the combined P. leopardus and P. maculatus genotypes (Table 2). Only Pma112 exhibited high levels of scoring error, which were due to the presence of null alleles.
Table 2

Characteristics of three multiplex kits based on 285 P. leopardus and 285 P. maculatus individuals

Plectropomus leopardus (N = 285)Plectropomus maculatus (N = 285)


LocusNaHoHeFisHWE P valuePECumPENaHoHeFisHWE P valuePECumPEScoring error, %
kit-1
Pma036430.8000.9340.144<0.0010.7680.768590.9610.957−0.0050.7420.8420.8421
Pma043200.8000.787−0.0170.9530.4190.865200.8070.8390.0380.0920.5360.9270
Pma097170.8530.8570.005<0.0010.5530.940250.8880.887−0.0010.6510.6380.9730
Pma104200.7720.7820.0130.9930.4390.966400.9120.9240.0130.8120.7360.9930
Pma106250.9020.9280.0290.8930.7470.991170.8390.8710.0380.9270.5950.9970
Pma109310.6140.9500.354<0.0010.8170.998170.8110.807−0.0040.4520.4800.9991
Pma112150.6070.8540.289<0.0010.5550.999140.6460.7850.178<0.0010.4220.9998
Pma11470.5510.532−0.0360.1980.1460.999160.8280.815−0.0160.0120.4891.0000
Pma180190.7580.7770.0240.4970.4391.000180.7510.7590.0100.3680.4151.0000
kit-2
Ple002210.8770.868−0.0100.2780.5870.587140.8600.854−0.0070.9650.5480.5480
Ple004160.7470.7590.0150.7540.4020.753110.7440.740−0.0050.1720.3500.7060
Pma012140.8670.854−0.0150.2560.5410.887100.6810.7160.0490.3610.3180.8000
Pma025190.7650.7740.0120.9910.4260.935220.9020.9080.0070.6510.6860.9370
Pma038230.8910.882−0.0100.4090.6190.975200.9160.891−0.0270.7430.6490.9780
Pma090100.4600.4720.0250.3910.1130.978100.7580.740−0.0250.4290.3580.9860
Pma101100.6910.7280.0510.3750.3290.985150.7090.683−0.0380.4600.3050.9900
Pma41290.6320.6570.0390.1470.2700.98990.6560.640−0.0250.5460.2460.9930
kit-3
Ple001210.8530.9030.0550.4110.6730.673200.7930.765−0.0360.7360.4160.4161
Ple005140.7820.8080.0320.0770.4670.82680.6980.662−0.0550.8200.2400.5560
Pma020230.8950.883−0.0140.5100.6200.934200.9160.892−0.0270.7580.6490.8440
Pma022140.6000.6160.0270.9340.2370.94990.7400.7410.0010.9030.3580.9000
Pma027230.8950.865−0.0340.7300.5860.979250.8390.8580.0230.9700.5690.9570
Pma121150.8250.8750.0570.2560.5940.992150.7960.760−0.0480.0600.3840.9730
Pma191100.0420.042−0.0121.0000.0010.992120.8560.8590.0030.3130.5550.9880
Pma28890.6040.6300.0420.0200.2240.993100.6040.567−0.0650.5860.1800.9900

N, number of individuals genotyped; Na, total number of alleles; Ho, observed heterozygosity; He, expected heterozygosity; Fis, fixation index; HWE, exact test for Hardy–Weinberg Equilibrium; PE, Probability of Exclusion; CumPE, Cumulative Probability of Exclusion for each multiplex PCR.

Significant departure from HWE are indicated in bold.

Characteristics of three multiplex kits based on 285 P. leopardus and 285 P. maculatus individuals N, number of individuals genotyped; Na, total number of alleles; Ho, observed heterozygosity; He, expected heterozygosity; Fis, fixation index; HWE, exact test for Hardy–Weinberg Equilibrium; PE, Probability of Exclusion; CumPE, Cumulative Probability of Exclusion for each multiplex PCR. Significant departure from HWE are indicated in bold.

Species discrimination

Assignment tests on 15 000 simulated genotypes successfully distinguished between P. leopardus, P. maculatus, and interspecific hybrids with a high degree of confidence. Combining all 3 multiplex kits, each group was clearly delimitated resulting in 100% confidence in assignments (Fig. 2). Independently, multiplex kits still resolved each group with ∼99% confidence (Table 3). Assignment thresholds were based on the posterior probability of assignment to the P. leopardus group and set ad lib to minimize the overall number of errors. Individuals with a posterior probability of assignment ≤0.240 were identified as P. maculatus, >0.780 were P. leopardus and between 0.240 and 0.780 were identified as interspecific hybrids. Assignment thresholds can be modified to meet study-specific objectives, for example maximizing the accuracy of species identification only. Furthermore, if the aim is to discriminate P. leopardus from P. maculatus, a single multiplex kit can provide sufficient power. Multiplex kit-3 provided to most discriminatory power of any single kit due to the presence of Pma191, which is almost monomorphic in the sampled population of P. leopardus.
Figure 2

Distribution of the posterior probability of assignment for 5000 simulated P. leopardus genotypes, 5000 P. maculatus genotypes, and 5000 interspecific F1 hybrids as determined in structure with 25 microsatellite loci.

Table 3

Proportion of incorrect assignments of simulated genotypes with optimum intervals of 0–0.24 (P. leopardus), 0.25–0.78 (F1 hybrids), and 0.79–1 (P. maculatus) with single and combined multiplex kits

KitNo. lociP. leopardusF1 hybridsP. maculatus
kit-190.221.150.16
kit-280.120.860.16
kit-380.141.330.08
kits 1 & 217000
kits 2 & 316000
kits 1, 2 & 325000
Proportion of incorrect assignments of simulated genotypes with optimum intervals of 0–0.24 (P. leopardus), 0.25–0.78 (F1 hybrids), and 0.79–1 (P. maculatus) with single and combined multiplex kits Distribution of the posterior probability of assignment for 5000 simulated P. leopardus genotypes, 5000 P. maculatus genotypes, and 5000 interspecific F1 hybrids as determined in structure with 25 microsatellite loci.

Accuracy of parentage

By analyzing simulated datasets with known parent–offspring relationships, we were able to measure the accuracy of parentage analyses and identify the frequency of type I (false positive) and type II (false negative) assignment errors. Combining all three multiplex kits with an introduced error rate of 1% per locus, we were able to correctly assign or correctly exclude 99.8 ± 0.1% SE of P. leopardus and 99.6 ± 0.1% of P. maculatus offspring. All assignment errors were due to genotypic mismatches in the simulated datasets. Two types of errors were identified at low frequency (Table 4): in order of abundance, these included i) wrongly assigning an offspring to a single parent when neither of the true parents were in the sample; and ii) assigning an offspring to a parent pair, where one parent was correctly assigned and the other falsely assigned when the true parent was present in the sample. Increasing LOD thresholds may not reduce the rate of Type I error because all errors were caused by simulated genotyping error.
Table 4

Accuracy of parent–offspring assignment in simulated populations of P. leopardus and P. maculatus based on species-specific allelic frequencies

P. leopardusP. maculatus


sim-1sim-2sim-3Averagesim-1sim-2sim-3Average
Accuracy of
 Single parent assignments0.9970.9970.9970.9971111.000
 Assignments to parent pairs1111.0001111.000
 Exclusions10.9990.9970.9980.9920.9920.9970.994
False-positive rate (Type I)0.0010.0020.0030.0020.0050.0050.0020.004
False-negative rate (Type II)0000.0000000.000
Overall accuracy0.9990.9980.9970.9980.9950.9950.9980.996
Accuracy of parent–offspring assignment in simulated populations of P. leopardus and P. maculatus based on species-specific allelic frequencies

Marker transferability

All but one species (P. punctatus) within the genus Plectropomus (Serranidae) were tested for the transferability of loci developed here for P. leopardus and P. maculatus. The overall transferability rate of loci across congeneric species ranged from 28 to 96%, notwithstanding monomorphic loci (Table 5). Given the small sample sizes for some species, it is possible that not all true alleles were observed. Of the 25 markers developed, 25 successfully amplified in samples of P. pessuliferus pessuliferus, 19 in P. areolatus, 17 in P. pessuliferus marisburi, 16 in P. laevis, and 14 in P. oligocanthus.
Table 5

Transferability of microsatellite multiplex PCRs within the genus Plectropomus

Species Sampling locationP. areolatus Thuwal, Saudi ArabiaP. pessuliferus marisburi Thuwal, Saudi ArabiaP. pessuliferus pessuliferus Maldives (N = 9) & Thailand (N = 20)P. oligacanthus Bohol, PhilippinesP. laevis Maldives





LocusNNaRange (bp)NNaRange (bp)NNaRange (bp)NNaRange (bp)NNaRange (bp)
kit-1
Pma0362113520–6162215458–5502916456–53677522–59468495–551
Pma0432615356–424
Pma09723594–118227102–12529486–1048294–1028186
Pma104298177–209
Pma106228158–1981911174–2152812157–19984145–16184200–212
Pma109179394–446107348–419268358–406
Pma112185176–192263148–160
Pma114236252–272245218–228293220–2288121981219
Pma1802212277–337217277–3342910293–32985333–35885292–308
kit-2
Ple0022110217–2611610244–2922812207–263
Ple004294384–394
Pma012235132–148236116–1522912132–17883128–13684132–148
Pma025231236243234–238298336–3688123681236
Pma0382311304–344239281–319299280–32445305–32972281–297
Pma090239385–417292365–369
Pma101232251–255243248–256297252–29862250–27471252
Pma412201153144149–165295153–17382139–14383153–161
kit-3
Ple001295150–178
Ple005276406–430
Pma0202211304–344197281–317288296–32451282
Pma022221291214283–295294283–30352283–29151291
Pma0271911294–342162304–3082810308–34886328–34484308–332
Pma1211810383–4631414388–472279379–41945379–415
Pma191202179–1831211832711838118381183
Pma288274148–16487162–19881133

N, number of individulals genotyped; Na, total number of alleles.

Transferability of microsatellite multiplex PCRs within the genus Plectropomus N, number of individulals genotyped; Na, total number of alleles.

Discussion

The three multiplex PCR kits developed herein allow fast, accurate, and cost-effective genotyping of individuals of two closely related species of coral trout: Plectropomus leopardus and P. maculatus. Each kit is composed of a selection of 8 or 9 highly polymorphic microsatellite loci that, independently, provide confident identification of each species and interspecific hybrid individuals. When combined, the three kits accurately also identified parent–offspring relationships with over 99.6% accuracy, providing unprecedented resolution of individual barcoding for these highly valued species. Given the high transferability of the markers among species, the PCR kits will be useful for investigating a range of population and evolutionary processes in this important genus. Investigating life-history processes in coral reef fishes can be challenging and technical advances in both the isolation of molecular markers and high throughput screening of multilocus genotypes have introduced new tools to ecologists tackling questions that were once intractable (Gardner et al. 2011). P. leopardus and P. maculatus, two important fishery species, have a complex and intertwined evolutionary history (van Herwerden et al. 2006). This novel set of microsatellite loci provides a strong basis to investigate whether contemporary hybridization is occurring in mixed populations of P. leopardus and P. maculatus throughout the Great Barrier Reef. As juveniles are morphologically indistinguishable, it also provides a simple assay to examine early life-history processes that are important in determining the distribution, abundance, and fishery stocks of these species. However, assignment thresholds for each class of individuals are likely to depend on the degree of introgression of sampled populations. Where the repeated backcrossing of interspecific hybrids with either parent species has resulted in multiple hybrid categories, other approaches that directly estimate the probability of individuals belonging to each category (Anderson and Thompson 2002) may be more appropriate. As such, simulations should always be performed with study-specific allele frequencies for each species. Recent work has already demonstrated the potential of parentage analysis in coral reef fish to provide invaluable insight into the reproductive success and juvenile dispersal of Plectropomus spp. (Harrison et al. 2012; Almany et al. 2013). When combined, the multiplex kits developed here provide unprecedented accuracy in the resolution of parent–offspring relationships in natural populations, minimizing both false-negative and false-positive assignments (Harrison et al. 2013a,b2013b). These kits may also be used to infer other genealogical relationships, determine pedigrees in aquaculture brood stocks, and provide insight into the heritability of desirable traits for aquaculture. Many experimental methods are effective for screening microsatellite (Gardner et al. 2011; Guichoux et al. 2011b). Overall, we found no discernible differences in the quality of markers identified from either cloning libraries or 454 pyrosequencing. However, the high throughput of next generation sequencing technologies greatly facilitated the optimization of multiplex PCRs by avoiding primer incompatibilities. Aside from one locus showing high levels of genotyping error (Pma112) in both species, the overall marker set of 25 loci demonstrated remarkably high specificity in not only P. leopardus and P. maculatus, but also other species in the genus. In conclusion, the genetic tools developed here will provide the means to answer a broad range of ecological and evolutionary questions for this important genus of groupers. Globally, serranids are one of the most important fishery species on coral reefs, and they represent important apex predators that are increasingly threatened by overexploitation (Sadovy de Mitcheson et al. 2013). The ability to discriminate species, identify individuals, and determine parent–offspring relationships will greatly aid in the development of both sustainable harvesting and conservation measures.
  31 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  The potential costs of accounting for genotypic errors in molecular parentage analyses.

Authors:  Michael B Morrissey; Alastair J Wilson
Journal:  Mol Ecol       Date:  2005-11       Impact factor: 6.185

3.  Genetic effects of captive breeding cause a rapid, cumulative fitness decline in the wild.

Authors:  Hitoshi Araki; Becky Cooper; Michael S Blouin
Journal:  Science       Date:  2007-10-05       Impact factor: 47.728

4.  Rise of the machines--recommendations for ecologists when using next generation sequencing for microsatellite development.

Authors:  Michael G Gardner; Alison J Fitch; Terry Bertozzi; Andrew J Lowe
Journal:  Mol Ecol Resour       Date:  2011-06-16       Impact factor: 7.090

5.  Connectivity dominates larval replenishment in a coastal reef fish metapopulation.

Authors:  Pablo Saenz-Agudelo; Geoffrey P Jones; Simon R Thorrold; Serge Planes
Journal:  Proc Biol Sci       Date:  2011-02-16       Impact factor: 5.349

6.  A practical guide to methods of parentage analysis.

Authors:  Adam G Jones; Clayton M Small; Kimberly A Paczolt; Nicholas L Ratterman
Journal:  Mol Ecol Resour       Date:  2009-10-22       Impact factor: 7.090

7.  Dispersal of grouper larvae drives local resource sharing in a coral reef fishery.

Authors:  Glenn R Almany; Richard J Hamilton; Michael Bode; Manuai Matawai; Tapas Potuku; Pablo Saenz-Agudelo; Serge Planes; Michael L Berumen; Kevin L Rhodes; Simon R Thorrold; Garry R Russ; Geoffrey P Jones
Journal:  Curr Biol       Date:  2013-03-28       Impact factor: 10.834

8.  Contrasting patterns of genetic structure in two species of the coral trout Plectropomus (Serranidae) from east and west Australia: introgressive hybridisation or ancestral polymorphisms.

Authors:  L van Herwerden; J H Choat; C L Dudgeon; G Carlos; S J Newman; A Frisch; M van Oppen
Journal:  Mol Phylogenet Evol       Date:  2006-05-10       Impact factor: 4.286

9.  A measure of population subdivision based on microsatellite allele frequencies.

Authors:  M Slatkin
Journal:  Genetics       Date:  1995-01       Impact factor: 4.562

10.  Larval dispersal connects fish populations in a network of marine protected areas.

Authors:  Serge Planes; Geoffrey P Jones; Simon R Thorrold
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-23       Impact factor: 11.205

View more
  3 in total

1.  Natal philopatry increases relatedness within groups of coral reef cardinalfish.

Authors:  Theresa Rueger; Hugo B Harrison; Peter M Buston; Naomi M Gardiner; Michael L Berumen; Geoffrey P Jones
Journal:  Proc Biol Sci       Date:  2020-07-08       Impact factor: 5.349

2.  Minimum size limits and the reproductive value of numerous, young, mature female fish.

Authors:  Charles P Lavin; Geoffrey P Jones; David H Williamson; Hugo B Harrison
Journal:  Proc Biol Sci       Date:  2021-03-10       Impact factor: 5.349

3.  MultiplexSSR: A pipeline for developing multiplex SSR-PCR assays from resequencing data.

Authors:  Liang Guo; Quan Yang; Jing-Wen Yang; Nan Zhang; Bao-Suo Liu; Ke-Cheng Zhu; Hua-Yang Guo; Shi-Gui Jiang; Dian-Chang Zhang
Journal:  Ecol Evol       Date:  2020-03-04       Impact factor: 2.912

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.