Literature DB >> 28649344

Historical demography and genetic differentiation of the giant freshwater prawn Macrobrachium rosenbergii in Bangladesh based on mitochondrial and ddRAD sequence variation.

M M Mahbub Alam^1,2, Kristen M Westfall^1,3, Snæbjörn Pálsson¹.

Abstract

Macrobrachium rosenbergii, the giant freshwater prawn, is an important source of high quality protein and occurs naturally in rivers as well as commercial farms in South and South-East Asia, including Bangladesh. This study investigated the genetic variation and population structure of M. rosenbergii sampled from four rivers in Bangladesh (sample size ranged from 19 to 20), assessing sequence variation, both in the mitochondrial cytochrome oxidase subunit 1 (CO1) gene and in 106 single nucleotide polymorphisms (SNPs) sampled randomly from the genome with double digest RAD sequencing (ddRADseq). The mitochondrial variation presented a shallow genealogy with high haplotype diversity (h = 0.95), reflecting an expansion in population size for the last ~82 kyr. Based on the CO1 variation the current effective population size (Ne) was 9.7 × 106 (CI: 1.33 × 106 - 35.84 × 106) individuals. A significant population differentiation was observed with the mitochondrial CO1 sequence variation and based on the ddRADseq variation, which could be traced to the divergence of the population in the Naf River in the South-East border with Myanmar from the other populations. A differentiation in mtDNA haplotype frequencies was also observed between the Biskhali River and the Karnaphuli Rivers in eastern Bangladesh. This study demonstrated the use of high-throughput genotyping based on the ddRADseq method to reveal population structure at a small geographical scale for an important freshwater prawn. The information from this study can be utilized for management and conservation of this species in Bangladesh.

Entities: Chemical Disease Gene Species

Keywords: conservation; cytochrome oxidase subunit 1; management; population structure; single nucleotide polymorphisms

Year: 2017 PMID： 28649344 PMCID： PMC5478082 DOI： 10.1002/ece3.3023

Source DB: PubMed Journal: Ecol Evol ISSN： 2045-7758 Impact factor: 2.912

Introduction

Macrobrachium rosenbergii (De Man 1879; Decapoda, Palaemonidae), the giant freshwater prawn, is highly valued commercial aquaculture species. The species is found naturally in rivers and adjacent freshwater reservoirs (i.e. lakes, swamps, and canals) in South and South‐East Asia, from western Pakistan to western Java of Indonesia (FAO 2002; De Bruyn, Nugroho, Hossain, Wilson, & Mather, 2005; Hurwood et al., 2014). Because of its importance both for fishing and aquaculture, it has been introduced into 40 countries (Iketani et al., 2011). Macrobrachium rosenbergii has a catadromous life cycle. Copulation occurs in freshwater and ovigerous females migrate to estuaries, holding the fertilized eggs in the brood chamber, where eggs hatch as free‐swimming zoeae and after progressing through 12 larval stages in the brackish environment, the postlarvae (PL) enter into the freshwater system to grow until sexual maturity (FAO 2002, 2004). During the last two decades, M. rosenbergii aquaculture has attracted considerable attention in Bangladesh for its export potential (Ahmed, Demaine, & Muir, 2008; Wahab, Ahmad‐Al‐Nahid, Ahmed, Haque, & Karim, 2012). Aquaculture operations have expanded in the South and South‐western districts of Bangladesh due to the availability of PL in the coastal areas (Azad, Lin, & Jensen, 2008). Moreover, a large number of freshwater ponds in Bangladesh have high potential to culture the freshwater prawn (Alam & Alam, 2014), which mostly depends on wild caught PL (Ahamed, Hossain, Fulanda, Ahmed, & Ohtomi, 2012; Ahmed, Occhipinti‐Ambrogi, & Muir, 2013; Ahmed & Troell, 2010). Concerns for the effects of wild prawn PL overfishing on coastal ecosystem biodiversity and production of other species caught as bycatch, led the Department of Fisheries (DOF) in Bangladesh to impose a ban in the year 2000 on wild prawn PL harvest (Ahmed & Troell, 2010; Department of Fisheries Bangladesh 2002). To fulfill the increased demand of PL, 27 freshwater prawn hatcheries have been established since 1992, producing about 27,000,000 PL in 2014 (Department of Fisheries Bangladesh 2015). Ovigerous females used in the hatcheries are collected directly either from the rivers or from aquaculture (Alam & Alam, 2014). The genetic diversity of M. rosenbergii in Bangladesh is under continuous threat due to human activities (i.e. overexploitation, natural postlarvae collection, escape from aquaculture, and use of banned gears) and climate change effects (i.e. sea level rise, saline water intrusion) (Department of Fisheries Bangladesh 2013; Quader, 2010). Assessments of population structure and genetic diversity are essential to inform management of harvested populations as ignorance of structure can lead to overexploitation, and escapees from aquaculture can be a risk for locally adapted populations (Koljonen, 2001; Laikre, Palm, & Ryman, 2005; Olsson et al., 2007; Palsbøll, Berube, & Allendorf, 2007; Ward, 2000) as escaping from aquaculture occurred during cyclones and its resulted floods (Department of Fisheries Bangladesh 2008). Genetic diversity facilitates further evolution given environmental change and may in addition play a key ecosystem function (Reusch & Hughes, 2006). Information of genetic variation in M. rosenbergii in Bangladesh has until recently been mostly unexplored, but recent studies on Penaid shrimps in Bangladesh have uncovered high levels of genetic diversity (Alam, de Croos, & Pálsson, 2016; Alam & Pálsson, 2016; Alam, Westfall, & Pálsson, 2015, 2016; Hurwood et al., 2014). Several genetic studies have been applied to M. rosenbergii including allozymes (reviewed in Agarwal et al., 2016), mtDNA (De Bruyn et al., 2005; Hurwood et al., 2014), microsatellites (Hurwood et al., 2014; Khan et al., 2014), and more recently identification of single nucleotide polymorphisms (SNPs) (Agarwal et al., 2016; Jung et al., 2014). The analysis of mtDNA variation revealed three distinct lineages within the species, that is, a western lineage West of the biogeographic barrier at the Isthmus of Kra, a central lineage mainly from the Sunda‐Shelf region and an eastern lineage mainly in Indonesia (De Bruyn et al., 2005; Hurwood et al., 2014), but microsatellite variation showed four distinct clusters where Bangladesh samples clustered together with the samples from South‐western Thailand (Hurwood et al., 2014). A study on genetic patterns within M. rosenbergii sampled from South‐western Bangladesh (the Pashur and the Paira Rivers) and South‐eastern Bangladesh (the Naf River), based on seven microsatellites, did not reveal significant differences among the sites, but the pairwise distances corresponded with the geographical distances (Khan et al., 2014). The study by Agarwal et al. (2016) identified a high number of SNPs in transcribed regions of the M. rosenbergii genome sampled from India (0.16 to 6.02 per 100 bp), in comparison with other species. Another transcriptome study of this same species by Jung et al. (2014) found 2.5 SNP per 100 bp. Double digest restriction‐site associated DNA sequencing (herein ddRADseq) and other variants of reduced‐representation sequencing have proven to be an effective tool to delineate genetic structure for various species, with a larger power than traditional genetic markers, by analyzing variation in thousands of loci, for example, in the American lobster Homarus americanus (Benestan et al., 2015) and three spine stickleback Gasterosteus aculeatus (Baird et al., 2008; Hohenlohe et al., 2010; Jones et al., 2012). The aim of this study was to investigate the genetic variation and population structure of M. rosenbergii within Bangladesh by assessing genomic variation in samples from four of its main rivers: the Bishkhali in the West, the Meghna, and the Karnaphuli in the East and the Naf River at the boundary with Myanmar in South‐East by applying more genetic markers than in a previous study by Khan et al. (2014) and including other rivers. This was carried out by analyzing geographical and historical patterns in mitochondrial DNA sequences and from double digest restriction‐site associated DNA sequence (ddRADseq; Peterson, Weber, Kay, Fisher, & Hoekstra, 2012) variation.

Materials and Methods

Sample collection and DNA extraction

A total of 83 wild origin M. rosenbergii were collected during the period from December 2012 to September 2013 by artisanal fishermen from four rivers in Bangladesh; Bishkhali River (BR, 20 ind.), Meghna River (MR, 20 ind.), Karnaphuli River (KR, 22 ind.), and postlarvae (PL) from the Naf River (NR, 21 ind.) (Figure 1). All samples were preserved in 96% ethanol. Total genomic DNA was extracted for mtDNA sequencing from ~1 mg pleopod tissue through overnight incubation at 56°C in a mixture of 6% Chelex and 0.2 mg/ml proteinase K, using a Thermomixer (Eppendorf Thermomixer Compact), and for ddRADseq from ~20 mg pleopod tissue using standard Phenol–Chloroform extraction (Maniatis, Fritsch, & Sambrook, 1982). The quality and concentrations of DNA were examined with a ND‐1000 spectrophotometer using ND‐software (Thermo Fisher Scientific). For ddRAD sequencing 500 ng gDNA was allowed to run on 2% agarose gel to ensure quality. All gDNAs (1,000 ng) were treated with RNase to get rid of RNA before library preparation.

Figure 1

Sampling sites along the coast of Bangladesh. Capital letters indicate the four sampling sites–BR, Bishkhali River; MR, Meghna River; KR, Karnaphuli River; NR, Naf River

Mitogenomic variation

Polymerase chain reaction (PCR) and sequencing

Variation in the mtDNA was assessed by sequencing a 1,316 bp fragment of the CO1 gene from 83 specimens (Genbank Accession numbers: KX585687–KX585769): corresponding with 132–1,447 bps of M. rosenbergii mitochondrial genome (GenBank accession number: AY659990.1; Miller, Murphy, Burridge, & Austin, 2005). The region includes the barcoding region (CO1b) and a downstream region (CO1d). The CO1b fragment was amplified using standard barcode primers LCO‐1490 and HCO‐2198 (Folmer, Black, Hoeh, Lutz, & Vrijenhoek, 1994), and the CO1d fragment with COIF (Palumbi & Benzie, 1991) and TL2N (Quan et al., 2001). PCR was performed in a volume of 10 μl, including 30–150 ng DNA, 0.2 mmol/L dNTP, 0.1% Tween 20, 1× Standard Taq Buffer (New England Biolabs), 0.5 mg Bovine Serum Albumin, 0.5 U Taq DNA Polymerase, and 0.34 mmol/L each of forward and reverse primers. The amplification protocol of CO1b fragment included an initial denaturation at 94°C for 4 min, and 37 cycles of denaturation at 94°C for 30 s, annealing at 45°C for 45 s, and extension at 72°C for 1 min, then a final extension at 72°C for 6 min. The PCR protocol for the CO1d region contained an initial denaturation at 94°C for 5 min, 38 cycles of denaturation at 94°C for 30 s, annealing at 55°C for 30 s, and extension at 72°C for 30 s, then a final extension at 72°C for 7 min. All PCR products were examined on a 1.5% agarose gel with 1 μl Bromophenyl Blue and visualized under UV light, after staining with Ethidium Bromide. An ExoSAP reaction was performed to remove excess nucleotides and primers from PCR products (5 μl) in a 10 μl reaction volume. The DNA template (1 μl) was sequenced with the Big Dye Terminator kit 3.1 (AB), precipitated with ethanol and run on a Genetic Analyser (3500xL Applied Biosystems). The sequences were edited using BioEdit Sequence Alignment Editor (Hall, 1999) and aligned by applying the ClustalW Multiple alignment.

Genetic diversity and population differentiation

Genetic diversity of the combined CO1 fragments of M. rosenbergii, including haplotype diversity (h), nucleotide diversity (π) and the partition among sample sites with analysis of molecular variance (AMOVA) applying both the distance method (Φ) and the conventional F‐statistics from haplotype frequencies were calculated using ARLEQUIN v3.5 (Excoffier & Lischer, 2011). Significance level of the genetic partition was tested by 1,000 permutations of individuals among samples. Haplotype richness (H R) was calculated using the allele richness function in HIERFSTAT package in R (Goudet, 2005). Evolutionary relationships were investigated with an unrooted cladogram, using a median‐joining algorithm (Bandelt, Forster, & Röhl, 1999), implemented in NETWORK v 5.0.0.0 (www.fluxus-engineering.com).

Demographic history and population expansion

Population demographic changes and deviation from neutrality in M. rosenbergii were estimated by analyzing the mismatch distribution, using sum of square deviation (SSD) (Excoffier, 2004) and the raggedness index (Harpending, 1994), and with Tajima's D (Tajima,1993) and Fu's Fs (Fu, 1997) using ARLEQUIN v3.5 (Excoffier & Lischer, 2011). The time since expansion was based on the median of the mismatch distribution (τ) and the mutation rate, μ = 0.7 to 1.3% per site per Myr, for the CO1 (Knowlton & Weigt, 1998; Knowlton, Weigt, Solorzano, Mills, & Bermingham, 1993; Schubart, Diesel, & Hedges, 1998), as t = τ/(2 μl), where L is the length of the sequence. The demographic changes were further analyzed with the Bayesian Skyline Plot (BSP), based on the 83 CO1 sequences to estimate the past population dynamics from the time of sampling, assuming no population structure. The BSP analysis was implemented in BEAST v1.7.5 (Drummond, Ho, Rawlence, & Rambaut, 2007), following a strict molecular clock and the TN93 model with Invariants sites (I), derived from a PhyML Test (Guindon et al., 2010) using the APE package (Paradis, 2012) in R (R Core Team 2015). Posterior probability of the effective population size (N e) was estimated with the BSP analysis, using MCMC procedures by moving backward until the time of the most recent common ancestor was reached (Liao et al., 2010). Markov chains were run for 5.0 × 107 generations and sampled every 1000. Log files were visualized for the posterior probabilities of the Markov Chain statistics using TRACER v1.5 (Rambaut & Drummond, 2009), and 10% of the samples were discarded as burn‐in during “skyline” reconstruction. Skyline data were exported from TRACER v1.5, and the skyline plot was redrawn using the package APE (Paradis, 2012) in R (R Core Team 2015).

Double digest RADseq variation

Library preparation

Double digest RADseq library for 83 individuals was prepared following protocols modified from Peterson et al. (2012) and Elshire et al. (2011). Total gDNA (500 ng) was double digested using Sau3AI (1 U) and ApeKI (2 U), respectively in sequential incubations of 4 hr each, in NEB Buffer 4. The digested DNA (100 ng from each individual) was ligated to adapters containing a unique combination of two inline barcodes for each individual and complementary ends to the restriction overhangs (adapter sequence and barcodes from Elshire et al., 2011). Eleven unique barcodes on the Ape KI adapters (5 bps) and eight unique barcodes on the Sau 3AI adapters (6 bps), each pair with a minimum distance of two, were used to generate one unique combinatorial barcode for each individual. Adapters were used in molar excess, in approximately 6:1 molar ratio of each adapter to fragmented DNA, calculated from the median fragment size as assessed on agarose gel. Adapters were ligated using NEB T4 DNA ligase and supplied buffer at 21°C for 4 hr. The ligase was heat inactivated and then all individuals were pooled for size selection. The pool of fragmented DNA, ligated with uniquely barcoded adapters, was purified applying magnetic beads (Macherey Nagel NGS Clean‐up and Size Select) following manufacturers protocol and eluted in sufficient volume of water for size selection. The fragmented DNA was size selected using a Pippin Prep (Sage Science) with 2% ethidium‐free gels and external size standard. Size selection used the narrow setting with a median fragment size of 400 bp (± 18 bp). The resulting fragments were amplified using PCR primers from Elshire et al. (2011) (final concentration 0.5 mmol/L) and NEB One Taq 2 × Master Mix with standard buffer. The Pippin Prep eluate was divided into eight separate PCR (10 μl per reaction). The PCR cycling conditions were 72°C, 3′; 98°C, 30″; (98°C, 10″; 65°C, 30″; 72°C, 30″) X 12; 72°C, 5′; 4°C, ∞. PCR products were purified using magnetic beads (Macherey Nagel NGS Clean‐up and Size Select) following manufacturers protocol. The PCR product was stained with SYBR Gold (Invitrogen) and quantified in Tecan (Genios) using a standard concentration curve generated from serial dilutions of lambda DNA (Table S1). Libraries were diluted 1:200 and 1: 400 and an average concentration calculated. Molarity was calculated based on the median fragment size of 400 bp. The library was run on an Illumina MiSeq2000 for 300 cycles (2 × 150 paired‐end) using v2 chemistry. Dilution and preparation for sequencing followed manufacturer's protocol, with the exception of a final library concentration of 38 p.m.

Bioinformatics and genotyping

Raw FASTQ files from the MiSeq runs were demultiplexed into unique reads for each individual using process_radtags command in STACKS v.1.09 (Catchen, Hohenlohe, Bassham, Amores, & Cresko, 2013). Reads were truncated to 140 bp to obtain equal length of the sequences and filtered for overall quality of 90% (raw phred score 10). Individuals with <300,000 reads were discarded (n = 10). Variant detection and genotyping were performed using denovo pipelines: ustacks, cstacks, and stacks in STACKS v.1.09 (Catchen et al., 2013). Min depth of coverage to create a stack was 7, maximum distance allowed between stacks was 2 and to align secondary reads 4. Max number of stacks allowed per de novo locus was 3. A table with single SNP per locus, selected among SNPs that had the least missing data per locus and read into R (R Core Team 2015) for statistical analysis. Deviation from random association of the variable sites within populations was calculated by comparing expected and observed heterozygosity, calculating the inbreeding coefficient (F IS) with bootstrapped confidence intervals and testing the deviation from Hardy–Weinberg, using the HIERFSTAT (Goudet, 2005) and PEGAS (Paradis, Jombart, Schliep, Potts, & Winter, 2016) packages in R. HIERFSTAT were also used to infer the population structure by calculating F ST (Weir & Cockerham, 1984) across all population and for pairwise comparisons. The F ST values were tested by 1000 permutation of individuals across samples. To evaluate the effect of loci which were not in Hardy–Weinberg equilibrium, whether due to natural selection or deviation from Mendelian segregations, the calculations were repeated by omitting those loci which failed the exact test within populations, applying the Fishers Combined probability (e.g. Sokal & Rohlf, 1995), with p < .05. To analyze further putative effect of selection, PGD Spider (Lischer & Excoffier, 2012) was used to convert the single SNP per locus file to BayeScan (Foll, 2012) format. Given enough statistical power BayeScan enables the identification of the effect of natural selection on the population subdivision at different loci, either due to diversifying or balancing/directional selection, which is characterized by positive and negative alphas, summarized using the package BOA (Smith, 2015) in R (R Core Team 2015). Ordination of the SNP genotypes was investigated using discriminant analysis of principal components (DAPC), and followed by the assignment of individuals to different clusters defined with and without prior information, as implemented in the ADEGENET package (Jombart et al., 2015) in R.

Results

Mitochondrial DNA diversity

The mitochondrial variation was characterized by a high overall haplotype diversity close to its maximum value (h = 0.90–0.99), and nucleotide diversities ranged from 0.0022 to 0.0031 for the different samples (Table 1). The combined CO1 sequences produced 45 unique haplotypes from 83 individuals, of which 33 were singletons (Figure 2). Haplotypes 1 and 2 were found in all locations, representing 6 and 11 individuals respectively. Two haplotypes (Haplotypes 3 and 4), representing 13 and 3 individuals respectively, were found in three of the four sampling locations (Figure 2). One to two mutations distinguished most of the sequences from four common haplotypes, forming a shallow network that indicates recent ancestry within the mtDNA lineage (Figure 2).

Table 1

Genetic diversity in Macrobrachium rosenbergii from Bangladesh, based on mitochondrial CO1 gene (1,316 bps)

Sampling location		N	N _h	H _R	h ± SE	π ± SE	S
BR	Bishkhali river	20	19	19	0.99 ± 0.02	0.0031 ± 0.0018	22
KR	Karnaphuli river	22	14	12.9	0.90 ± 0.06	0.0024 ± 0.0014	18
MR	Meghna river	20	15	15	0.96 ± 0.03	0.0024 ± 0.0015	18
NR	Naf river, Bangladesh‐Myanmar	21	13	12.6	0.94 ± 0.03	0.0022 ± 0.0014	15
All		83	45	14.8	0.95 ± 0.01	0.0026 ± 0.0015	46

N, No. of individuals; N , No. of haplotypes; H R, haplotype richness; h, haplotype diversity; π, nucleotide diversity; S, No. of segregating sites; SE, standard error.

Figure 2

Median‐joining haplotype network based on mitochondrial CO1 (1,316 bps) of 83 Macrobrachium rosenbergii sampled from four rivers in Bangladesh. Each pie represents a haplotype and its size reflects the frequency. Distances between pies correspond to number of mutational differences between haplotypes. Shadings (black to white) denote four sampling locations, BR, MR, KR, and NR, respectively, see Figure 1)

Genetic diversity in Macrobrachium rosenbergii from Bangladesh, based on mitochondrial CO1 gene (1,316 bps) N, No. of individuals; N , No. of haplotypes; H R, haplotype richness; h, haplotype diversity; π, nucleotide diversity; S, No. of segregating sites; SE, standard error. Median‐joining haplotype network based on mitochondrial CO1 (1,316 bps) of 83 Macrobrachium rosenbergii sampled from four rivers in Bangladesh. Each pie represents a haplotype and its size reflects the frequency. Distances between pies correspond to number of mutational differences between haplotypes. Shadings (black to white) denote four sampling locations, BR, MR, KR, and NR, respectively, see Figure 1) Differentiation was observed among the samples along the coast of Bangladesh based on the CO1 sequences. Marginal difference was observed when genetic distances among haplotypes were considered (overall ΦST = 0.030, p = .045), but not for the haplotype frequencies (overall F ST = 0.009, p > .05). The greatest differentiation occurred between the two most geographically distant locations, BR and NR (ΦST = 0.089, p = .012) and between BR and KR based on haplotype frequencies (F ST = 0.042, p = .010) (Table 3).

Table 3

Pairwise F ST and ΦST based on mtCO1 and 106 SNPs with corresponding p values between Macrobrachium rosenbergii populations sampled from four rivers (BR, KR, MR, and NR; see Figure 1) in Bangladesh. p values were obtained with 1,000 permutation of individuals across sites. F ST‐ nhw and P ‐nhw are based on results where loci which were not in HWE within populations were excluded from the dataset

Pop	mtCO1				SNPs
Pop	F _ST	p	Φ_ST	p	F _ST	p	F _ST‐ _nhw	P _‐nhw
BR‐KR	0.042	.009	0.042	.058	0.003	.168	0.013	0.064
BR‐MR	0.006	.305	0.001	.340	0.005	.170	0.013	0.102
BR‐NR	0.017	.077	0.089	.012	0.011	.020	0.020	0.033
KR‐MR	0.005	.280	0.001	.367	0	.804	0	0.679
KR‐NR	0	.570	0.009	.258	0.034	<.001	0.076	<0.001
MR‐NR	0	.878	0.028	.111	0.035	<.001	0.072	<0.001
Overall	0.009	.136	0.030	.045	0.011	<.001	0.024	<0.001

The mismatch distribution for the samples (Figure 3) followed the sudden expansion model both for the SSD and the raggedness index (p > .98). Deviation from equilibrium was also observed with the Tajima's D and Fu's Fs, which were both negative and significant (Tajima's D = −2.035, p = .003; Fu's Fs = −26.220, p < .001), suggesting expansion in population size or range from a bottleneck or alternatively due to recovery from a selective sweep. The BSP analysis showed that the population has a current effective population size (N e) of 9.7 × 106 (CI: 1.33 × 106 – 35.84 × 106) individuals and has undergone a steady increase in N e for the last 81.6 kyr (with a lower confidence limit close to 35 kyr) (Figure 4). Time since expansion (t) of the Bangladesh M. rosenbergii, based on the median of the mismatch distributions (τ), was 100.35–186.37 kyr ago.

Figure 3

Mismatch distributions based on 83 sequences of mitochondrial CO1 (1,316 bps) of Macrobrachium rosenbergii from four rivers in Bangladesh, under the sudden expansion model

Figure 4

Bayesian Skyline Plot showing the past population dynamics of Macrobrachium rosenbergii in Bangladesh estimated from 83 sequences of mitochondrial CO1 (1,316 bps). Dotted lines represent the 95% confidence intervals. Effective population size (N e × 106) per generation is traced back in time from the present to the past

Mismatch distributions based on 83 sequences of mitochondrial CO1 (1,316 bps) of Macrobrachium rosenbergii from four rivers in Bangladesh, under the sudden expansion model Bayesian Skyline Plot showing the past population dynamics of Macrobrachium rosenbergii in Bangladesh estimated from 83 sequences of mitochondrial CO1 (1,316 bps). Dotted lines represent the 95% confidence intervals. Effective population size (N e × 106) per generation is traced back in time from the present to the past

SNP Variation

Genetic diversity

In total 141 stacks with 106 SNPs were obtained from the ddRADseq analysis of 73 individuals with an average depth of 18.6 (SD = 20.7) using the 7 depth base calling. The deviations of expected heterozygosity from observed heterozygosity (F IS) for the SNP dataset were overall negative, ranging from −0.22 to −0.66 within populations (Table 2). Nineteen loci were found to deviate from Hardy–Weinberg equilibrium, 17 with excess of heterozygotes (negative F IS values from −0.55 to −1.00, (14 with p < .001) and two with positive F IS values (0.88 and 1.00; p < .005). By omitting these 19 loci the overall heterozygosity decreased from 0.15 to 0.10, and the F IS did not differ from 0 (Table 2). The expected heterozygosity for the single SNP dataset was similar among the sampling locations. (Table 2).

Table 2

Genetic diversity in Macrobrachium rosenbergii populations sampled from four rivers (BR, KR, MR, and NR; see Figure 1) in Bangladesh, based on 106 SNPs

Pop	N	H _e	F _IS(CI)	H _e‐hw	F _IS _‐ _hw (CI)
BR	17	0.14	−0.62/−0.33	0.12	−0.21/0.04
KR	20	0.14	−0.63/−0.37	0.13	−0.15/−0.01
MR	17	0.15	−0.54/−0.22	0.15	−0.09/0.10
NR	19	0.13	−0.66/−0.37	0.14	−0.22/−0.03
Total	73	0.14	−0.60/−0.34	0.08	−0.12/0.08

N, number of individuals; H e, expected heterozygosity; F IS, 95% confidence interval (CI) for the inbreeding coefficient obtained with bootstrap; F IS‐hw , excluding loci which were not in Hardy–Weinberg equilibrium within samples.

Genetic diversity in Macrobrachium rosenbergii populations sampled from four rivers (BR, KR, MR, and NR; see Figure 1) in Bangladesh, based on 106 SNPs N, number of individuals; H e, expected heterozygosity; F IS, 95% confidence interval (CI) for the inbreeding coefficient obtained with bootstrap; F IS‐hw , excluding loci which were not in Hardy–Weinberg equilibrium within samples.

Detection of selection and demographic changes

Despite the deviation from Hardy–Weinberg equilibrium in some loci, variation at all SNP markers was in compliance with the neutral expectation obtained from BayeScan. The distribution of alpha values ranged from −0.06 to 0.51 with a mean of 0.00 and was all nonsignificant as the corrected p values (q) due to multiple testing using the FDR method (Foll, 2012) ranged from 0.66 to 0.90.

Clustering of individuals and population differentiation

The DAPC analysis with prior information of the sampling sites, based on the SNP dataset, reveal distinct differentiation among populations, with the highest overlap between KR and MR (Figure 5a; see Figures 5c, and 6 for the proportion of individuals in each cluster). The DAPC analysis without the prior information resulted in three clusters (K = 3, stat = 86.25, alpha score = 0.16; Figure 5b). Individuals from BR, KR, and MR were found in all clusters but in different relative frequencies, but individuals from the NR in South‐eastern Bangladesh were only found in two of the clusters (Figure 5d). The population comparisons (F ST) for SNPs among all sampling locations were significant (p < .001, Table 3). The pairwise comparisons between populations among BR, KR, MR, and NR support the result observed from the DAPC analyzes that the NR differed most from the others (Figure 7). All comparisons with NR were significant (p < .02), whereas the others were not (Table 3). The pairwise differentiation between populations (F ST) was about two times larger when the loci which deviated from Hardy–Weinberg (similar values were observed when omitting all or just with excess of heterozygotes) were omitted, but the pattern remained the same. The differentiation between BR and KR, observed with the mtDNA, increased when omitting the loci which were not in HWE (F ST = 0.013, p = .064).

Figure 5

Figure 6

Assignment probabilities (Q‐values) of Macrobrachium rosenbergii individuals from the four sampling locations (BR, KR, MR, and NR; see Figure 1)

Figure 7

Differentiation among the four Macrobrachium rosenbergii populations along Bangladesh coastline. The dendrogram is based on the F ST values, presented in Table 3, using the upgma method. Letters for sampling locations (BR, KR, MR, and NR) correspond to Figure 1

Discriminant analysis of principal components (DAPC) of Macrobrachium rosenbergii: (a) with prior information, (b) without prior information, (c) proportion of individuals in “a” from four locations (BR, KR, MR, and NR) and (d) proportion of individuals in “b” (clusters: 1, 2, and 3) sampled from four locations. Shadings (from black to white) represent four locations: BR, KR, MR, and NR, respectively Assignment probabilities (Q‐values) of Macrobrachium rosenbergii individuals from the four sampling locations (BR, KR, MR, and NR; see Figure 1) Pairwise F ST and ΦST based on mtCO1 and 106 SNPs with corresponding p values between Macrobrachium rosenbergii populations sampled from four rivers (BR, KR, MR, and NR; see Figure 1) in Bangladesh. p values were obtained with 1,000 permutation of individuals across sites. F ST‐ nhw and P ‐nhw are based on results where loci which were not in HWE within populations were excluded from the dataset Differentiation among the four Macrobrachium rosenbergii populations along Bangladesh coastline. The dendrogram is based on the F ST values, presented in Table 3, using the upgma method. Letters for sampling locations (BR, KR, MR, and NR) correspond to Figure 1

Discussion

Information about genetic population structure and connectivity of natural populations is important for sustainable harvest of populations and the management of diversity (Olsson et al., 2007). Natural populations can be affected by human activities such as aquaculture, exploitation for consumption, and environmental changes. Application of next‐generation sequencing (e.g. RAD sequencing) has proven to be successful to detect population patterns, for example, in American lobster Homarus americanus (Benestan et al., 2015) and three spine stickleback Gasterosteus aculeatus (Baird et al., 2008; Hohenlohe et al., 2010; Jones et al., 2012), but application of such intensive methods has been limited in the developing nations, particularly in South and South‐East Asia where both biodiversity and threats to biodiversity are prominent (Willette et al., 2014). By using ddRADseq SNPs in addition to mitochondrial CO1 sequence variation, we revealed high genetic variation and two distinct populations of M. rosenbergii from different watersheds in Bangladesh: one in the Naf River, and the second in the Karnaphuli, the Meghna, and the Bishkhali rivers. Analyzes of genome wide SNPs revealed a clear split between the population in the Naf River, at the boundary of Bangladesh and Myanmar, and the populations of the three other main rivers of Bangladesh which appear to be more connected, with an indication of gene flow and admixture among the rivers. Despite the divergence of the population in Naf River from other locations, two individuals (of 20) from the Naf River showed the closest genetic similarity with the individuals from the Bishkhali and the Meghna Rivers. This could possibly be explained by recent anthropogenic introductions, either due to transportation for aquaculture or accidentally through transfer of ballast water. The population sampled in Meghna River showed closest similarity with the Karnaphuli and the Bishkhali rivers, indicating connectivity and ongoing gene flow among these rivers either due to natural or human‐mediated mixing. Prawn can survive and grow in salinity up to 15 ppt (Chand et al., 2015), and possible migrations of larva, juveniles, and adults might have occurred among these rivers, through the Meghna and the Karnaphuli estuaries. Some extent of similarities was observed among all studied wild populations, which might have resulted from aquaculture practices as described earlier, or alternatively, due to shared ancestral polymorphism. Our findings were in line with the results from a previous genetic diversity study on M. rosenbergii sampled from the Pashur (36 ind.) and the Paira rivers (36 ind.), South‐West Bangladesh and the Naf River (36 ind.), South‐East Bangladesh, performed by Khan et al. (2014), based on seven variable microsatellites. Although their pairwise F ST values (0.012–0.021) were not significant, they were almost similar to the values observed in our study (0.011–0.035). The detection of significant differentiation between the populations in the present study might have been supported by the larger number of the nuclear markers. The result from the DAPC ordination method supported the differentiation of the Naf River population but indicated that there might also be barriers to admixture between the rivers. When considering the combination of base pairs at the different sites within the genome, the most individuals sampled in one river were more similar to the individuals from the same river than to the individuals in other river. To conclude, SNP variation revealed at least two distinct populations of M. rosenbergii sampled from Bangladesh: first, in the Naf River in between Bangladesh and Myanmar, and the second, in the Meghna, Bishkhali, and the Karnaphuli rivers. The haplotypic variation revealed high genetic variability within populations. Mitochondrial CO1 sequence variation revealed high overall haplotype diversity in Bangladesh M. rosenbergii, and revealed some extent of population differentiation, indicating similar pattern of population structure to the nuclear data, although the support was weaker. Based on our findings of these genetically distinct populations, we suggest that they should be considered as separate management units for sustainable management, harvest, and conservation. Fishing efforts can be controlled based on the genetic patterns to avoid over or underexploitation of different populations. As Bangladesh M. rosenbergii has high genetic variation, prawn hatcheries could be more sensible when they use ovigerous females from the same area in order to reduce threats to the local population diversity due to accidental escape from aquaculture. Further information about the biology of the species, such as variation in time of reproduction, behavior and habitat, is warranted to evaluate whether the observed population structure can be explained by different sources rather than the geographical origins.

Conflict of Interest

None declared. Click here for additional data file. Click here for additional data file.

28 in total

1. Low genetic variation of Penaeus chinensis as revealed by mitochondrial COI and 16S rRNA gene sequences.

Authors: J Quan; X M Lü; Z Zhuang; J Dai; J Deng; Y P Zhang
Journal: Biochem Genet Date: 2001-08 Impact factor: 1.890

2. Median-joining networks for inferring intraspecific phylogenies.

Authors: H J Bandelt; P Forster; A Röhl
Journal: Mol Biol Evol Date: 1999-01 Impact factor: 16.240

3. PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs.

Authors: H E L Lischer; L Excoffier
Journal: Bioinformatics Date: 2011-11-21 Impact factor: 6.937

4. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0.

Authors: Stéphane Guindon; Jean-François Dufayard; Vincent Lefort; Maria Anisimova; Wim Hordijk; Olivier Gascuel
Journal: Syst Biol Date: 2010-03-29 Impact factor: 15.683

5. Genetic population structure of fishes: implications for coastal zone management.

Authors: Linda Laikre; Stefan Palm; Nils Ryman
Journal: Ambio Date: 2005-03 Impact factor: 5.129

6. Large mitochondrial DNA differences between morphologically similar Penaeid shrimp.

Authors: S R Palumbi; J Benzie
Journal: Mol Mar Biol Biotechnol Date: 1991-09

7. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection.

Authors: Y X Fu
Journal: Genetics Date: 1997-10 Impact factor: 4.562

8. Signature of ancient population growth in a low-resolution mitochondrial DNA mismatch distribution.

Authors: H C Harpending
Journal: Hum Biol Date: 1994-08 Impact factor: 0.553

9. Phylogeographic evidence for the existence of an ancient biogeographic barrier: the Isthmus of Kra Seaway.

Authors: M de Bruyn; E Nugroho; Md Mokarrom Hossain; J C Wilson; P B Mather
Journal: Heredity (Edinb) Date: 2005-03 Impact factor: 3.821

10. The history of the introduction of the giant river prawn, Macrobrachium cf. rosenbergii (Decapoda, Palaemonidae), in Brazil: New insights from molecular data.

Authors: Gabriel Iketani; Luciana Pimentel; Glaúcia Silva-Oliveira; Cristiana Maciel; Wagner Valenti; Horacio Schneider; Iracilda Sampaio
Journal: Genet Mol Biol Date: 2011-03-01 Impact factor: 1.771

1 in total

1. Historical demography and genetic differentiation of the giant freshwater prawn Macrobrachium rosenbergii in Bangladesh based on mitochondrial and ddRAD sequence variation.

Authors: M M Mahbub Alam; Kristen M Westfall; Snæbjörn Pálsson
Journal: Ecol Evol Date: 2017-05-09 Impact factor: 2.912

1 in total