| Literature DB >> 24896825 |
Bree K Yednock1, Joseph E Neigel1.
Abstract
The identification of genes involved in the adaptive evolution of non-model organisms with uncharacterized genomes constitutes a major challenge. This study employed a rigorous and targeted candidate gene approach to test for positive selection on protein-coding genes of the blue crab, Callinectes sapidus. Four genes with putative roles in physiological adaptation to environmental stress were chosen as candidates. A fifth gene not expected to play a role in environmental adaptation was used as a control. Large samples (n>800) of DNA sequences from C. sapidus were used in tests of selective neutrality based on sequence polymorphisms. In combination with these, sequences from the congener C. similis were used in neutrality tests based on interspecific divergence. In multiple tests, significant departures from neutral expectations and indicative of positive selection were found for the candidate gene trehalose 6-phosphate synthase (tps). These departures could not be explained by any of the historical population expansion or bottleneck scenarios that were evaluated in coalescent simulations. Evidence was also found for balancing selection at ATP-synthase subunit 9 (atps) using a maximum likelihood version of the Hudson, Kreitmen, and Aguadé test, and positive selection favoring amino acid replacements within ATP/ADP translocase (ant) was detected using the McDonald-Kreitman test. In contrast, test statistics for the control gene, ribosomal protein L12 (rpl), which presumably has experienced the same demographic effects as the candidate loci, were not significantly different from neutral expectations and could readily be explained by demographic effects. Together, these findings demonstrate the utility of the candidate gene approach for investigating adaptation at the molecular level in a marine invertebrate for which extensive genomic resources are not available.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24896825 PMCID: PMC4045945 DOI: 10.1371/journal.pone.0099081
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sampling locations and life stage totals for Callinectes sapidus.
| Country | State | Latitude (degrees) | Longitude (degrees) | Adult | Juveniles | Unknown | Megalopae | Totals |
| Mexico | Veracruz |
|
| 0 | 0 | 3 | 0 | 3 |
| USA | Louisiana | 29.8382 |
| 23 | 0 | 0 | 0 | 23 |
| 29.5520 |
| 47 | 0 | 0 | 24 | 71 | ||
| 29.2394 |
| 47 | 0 | 0 | 25 | 72 | ||
| 29.6202 |
| 24 | 0 | 0 | 0 | 24 | ||
| 30.3617 |
| 24 | 0 | 0 | 0 | 24 | ||
| 29.2540 |
| 47 | 0 | 0 | 24 | 71 | ||
| 29.5777 |
| 24 | 0 | 0 | 0 | 24 | ||
| 29.7121 |
| 33 | 8 | 0 | 25 | 66 | ||
| 29.7789 |
| 20 | 1 | 0 | 0 | 21 | ||
| Texas | 26.2336 |
| 4 | 11 | 0 | 0 | 15 | |
| Venezuela | Falcon |
|
| 0 | 1 | 1 | 0 | 2 |
| Zulia |
|
| 2 | 1 | 2 | 0 | 5 | |
|
|
| 0 | 1 | 0 | 0 | 1 | ||
|
|
| 0 | 0 | 2 | 0 | 2 | ||
| TOTALS | 295 | 23 | 8 | 98 | 424 |
Italics indicate approximate sampling coordinates based on the location name associated with the cataloged specimen from the University of Louisiana at Lafayette Zoological (ULLZ) Collection (exact coordinates were not available).
Figure 1Demographic models evaluated in simulations.
Demographic models and parameter values that were evaluated in coalescent simulations. t 1, time of expansion; t 2, length of bottleneck event; α, population growth parameter; F, bottleneck size defined as a proportion of the present day population (N e).
Sequence information and summary statistics for Callinectes sapidus.
| Locus | Length | % |
|
|
|
|
|
|
|
|
|
|
|
| 414 | 40 | 33 | 2 | 32 | 54 | 0.757 | 0.003 | 4.509 | 0.011 | 0.019 | 0.567 |
|
| 227 | 46 | 27 | 0 | 27 | 39 | 0.662 | 0.005 | 3.689 | 0.009 | 0.003 | 3.215 |
|
| 368 | 13 | 22 | 5 | 17 | 19 | 0.313 | 0.001 | 3.006 | 0.008 | 0.054 | 0.152 |
|
| 489 | 23 | 28 | 12 | 16 | 31 | 0.757 | 0.003 | 3.825 | 0.008 | 0.019 | 0.405 |
|
| 191 | 36 | 11 | 1 | 10 | 23 | 0.819 | 0.009 | 1.503 | 0.008 | 0.014 | 0.574 |
Length, trimmed length of analyzed sequence; %, percent coverage of the complete mRNA reference sequence available on GenBank (ant: EF077712, tps: EU679406, hsp: DQ663760, rpl: FJ774832) and the partial mRNA of atps (HM217802); S, total number of segregating sites; NS, number of nonsynonymous segregating sites; Syn, number of synonymous segregating sites; H; number of haplotypes; Hd, haplotype diversity; π, nucleotide diversity; θ, Watterson's theta; θ, theta pi, K, mean divergence between C. sapidus and C. similis, defined as the mean number of nucleotide differences per site between species.
Indicates one haplotype has a Syn and NS allele, therefore the value of S is 1 less than the sum of NS and Syn.
Figure 2Comparison of complete and truncated isoforms of the 70 kDa heat shock protein family.
(A) Protein structure of E.coli HSP70 (DNAK) chaperone (aa 1–605) complexed with ADP and substrate (PDB: 2KHO). (B) The truncated bovine HSC70 (aa 1–554) mutant E214A/D214B in post-ATP bound phase (PDB: 1YUW). The truncated form is missing a portion of the C-terminal α-helices, but retains the ATPase site within the nucleotide binding domain.
Linkage disequilibrium (LD) and recombination results for Callinectes sapidus.
| Locus |
|
|
|
|
|
|
|
| 0.002 | 9.829 | 4 | 0.024 | 0.001 | 0.643 |
|
| 0.006 | 0.000 | 3 | 0.000 | −0.006 | 0.401 |
|
| 0.060 | 0.000 | 0 | 0.000 | 0.071 | 0.897 |
|
| 0.011 | 0.000 | 2 | 0.000 | −0.009 | 0.438 |
|
| 0.030 | 1.160 | 2 | 0.006 | 0.026 | 0.757 |
r, mean pairwise LD; γ, population recombination rate; R, minimum number of recombination events; γ/bp, population recombination rate per base pair; ZZ, observed ZZ; p(ZZ), probability of simulated ZZ being as extreme or more extreme than ZZ.
*indicates p<0.005.
Tajima's D, Fay and Wu's H, Ewens-Watterson EW, and DHEW compound test results for Callinectes sapidus.
| Locus |
|
|
|
|
|
| −2.055 | −0.753 | 0.424 | 0.102 |
|
| −1.756 | −0.436 | 0.398 | 0.103 |
|
| −2.011** | −6.337** | 0.723 | 0.105 |
|
| −1.622 | 0.269 | 0.244 | 0.104 |
|
| 0.590 | −1.377 | 0.192 | 0.125 |
D, Tajima's D; H, Fay and Wu's H; EW, Ewens-Watterson EW; DHEW(P), critical P for DHEW test. Significance of the DHEW test at nominal α = 0.01 is based on the p-values for D and H being less than DHEW(P) and the p-value of EW being greater than 1-DHEW(P). Significance of the individual EW test is based on 1 minus the p-value of EW.
*p<0.01, **p<0.001.
McDonald-Kreitman test results for ant and tps.
| Locus | Mutation Class | Fixed | Polymorphic | Fisher's Exact (p) |
|
|
| 3 | 2 | |
|
| 2 | 34 | 0.0086 | |
|
|
| 2 | 4 | |
|
| 10 | 19 | 1.0000 |
McDonald-Kreitman tests for ant and tps using C. sapidus polymorphisms and fixed differences between C. sapidus and C. similis. Contingency tables for atps, hsp, and rpl could not be calculated because there were no fixed differences between C. sapidus and C. similis.
*p<0.025.
Figure 3Results of population expansion simulations.
P-values for Tajima's D and Fay and Wu's H tests from coalescent simulations of population expansion scenarios across a range of growth (α) and time of expansion (t 1) parameter values. The demographic model is shown in Figure 1.
Figure 4Results of population bottleneck simulations.
P-values for Tajima's D and Fay and Wu's H tests from coalesceant simulations of bottleneck scenarios run with a constant growth rate (α = 10) since the initial time of expansion (t 1). Bottleneck size (F) is defined as a proportion of the present day population (N e). The results of simulations with α = 50, 100, 150, and 200 show similar p-values for each locus. The demographic model is shown in Figure 1.