| Literature DB >> 22428050 |
Iderval S Sobrinho1, Reinaldo A de Brito.
Abstract
The gene doublesex (dsx) is considered to be under strong selective constraint along its evolutionary history because of its central role in somatic sex differentiation in insects. However, previous studies of dsx used global estimates of evolutionary rates to investigate its molecular evolution, which potentially miss signals of adaptive changes in generally conserved genes. In this work, we investigated the molecular evolution of dsx in the Anastrepha fraterculus species group (Diptera, Tephritidae), and test the hypothesis that this gene evolved solely by purifying selection using divergence-based and population-based methods. In the first approach, we compared sequences from Anastrepha and other Tephritidae with other Muscomorpha species, analyzed variation in nonsynonymous to synonymous rate ratios (dN/dS) in the Tephritidae, and investigated radical and conservative changes in amino acid physicochemical properties. We show a general selective constraint on dsx, but with signs of positive selection mainly in the common region. Such changes were localized in alpha-helices previously reported to be involved in dimer formation in the OD2 domain and near the C-terminal of the OD1 domain. In the population-based approach, we amplified a region of 540 bp that spanned almost all of the region common to both sexes from 32 different sites in Brazil. We investigated patterns of selection using neutrality tests based on the frequency spectrum and locations of synonymous and nonsynonymous mutations in a haplotype network. As in the divergence-based approach, these analyses showed that dsx has evolved under an overall selective constraint, but with some events of positive selection. In contrast to previous studies, our analyses indicate that even though dsx has indeed evolved as a conserved gene, the common region of dsx has also experienced bouts of positive selection, perhaps driven by sexual selection, during its evolution.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22428050 PMCID: PMC3302808 DOI: 10.1371/journal.pone.0033446
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Map of sampling sites in Brazil.
The haplotypes from each site are shown in the table at right.
Figure 2Unrooted phylogenetic trees estimated by maximum likelihood.
A) Male isoform. B) Female isoform. The foreground branch is marked by an asterisk (*). The branch lengths are as nucleotide substitutions per nucleotide site.
Parameters estimates and log-likelihood for models MA and MA null and contrast between models MA and MA null by likelihood ratio test (LRT).
| Data set | Model | Parameters estimates | log-likelihood | LRT | Positively selected sites |
| Male isoform | MA |
| −10154.136 | 8.872** |
|
| MA null |
| −10158.572 | Not allowed | ||
| Female isoform | MA |
| −7307.287 | 6.149** |
|
| MA null |
| −7310.361 | Not allowed |
ω 0 = dN/dS for sites with 0<ω<1 ; ω 1 = dN/dS for sites with ω = 1; ω 2 = dN/dS for sites with ω>1. p0 = proportion of sites with ω 0; p1 = proportion of sites with ω 1; (p2+p3) = proportion of sites with ω 2; All such parameters correspond only to foreground branches. The LRT critical values at 5% and 1% level are 2.71(*) and 5.41(**), respectively. Site numbers in boldface are at the common region.
Figure 3Bayes Empirical Bayes of male and female isoform.
A) Posterior probability for ω >1 in male isoform. B) Schematic representation of male and female transcripts. The dashed box stands for the amplified segment used in populational analyses. C) Posterior probability for ω>1 in female isoform. The dashed line represents the 0.95 limit of posterior probability for ω>1. Arrows indicate codon positions with posterior probability greater than 0.95 for ω >1. DM/OD1 – DNA-binding and dimerization domain 1. OD2 – dimerization domain 2.
Contrasts of evolutionary rates between different regions of doublesex by the fixed-sites test.
| Contrasted regions | Model contrast | LRT | D.F. | Probability (χ2) | Relative substitution rates |
|
|
| common region | A×B | 0.819 | 1 | 0.365 | rcommon = 1.0 |
|
|
| B×D |
| 2 |
| rmale = 0.9 |
|
| |
| C×E |
| 2 |
| ||||
| common region | A×B |
| 1 |
| rcommon = 1.0 |
|
|
| B×D | 4.296 | 2 | 0.117 | rfemale = 0.3 |
|
| |
| C×E | 2.521 | 2 | 0.284 | ||||
| female exon | A×B |
| 1 |
| rfemale = 1.0 |
|
|
| B×D | 1.808 | 2 | 0.405 | rmale = 3.4 |
|
| |
| C×E | 1.220 | 2 | 0.543 | ||||
| DM/OD1 vs variable common region | A×B |
| 1 |
| rDM/OD1 = 1.0 |
|
|
| B×D |
| 2 |
| rcommon = 1.3 |
|
| |
| C×E |
| 2 |
| ||||
| OD2 vs variable common region | A×B |
| 1 |
| rOD2 = 1.0 |
|
|
| B×D |
| 2 |
| rcommon = 2.05 |
|
| |
| C×E |
| 2 |
|
LRT – Likelihood Ratio Test; D.F. – degrees of freedom. Significant contrasts are in boldface.
Amino acid physicochemical properties under positive destabilizing and purifying selection in male and female isoforms of doublesex.
| Property | Goodness-of-fit neutral expectation | z-score in conservative change (category 1) | z-score in radical change (category 3) |
|
| |||
| Alpha-helix propensity derived from designed sequences (KOEP990101) | 7.004 | - | 2.354 |
| Average relative probability of beta-sheet (KANM800102) | 13.771 | - | 1.916 |
| Beta-sheet propensity derived from designed sequences (KOEP990102) | 27.322 | 3.100 | - |
| Beta-strand indices for beta-proteins (GEIM800106) | 31.892 | 2.141 | - |
| Hydrophilicity value (HOPT810101) | 102.508 | 4.165 | - |
| Hydrophobic parameter (LEVM760101) | 101.53 | 4.165 | - |
| Hydrophobicity (JOND750101) | 23.206 | 2.346 | - |
| Hydrophobicity (PRAM900101) | 50.67 | 2.556 | - |
| Hydrophobicity factor (GOLD730101) | 25.074 | 2.475 | - |
| Hydrophobicity index (ARGP820101) | 23.206 | 2.346 | - |
| Isoelectric point (ZIMJ680104) | 98.935 | 4.061 | - |
| Net charge (KLEP840101) | 96.412 | 3.890 |
|
| Normalized frequency of alpha-helix (BURA740101) | 8.490 | - | 2.350 |
| Normalized frequency of reverse turn, with weights (LEVM780103) | 13.843 | 1.906 |
|
| Normalized frequency of turn from LG (PALJ810105) | 9.347 | 1.811 |
|
| Normalized relative frequency of bend (ISOY800103) | 17.201 | 2.199 |
|
| Normalized relative frequency of double bend (ISOY800107) | 29.789 | 2.363 |
|
| pK (-COOH) (JOND750102) | 35.418 | 1.933 |
|
| Polarity (GRAR740102) | 55.985 | 3.251 |
|
| Polarity (ZIMJ680103) | 52.857 | 3.244 |
|
| Positive charge (FAUJ880111) | 29.904 | 2.179 |
|
|
| |||
| Beta-sheet propensity derived from designed sequences (KOEP990102) | 26.713 | 2.974 | - |
| Beta-strand indices for beta-proteins (GEIM800106) | 27.528 | 1.990 | - |
| Hydrophilicity value (HOPT810101) | 93.536 | 4.094 | - |
| Hydrophobic parameter (LEVM760101) | 92.808 | 4.094 | - |
| Hydrophobicity (JOND750101) | 14.690 | 1.858 | - |
| Hydrophobicity (PRAM900101) | 40.302 | 2.406 | - |
| Hydrophobicity factor (GOLD730101) | 17.573 | 2.062 | - |
| Hydrophobicity index (ARGP820101) | 14.690 | 1.858 | - |
| Isoelectric point (ZIMJ680104) | 89.787 | 3.998 | - |
| Negative charge (FAUJ880112) | 39.901 | 1.679 | - |
| Net charge (KLEP840101) | 86.841 | 3.804 | - |
| Normalized frequency of alpha-helix (BURA740101) | 6.760 | - | 2.249 |
| Normalized relative frequency of double bend (ISOY800107) | 30.651 | 2.460 | - |
| pK (-COOH) (JOND750102) | 29.895 | 1.827 | - |
| Polarity (GRAR740102) | 45.076 | 2.941 | - |
| Polarity (ZIMJ680103) | 45.213 | 3.107 | - |
| Positive charge (FAUJ880111) | 20.636 | 1.868 | - |
Properties under positive destabilizing selection are boldfaced.
- p<0.05;
- p<0.01;
- p<0.001.
Figure 4Sliding window plots of the z-scores of radically changed properties showing regions under positive-destabilizing selection in (A) female and (B) male isoform.
DM/OD1 – DNA-binding and dimerization domain. OD2 – dimerization domain 2. α1, α2 and α3 – alpha-helices that compose the predicted UBA-like domain. α* - disordered C-terminal tail, proposed to fold as an alpha-helix. Dashed horizontal line indicates the Bonferroni corrected significant limit (z-score = 3.07, p<0.05, male exon; z-score = 2.95, p<0.05, female exon).
Figure 5Haplotype network estimated by statistical parsimony (TCS ver 1.21).
Thicker lines represent nonsynonymous substitutions and small circles stand for inferred haplotypes. Haplotypes were labeled sequentially and the subsequent letters represent the respective identified species. O – A. obliqua, F – A. fraterculus, S – A. sororcula, G – A. grandis, T – A. striata, B – A. bistrigata, E– A. serpentina. Letters near lines represent internal mutations labels.
doublesex genetic diversity estimates of the common region and DM/OD1 domain.
| Domain/segment | species | N | h | Hd (SD) |
|
|
|
|
| full common region |
| 109 | 97 | 0.995 (0.003) | 0.016 (0.001) | 0.037 | 0.009 | 0.056 (0.014) |
|
| 39 | 35 | 0.993 (0.008) | 0.018 (0.001) | 0.039 | 0.010 | 0.035 (0.011) | |
|
| 44 | 40 | 0.995 (0.007) | 0.011 (0.001) | 0.022 | 0.007 | 0.033 (0.010) | |
|
| 18 | 18 | 1.000 (0.019) | 0.016 (0.003) | 0.038 | 0.009 | 0.025 (0.009) | |
| DM/OD1 |
| 109 | 58 | 0.936 (0.016) | 0.015 (0.001) | 0.032 | 0.010 | 0.054 (0.015) |
|
| 39 | 24 | 0.955 (0.018) | 0.017 (0.002) | 0.037 | 0.012 | 0.031 (0.011) | |
|
| 44 | 23 | 0.813 (0.060) | 0.008 (0.002) | 0.011 | 0.007 | 0.029 (0.010) | |
|
| 18 | 13 | 0.928 (0.052) | 0.018 (0.004) | 0.044 | 0.010 | 0.028 (0.011) | |
| common region without DM/OD1 |
| 109 | 79 | 0.986 (0.005) | 0.018 (0.001) | 0.042 | 0.010 | 0.056 (0.014) |
|
| 39 | 32 | 0.988 (0.009) | 0.018 (0.001) | 0.041 | 0.011 | 0.035 (0.011) | |
|
| 44 | 36 | 0.985 (0.010) | 0.014 (0.001) | 0.029 | 0.009 | 0.033 (0.010) | |
|
| 18 | 14 | 0.954 (0.039) | 0.017 (0.003) | 0.034 | 0.011 | 0.025 (0.010) |
N = number of sequences; h = number of haplotypes; Hd = haplotype diversity; S = number of polymorphic sites; Sy = number of synonymous changes; Nsy = number of nonsynonymous changes; π = nucleotide diversity; πs = synonymous nucleotide diversity; πa = nonsynonymous nucleotide diversity. SD = standard deviation.
Neutrality tests for dsx DM/OD1 and common region without DM/OD1 and OD2 domains.
| Domain/segment | Species | Tajima's | Fu and Li's | Fay and Wu's |
| full common region |
| − |
| −34.402 |
|
| −1.782 | −3.000a | −14.826 | |
|
|
|
| −7.904 | |
|
| −1.376 | −2.050a | −3.719 | |
| DM/OD1 |
|
|
| −9.657 |
|
| −1.475 |
| −4.920 | |
|
|
|
| −3.824 | |
|
| −1.399 | −1.665 | −4.760 | |
| common region without DM/OD1 |
|
|
| −17.489 |
|
| −1.849 |
| −9.905 | |
|
|
|
| −4.080 | |
|
| −1.34039 | −1.977 | 1.041 |
Significant tests after Dunn-Šidàk correction are given in bold.
p<0.01;
p<0.05; a 0.10>p>0.05.
Contingency analysis of synonymous and nonsynonymous vs tip or interior position in the haplotype network.
| Position in the network | synonymous | nonsynonymous |
| Tip | 71 | 95 |
| Interior | 34 | 9 |
Fisher's Exact Test probability under the null hypothesis of homogeneity: 2.7 10−6.
Permutation analysis of synonymous and nonsynonymous mutations versus position in the haplotype network (tip or interior).
| synonymous | nonsynonymous | |||
| Position in the network | DM/OD1 | common region | DM/OD1 | common region |
| Tip | 24 | 47 | 35 | 60 |
| Interior | 11 | 23 | 3 | 6 |
Permutational probability: 0.0004.
Contingency analyses of how synonymous and nonsynonymous mutations within DM/OD1 and common region are distributed across the haplotype network position (tip or interior).
| DM/OD1 | ||
| Position in the network | synonymous | nonsynonymous |
| Tip | 24 | 35 |
| Interior | 11 | 3 |
Fisher's Exact Test probability under the null hypothesis of homogeneity: 0.0008.