| Literature DB >> 34184267 |
Bahar Patlar1, Vivek Jayaswal2, José M Ranz3, Alberto Civetta1.
Abstract
Seminal fluid proteins (SFPs) are a group of reproductive proteins that are among the most evolutionarily divergent known. As SFPs can impact male and female fitness, these proteins have been proposed to evolve under postcopulatory sexual selection (PCSS). However, the fast change of the SFPs can also result from nonadaptive evolution, and the extent to which selective constraints prevent SFPs rapid evolution remains unknown. Using intra- and interspecific sequence information, along with genomics and functional data, we examine the molecular evolution of approximately 300 SFPs in Drosophila. We found that 50-57% of the SFP genes, depending on the population examined, are evolving under relaxed selection. Only 7-12% showed evidence of positive selection, with no evidence supporting other forms of PCSS, and 35-37% of the SFP genes were selectively constrained. Further, despite associations of positive selection with gene location on the X chromosome and protease activity, the analysis of additional genomic and functional features revealed their lack of influence on SFPs evolving under positive selection. Our results highlight a lack of sufficient evidence to claim that most SFPs are driven to evolve rapidly by PCSS while identifying genomic and functional attributes that influence different modes of SFPs evolution.Entities:
Keywords: Functional attributes; nonadaptive evolution; postcopulatory sexual selection; relaxed selection; selective constraints; seminal fluid proteins
Mesh:
Substances:
Year: 2021 PMID: 34184267 PMCID: PMC8457112 DOI: 10.1111/evo.14297
Source DB: PubMed Journal: Evolution ISSN: 0014-3820 Impact factor: 3.694
A comparison of D. melanogaster and D. simulans divergence between SFPs and the rest of the genome
| Pop. | Genome | SFPs | Comparisons | |||||
|---|---|---|---|---|---|---|---|---|
|
| Mean | 95% CI |
| Mean | 95% CI |
| ||
| Zambia | ||||||||
|
| 13,370 | 0.032 | 0.031–0.032 | 317 | 0.051 | 0.047–0.056 | 12.6 × 105 | <0.001 |
|
| 13,370 | 0.022 | 0.021–0.022 | 317 | 0.042 | 0.037–0.047 | 12.3 × 105 | <0.001 |
|
| 13,302 | 0.074 | 0.073–0.075 | 315 | 0.089 | 0.084–0.095 | 16.5 × 105 | <0.001 |
| 12,801 | 0.274 | 0.268–0.279 | 310 | 0.474 | 0.426–0.524 | 12.6 × 105 | <0.001 | |
| Raleigh | ||||||||
|
| 13,370 | 0.029 | 0.028–0.029 | 317 | 0.048 | 0.048–0.059 | 13.3 × 105 | <0.001 |
|
| 13,370 | 0.019 | 0.018–0.019 | 317 | 0.039 | 0.034–0.044 | 13.1 × 105 | <0.001 |
|
| 13,302 | 0.071 | 0.070–0.071 | 315 | 0.089 | 0.082–0.096 | 16.6 × 105 | <0.001 |
| 12,801 | 0.251 | 0.245–0.256 | 283 | 0.443 | 0.396–0.490 | 10.3 × 105 | <0.001 | |
Means and 95% confidence intervals (CI) from 10,000 bootstraps for D, K a, K s, and K a/K s for genome and SFP genes are shown. Wilcoxon Rank Sum (W) tests were applied to assess for differences in divergence between SFP genes and the rest of the genome. n is the number of proteins examined.
Figure 1Mode of evolution of SFP genes. Genes were divided into two groups based on their K a/K s ratios relative to the genome average. The eMKT, as well as comparisons of polymorphism (π a/π s) and divergence (K a/K s), and the frequency distribution of derived alleles (DAF) were used to group genes under three major groups (bold). Significance (sig.) indicates FDR corrected P < 0.05.
Figure 2Summary statistics for polymorphism and divergence, and modes of evolution, compared between the ancestral (ZI) and derived (RAL) D. melanogaster populations. (A) Average polymorphism (π a/π s) and divergence (K a/K s) for the genome and SFPs after correcting for covariance between the two estimates. Error bars are 95% confidence intervals. (B) The derived allele frequency spectrum of mean counts of nonsynonymous (Daf0f) and synonymous (Daf4f) polymorphisms in SFP genes with K a/K s higher than the genome average and nonsignificant α. (C) The ratio of nonadaptive (ω na) and adaptive (ω a) nonsynonymous to synonymous substitutions, respectively. Error bars represent standard error of the mean. Dark‐colored bar: Raleigh population, light‐colored bar: Zambia population. (D) SFP genes modes of evolution in ZI and RAL. Modes of evolution groups are positive selection (orange), relaxed selection (green), and selectively constrained (blue). Numbers are counts per group with colored lines showing movement of genes across group classifications.
Patterns of nonrandom association for six genomic or functional features and different gene categories based on their mode of molecular evolution
| Selection regime | |||||||
|---|---|---|---|---|---|---|---|
| Constrained | Positive | Relaxed | |||||
| Feature | Odds ratio | Odds ratio | Odds ratio | ||||
| Zambia | |||||||
| Transferred vs. nontransferred | 0.076 | 2.358 | 0.169 | 3.000 | 0.487 | 0.327 | 0.074 |
| Autosomes vs. X |
| 1.123 | 1.000 | 0.109 |
| 5.661 |
|
| Reproductive vs. nonreproductive |
| 0.251 |
| 1.174 | 0.833 | 3.813 |
|
| Post‐mating vs. unknown | 1.000 | 1.036 | 1.000 | 0.871 | 1.000 | 1.022 | 1.000 |
| Immunity vs. unknown | 0.178 | 1.581 | 0.413 | 2.152 | 0.331 | 0.400 | 0.331 |
| Proteases vs. nonproteases |
| 1.062 | 1.000 | 3.745 |
| 0.431 | 0.057 |
| Raleigh | |||||||
| Transferred vs. nontransferred | 0.076 | 2.358 | 0.169 | 3.000 | 0.487 | 0.327 | 0.074 |
| Autosomes vs. X |
| 0.779 | 0.626 | 0.096 |
| 4.126 |
|
| Reproductive vs. nonreproductive |
| 0.288 |
| 1.275 | 0.784 | 3.168 |
|
| Post‐mating vs. unknown | 0.583 | 0.776 | 0.686 | 0.655 | 0.768 | 1.396 | 0.686 |
| Immunity vs. unknown | 0.230 | 1.319 | 0.776 | 2.690 | 0.776 | 0.545 | 0.776 |
| Proteases vs. nonproteases |
| 0.803 | 0.693 | 5.112 |
| 0.671 | 0.503 |
Only the 254 SFPs common to the populations of Zambia and Raleigh are considered.
1For each feature, genes are split into two categories and differential association with the three modes of evolution is tested using a 2 × 3 Fisher exact test (FET). When significant, the P‐value is bolded.
2Post hoc 2 × 2 FETs to test for significant excess (odds ratio > 1) or deficit (odds ratio < 1) between any selective regime and the other two. P‐values are FDR corrected. When significant, the P‐value is bolded, and an arrow identifies the excess or deficit for the first category listed. For the alternative category, the pattern is the opposite (e.g., autosomal SFPs are underrepresented, whereas X‐linked SFPs are enriched, in the positive selection group).