| Literature DB >> 24489114 |
José L Campos1, Daniel L Halligan, Penelope R Haddrill, Brian Charlesworth.
Abstract
Genetic recombination associated with sexual reproduction increases the efficiency of natural selection by reducing the strength of Hill-Robertson interference. Such interference can be caused either by selective sweeps of positively selected alleles or by background selection (BGS) against deleterious mutations. Its consequences can be studied by comparing patterns of molecular evolution and variation in genomic regions with different rates of crossing over. We carried out a comprehensive study of the benefits of recombination in Drosophila melanogaster, both by contrasting five independent genomic regions that lack crossing over with the rest of the genome and by comparing regions with different rates of crossing over, using data on DNA sequence polymorphisms from an African population that is geographically close to the putatively ancestral population for the species, and on sequence divergence from a related species. We observed reductions in sequence diversity in noncrossover (NC) regions that are inconsistent with the effects of hard selective sweeps in the absence of recombination. Overall, the observed patterns suggest that the recombination rate experienced by a gene is positively related to an increase in the efficiency of both positive and purifying selection. The results are consistent with a BGS model with interference among selected sites in NC regions, and joint effects of BGS, selective sweeps, and a past population expansion on variability in regions of the genome that experience crossing over. In such crossover regions, the X chromosome exhibits a higher rate of adaptive protein sequence evolution than the autosomes, implying a Faster-X effect.Entities:
Keywords: Drosophila melanogaster; Hill–Robertson interference; background selection; crossing over; heterochromatin; recombination; selective sweeps
Mesh:
Substances:
Year: 2014 PMID: 24489114 PMCID: PMC3969569 DOI: 10.1093/molbev/msu056
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Summary Statistics for Autosomal Genes in Crossover Regions (AC), X Chromosome Genes in Crossover regions (XC), and All NC Genes.
| AC | XC | NC | |
|---|---|---|---|
| 7,099 | 1,319 | 268 | |
| 45,373 | 8,868 | 620 | |
| 144,370 | 34,812 | 777 | |
| 0.00143 (0.00139, 0.00146) | 0.00128 (0.00120, 0.00135) | 0.000537 (0.000313, 0.000761) | |
| 0.0141 (0.0139, 0.0144) | 0.0156 (0.0151, 0.0161) | 0.00218 (0.000990, 0.00338) | |
| 0.101 (0.098, 0.104) | 0.0818 (0.0765, 0.0875) | 0.268 (0.215, 0.321) | |
| 0.00179 (0.00175, 0.00184) | 0.00178 (0.00168, 0.00188) | 0.000620 (0.000381, 0.000859) | |
| 0.0147 (0.0145, 0.0150) | 0.0178 (0.0173, 0.0183) | 0.00230 (0.00124, 0.00337) | |
| 0.514 (0.492, 0.536) | 0.610 (0.549, 0.677) | 0.439 (0.345, 0.533) | |
| 0.354 (0.340, 0.369) | 0.427 (0.395, 0.465) | 0.393 (0.296, 0.491) | |
| −0.666 (−0.685, −0.646) | −0.953 (−0.996, −0.911) | −0.603 (−0.972, −0.234) | |
| −0.173 (−0.190, −0.157) | −0.532 (−0.563, −0.5014) | −0.354 (−0.778, 0.069) | |
| 0.0381 (0.0371, 0.0391) | 0.0404 (0.0381, 0.0427) | 0.0549 (0.0499, 0.0599) | |
| 0.262 (0.260, 0.264) | 0.258 (0.254, 0.262) | 0.273 (0.266, 0.279) | |
| 0.145 (0.141, 0.150) | 0.156 (0.148, 0.166) | 0.204 (0.184, 0.222) | |
| 0.000035 (−0.000003, 0.000071) | −0.00004 (−0.00014, 0.00006) | 0.000118 (0.000057, 0.000179) | |
| −0.00296 (−0.00319, −0.00274) | −0.00292 (−0.00356, −0.00231) | −0.000089 (−0.000714, 0.000537) |
Note.—N, number of genes analyzed; S, number of segregating sites (subscript A: nonsynonymous sites; subscript S: synonymous sites); π, mean number of nucleotide differences per site; θ, mean value of Watterson’s theta per gene; D, mean value of Tajima’s D per gene; K, mean value of divergence per nucleotide site from D. yakuba; Psing, proportion of segregating sites that are singletons; H, mean value of the Fay and Wu statistic. The quantities in parentheses are the 95% CIs of the means; for C regions, these were obtained by bootstrapping across genes and for NC regions by jackknifing across the five independent NC regions.
Summary Statistics for the Five NC Regions.
| N2 | N3 | N4 | NXc | NXt | |
|---|---|---|---|---|---|
| 59 | 99 | 67 | 19 | 23 | |
| 142 | 150 | 191 | 72 | 65 | |
| 222 | 197 | 176 | 104 | 78 | |
| 0.000455 (0.000234) | 0.000426 (0.000218) | 0.000279 (0.000143) | 0.000955 (0.000498) | 0.00057 (0.000299) | |
| 0.00221 (0.00113) | 0.00163 (0.000828) | 0.000807 (0.000413) | 0.004438 (0.002281) | 0.001829 (0.000953) | |
| 0.206 (0.148) | 0.262 (0.190) | 0.346 (0.251) | 0.215 (0.158) | 0.312 (0.230) | |
| 0.000564 (0.000215) | 0.000431 (0.000164) | 0.000384 (0.000146) | 0.00107 (0.000418) | 0.000651 (0.000256) | |
| 0.00254 (0.00096) | 0.00160 (0.000606) | 0.00102 (0.000387) | 0.00422 (0.00162) | 0.00215 (0.000838) | |
| 0.458 | 0.320 | 0.597 | 0.361 | 0.462 | |
| 0.374 | 0.279 | 0.528 | 0.298 | 0.487 | |
| −0.821 | −0.050 | −1.173 | −0.450 | −0.523 | |
| −0.551 | 0.083 | −0.890 | 0.224 | −0.639 | |
| 0.0603 (0.0496, 0.0698) | 0.0549 (0.0452, 0.0635) | 0.0556 (0.0467, 0.0643) | 0.0597 (0.0374, 0.0799) | 0.0349 (0.0258, 0.0445) | |
| 0.294 (0.278, 0.310) | 0.284 (0.273, 0.296) | 0.248 (0.238, 0.259) | 0.252 (0.226, 0.277) | 0.254 (0.234, 0.274) | |
| 0.205 (0.169, 0.244) | 0.193 (0.163, 0.226) | 0.224 (0.190, 0.258) | 0.237 (0.155, 0.336) | 0.137 (0.101, 0.175) | |
| 0.000105 | 0.000142 | 0.0000034 | 0.000161 | 0.000177 | |
| −0.00111 | 0.0000632 | 0.000283 | −0.000431 | 0.000754 |
Note.—The entries in the columns headed N2–N4 are the mean values for the NC regions of chromosomes 2−4; those under NXc are for the NC region of the X adjacent to the centromere, and those under NXt are for the NC region of the X adjacent to the telomere. The meaning of the other symbols is the same as for table 1, except that the quantities in brackets for the diversity statistics π and θ are the standard errors of the means obtained from the coalescent process formulae with no recombination; the standard errors for the corresponding ratios were obtained by the delta method formula for a ratio (see Materials and Methods).
aNo DS was significantly different from 0 when tested by 1,000 coalescent simulations with no recombination.
2 × 2 Contingency Tables Comparing the Numbers of Derived Mutations in Different Frequency Categories in C and NC regions for Nonsynonymous (A) and Synonymous (S) Variants.
| No. of Derived Mutations | Site | Region | ||
|---|---|---|---|---|
| AC | NA | |||
| 1 (singletons) | A | 18,070 | 135 | 2 × 10−10 |
| S | 37,810 | 126 | ||
| 2–7 (intermediate) | A | 12,914 | 127 | 2 × 10−13 |
| S | 48,427 | 190 | ||
| 8–16 (high) | A | 5,187 | 49 | 2 × 10−11 |
| S | 27,010 | 64 | ||
| 1–16 (all) | A | 36,171 | 311 | 1 × 10−32 |
| S | 113,247 | 380 | ||
| XC | NX | |||
| 1 (singletons) | A | 3,157 | 35 | 0.0023 |
| S | 8,531 | 46 | ||
| 2–7 (intermediate) | A | 1,455 | 43 | 5 × 10−11 |
| S | 7769 | 53 | ||
| 8–16 (high) | A | 709 | 16 | 0.00017 |
| S | 4097 | 25 | ||
| 1–16 (all) | A | 5321 | 94 | 1 × 10−13 |
| S | 20397 | 124 | ||
Note.—AC, autosomal C region; NA, autosomal NC regions; XC, X-chromosome C regions; NX, X chromosome NC regions. P value: Fisher's exact test probability for the corresponding 2 × 2 table.
FRatio of the number of derived nonsynonymous mutations per nonsynonymous site to the number of synonymous mutations per synonymous site, for three categories of frequencies of derived variants. AC, autosomal crossover regions; XC, X chromosome crossover regions; NA, autosomal NC regions; NX, X chromosome NC regions.
Estimates of the Proportions (α) and the Relative Rates (ω) of Adaptive Nonsynonymous Substitutions.
| N2 | 0.016 | 0.0030 |
| N3 | −0.337 | −0.0641 |
| N4 | −0.449 | −0.0998 |
| NXc | −0.039 | −0.0085 |
| NXt | −1.253 | −0.1762 |
| NC | −0.412 (−0.858, 0.034) | −0.069 (−0.133, −0.0051) |
| AC | 0.368 (0.339, 0.405) | 0.053 (0.049, 0.059) |
| XC | 0.569 (0.539, 0.597) | 0.089 (0.082, 0.096) |
| oAC | 0.401 (0.382, 0.419) | 0.058 (0.054–0.061) |
| oXC | 0.548 (0.496, 0.595) | 0.091 (0.079–0.103) |
Note.—The quantities in parentheses are the 95% CIs of the values obtained by the method of Fay et al. (2002); for C regions, these are obtained by bootstrapping across genes, and for NC by jackknifing across the five independent NC regions. oAC, overlap autosomal crossover regions; oXC, overlap X crossover region (“overlap” means that the X and autosomal genes in these regions have similar effective rates of recombination—see Materials and Methods for details).
Minimum Numbers of Recombinants (Rh) Detected in Each NC Region.
| N2 | 119 | 1.184 |
| N3 | 74 | 0.53 |
| N4 | 40 | 0.202 |
| NXc | 74 | 2.709 |
| NXt | 27 | 0.67 |
FCorrelations between diversity statistics and the numbers of sites in coding sequences in the five NC regions for nonsynonymous diversity πA, synonymous diversity πS, and the ratio πA/πS. ρ, Spearman’s rank correlation coefficient, with significance denoted by asterisks (***<0.001; *<0.05).
FB values for the five NC regions (red dots) against the number of coding sequence sites in each region. The blue line shows the effects of HRI on B due to BGS, predicted by Kaiser and Charlesworth (2009). The error bars are the standard errors of B obtained from the diversity statistics for the NC regions as described for table 2.
FRelations between the effective recombination rate and the means of several variables for genes in the C regions, after grouping genes into bins defined by rates of crossing over. The x axis gives the mean effective recombination rate (cM/Mb) for each bin. Autosomal genes (A) are shown in green and X-linked (X) genes in red. Values for NC regions are indicated by the filled points at the extreme left of each panel but are not included in the correlation or regression analyses (black: the five NR blocks; green: autosomal NC genes; red: X-linked NC genes). ρ: Spearman’s rank correlation coefficient, with significance denoted by asterisks (***<0.001; **<0.01; *<0.05). The lines are least-squares regressions but should be regarded only as indicative, in view of the binning of the data.