| Literature DB >> 29158556 |
Haoxuan Liu1, Ju Huang1, Xiaoguang Sun1, Jing Li2, Yingwen Hu1, Luyao Yu1, Gianni Liti2, Dacheng Tian1, Laurence D Hurst3, Sihai Yang4.
Abstract
GC-favouring gene conversion enables fixation of deleterious alleles, disturbs tests of natural selection and potentially explains both the evolution of recombination as well as the commonly reported intragenomic correlation between G+C content and recombination rate. In addition, gene conversion disturbs linkage disequilibrium, potentially affecting the ability to detect causative variants. However, the importance and generality of these effects is unresolved, not simply because direct analyses are technically challenging but also because previous within- and between-species discrepant results can be hard to appraise owing to methodological differences. Here we report results of methodologically uniform whole-genome sequencing of all tetrad products in Saccharomyces, Neurospora, Chlamydomonas and Arabidopsis. The proportion of polymorphic markers converted varies over three orders of magnitude between species (from 2% of markers converted in yeast to only ~0.005% in the two plants) with at least 87.5% of the variance in per tetrad conversion rates being between species. This is largely due to differences in recombination rate and median tract length. Despite three of the species showing a positive GC-recombination correlation, there is no significant net AT→GC conversion bias in any of the species, despite relatively high resolution in the two taxa (Saccharomyces and Neurospora) with relatively common gene conversion. The absence of a GC bias means that: (1) there should be no presumption that gene conversion is GC biased, or (2) that a GC-recombination correlation necessarily implies biased gene conversion, (3) K a/K s tests should be unaffected in these species and (4) it is unlikely that gene conversion explains the evolution of recombination.Entities:
Mesh:
Year: 2017 PMID: 29158556 PMCID: PMC5733138 DOI: 10.1038/s41559-017-0372-7
Source DB: PubMed Journal: Nat Ecol Evol ISSN: 2397-334X Impact factor: 15.460
Rate of COs and NCOs in each cross.
| Organisms | Parental strains | Marker density | Tetrads sequenced | CO | NCO-GC | ||||
|---|---|---|---|---|---|---|---|---|---|
| CO Events | cM/Mb | CO-GC | Events | Converted markers | |||||
| Events | Converted markers | ||||||||
| YJM789 × S96 (S1) | 0.44% | 14 | 94.4 | 391 | 67.5 | 1.3×10-2 | 47.8 (53.9) | 9.0×10-3 | |
| YJM789 × S96 | 0.44% | 46 | 90.5 | 375 | 62.7 | 1.1×10-2 | 46.2 | 8.4×10-3 | |
| YJM789 × YPS128 (S2) | 0.50% | 15 | 76.5 | 317 | 63.2 | 9.8×10-3 | 46.4 (49.9) | 7.4×10-3 | |
| FGSC2489 × FGSC3246 (N1) | 0.04% | 10 | 11.5 | 14 | 0.3 | 3.8×10-5 | 0.1 | 3.2×10-5 | |
| FGSC4200 × FGSC1363 (N2) | 0.66% | 20 | 16.1 | 20 | 8.1 | 2.9×10-4 | 6.0 (11.1) | 2.0×10-4 | |
| FGSC2225 × FGSC3246 (N3) | 1.06% | 18 | 15.7 | 19 | 8.7 | 1.2×10-4 | 7.3 (10.0) | 1.0×10-4 | |
| N3-14-2 × N3-14-4 (N4) | 0.61% | 9 | 17.1 | 21 | 7.7 | 6.0×10-4 | 4.3 (8.8) | 1.8×10-4 | |
| CC124 × CC1010 (C1) | 0.14% | 6 | 16.5 | 8 | 0.2 | 1.0×10-5 | 0.2 | 1.3×10-6 | |
| CC2935 × CC2936 (C2) | 1.53% | 12 | 25.3 | 12 | 12.3 | 4.6×10-5 | 3.1 (8.1) | 3.7×10-6 | |
| CC408 × CC2936 (C3) | 1.17% | 9 | 23.2 | 11 | 10.1 | 3.2×10-5 | 2.2 (7.3) | 3.2×10-6 | |
| Col × Ler (A) | 0.26% | 9 | 11.0 | 4.6 | 3.9 | 5.1×10-5 | 2.0 (16.7) | 1.1×10-5 | |
| Col × Ler | 0.12% | 5 | 10.4 | 4.3 | 2.0 | 3.5×10-5 | 1.4 | 1.0×10-5 | |
| - | 0.001%~0.05% | - | 43/23 | 1.5/0.8 | - | - | - | 6×10-6 | |
These numbers are counted as per tetrad per meiosis.
These numbers are proportion of markers converted.
Numbers in the parenthesis are corrected number based on simulated tract length and probability of detection.
These two strains are products of a single meiosis in cross N3 (FGSC2225 × FGSC3246).
43 COs per cell and 1.5 cM/Mb for oocytes and 23 COs per cell and 0.8 cM/Mb for sperms.
20, 21, 32, 64 The four numbers correspond to the reference numbers, the grey shaded data are from these references.
Figure 1Schematic illustration of experiment design.
After a cross is made between two strains, the meiotic products are dissected and whole-genome re-sequenced individually. The meiotic products in yeast, Chlamydomonas, and Arabidopsis are tetrads. Each tetrad can be dissected into four spores and each spore is sequenced after being grown into a clonal colony. The meiotic products in Neurospora are ascospores, which are products of a meiosis followed by a mitosis and thus have eight spores, every two spores originate from a mitosis. Four of the eight spores in each ascospore are selected for sequencing.
Figure 2Outcomes of DSBR and SDSA pathways in meiotic recombination.
Four kinds of outcomes of recombination events are shown in this figure: Type I: A single crossover event; Type II: A crossover and a non-crossover event occur at the same locus indicating that the two crossing over chromatids invaded a third chromatid during a recombination event65,66; Type III: Two non-crossover events occur at the same locus. This can be explained by resolution of a double Holiday junction in a NCO fashion65,66 or two chromatids breaks at the same locus. Type IV: A single non-crossover event. The rates (count per tetrad per meiosis) of each outcome in these four organisms are shown. As shown in black dotted frame, for type II events, the CO and the NCO event do not share the same genotype switching point, and for type III events, the two NCO events mostly do not share the same conversion tract, and they always lead to 2:2 or 4:0 segregation.
Figure 3Estimation of tract length for gene conversion events.
Two methods were used for the estimation. a. and b. show the violin plot for estimated tract length for NCO-GCs (a) and CO-GCs (b) by the midpoint method. c. is the simulation of average markers converted at different tract lengths in each cross. e.g. in cross A, the observed number of markers converted by CO-GC is 4, and in this simulation this averages at 700 bp tract length. d. Summary of tract lengths estimated from the two methods.
Nucleotide direction of gene conversion events.
The ratio (number) of AT->GC and GC->TA conversions were listed in each category, p-values are from binomial test. AT-biased trend is highlighted in green while GC-biased trend is highlighted in red.
| Organisms | GC-content of whole genome | Parental strains | GC-content at SNP sites | NCO-GC + CO-GC | NCO-GC | CO-GC | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AT->GC | GC->AT | p-value | AT->GC | GC->AT | p-value | AT->GC | GC->AT | p-value | ||||
| 38.30% | YJM789 | 48.79% | 49.47% (7153) | 50.53% (7307) | 0.2 | 50.63% (2914) | 49.37% (2842) | 0.34 | ||||
| 50.52% (15238) | 49.48% (14922) | 0.07 | ||||||||||
| YJM789 | 47.85% | 49.41% | 50.59% | 0.3 | ||||||||
| 48.50% | FGSC4200 | 49.59% | 50.13% | 49.87% | 0.92 | 49.46% | 50.54% | 0.77 | 50.60% | 49.40% | 0.68 | |
| FGSC2225 | 50.43% | 51.18% | 48.82% | 0.38 | 52.32% | 47.68% | 0.24 | 50.13% | 49.87% | 0.97 | ||
| C14-2 | 50.32% | 50.34% | 49.66% | 0.81 | 50.75% | 49.25% | 0.83 | 50.22% | 49.78% | 0.9 | ||
| 64.08% | CC2935 | 52.67% | 52.95% | 47.05% | 0.11 | 51.98% | 48.02% | 0.31 | ||||
| CC408 | 52.47% | 51.50% | 48.50% | 0.62 | 49.18% | 50.82% | 0.82 | |||||
| 36.03% | Col | 46.14% | 54.17% | 45.83% | 0.41 | 50.00% | 50.00% | 1 | 54.90% | 45.10% | 0.37 | |
| Col | 45.98% | 51.95% | 48.05% | 0.82 | 33.33% | 66.67% | 1 | 52.70% | 47.30% | 0.73 | ||
15, 20, 32 The three numbers correspond to the reference numbers, the grey shaded data are from these references.
Correlations between COs, NCOs and GC-content at different scales.
The genome is divided into 10 kb (20 kb, 100 kb, and 200 kb) non-overlapping windows. The number of COs, number of NCOs, and GC-content (percentage of identifiable bases that are G or C) are calculated for each window. The raw correlations between these variables at each window size are shown in table below. Significant correlations are highlighted in red.
| Variables | Block size | Yeast | Neurospora | Chlamydomonas | Arabidopsis | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Spearman's ρ | Spearman's ρ | Spearman's ρ | Spearman's ρ | |||||||
| 10k | 0.252 | 2.2E-16 | 0.077 | 8.4E-7 | 0.032 | 0.00077 | -0.0387 | 2.5E-5 | ||
| 20k | 0.207 | 2.5E-7 | 0.129 | 5.5E-9 | 0.035 | 0.0102 | -0.0458 | 0.0004 | ||
| 50k | 0.144 | 0.023 | 0.167 | 1.7E-6 | 0.048 | 0.026 | -0.0621 | 0.0024 | ||
| 100k | 0.0588 | 0.51 | 0.154 | 0.0019 | 0.053 | 0.078 | -0.117 | 5.1E-5 | ||
| 200k | -0.149 | 0.22 | 0.167 | 0.016 | 0.075 | 0.076 | -0.162 | 6.6E-5 | ||
| 10k | 0.195 | 6.8E-12 | 0.049 | 0.0018 | 0.036 | 0.00019 | -0.0129 | 0.16 | ||
| 20k | 0.167 | 3.4E-5 | 0.062 | 0.00499 | 0.044 | 0.0012 | -0.0194 | 0.13 | ||
| 50k | 0.167 | 0.0084 | 0.089 | 0.011 | 0.054 | 0.0114 | -0.0362 | 0.077 | ||
| 100k | 0.1 | 0.26 | 0.152 | 0.0021 | 0.087 | 0.0038 | -0.0368 | 0.2 | ||
| 200k | -0.05 | 0.68 | 0.209 | 0.0025 | 0.175 | 3.5E-5 | -0.0171 | 0.68 | ||
| 10k | 0.270 | <2.2E-16 | 0.0939 | 2.2E-9 | 0.0414 | 1.8E-5 | -0.0407 | 8.9E-6 | ||
| 20k | 0.229 | 1.1E-8 | 0.145 | 5.0E-11 | 0.0455 | 0.0008 | -0.0505 | 9.6E-5 | ||
| 50k | 0.201 | 0.0015 | 0.190 | 5.0E-8 | 0.0646 | 0.0025 | -0.0709 | 0.0005 | ||
| 100k | 0.151 | 0.0898 | 0.192 | 9.5E-5 | 0.0783 | 0.009 | -0.125 | 1.5E-5 | ||
| 200k | -0.100 | 0.42 | 0.206 | 0.0029 | 0.142 | 0.0008 | -0.157 | 0.0001 | ||