| Literature DB >> 27643539 |
Bjarni V Halldorsson1,2, Marteinn T Hardarson1, Birte Kehr1, Unnur Styrkarsdottir1, Arnaldur Gylfason1, Gudmar Thorleifsson1, Florian Zink1, Adalbjorg Jonasdottir1, Aslaug Jonasdottir1, Patrick Sulem1, Gisli Masson1, Unnur Thorsteinsdottir1,3, Agnar Helgason1,4, Augustine Kong1, Daniel F Gudbjartsson1,5, Kari Stefansson1,3.
Abstract
Meiotic recombination involves a combination of gene conversion and crossover events that, along with mutations, produce germline genetic diversity. Here we report the discovery of 3,176 SNP and 61 indel gene conversions. Our estimate of the non-crossover (NCO) gene conversion rate (G) is 7.0 for SNPs and 5.8 for indels per megabase per generation, and the GC bias is 67.6%. For indels, we demonstrate a 65.6% preference for the shorter allele. NCO gene conversions from mothers are longer than those from fathers, and G is 2.17 times greater in mothers. Notably, G increases with the age of mothers, but not the age of fathers. A disproportionate number of NCO gene conversions in older mothers occur outside double-strand break (DSB) regions and in regions with relatively low GC content. This points to age-related changes in the mechanisms of meiotic gene conversion in oocytes.Entities:
Mesh:
Year: 2016 PMID: 27643539 PMCID: PMC5083143 DOI: 10.1038/ng.3669
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1The study design and method used for detecting gene conversions. Including a) the family structure and b) the two sources of genotypes. c) The genotypes of the siblings are necessary to verify the haplotypes of both parents. We require at least one child of the proband to verify the gene conversion. The method is shown for NCO gene conversion.
Estimates of NCO gene conversion of SNPs
| Chip dataset | Sequencing dataset | |||||
|---|---|---|---|---|---|---|
|
| ||||||
| Paternal | Maternal | Total | Paternal | Maternal | Total | |
| 641 | 1,393 | 2,034 | 300 | 565 | 865 | |
| 106,740,626 | 106,761,366 | 213,501,992 | 73,919,056 | 73,016,289 | 146,935,345 | |
| 6.0 | 13.0 | 9.5 | 4.1 | 7.7 | 5.9 | |
| 3.9 | 10.0 | 7.0 | – | – | – | |
| 618 | 1,018 | 1,636 | 244 | 241 | 485 | |
| | 606 | 849 | 1,455 | 234 | 196 | 430 |
| | 12 | 169 | 181 | 10 | 45 | 55 |
| 1.04 (±0.01) | 1.37 (± 0.03) | 1.24 (±0.02) | 1.23 (±0.06) | 2.34(±0.28) | 1.78(±0.15) | |
| | 1.01 (±0.00) | 1.04 (±0.01) | 1.03 (±0.01) | 1.09 (± 0.02) | 1.17 (±0.04) | 1.13 (±0.02) |
| | – | 3.01(±0.13) | 2.96 (±0.12) | – | 7.44 (±1.15) | 6.93 (±0.99) |
| 64.4 | 69.1 | 67.6 | 63.1 | 72.5 | 69.3 | |
| | 64.7 | 69.5 | 67.5 | 65.3 | 73.0 | 68.9 |
| | – | 68.6 | 67.9 | – | 72.2 | 69.8 |
| 42.3 | 5.4 | 10.7 | 45.7 | 7.1 | 14.9 | |
| | 49.7 | 9.1 | 17.5 | 66.9 | 20.3 | 37.0 |
| | – | 1.1 | 1.1 | – | 1.6 | 2.0 |
| 8.2 | 4.6 | – | 12.7 | 8.0 | – | |
| | 8.8 | 6.5 | – | 14.8 | 14.1 | – |
| | – | 2.0 | – | – | 4.0 | – |
Corrected gene conversion rate is corrected for crossover recombination rate. This correction and the odds ratio for hotspots are computed using sex specific maps. Short events < 1,000 base-pairs, long events >= 1,000 base-pairs. Results for long NCO events are not presented in paternal transmissions due to small sample size. As crossover maps are sex specific no joint analysis is provided. NCO = non-crossover, OR = odds ratio, DSB = double strand break, mpp = marker proband pair, CI = confidence interval, SE = standard error
Estimates of CCO gene conversions of SNPs
| Chip dataset | Sequencing dataset | |||||
|---|---|---|---|---|---|---|
|
| ||||||
| Paternal | Maternal | Total | Paternal | Maternal | Total | |
| 8 | 150 | 158 | 5 | 157 | 162 | |
| 289,364 | 450,307 | 739,671 | 170,612 | 262,323 | 432,935 | |
| 28 | 333 | 214 | 29 | 598 | 374 | |
| 7 | 107 | 114 | 5 | 34 | 39 | |
| 1.14 | 1.40 (±0.08) | 1.39 (±0.07) | 1 | 4.76(±1.29) | 4.26(±1.16) | |
| 73.3 | 70.2 | 70.2 | 70.1 | |||
| 45,025 | 70,079 | 115,104 | 1,624 | 2,490 | 4,114 | |
| 0.16 | 1.5 | 1.0 | 3.1 | 13.3 | 9.2 | |
Confidence intervals for CCO gene conversions in paternal transmissions are not reported due to small number of events. CCO = complex crossover, OR = odds ratio, DSB = double strand break, mpp = marker proband pair, CI = confidence interval
Figure 2NCO gene conversion rate, G. As a function of a) age of parent, b) distance from telomere, c) age of mother stratified by presence in DSB region and d) age of mother stratified by presence in crossover recombination hotspots. e) percent GC content of neighboring 100 bases for gene converted markers outside of double strand break regions as a functionof age of mother. Blue represents paternal and red maternal transmission. Error bars represent 95% confidence intervals. a) Number of parents in each age bin are presented in top of figure. a,c,d,e) Individuals are grouped into 2 year age bins with all individuals younger than 20 grouped together and all individuals 40 years or older grouped together. b) mpps are grouped into 5Mb bins. Crossover recombination rate is computed as the average over all marker proband pairs in a bin of average crossover recombination rate in +/– 5kb interval around a marker. cM = centiMorgan, Mb = 1 million base pairs.
Figure 3NCO gene conversion rate G, as a function of GC content of neighboring 100bp. a) G outside of male DSB regions. b) G inside of male DSB regions. Error bars represent 95% confidence intervals. The dotted line represents a linear model. NCO = non-crossover, DSB = double strand break
Gene conversion of indels
| Sequencing dataset | |||
|---|---|---|---|
|
| |||
| Paternal | Maternal | Total | |
| 23 | 38 | 61 | |
| | 22 | 28 | 50 |
| | 1 | 10 | 11 |
| 4,391,571 | 4,324,302 | 8,715,873 | |
| | 4,382,511 | 4,310,148 | 8,692,659 |
| | 9,060 | 14,154 | 23,214 |
| 5.2 | 8.8 | 7.0 | |
| | 5.0 | 6.5 | 5.8 |
| | 110 | 707 | 474 |
| 55.5% | 71.1% | 65.6% | |
| | 54.5 | 67.8 | 62.0 |
| | – | ||
| 35.6 | 4.9 | 13.9 | |
| 10.4 | 8.4 | ||
OR = odds ratio, DSB = double strand break, mpp = marker proband pair, CI = confidence interval, CCO = complex crossover, NCO = non-crossover.
Figure 4Complex crossovers (CCO) and maternal age. a) Complex crossover (CCO) gene conversion rate by maternal age in the chip dataset. b) Fraction of maternally transmitted crossovers in the chip dataset that are complex, as a function of mother's age. Error bars represent 95% confidence intervals. The dotted line represents a linear model.