| Literature DB >> 30982471 |
Laura-Jayne Gardiner1,2, Luzie U Wingen3, Paul Bailey4, Ryan Joynson4, Thomas Brabbs4, Jonathan Wright4, James D Higgins5, Neil Hall4,6, Simon Griffiths3, Bernardo J Clavijo4, Anthony Hall7,8.
Abstract
BACKGROUND: Sequence exchange between homologous chromosomes through crossing over and gene conversion is highly conserved among eukaryotes, contributing to genome stability and genetic diversity. A lack of recombination limits breeding efforts in crops; therefore, increasing recombination rates can reduce linkage drag and generate new genetic combinations.Entities:
Keywords: Crossover; Gene conversion; QTL; Recombination; Wheat
Mesh:
Year: 2019 PMID: 30982471 PMCID: PMC6463664 DOI: 10.1186/s13059-019-1675-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Recombination landscape of wheat. a The number of COs recorded for each RIL in the Paragon × Chinese Spring population (CO frequency per sample) as a frequency histogram. b The number of RILs sharing each recorded CO (number of samples with each CO) as a frequency histogram for the Paragon × Chinese Spring population. c For all analyzed COs, the location of the CO (the start of the window that shows a different predominant parental allele compared to the previous window) is plotted on the x-axis with the number of samples in the population that share the CO on the y axis. d The intersection of two 20-Mbp windows defines a CO. Therefore, for all windows of 40 Mbp encompassing a central defined CO, the number of high-confidence genes that are found within each interval is plotted alongside the number of RILs within each population showing the CO. e Parents for the 13 populations clustered according to their representative alleles from the 35K SNP array. f The 13 populations clustered according to their individual CO profiles, i.e., number of RILs with each recorded CO in the population. The dendrograms in e and f were produced using the R package pvclust average linkage method with correlation-based dissimilarity matrix and the value of this distance metric between clusters is represented as height on the y-axis. AU (approximately unbiased) p values were computed by multiscale bootstrap resampling (bootstrap number of 1000). Landraces are highlighted with blue boxes and pure breeding lines are highlighted with pink boxes
Fig. 2Fine-scale analysis of sequence exchange events. a The number of COs and/or GCs recorded for each RIL in the Paragon × Chinese Spring population (GC/CO frequency per sample) as a frequency histogram. b Line plots separately for the number of COs (COs), GCs (GCs), and array SNPs per 20-Mbp window across the genome. All chromosomes are normalized to 500 Mbp in length to be displayed in a single plot. The moving average of each dataset is displayed (period = 15). c Schematic of methodology for calling gene conversions (GCs) and crossovers (COs) in the skim sequencing data using pre-defined Paragon and Chinese Spring-specific homozygous SNPs. d Immunolocalization of the chromosome axis protein ASY1 (blue) and yH2A.X (red) a marker for DNA DSB on hexaploid wheat leptotene male meiotic nuclei. Scale bar = 10 μM. e Original nuclei as per d; however, yH2A.X foci are marked that co-localize with ASY1. Mean number of yH2A.X foci across five replicates are shown from the displayed image n = 1673. f Line plots separately for the number of GCs 20 bp–2 kbp, 2–10 kbp, 10–500 kbp, and > 500 kbp in length per 20-Mbp window across the genome. Chromosomes are normalized as per b and the average frequency per window is displayed
QTLs identified for CO frequency. Detailing for the QTL analysis; the population under analysis, phenotype, cross type and generation, chromosome, position (cM), logarithm of odds (LOD) score, % variation explained, additive effect, flanking markers including QTL region length and gene number, top marker from QTL scan and annotation of identified candidate genes in the region
| Population/phenotype | Cross | BC.gen | F.gen | Chr | Pos cM | LOD | %var | Add eff | Flanking markers | Top marker | Candidate genes plus annotation | Parent with increasing effect |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CSxP | RIself | NA | NA | 6A | 11.84 | 3.637 | 6.037 | − 4.502 | AX-95155001 | AX-94901219 11.84 cM | TraesCS6A01G026300.2 Holliday junction ATP-dependent DNA helicase RuvB (13,063,092 bp) | Chinese Spring |
| CSxP | RIself | NA | NA | 2A | 161 | 2.513 | 4.212 | − 14.73 | AX-94616756 | AX-94419084 | TraesCS2A01G103300 (56,942,731 bp) | Chinese Spring |
| PxCIM47 | RIself | 0 | 5 | 2B | 0 | 6.041 | 6.623 | 0.519 | AX-94768203 | AX-9476820364,988,255 bp | TraesCS2B01G103300 (64,091,671 bp) | CIM47 |
| PxWat94 | RIself | 0 | 4 | 4B | 191 | 2.591 | 10.461 | − 0.838 | AX-94840592 | AX-94840592 | TraesCS4B01G006700 Protein HIRA (4,526,708 bp) | Paragon |
| PxBaj | RIself | 0 | 4 | 5A | 304.6 | 3.802 | 9.898 | 1.24 | AX-95240942 | AX-94758045472,344,060 bp | TraesCS5A01G257000 (472,343,789-472,347,557 bp) | Baj |
Sequence exchange events defined from whole genome skim sequencing. Detailing for the 12 Paragon × Chinese Spring samples skim sequenced; the sample number, CO and frequency defined using the 35K array, % of these events validated by skim sequencing, GCs identified with track lengths > 40,000,000 bp (representative of COs), > 20,000,000 bp, > 500,000 bp, 10,000–500,000 bp, 2000–10,000 bp, and 20–2000 bp. High-confidence GCs are defined as those where the specific array SNP allele that was used to make the call could be validated in the skim sequencing
| Sample | CO frequency (35K array) | % of array COs validated by skim seq | GC frequency (35K array) | % array GC positions validated with skim sequence | % GC-specific SNP alleles validated with skim sequence | Track length | Track length | Track length > 500 kbp | High-confidence track length | Track length 10–500 kbp | High-confidence track length | Track length 2–10 kbp | High-confidence track length | Track length 20 bp–2 kbp | High-confidence track length |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0055 | 73 | 86.3 | 429 | 96.0 | 83.9 | 56 | 94 | 186 | 177 | 2884 | 898 | 4230 | 1158 | 15,262 | 5676 |
| 0334 | 70 | 91.4 | 507 | 96.8 | 86.9 | 55 | 88 | 337 | 321 | 10,017 | 8155 | 7687 | 4828 | 18,913 | 9543 |
| 0200 | 54 | 87.0 | 461 | 92.8 | 82.6 | 53 | 81 | 200 | 184 | 3433 | 1322 | 5251 | 1691 | 19,602 | 8609 |
| 0270 | 54 | 81.5 | 483 | 96.7 | 50.9 | 79 | 122 | 278 | 257 | 2907 | 1694 | 3721 | 1853 | 17,651 | 11,383 |
| 0008 | 65 | 98.5 | 413 | 94.2 | 85.2 | 43 | 72 | 215 | 197 | 3588 | 1630 | 5954 | 2508 | 26,063 | 13,451 |
| 0119 | 69 | 91.3 | 409 | 94.4 | 82.8 | 43 | 75 | 330 | 317 | 5926 | 3995 | 6097 | 2898 | 20,600 | 9961 |
| 0005 | 55 | 94.5 | 363 | 96.7 | 84.6 | 51 | 79 | 238 | 213 | 4303 | 2228 | 6002 | 2402 | 22,553 | 10,738 |
| 0018 | 52 | 92.3 | 386 | 94.3 | 87.0 | 58 | 101 | 241 | 218 | 2531 | 1148 | 4323 | 1773 | 18,583 | 10,225 |
| 0097 | 53 | 98.1 | 322 | 96.3 | 86.9 | 30 | 56 | 177 | 163 | 4296 | 1717 | 7203 | 2694 | 28,299 | 13,193 |
| 0004 | 34 | 88.2 | 378 | 97.9 | 86.6 | 47 | 73 | 217 | 202 | 3377 | 1701 | 5242 | 2206 | 20,889 | 10,829 |
| 0010 | 37 | 100 | 319 | 94.7 | 89.1 | 44 | 85 | 309 | 291 | 3292 | 1383 | 4926 | 1860 | 22,063 | 11,234 |
| 0281 | 39 | 79.5 | 306 | 97.7 | 89.4 | 66 | 102 | 212 | 196 | 2642 | 1097 | 3426 | 1195 | 13,612 | 5930 |
Fig. 3Output from QTL analysis from the Paragon × Chinese Spring population. QTL analysis output for the Paragon × Chinese Spring population that yielded significant associations for either a CO-Phenotype or b GC-Phenotype (p < 0.05). Detailing LOD scores plotted over the respective linkage groups, i.e., chromosomes. Increased resolution of QTL peaks for c CO-Phenotype and d GC-Phenotype. e Finally, the locations of the array SNPs showing the peak associations are marked in red surrounded by a red box while also showing all other array SNP locations per chromosome
Fig. 4Examination of candidate genes from QTL analysis RecQ-7 and RuvB. a Box plot comparison of the knockout RuvB lines with the control lines, defining CO frequency using collapsed linkage windows as per CO-Phenotype. b Box plot comparison of the knockout RecQ-7 lines with the control lines, defining CO/GC frequency using GC-Phenotype. c Box plot comparison of the knockout RecQ-7 lines with the control lines, defining CO frequency using CO-Phenotype. d Phylogenetic tree of identified genes across multiple species (Arabidopsis, rice and wheat) with sequence similarity to the RecQ helicase family, including our wheat candidate RecQ-7 gene for comparison. Bootstrap values ≥ 90% are shown as green dots on the branches