| Literature DB >> 33737754 |
M Timothy Rabanus-Wallace1, Bernd Hackauf2, Martin Mascher1, Thomas Lux3, Thomas Wicker4, Heidrun Gundlach3, Mariana Baez5, Andreas Houben1, Klaus F X Mayer3,6, Liangliang Guo7, Jesse Poland7, Curtis J Pozniak8, Sean Walkowiak8,9, Joanna Melonek10, Coraline R Praz4, Mona Schreiber1, Hikmet Budak11, Matthias Heuberger12, Burkhard Steuernagel13, Brande Wulff13, Andreas Börner1, Brook Byrns8, Jana Čížková14, D Brian Fowler8, Allan Fritz7, Axel Himmelbach1, Gemy Kaithakottil15, Jens Keilwagen16, Beat Keller4, David Konkin17, Jamie Larsen18, Qiang Li19, Beata Myśków20, Sudharsan Padmarasu1, Nidhi Rawat21, Uğur Sesiz22, Sezgi Biyiklioglu-Kaya23, Andy Sharpe8, Hana Šimková14, Ian Small10, David Swarbreck15, Helena Toegelová14, Natalia Tsvetkova24, Anatoly V Voylokov25, Jan Vrána14, Eva Bauer26, Hanna Bolibok-Bragoszewska27, Jaroslav Doležel14, Anthony Hall15, Jizeng Jia28, Viktor Korzun29, André Laroche30, Xue-Feng Ma31, Frank Ordon32, Hakan Özkan22, Monika Rakoczy-Trojanowska27, Uwe Scholz1, Alan H Schulman33,34, Dörthe Siekmann35, Stefan Stojałowski20, Vijay K Tiwari21, Manuel Spannagl3, Nils Stein36,37.
Abstract
Rye (Secale cereale L.) is an exceptionally climate-resilient cereal crop, used extensively to produce improved wheat varieties via introgressive hybridization and possessing the entire repertoire of genes necessary to enable hybrid breeding. Rye is allogamous and only recently domesticated, thus giving cultivated ryes access to a diverse and exploitable wild gene pool. To further enhance the agronomic potential of rye, we produced a chromosome-scale annotated assembly of the 7.9-gigabase rye genome and extensively validated its quality by using a suite of molecular genetic resources. We demonstrate applications of this resource with a broad range of investigations. We present findings on cultivated rye's incomplete genetic isolation from wild relatives, mechanisms of genome structural evolution, pathogen resistance, low-temperature tolerance, fertility control systems for hybrid breeding and the yield benefits of rye-wheat introgressions.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33737754 PMCID: PMC8035072 DOI: 10.1038/s41588-021-00807-0
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Fig. 1Rye (‘Lo7’) genome composition and structure over chromosomes 1R to 7R.
Twin vertical gray lines in each chromosome denote the boundaries of the pericentromeric low-collinearity regions for each chromosome. a, Genetic map positions of markers used in assembly. Scaffold boundaries marked by gray vertical lines. b, Density of annotated gene models. c, Gene collinearity with barley (cv. ‘Morex’), with the position on the ‘Morex’ pseudomolecules on the vertical axis. Text and point colors represent barley chromosomes as labeled. d–g, Positions and ages of four LTR retrotransposon families RLG-Sabrina (d), RLG-WHAM (e), RLC-Angela (f) and RLG_Cereba (g) in the genome, represented as a heatmap. Binned ages are on the vertical axis (from 0 million years ago, Ma, at the bottom) and bin positions are across the horizontal. Heat represents the number of TEs in each age/position bin (see legend inset). Red arrows mark notable changes in LTR-RT profiles.
Fig. 2Dissecting the relationships among rye genotypes.
a, Hi-C asymmetry detects SVs between the reference genotype ‘Lo7’ and S. cereale ‘Lo225’. SVs result in discontinuities in r, the ratio of Hi-C links mapping left:right relative to ‘Lo7’. Large inversions (marked) typically produce clean, diagonal lines. Visually identified candidate SVs are shaded. b, Detail of 5R genetic map marker positions showing how recombination rate relates to candidate SVs. The rightmost inversion marked on 5R corresponds to a region of suppressed recombination on chromosome 5R. The effect of other ‘Lo7’ versus ’Lo225’ SVs on recombination was harder to confirm since they fall in already-low-recombining regions. c,d, PCA plots showing the relationships among genetically determined rye clusters for PCs 1 and 2 (c) and for PCs 2 and 3 (d). e,f, Binwise IBS, Fst and Pn/Ps statistics calculated across the chromosomes using the expanded Schreiber et al.[25] rye diversity panel data mapped to the ‘Lo7’ assembly. Exemplary instances are shown: chromosomes 1R (e) and 4R (f). The position of each bin on the genome is the mean pseudochromosome position of identified variable sites within that bin. Upper in each pane: binwise IBS scores of the panel genotypes compared with ‘Lo7’, with features discussed in the text marked with asterisks. Colors correspond to d. Middle in each pane: binwise Fst showing changes in genetic variance partitioning among and between subgroups across the chromosome. Line colors, Fst key. Lower in each pane: binwise Pn/Ps ratios (shown when Pn + Ps > = 10 for a given bin) for recently acquired rye polymorphisms (wheat outgroup). The values were calculated separately for different groups of ryes (‘domesticated’ (cluster 3) versus ‘wild’ (clusters 1,4–7)—see d) to allow detection of possible recent selective events affecting different rye groups. Point colors/sizes, Pn/Ps key.
Genome assembly and annotation statistics
| Assembly | Raw scaffolds (after chimera breaking) | In chromosome-scale pseudomolecules | |
|---|---|---|---|
| Scaffolds | 109,776 | 476 | |
| Total length (Mb) | 6,670.03 | 6,206.74 | |
| N50 length (Mb) | 15.16 | 29.44 | |
| Length with chromosome assignment (%) | 95.3% | 100% | |
| Maps | 5,601 | ||
| Total length (Mb) | 6,660.18 | ||
| N50 length (Mb) | 1.671 | ||
| Total aligned length (Mb) | 6,248.60 | ||
| Uniquely aligned length (Mb) | 6,029.11 | ||
| Number of genes | 34,441 | 22,781 | |
| Mean gene length | 2,892 | 946 | |
| Mean exons per gene | 4.42 | 1.79 | |
| Proportion of complete BUSCO set | 96.4% | 5.8% | |
| Copia | 11,128 | 0.53 | |
| Gypsy | 934 | 1.24 | |
| Gypsy | 3,996 | 2.10 | |
| Gypsy | 1,457 | 2.06 | |
| CACTA | 1,480 | NA | |
| CACTA | 516 | NA | |
| NA | 13,124 | NA | |
| NA | 1,973 | NA | |
| NA | 15,097 | NA | |
BUSCO, benchmarking universal single-copy orthologs (v.3; https://busco.ezlab.org/); NA, not applicable.
Fig. 3Combined reference mapping as a means to classify wheat and wheat–rye introgression karyotypes.
a, Color key for b and c. b, Normalized read mapping depths for 1-Mb bins of chromosomes 1A, 1B and 1R, for a selection of wheat lines (including also some Aegilops tauschii accessions which contain no A or B subgenome) with various chromosome complements and introgressions (rows). The value r denotes the difference between the log2 reads per million mapping to a bin, compared to T. aestivum cv. ‘Chinese Spring’. c, Visual representation of the SVM classifier, with the two selection features based on relative read mapping densities to ‘translocation-prone’ and ‘other’ chromosomal regions (Methods) shown on the x and y axes. Points represent training samples, with color corresponding to human-designated classification and size proportional to the total number of mapped reads for the sample. Black points are samples not classified by a human. Background colors represent the hypothetical classification that would be given to a sample at that position. d, Results of cross-validation testing the accuracy of the classifier and its relationship to the size of the training set. Box edges and whiskers represent quartiles and the center lines show the arithmetic means.
Summary of fixed effects estimates from linear mixed model estimating the influence of rye–wheat translocations upon yield, in two wheat diversity panels
| Panel | Introgression type | Estimated yield effect | s.e. | Degrees of freedom | ||
|---|---|---|---|---|---|---|
| KSU | 1AL.1RS | 4.06% | 0.54 | 7.54 | 1.89 × 104 | 4.95 × 10−14 |
| KSU | 1BL.1RS | 1.50% | 0.41 | 3.68 | 1.88 × 104 | 0.00023 |
| USDA | 1AL.1RS | 0.86% | 0.31 | 2.72 | 2.82 × 104 | 0.0064 |
| USDA | 1BL.1RS | 4.55% | 0.39 | 11.78 | 2.82 × 104 | < 2.0 × 10−16 |
P values are calculated using a one-sided Student’s t-test on the null hypothesis that the true yield effect is zero (Methods; Supplementary Table 25).
Fig. 4Comparative genomics of rye genes with agricultural importance.
a–e, Density (instances per Mb) of mTERFs (a), PPRs (d) as well as NLRs (e) across the pseudomolecules. For visualization, the y axis is transformed using x → x1/3. f, Genomic locations of genes and loci discussed in the text. Colored bars correspond and refer to the colors of the box outlines in g–k; g–j, physical organization of selected NLR gene clusters compared across cultivated Triticeae genomes: Pm2 (g), Lr10/RGA2 (h), Pm3 (i) and Mla (j). k, Organization of RFL genes at the ‘Lo7’ Rf locus compared to its wheat (‘Chinese Spring’) counterpart. Flanking markers are shown on either end of the rye sequence. Two full-length wheat RFLs and a putative rye ortholog are labeled. PPR genes are colored red. l, CNV between ‘PUMA-SK’ and ‘Lo7’ within the Fr2 interval revealed by 10x Genomics linked read sequencing. A (Dup)lication flagged by the Loupe analysis software is marked. The estimated copy number differences between ‘Lo7’ and ‘Puma’ are shown for CBF genes.
Fig. 5The cold tolerance associated region Fr2 in ‘Puma’ and ‘NorstarPuma5A:5R’ translocation line.
a, Chromosome labeling (top) using probes specific for ‘Norstar’ chromosome 5A (Afa) and ‘Puma’ 5R (pSc119.2) confirm the presence of a rye translocation in NorstarPuma5A:5R (red box), which also alters the binding of GAA. White bars, 10 µm. b, Combined reference read mapping of group 5 chromosomes confirms the balanced translocation event, gain of a large region of chromosome 5R from ‘Puma’ (rye, light read line) and loss of a large region on chromosome 5A of ‘Norstar’ (wheat, light blue line) in ‘NorstarPuma5A:5R’. Read depth is given in log2 reads per million versus ‘Chinese Spring’. c, Gene expression analysis of rye CBF genes with CNV in ‘Puma’ (blue line) and ‘NorstarPuma5A:5R’ (orange line). Plants were grown in a time series with decreasing day length and temperature over a 70-d period and the temperatures at which 50% lethality was observed (LT50) were recorded (heatmap).