| Literature DB >> 33244607 |
Kentaro K Shimizu1,2, Dario Copetti2,3, Moeko Okada2, Thomas Wicker4, Toshiaki Tameshige1,5, Masaomi Hatakeyama2,6, Rie Shimizu-Inatsugi2, Catharine Aquino6, Kazusa Nishimura7, Fuminori Kobayashi8, Kazuki Murata9, Tony Kuo10,11, Emily Delorean12, Jesse Poland12, Georg Haberer13, Manuel Spannagl13, Klaus F X Mayer13,14, Juan Gutierrez-Gonzalez15, Gary J Muehlbauer15, Cecile Monat16, Axel Himmelbach16, Sudharsan Padmarasu16, Martin Mascher16, Sean Walkowiak17,18, Tetsuya Nakazaki7, Tomohiro Ban1, Kanako Kawaura1, Hiroyuki Tsuji1, Curtis Pozniak17, Nils Stein16,19, Jun Sese10,20, Shuhei Nasuda9, Hirokazu Handa8,21.
Abstract
Bread wheat is a major crop that has long been the focus of basic and breeding research. Assembly of its genome has been difficult because of its large size and allohexaploid nature (AABBDD genome). Following the first reported assembly of the genome of the experimental strain Chinese Spring (CS), the 10+ Wheat Genomes Project was launched to produce multiple assemblies of worldwide modern cultivars. The only Asian cultivar in the project is Norin 61, a representative Japanese cultivar adapted to grow across a broad latitudinal range, mostly characterized by a wet climate and a short growing season. Here, we characterize the key aspects of its chromosome-scale genome assembly spanning 15 Gb with a raw scaffold N50 of 22 Mb. Analysis of the repetitive elements identified chromosomal regions unique to Norin 61 that encompass a tandem array of the pathogenesis-related 13 family. We report novel copy-number variations in the B homeolog of the florigen gene FT1/VRN3, pseudogenization of its D homeolog and the association of its A homeologous alleles with the spring/winter growth habit. Furthermore, the Norin 61 genome carries typical East Asian functional variants different from CS, ranging from a single nucleotide to multi-Mb scale. Examples of such variation are the Fhb1 locus, which confers Fusarium head-blight resistance, Ppd-D1a, which confers early flowering, Glu-D1f for Asian noodle quality and Rht-D1b, which introduced semi-dwarfism during the green revolution. The adoption of Norin 61 as a reference assembly for functional and evolutionary studies will enable comprehensive characterization of the underexploited Asian bread wheat diversity. � The Author(s) 2021. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.Entities:
Keywords: Adaptation; Asian germplasm; Bread wheat; Genome assembly; Norin 61; Polyploidy
Year: 2021 PMID: 33244607 PMCID: PMC7991897 DOI: 10.1093/pcp/pcaa152
Source DB: PubMed Journal: Plant Cell Physiol ISSN: 0032-0781 Impact factor: 4.927
Genomic DNA libraries and sequencing strategy for Norin 61
| Library type | Insert size | Sequencing length | Platform | # of flow cells | Approximate coverage (×) |
|---|---|---|---|---|---|
| PCR-free PE470 | 450–470 bp | 2 × 266 bp | HiSeq 2500v2 | 17 | 92 |
| PCR-free PE700 | 700–800 bp | 2 × 150 bp | HiSeq 2500v4 | 7 | 56 |
| Nextera MP3kb | 2–4 kb | 2 × 150 bp | HiSeq 4000 | 4 | 43 |
| Nextera MP6kb | 5–7 kb | 2 × 150 bp | HiSeq 4000 | 4 | 41 |
| Nextera MP9kb | 8–10 kb | 2 × 150 bp | HiSeq 4000 | 4 | 63 |
| 10× Genomics Chromium | N/A | 2 × 150 bp | HiSeq 4000 | 4 | 28 |
| Hi-C | 300 bp | 2 × 100 bp | HiSeq 2500v2 | 4 | 5.2 |
Fig. 1Norin 61 assembly validation. (A) KAT plot comparing the sequence content in the raw reads (curve profile) and in the assembly (colored areas). The peak at X ≈ 21× contains the single-copy regions that are assembled in one copy (red area). The shoulder at 40–60× contains duplicated genomic sequences that are mostly assembled (as expected) in two copies. (B) Presence and copy number of universal SCOs in the Norin 61 assembly and gene annotation.
Fig. 2Nondenaturing fluorescence in situ hybridization of a metaphase spread of Norin 61. A pseudocolored image of the stacked pictures taken with red (Oligo-pSc119.2-1; shown in blue), green (Oligo-pTa535; shown in red), near-infrared (Oligo-pTa713; shown in green) and blue (counterstaining with DAPI; shown in gray) filters.
Fig. 3Analysis of chromosomal introgression in Norin 61. (A) Dot plot comparison of the terminal 100 Mb of chromosome 3B from Norin 61 (horizontal) and CS. Norin 61 contains an introgression at approximately position 25–46 Mb (indicated with a blue bar). Discontinuity in the diagonal denotes lack of sequence conservation in that region, and Norin 61 contains a unique ≈3.9 Mb segment (shift to the right in the dashed blue rectangle). (B) Dot plot comparison of the terminal 100 Mb of chromosome 1A from Norin 61 (horizontal) and CS. The introgression was identified through the analysis of repetitive sequences and ranges between 0 and 47.7 Mb (indicated with a blue bar). Sequences are particularly diverse in the 5′ terminal 25 Mb. (C) Self dot plot alignment of the 3.9 Mb 3B segment that is unique to Norin 61. The alignment of the segment with itself identifies a large array of five tandem repeats (series of parallel lines). (D) Phylogenetic analysis of proteins encoded by gene family 3B_25-46-1 whose members are identified in the repeat array shown in (C). Note that the individual genes cluster in two clades that cannot be further resolved because of the high similarity of the protein sequences.
Flowering time-related gene homologs between Norin 61 and CS
| Accession | CS | Norin 61 | Notes on potential extra copies in Norin 61 | |||||
|---|---|---|---|---|---|---|---|---|
| Gene symbol | Reference | A | B | D | A | B | D | |
|
|
| TraesCS7A02G115400 | TraesCS7B02G013100 | TraesCS7D02G111600 | TraesNOR7A01G123700 | NA | TraesNOR7D01G129000 | See main text. Copy number variation of the B homoeologs in Norin 61 as well as in CS |
|
|
| TraesCS3A02G143100 | TraesCS3B02G162000 | TraesCS3D02G144500 | TraesNOR3A01G166100 | TraesNOR3B01G189800 | TraesNOR3D01G160200 | |
|
|
| TraesCS1A02G338600 | TraesCS1B02G351100 | TraesCS1D02G340800 | TraesNOR1A01G362800 | TraesNOR1B01G382200 | TraesNOR1D01G378400 | |
|
|
| TraesCS2A02G132300 | TraesCS2B02G154800 | TraesCS2D02G134200 | TraesNOR2A01G150300 | TraesNOR2B01G174600 | TraesNOR2D01G153400 | |
|
|
| TraesCS5A02G546900 | TraesCS4B02G379100 | TraesCSU02G130900 | TraesNOR5A01G583800 | TraesNOR4B01G414800 | TraesNORUn01G040600 | There might be multiple copies on the B genome of Norin 61 |
|
|
| TraesCS5A02G546800 | TraesCS4B02G380600 | TraesCSU02G130800 | TraesNOR5A01G583700 | TraesNOR4B01G416300 | TraesNORUn01G040500 | A second Norin 61D homoeolog was not annotated |
|
|
| NA | NA | TraesCSU02G130700 | NA | NA | TraesNORUn01G040400 | |
|
|
| TraesCS6A02G160200 | TraesCS6B02G191200 | TraesCS6D02G152500 | TraesNOR6A01G175800 | TraesNOR6B01G210900 | TraesNOR6D01G177500 | |
|
|
| TraesCS5A02G154600 | TraesCS5B02G152800 | TraesCS5D02G159500 | TraesNOR5A01G167400 | TraesNOR5B01G169100 | TraesNOR5D01G173200 | |
|
|
| TraesCS2A02G536600 | TraesCS2B02G511400 | TraesCS2D02G538000 | TraesNOR2A01G582200 | TraesNOR2B01G557400 | TraesNOR2D01G585300 | High copy number |
|
|
| TraesCS2A02G485000 | TraesCS2B02G567400 | TraesCS2D02G485100 | TraesNOR2A01G524400 | TraesNOR2B01G619000 | NA | High copy number |
|
|
| TraesCS2A02G536900 | TraesCS3B02G015800 | TraesCS2D02G538100 | NA | TraesNOR3B01G017500 | NA | |
|
|
| TraesCS2A02G536700 | TraesCSU02G045800 | TraesCS3D02G011600 | NA | TraesNOR3B01G017400 | NA | |
|
|
| TraesCS2A02G347000 | TraesCS2B02G365300 | TraesCS2D02G345700 | TraesNOR2A01G378700 | TraesNOR2B01G400600 | TraesNOR2D01G376200 | |
| New |
| TraesCS2A02G346900 | TraesCS2B02G365200 | TraesCS2D02G345600 | TraesNOR2A01G378600 | TraesNOR2B01G400500 | TraesNOR2D01G376100 | |
|
|
| TraesCS5A02G297300 | TraesCS5B02G296600 | TraesCS5D02G304400 | TraesNOR5A01G314900 | TraesNOR5B01G321800 | TraesNOR5D01G327800 | |
|
|
| NA | TraesCS4B02G073800 | TraesCS4D02G072300 | NA | TraesNOR4B01G083000 | TraesNOR4D01G076400 | |
|
|
| TraesCS6A02G214400 | TraesCS6B02G244400 | TraesCS6D02G197100 | TraesNOR6A01G234900 | TraesNOR6B01G268400 | TraesNOR6D01G227500 | |
|
|
| TraesCS7A02G229400 | TraesCS7B02G195200 | TraesCS7D02G230000 | TraesNOR7A01G248800 | TraesNOR7B01G154000 | TraesNOR7D01G258900 | |
|
|
| TraesCS3A02G006600 | TraesCS3B02G010100 | TraesCS3D02G004100 | TraesNOR3A01G008900 | TraesNOR3B01G009400 | TraesNOR3D01G000600 | |
|
|
| NA | TraesCS3B02G007400 | NA | NA | TraesNOR3B01G009500 | NA | |
|
|
| TraesCS5A02G128600 | TraesCS5B02G127600 | TraesCS5D02G136300 | TraesNOR5B01G141700 | TraesNOR5B01G141700 | TraesNOR5D01G148800 | |
|
|
| TraesCS4A02G409200 | TraesCS4B02G307600 | TraesCS4D02G305800 | TraesNOR4A01G399300 | TraesNOR4B01G335300 | TraesNOR4D01G327600 | |
|
|
| TraesCSU02G202000 | TraesCS2B02G310700 | TraesCS2D02G292000 | TraesNOR2A01G323700 | TraesNOR2B01G343700 | TraesNOR2D01G320200 | |
|
|
| TraesCS5A02G391700 | TraesCS5B02G396600 | TraesCS5D02G401500 | TraesNOR5A01G416600 | TraesNOR5B01G433300 | TraesNOR5D01G434800 | |
|
|
| TraesCS5A02G541300 | TraesCS4B02G372700 | TraesCS4D02G364500 | TraesNOR5A01G578600 | TraesNOR4B01G408300 | TraesNORUn01G034600 | |
|
|
| TraesCS5A02G541200 | NA | TraesCS4D02G364400 | TraesNOR5A01G578700 | NA | TraesNORUn01G034500 | |
|
|
| TraesCS4A02G293000 | TraesCS4B02G020300 | TraesCS4D02G018500 | TraesNOR4A01G307500 | TraesNOR4B01G023800 | TraesNOR4D01G020000 | |
|
|
| TraesCS7A02G175200 | TraesCS7B02G080300 | TraesCS7D02G176700 | TraesNOR7A01G188500 | TraesNOR7B01G090400 | TraesNOR7D01G200800 | |
|
|
| TraesCS3A02G116300 | TraesCS3B02G135400 | TraesCS3D02G118200 | TraesNOR3A01G128300 | TraesNOR3B01G162100 | TraesNOR3D01G131800 | |
|
|
| TraesCS4D02G341700 | TraesCS4B02G346700 | TraesCS5A02G515500 | TraesNOR5A01G549300 | TraesNOR4B01G377200 | TraesNOR4D01G366500 | A second Norin 61D homoeolog was not annotated |
|
|
| TraesCS3A02G526600 | TraesCS3B02G594300 | TraesCS3D02G531900 | TraesNOR3A01G575200 | TraesNOR3B01G660800 | TraesNOR3D01G579900 | |
|
|
| TraesCS1A02G220300 | TraesCS1B02G233600 | TraesCS1D02G221800 | TraesNOR1A01G240300 | TraesNOR1B01G254700 | TraesNOR1D01G249900 | |
|
|
| TraesCS6A02G289400 | TraesCS6B02G319500 | TraesCS6D02G269500 | TraesNOR6A01G314800 | TraesNOR6B01G348100 | TraesNOR6D01G304400 | |
|
|
| TraesCS5A02G391300 | TraesCS5B02G396200 | TraesCS5D02G401000 | TraesNOR5A01G416100 | TraesNOR5B01G432800 | TraesNOR5D01G434300 | |
|
|
| TraesCS2A02G081900 | NA | TraesCS2D02G079600 | TraesNOR2A01G096300 | NA | TraesNOR2D01G091900 | There might be multiple copies of the B homoeologs in Norin 61 as well as in CS |
|
|
| TraesCS5A02G391700 | TraesCS5B02G396600 | TraesCS5D02G401500 | TraesNOR5A01G416600 | TraesNOR5B01G433300 | TraesNOR5D01G434800 | Splicing variant in CS. Splicing variant in Norin 61 was not annotated |
|
|
| TraesCS2A02G514200 | TraesCS2B02G542400 | TraesCS2D02G515800 | TraesNOR2A01G556000 | TraesNOR2B01G592300 | TraesNOR2D01G559400 | |
|
|
| TraesCS7A02G260900 | TraesCS7B02G158900 | TraesCS7D02G261900 | TraesNOR7A01G280100 | TraesNOR7B01G185700 | TraesNOR7D01G293000 | A second Norin 61D homoeolog was not annotated |
|
|
| TraesCS3A02G434900 | NA | NA | TraesNOR3A01G466500 | TraesNOR3B01G517700 | TraesNOR3D01G458800 | |
|
|
| TraesCS3A02G435000 | TraesCS3B02G470000 | TraesCS3D02G428000 | TraesNOR3A01G466400 | TraesNOR3B01G517500 | TraesNOR3D01G458600 | |
|
| Sharma et al. (2016) | TraesCS3A02G434400 | TraesCS3D02G427700 | TraesCS3B02G469700 | TraesNOR3A01G465900 | TraesNOR3B01G517500 | TraesNOR3D01G458500 | |
|
|
| TraesCS3A02G432900 | NA | TraesCS3D02G427900 | TraesNOR3A01G466500 | NA | TraesNOR3D01G457000 | |
|
|
| TraesCS1A02G125800 | TraesCS1B02G144800 | TraesCS1D02G127700 | TraesNOR1A01G139600 | TraesNOR1B01G161700 | TraesNOR1D01G145300 | Potential another copies on chr Un |
|
|
| TraesCS3A02G314300 | TraesCS3B02G157500 | TraesCS3D02G140200 | TraesNOR3A01G153000 | NA | TraesNOR3D01G155900 | |
Fig. 4(A) Maximum-likelihood tree of 108 FT homologs based on amino acid sequences. The bootstrap values are shown on the branches. N61 and CS stand for Norin 61 and CS, respectively. (B) Multiple sequence alignment of FT1 genes on CS and N61. The red stars highlight the positions with the 1-bp deletion in FT-D1 of Norin 61 and the 1-bp insertion in FT-B1-3 of CS. The black boxes indicate the conserved sequences. The white regions show the SNPs or indels between sequences.
Resequencing coverage of Adh-1 and FT1 loci on CS and Norin 61
| Genotype | Gene | Homeolog coverage | ||
|---|---|---|---|---|
| A | B | D | ||
| CS |
| 7.8 | 7.4 | 9.4 |
|
| 9.7 |
| 8.9 | |
| Norin 61 |
| 9.1 | 9.7 | 8.0 |
|
| 7.6 |
| 11.5 | |
|
| ||||
Compared with the Adh-1 gene, the mean coverage of the FT1 copies is less homogeneous (higher coverages are highlighted in bold), hinting at a possible collapse of multiple (even more than two) copies in the B homeolog in both the Norin and CS assemblies. The coverage values of the two Norin 61 FT-B1 copies detected in the assembly are represented on two separate lines.
Variation at FT-A1 (also denoted as Vrn-A3) and the growth habit
| Cultivar |
| Habit | ||
|---|---|---|---|---|
| Size | Type | Allele | ||
| Norin 61 | 799 | del |
| Facultative spring |
| Arina | 830 | Ins |
| Winter |
| CS | 799 | Del |
| Spring |
| Jaggar | 830 | Ins |
| Winter |
| Julius | 830 | Ins |
| Winter |
| LongReach Lancer | 799 | Del |
| Spring |
| CDC Landmark | 799 | Del |
| Spring |
| Mace | 799 | Del |
| Spring |
| CDC Stanley | 799 | Del |
| Spring |
| SY Mattis | 799 | Del |
| Winter |
| PI190962 (spelt wheat) | 830 | Ins |
| Winter |
| Cadenza | 830 | Ins |
| Spring |
| Paragon | 799 | Del |
| Spring |
| Robigus | 830 | Ins |
| Winter |
| Claire | 830 | Ins |
| Winter |
| Weebill 1 | 799 | Del |
| Spring |
Data from Walkowiak et al. (2020).
Fig. 5Sequence features of Ppd-1 genes with the 5-kb upstream region extracted from the CS and Norin 61 pseudomolecules. (A) Both CS and Norin 61 have the photoperiod-sensitive Ppd-A1b allele. Three insertions/deletions were detected. (B) The red region spans the 2,089-bp deletion of the Ppd-D1a allele of Norin 61 compared with the Ppd-D1b allele of CS. The boxes indicate CDSs. The red lines show the sequence polymorphisms between CS and Norin 61 assemblies.
Fig. 6Characterization of the Fhb1 locus in CS and in FHB-resistant varieties. (A) Sequence dot plot of the region surrounding Fhb1 in CS (X-axis chromosome 3B: 7.99–8.88 Mb) and Norin 61 (Y-axis, chromosome 3B: 11.41–12.37 Mb). The gap in the bottom part of the diagonal denotes the region with different sequence composition (spanning about 340 kb). (B) Dot plot comparing CM-82036 derived from Sumai 3 (X-axis, from Schweiger et al. 2016; BAC assembly, GenBank accession: MK450312) and Norin 61 (Y-axis, chromosome 3B: 11.41–12.37 Mb) showing the almost complete identity of the sequence. (C) Annotation of the ≈390 kb Norin 61 region containing Fhb1 (chromosome 3B: 11.89–12.31 Mb). The newly annotated genes (red boxes) and the span of the Norin 61 region differing from CS are shown below the published annotation of Norin 61 (Walkowiak et al. 2020, first three tracks). The TaHRC gene is highlighted in purple. (D) Schematic structure of the deletion in the TaHRC gene. The boxes indicate the third exon. The red dashed line shows the sequence polymorphisms between CS and Norin 61 assemblies.
Fig. 7Schematic features of Tamyb10 sequences of CS and Norin 61. (A) Tamyb10-A1. CS has the recessive allele (Tamyb10-A1a), characterized by a large deletion, including the start codon. Norin 61 has the (wild type) dominant allele (Tamyb10-A1b). (B) Tamyb10-B1. The CS has the recessive allele (Tamyb10-B1a) has a 19-bp deletion and Norin 61 has the dominant Tamyb10-B1b allele. (C) Tamyb10-D1. Both CS and Norin 61 have the identical dominant allele (Tamyb10-D1b). The boxes indicate CDSs. The red dashed lines show the sequence polymorphisms between CS and Norin 61 assemblies.
Genotypes of Japanese cultivars at loci regulating flowering time and plant height
| Cultivar name | Status |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| Komugi Norin 61 | Breeders line |
|
|
|
|
|
|
|
|
| Shinchunaga | Pure selected line |
|
|
|
|
|
|
|
|
| Komugi Norin 26 | Breeders line |
|
|
|
|
|
|
|
|
| Shirasagi komugi | Breeders line |
|
|
|
| NA |
|
|
|
| CS | Standard line |
|
|
|
|
|
|
|
|
Ppd-D1 and ppd-D1 correspond to Ppd-D1a and Ppd-D1b in Fig. 5, respectively.
Data from Nishida et al. (2013).