| Literature DB >> 34254113 |
Kazuhiro Sato1, Fumitaka Abe2, Martin Mascher3,4, Georg Haberer5, Heidrun Gundlach5, Manuel Spannagl5, Kenta Shirasawa6, Sachiko Isobe6.
Abstract
We have established a high-quality, chromosome-level genome assembly for the hexaploid common wheat cultivar 'Fielder', an American, soft, white, pastry-type wheat released in 1974 and known for its amenability to Agrobacterium tumefaciens-mediated transformation and genome editing. Accurate, long-read sequences were obtained using PacBio circular consensus sequencing with the HiFi approach. Sequence reads from 16 SMRT cells assembled using the hifiasm assembler produced assemblies with N50 greater than 20 Mb. We used the Omni-C chromosome conformation capture technique to order contigs into chromosome-level assemblies, resulting in 21 pseudomolecules with a cumulative size of 14.7 and 0.3 Gb of unanchored contigs. Mapping of published short reads from a transgenic wheat plant with an edited seed-dormancy gene, TaQsd1, identified four positions of transgene insertion into wheat chromosomes. Detection of guide RNA sequences in pseudomolecules provided candidates for off-target mutation induction. These results demonstrate the efficiency of chromosome-scale assembly using PacBio HiFi reads and their application in wheat genome-editing studies.Entities:
Keywords: zzm321990 Triticum aestivumzzm321990 ; circular consensus sequencing; genome assembly; genome editing; pseudomolecules
Mesh:
Substances:
Year: 2021 PMID: 34254113 PMCID: PMC8320877 DOI: 10.1093/dnares/dsab008
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.477
Assembly statistics for each chromosome of the cultivar ‘Fielder’
| Chromosome | Length (Mb) | No. of contigs | N50 (Mb) | Max. length (Mb) | Min. length (kb) |
|---|---|---|---|---|---|
| chr1A | 609.0 | 28 | 42.6 | 97.6 | 685.3 |
| chr1B | 721.0 | 105 | 15.8 | 52.7 | 504.4 |
| chr1D | 501.3 | 40 | 28.4 | 63.8 | 622.2 |
| chr2A | 804.6 | 58 | 22.6 | 81.7 | 560.4 |
| chr2B | 808.1 | 92 | 15.3 | 89.0 | 502.3 |
| chr2D | 649.1 | 40 | 34.5 | 91.3 | 610.2 |
| chr3A | 758.9 | 42 | 64.5 | 171.2 | 560.6 |
| chr3B | 861.1 | 133 | 10.8 | 36.9 | 650.5 |
| chr3D | 642.4 | 48 | 24.4 | 105.4 | 592.5 |
| chr4A | 759.9 | 66 | 43.3 | 77.5 | 534.7 |
| chr4B | 689.8 | 108 | 11.1 | 38.2 | 558.0 |
| chr4D | 531.5 | 22 | 52.2 | 73.9 | 875.7 |
| chr5A | 714.5 | 46 | 26.3 | 81.4 | 930.5 |
| chr5B | 717.3 | 131 | 10.5 | 46.0 | 557.9 |
| chr5D | 586.3 | 41 | 30.5 | 88.3 | 563.8 |
| chr6A | 626.3 | 38 | 32.5 | 79.7 | 620.8 |
| chr6B | 738.1 | 128 | 8.7 | 43.6 | 507.7 |
| chr6D | 505.8 | 38 | 23.8 | 64.2 | 610.6 |
| chr7A | 759.1 | 53 | 35.2 | 87.9 | 504.0 |
| chr7B | 751.6 | 122 | 12.1 | 38.9 | 535.9 |
| chr7D | 653.1 | 49 | 36.4 | 85.0 | 889.0 |
| chr1A-7D | 1,4703.3 | 1428 | 21.0 | 171.2 | 502.3 |
| chrUn | 314.5 | 3774 | 0.1 | 0.5 | 3.5 |
Figure 1Intra-chromosomal contact matrices. Grey lines mark contig boundaries.
Figure 2Alignment of pseudomolecules of ‘Fielder’ and ‘Julius’ by single chromosomes.
Figure 3Example of a chimeric contig (ptg001173l). The location of the breakpoint at 1.84 Mb is indicated by a dotted blue line in (a) and (b). (a) Alignment of guide map markers to the contig. Single-copy sequences in the genome assembly of wheat cv. Julius were used as the guide map. Markers from two different chromosomes (3B and 6B) are aligned to the contig. (b) Physical coverage with Hi-C links along the contig. The y-axis shows the number of Hi-C read pairs spanning each 10-kb window along the length of the scaffold. A sharp drop in coverage is evident at the breakpoint.
Comparison of assembly statistics for ‘Fielder’ and two pan-genome assemblies.
| Cultivar | ‘Julius’ | ‘Norin 61’ | ‘Fielder’ |
|---|---|---|---|
| Pseudomolecule statistics | |||
| Total scaffolds in pseudomolecule | 981 | 1,394 | 1,428 |
| Pseudomolecule size (Gb) | 14.2 | 14.1 | 14.7 |
| N50 of corrected assembly (Mb) | 38.0 | 21.9 | 20.7 |
| N90 of corrected assembly (Mb) | 6.5 | 2.6 | 3.8 |
| Contig statistics | |||
| Total contigs | 620,670 | 987,560 | 5,200 |
| Assembly size (Gb) | 14.2 | 14.7 | 14.7 |
| N50 (bp) | 53,974 | 62,736 | – |
| N90 (bp) | 12,975 | 12,160 | – |
| BUSCO | |||
| Complete BUSCOs | 1,415 (98.3%) | 1,417 (98.4%) | 1,399 (97.1%) |
| Complete BUSCOs–Single copy | 97 (6.8%) | 96 (6.7%) | 85 (5.9%) |
| Complete BUSCOS–Duplicated | 1,318 (91.5%) | 1,321 (91.7%) | 1,314 (91.2%) |
| Fragmented BUSCOs | 4 (0.3%) | 2 (0.1%) | 6 (0.4%) |
| Missing BUSCOs | 21 (1.4%) | 21 (1.5%) | 35 (2.5%) |
| Total BUSCO groups searched | 1,440 | 1,440 | 1,440 |
Data from Walkowiak et al. (2020).