| Literature DB >> 26801360 |
Helena Staňková1, Alex R Hastie2, Saki Chan2, Jan Vrána1, Zuzana Tulpová1, Marie Kubaláková1, Paul Visendi3, Satomi Hayashi4, Mingcheng Luo5, Jacqueline Batley4,6, David Edwards6, Jaroslav Doležel1, Hana Šimková1.
Abstract
The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules.Entities:
Keywords: chromosomes; flow sorting; optical mapping; physical map; sequencing; wheat
Mesh:
Year: 2016 PMID: 26801360 PMCID: PMC5066648 DOI: 10.1111/pbi.12513
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Data collection and assembly statistics
| No. molecules/genome maps | Total length | 7DS arm coverage | Molecule/map N50 | Longest molecule/map (Mb) | |
|---|---|---|---|---|---|
| Single molecules (>150 kb) | 209 788 | 68.8 Gb | 180× | 354 kb | 2.1 |
| Map assembly | 371 | 350 Mb | 0.92× | 1.3 Mb | 4.6 |
Figure 1Genome map No. 350 comprising a long array of tandem repeats. The pile of single molecules, depicted as yellow lines with blue and green dots corresponding to mapped and unmapped labels, respectively, was a source for building the consensus genome map (blue bar). The regular labelling pattern indicates presence of tandem repeats.
Assignment of 30‐, 60‐, and 90‐kb sequences to 7DS genome maps
| Sequence length (kb) | No. sequences | No. correctly assigned | Percentage correctly assigned (%) | Lowest confidence Highest confidence | No. labels |
|---|---|---|---|---|---|
| 30 | 100 | 12 | 12 | 5.86 | 5 |
| 7.60 | 6 | ||||
| 60 | 70 | 57 | 81 | 6.07 | 5 |
| 13.87 | 10 | ||||
| 90 | 40 | 40 | 100 | 7.91 | 6 |
| 19.99 | 14 |
No. labels corresponds to number of distinguishable Nt.BspQI recognition sites in the sequence.
Figure 2Scaffolding and correcting physical map contigs based on the genome map No. 19. (a) In total, nine contigs of the physical map (black bars) could be anchored through sequences of constituting BAC clones (blue lines) to the genome map No. 19 (green bar). Short red bars indicate approximate positions of Aegilops tauschii SNP markers anchored to particular clones. The purple line and bar in ctg3865 represent clone TaaCsp067A19, which was incorrectly assigned to this contig. Detail is shown in (b). A cmap of the clone TaaCsp067A19 does not match the corresponding region in GM19. The green bar corresponds to GM19, while the blue bars represent in silico digested BAC sequences (cmaps).
Figure 4Merging genome maps. Three genome maps (green bars) could be merged together after aligning sequences of BAC clones (blue bars) from three contigs of the physical map (black bars).
Figure 3Local integration of three genetic maps through the genome map No. 15. Contigs of the physical map (black bars) were aligned to the genome map No. 15 (green bar) through sequences of constituting BAC clones (blue bars). The BAC contigs carry genetic markers (red) originating from three genetic maps, which could be integrated through the genome map.