| Literature DB >> 32759330 |
Natascha van Lieshout1, Ate van der Burgt2, Michiel E de Vries2, Menno Ter Maat2, David Eickholt3, Danny Esselink1, Martijn P W van Kaauwen1, Linda P Kodde1, Richard G F Visser1, Pim Lindhout2, Richard Finkers4.
Abstract
With the rapid expansion of the application of genomics and sequencing in plant breeding, there is a constant drive for better reference genomes. In potato (Solanum tuberosum), the third largest food crop in the world, the related species S. phureja, designated "DM", has been used as the most popular reference genome for the last 10 years. Here, we introduce the de novo sequenced genome of Solyntus as the next standard reference in potato genome studies. A true Solanum tuberosum made up of 116 contigs that is also highly homozygous, diploid, vigorous and self-compatible, Solyntus provides a more direct and contiguous reference then ever before available. It was constructed by sequencing with state-of-the-art long and short read technology and assembled with Canu. The 116 contigs were assembled into scaffolds to form each pseudochromosome, with three contigs to 17 contigs per chromosome. This assembly contains 93.7% of the single-copy gene orthologs from the Solanaceae set and has an N50 of 63.7 Mbp. The genome and related files can be found at https://www.plantbreeding.wur.nl/Solyntus/ With the release of this research line and its draft genome we anticipate many exciting developments in (diploid) potato research.Entities:
Keywords: Solanum tuberosum; diploid breeding; genome assembly; homozygosity; potato
Mesh:
Year: 2020 PMID: 32759330 PMCID: PMC7534448 DOI: 10.1534/g3.120.401550
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Pictures of Solyntus growing in the greenhouse.
Figure 2Picture of Solyntus tubers.
Figure 3A K-mer (K = 71) distribution based on the high coverage Illumina reads as modeled and visualized by genomescope, with a maximum read count cut-off of 2500.
Solyntus de Novo Genome Assembly Metrics Estimated Using QUAST
| Contigs | Pseudomolecules | |
|---|---|---|
| Number of Contigs/Scaffolds: | 116 | 12 |
| Largest Contig/Scaffold: | 44 448 130 | 72 008 707 |
| Total Length: | 716 161 047 | |
| N50: | 13 367 893 | 63 701 590 |
| N75: | 7 229 460 | 57 022 023 |
| L50: | 15 | 6 |
| L75: | 34 | 9 |
| Number of Ns per 100 Kbps: | 0 | 1.45 |
| GC Content (%): | 34.82 | |
Summary of the Number of Contigs Placed by RaGOO into Each Pseudochromosome
| Chromosome | Number of Contigs Placed | Total Length |
|---|---|---|
| StSOLv1.1ch01 | 16 | 59 557 243 |
| StSOLv1.1ch02 | 7 | 42 706 079 |
| StSOLv1.1ch03 | 8 | 63 701 590 |
| StSOLv1.1ch04 | 12 | 72 008 707 |
| StSOLv1.1ch05 | 8 | 58 542 029 |
| StSOLv1.1ch06 | 7 | 65 489 876 |
| StSOLv1.1ch07 | 10 | 41 124 029 |
| StSOLv1.1ch08 | 17 | 67 850 527 |
| StSOLv1.1ch09 | 12 | 70 978 473 |
| StSOLv1.1ch10 | 13 | 49 170 357 |
| StSOLv1.1ch11 | 3 | 57 022 023 |
| StSOLv1.1ch12 | 3 | 68 020 514 |
Output from BUSCO Analysis Pipelines to Assess Genome Completeness
| Solanaceae (odb10) | |
|---|---|
| Complete BUSCOs: | 2859 (93.7%) |
| Complete and Single Copy BUSCOs: | 2771 |
| Complete and Duplicated BUSCOs: | 88 |
| Fragmented BUSCOs: | 91 |
| Missing BUSCOs: | 102 |
| Total BUSCO groups searched: | 3052 |
Figure 4Circos plot that shows Solyntus assembly (outer ring), heterozygous SNP rate of the short reads (middle ring) and the coverage of the short reads per bin (innermost ring). The alternating green and gray blocks of the outer ring symbolize the contigs as they were placed into pseudochromosomes.
Figure 5Identity plot between Solyntus V1.1 and DM V4.03 pseudochromosomes using D-GENIES with noise filtered out.