| Literature DB >> 35775942 |
Awie J Hosaka1,2, Rena Sanetomo3, Kazuyoshi Hosaka3.
Abstract
There are over 100 known species of cultivated potatoes and their wild relatives. Many of these species, including cultivated potatoes, share the A genome; these species are mainly distributed in South America and are reproductively isolated from Mexican diploid species. The only diploid A-genome species distributed in Mexico is Solanum verrucosum Schlechtendal, which is also a maternal progenitor of Mexican polyploid species. In this study, we constructed a high-quality de novo assembly of the S. verrucosum genome using PacBio long-read sequencing and Hi-C scaffolding technologies. A monohaploid clone (2n = x = 12) of S. verrucosum was used to reduce assembly difficulty due to the heterozygous nature of the species. The final sequence assembly consisted of 780.2 Mb of sequence, 684.0 Mb of which were anchored to the 12 chromosomes, with a scaffold N50 of 55.2 Mb. Putative centromeres were identified using publicly available data obtained via chromatin immunoprecipitation sequencing against a centromere-specific histone 3 protein. Transposable elements accounted for approximately 61.8% (482.1 Mb) of the genome, and 46,904 genes were functionally annotated. High gene synteny and similarity were revealed among the genomes of S. verrucosum, Solanum commersonii, Solanum chacoense, Solanum phureja, Solanum tuberosum, and Solanum lycopersicum. The reference-quality S. verrucosum genome will provide new insights into the evolution of Mexican polyploid species and contribute to potato breeding programs.Entities:
Keywords: zzm321990 Solanum verrucosumzzm321990 ; A-genome species; Mexican wild potato; genome assembly
Mesh:
Year: 2022 PMID: 35775942 PMCID: PMC9339273 DOI: 10.1093/g3journal/jkac166
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Assembly statistics.
| Primary contigs with PacBio reads | Final scaffolded contigs after Hi-C sequencing | |
|---|---|---|
| Number of contigs | 1,437 | 1,547 |
| Total size, bp | 779,910,189 | 780,238,689 |
| Longest size, bp | 55,137,318 | 84,109,000 |
| Mean size, bp | 542,735 | 504,356 |
| N50 size, bp | 20,992,750 | 55,157,000 |
Fig. 1.Dot plot analysis between S. verrucosum and S. phureja using D-GENIES with the “hide noise” option.
Fig. 2.Putative centromeres. a) Distribution of CENH3 ChIP-seq signals in every 100 kb window in S. verrucosum and S. phureja. b) Dot plot between S. verrucosum and S. phureja for chromosome 12. ChIP-seq signals against the CENH3 proteins of S. verrucosum and S. phureja are shown on the right and at the top of the plot, respectively, and are highlighted on the plot.
Fig. 3.Chromosomal distribution of genes and transposons (a) and the correlations of their locations (b).
Fig. 4.Size and structural differences among 6 species genomes. a) Chromosome size in Mb. b) Synteny plot between S. verrucosum and the other species.
Fig. 5.Orthologous relationships among 6 species genomes. a) UpSet plot of the shared orthogroups among 38,937 orthogroups assigned from a total of 288,020 genes identified in the 6 species genomes. The top 20 orthogroup intersections are shown. b) OrthoFinder-generated phylogenetic tree constructed using 16,964 orthogroups present in all 6 species. S. lycopersicum was used as the outgroup species.