| Literature DB >> 34142133 |
Hideki Hirakawa1, Atsushi Toyoda2, Takehiko Itoh3, Yutaka Suzuki4, Atsushi J Nagano5, Suguru Sugiyama6, Yasuyuki Onodera7.
Abstract
Spinach (Spinacia oleracea) is grown as a nutritious leafy vegetable worldwide. To accelerate spinach breeding efficiency, a high-quality reference genome sequence with great completeness and continuity is needed as a basic infrastructure. Here, we used long-read and linked-read technologies to construct a de novo spinach genome assembly, designated SOL_r1.1, which was comprised of 287 scaffolds (total size: 935.7 Mb; N50 = 11.3 Mb) with a low proportion of undetermined nucleotides (Ns = 0.34%) and with high gene completeness (BUSCO complete 96.9%). A genome-wide survey of resistance gene analogues identified 695 genes encoding nucleotide-binding site domains, receptor-like protein kinases, receptor-like proteins and transmembrane-coiled coil domains. Based on a high-density double-digest restriction-site associated DNA sequencing-based linkage map, the genome assembly was anchored to six pseudomolecules representing ∼73.5% of the whole genome assembly. In addition, we used SOL_r1.1 to identify quantitative trait loci for bolting timing and fruit/seed shape, which harbour biologically plausible candidate genes, such as homologues of the FLOWERING LOCUS T and EPIDERMAL PATTERNING FACTOR-LIKE genes. The new genome assembly, SOL_r1.1, will serve as a useful resource for identifying loci associated with important agronomic traits and for developing molecular markers for spinach breeding/selection programs.Entities:
Keywords: bolting timing; fruit/seed shape; resistance gene analogues; spinach
Year: 2021 PMID: 34142133 PMCID: PMC8231376 DOI: 10.1093/dnares/dsab004
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1Molecular linkage map of spinach, based on the mapping population, 03-009 × 03-336 F2. Black and grey bars on the right side of the map represent QTLs for fruit/seed shape and bolting time, respectively, whose LOD values are given in parentheses. Arrows indicate the loci of protein-coding genes, which were mapped on the sex chromosomes in our previous study. Open triangles indicate the loci of Sol_r1.0_p041.1.g32544.t1, Sol_r1.0_p003.1.g5896.t1, and SoCOL1, with significant homology to BTC1, BvBBX19 and CONSTANTS, respectively.
Summary of the genetic linkage map and the pseudomolecules for spinach
| Linkage groups | Genetic length (cM) | Number of SNPs | Pseudomolecules | Physical length (bp) |
|---|---|---|---|---|
| LG1 | 60.5 | 460 | SOL_r1.0_chr01 | 146,198,663 |
| LG2 | 64.6 | 433 | SOL_r1.0_chr02 | 124,699,619 |
| LG3 | 94.7 | 390 | SOL_r1.0_chr03 | 133,520,741 |
| LG4 | 57.7 | 347 | SOL_r1.0_chr04 | 109,917,612 |
| LG5 | 73.9 | 281 | SOL_r1.0_chr05 | 88,246,263 |
| LG6 | 59.7 | 265 | SOL_r1.0_chr06 | 85,406,467 |
| Total | 411.1 | 2176 | — | 687,989,365 |
Figure 2Genome map of the SOL_r1.0_pseudomolecule. Identifiers of the six pseudomolecules and their associated linkage groups are displayed around the genome map. The locations of the RGAs, NBS-encoding proteins, RLKs, RLPs, and TM-CC proteins, are indicated on the outermost genome map. Red diamonds on the outermost circular map indicate the locations of the downy mildew resistance genes corresponding to Spo12729 (Sol_r1.0_p006.1.g25710.t1), Spo12784 (Sol_r1.0_p006.1.g25727.t7), and Spo12903 (Sol_r1.0_p006.1.g25767.t1). The inner circular maps show percentage of the total repeat sequences (the repeat sequences classified into Copia, Gypsy, SpRE1, and SpRE2), each of the repeat sequence and GC contents, and the number of genes calculated by using a window size of 50 kb with a sliding window size of 10 kb.