| Literature DB >> 35881674 |
David E Jarvis1, John S Sproul2, Beatriz Navarro-Domínguez3, Karol Krak4, Kate Jaggi1, Yung-Fen Huang5, Tzu-Yun Huang6, Tzu Che Lin7, Eric N Jellen1, Peter J Maughan1.
Abstract
Djulis (Chenopodium formosanum Koidz.) is a crop grown since antiquity in Taiwan. It is a BCD-genome hexaploid (2n = 6x = 54) domesticated form of lambsquarters (C. album L.) and a relative of the allotetraploid (AABB) C. quinoa. As with quinoa, djulis seed contains a complete protein profile and many nutritionally important vitamins and minerals. While still sold locally in Taiwanese markets, its traditional culinary uses are being lost as diets of younger generations change. Moreover, indigenous Taiwanese peoples who have long safeguarded djulis are losing their traditional farmlands. We used PacBio sequencing and Hi-C-based scaffolding to produce a chromosome-scale, reference-quality assembly of djulis. The final genome assembly spans 1.63 Gb in 798 scaffolds, with 97.8% of the sequence contained in 27 scaffolds representing the nine haploid chromosomes of each sub-genome of the species. Benchmarking of universal, single-copy orthologs indicated that 98.5% of the conserved orthologous genes for Viridiplantae are complete within the assembled genome, with 92.9% duplicated, as expected for a polyploid. A total of 67.8% of the assembly is repetitive, with the most common repeat being Gypsy long terminal repeat retrotransposons, which had significantly expanded in the B sub-genome. Gene annotation using Iso-Seq data from multiple tissues identified 75,056 putative gene models. Comparisons to quinoa showed strong patterns of synteny which allowed for the identification of homoeologous chromosomes, and sub-genome-specific sequences were used to assign homoeologs to each sub-genome. These results represent the first hexaploid genome assembly and the first assemblies of the C and D genomes of the Chenopodioideae subfamily.Entities:
Keywords: zzm321990 Gypsyzzm321990 ; LTR; chromosome-scale; long-read sequencing; polyploid evolution; retrotransposon
Mesh:
Year: 2022 PMID: 35881674 PMCID: PMC9356728 DOI: 10.1093/gbe/evac120
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 4.065
Fig. 1.(A) Circular representation of the nine chromosomes in each of the B (outer track, blue), C (green), and D (brown) sub-genomes of C. formosanum. Tick marks represent 10 Mb. From inside to outside, tracks represent the density of Gypsy LTR elements, mapped C. acuminatum reads, 18–24J repeats, telomeric repeats, and genes. (B) Dotplot visualization of syntenic genes between the genomes of quinoa (y-axis) and C. formosanum (x-axis). (C) Hive plot visualization of syntenic genes among the B, C, and D sub-genomes of C. formosanum. For the inside to the outside, chromosomes of each sub-genome are arranged in order from 1–9. (D) Detailed view of the syntenic relationships between chromosomes 1 and 2 of the B, C, and D sub-genomes of C. formosanum.
Fig. 2.(A) Abundance of repeats in the major repetitive element categories for the A, B, C, and D sub-genomes in C. quinoa (Cq), C. berlandieri (Cb), and C. formosanum (Cf). (B) Repetitive element landscape plots for C. formosanum sub-genomes. The y-axis shows TE abundance as a percentage of the genome for major repeat categories. The x-axis shows both sequence divergence (CpG adjusted Kimura distance) relative to consensus sequences for TE superfamilies (below) and an estimate of evolutionary time calculated using LTR sequence evolution rate estimates in rice. (C) Repeat landscapes for the B sub-genome in quinoa and C. berlandieri. (D) Repeat landscape isolating LTR Gypsy elements in C. formosanum, C. quinoa, and C. berlandieri. (E) Repeat landscape of satellite DNAs in sub-genomes of C. formosanum.