| Literature DB >> 33154426 |
Phuong T N Hoang1,2, Anne Fiebig1, Petr Novák3, Jiří Macas3, Hieu X Cao1,4, Anton Stepanenko5,6, Guimin Chen5,6, Nikolai Borisjuk5,6, Uwe Scholz1, Ingo Schubert7.
Abstract
Duckweeds are small, free-floating, morphologically highly reduced organisms belonging to the monocot order Alismatales. They display the most rapid growth among flowering plants, vary ~ 14-fold in genome size and comprise five genera. Spirodela is the phylogenetically oldest genus with only two mainly asexually propagating species: S. polyrhiza (2n = 40; 160 Mbp/1C) and S. intermedia (2n = 36; 160 Mbp/1C). This study combined comparative cytogenetics and de novo genome assembly based on PacBio, Illumina and Oxford Nanopore (ON) reads to obtain the first genome reference for S. intermedia and to compare its genomic features with those of the sister species S. polyrhiza. Both species' genomes revealed little more than 20,000 putative protein-coding genes, very low rDNA copy numbers and a low amount of repetitive sequences, mainly Ty3/gypsy retroelements. The detection of a few new small chromosome rearrangements between both Spirodela species refined the karyotype and the chromosomal sequence assignment for S. intermedia.Entities:
Mesh:
Year: 2020 PMID: 33154426 PMCID: PMC7645714 DOI: 10.1038/s41598-020-75728-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Six linkages due to chromosome rearrangements between S. polyrhiza and S. intermedia (clone 8410) are also present in the sequenced clone 7747. (a) ChrSp05–ChrSp06 = ChrSi06; (b) ChrSp08–ChrSp18 = ChrSi09; (c) ChrSp03–ChrSp17 = ChrSi04; (d) ChrSp10–ChrSp16 = ChrSi11; (e) ChrSp06–ChrSp07–ChrSp14 = ChrSi07; (f) ChrSp03–ChrSp06–ChrSp14 = ChrSi03. See also Fig. 3. Scale bars = 5 µm.
Figure 3Circos plot of genomes of S. polyrhiza 9505 (orange) and S. intermedia 8410 (blue). (A) tracks representing the size of the pseudomolecules with a corresponding scale in 1 Mbp steps, with highlights every 5 Mbps, (B) total length of repeat features (in kbps) (C) gene density and (D) pairwise sequence synteny. The synteny link between gi|13 and Si02 is based on an error in the ON assembly for clone 9509 (see Fig. 5). The region in question actually belongs to gi|2 as does the remaining part of Si02. Gene and repeat density are plotted in 0.5 Mbps bins. Data for S. polyrhiza 9505 are from Michael et al. 2017[25].
Assembly statistics of S. intermedia clones.
| Sequencing technique | PacBio (clone 7747) | ON/Illumina (clone 8410) |
|---|---|---|
| Read coveragea | 37.6× | 191× |
| Chromosome number | 18 | |
| Genome physical sizeb | ~ 160 Mbp | |
| Assembly length (Mbp) | 147.6 | 136.6 |
| Number of pseudomolecules | 18 | |
| Number of contigs | 584 | 86 |
| Number of scaffolds | 420 | 70 |
| Number of assigned scaffolds | 18 pseudomolecules (featuring 63 contigs) ~ 131.4 Mbp total length | 18 pseudomolecules (featuring 34 contigs) ~ 134 Mbp total length |
| Largest scaffold length (Mbp) | 12.5 (Si09) | 13.4 (Si09) |
| N50 scaffold length (Mbp) | 8.3 | 9.25 |
| G + C content (%) | 41.6 | 42.0 |
| Number of predicted gene models | 22,245 | 21,594 |
| Complete genes (C) | 1097 (79.8%) | 1280 (93.1%) |
| Complete and single-copy (S) | 1085 (78.9%) | 1266 (92.1%) |
| Complete and duplicated (D) | 12 (0.9%) | 14 (1.0%) |
| Fragmented genes (F) | 131 (9.5%) | 41 (3.0%) |
| Missing genes (M) | 147 (10.7%) | 54 (3.9%) |
| Total number of BUSCO genes used | 1375 | |
aBased on genome size measurements by FCM.
bMeasured by FCM.
cReference database odb10.
Figure 2Rearrangements between S. polyrhiza (n = 20) and S. intermedia (n = 18), confirmed for chromosomes of clones 8410 and 7747. Enframed: newly found rearrangement. Red boxes: present only in S. polyrhiza; green boxes: sequences present only in S. intermedia. Scale bar = 3 Mbp (based on PacBio assembly for clone 7747). Enumeration is as in Hoang & Schubert (2017)[29] and in Table S3 (Short sequences present in S. p., but not chromosomally assigned in S. i. correspond to “SiUn” and are not considered). 13 ChrSi of the 7747 assembly show telomeric sequences at one (*) or both ends (**), while all pseudomolecules of the 8410 assembly show them at both ends.
Figure 5Evidence for ON mis-assembly of chromosome 13 of S. polyrhiza 9509. The newly tested BACs 029A10, 028I16 and 037J09 (yellow) belong to ChrSp 02 (red) of S. polyrhiza clone 9509 (upper panel) and clone 7498 (lower panel), not to ChrSp13. Scale bars = 5 µm.
Figure 4New rearrangement between ChrSp 20 and ChrSp 15 in S. intermedia (a) The newly tested BAC 013O04 (yellow) belongs to ChrSp 20 (green) in S. polyrhiza; (b,c) BAC 013O04 was translocated to ChrSp 15 (red) forming ChrSi 16 in S. intermedia clones 8410 and 7747. Scale bars = 5 µm.
Repeat proportions [%] estimated for unassembled sequence reads and genome assemblies of S. intermedia clones 8410 and 7747.
| Repeat | 8410 | 8410 | 7747 |
|---|---|---|---|
| Illumina reads | Assembly | Assembly | |
| Athila | 7.44 | 7.23 | 8.57 |
| CRM | 1.09 | 1.02 | 1.13 |
| Reina | 0.36 | 0.32 | 0.30 |
| Galadriel | 0.05 | 0.04 | 0.05 |
| Tekay | 0.02 | 0.00 | 0.00 |
| Ale | 1.87 | 1.78 | 1.79 |
| Ivana | 0.94 | 0.91 | 0.97 |
| Tork | 0.69 | 0.71 | 0.70 |
| Ikeros | 0.40 | 0.37 | 0.39 |
| Unclassified | 0.23 | 0.21 | 0.27 |
| LTR unclass | 1.01 | 0.93 | 1.08 |
| LINE | 0.37 | 0.42 | 0.49 |
| Satellite | 0.17 | 0.06 | 0.09 |
| Microsat. (GA)n | 0.67 | 1.02 | 0.98 |
| rDNA | 0.47 | 0.34 | 0.06 |
| Unclassified | 4.21 | 3.11 | 3.35 |
| Total | 20.00 | 18.49 | 20.22 |
| RepeatScout | n.a | 23.11 | 25.58 |
Figure 6Schematic representation of rDNA loci in S. intermedia 8410. (a) Depiction of 45S rDNA loci at chromosomes ChrSi01 and ChrSi06; Tel = telomere. (b) Depiction of 5S rDNA loci at chromosome ChrSi15, and ChrSi14. The ChrSi15 locus contains 31 5S rDNA units with NTS of type-1; the ChrSi14 locus is composed of two clusters of 7 and 13 5S rDNA units of type-2 NTS, separated by a doubled 6 kb sequence of unknown function.