| Literature DB >> 32143301 |
Christian Siadjeu1,2, Boas Pucker2,3, Prisca Viehöver2, Dirk C Albach1, Bernd Weisshaar2.
Abstract
Trifoliate yam (Dioscorea dumetorum) is one example of an orphan crop, not traded internationally. Post-harvest hardening of the tubers of this species starts within 24 h after harvesting and renders the tubers inedible. Genomic resources are required for D. dumetorum to improve breeding for non-hardening varieties as well as for other traits. We sequenced the D. dumetorum genome and generated the corresponding annotation. The two haplophases of this highly heterozygous genome were separated to a large extent. The assembly represents 485 Mbp of the genome with an N50 of over 3.2 Mbp. A total of 35,269 protein-encoding gene models as well as 9941 non-coding RNA genes were predicted, and functional annotations were assigned.Entities:
Keywords: D. dumetorum; comparative genomics; genome assembly; nanopore sequencing; read depth; yam
Mesh:
Substances:
Year: 2020 PMID: 32143301 PMCID: PMC7140821 DOI: 10.3390/genes11030274
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Statistics of selected versions of the Dioscorea dumetorum genome assembly (see File S5 for a full table).
| Initial Assembly | Racon1 | Medaka2 | Pilon3 | Final | |
|---|---|---|---|---|---|
| Number of contigs | 1172 | 1172 | 1215 | 1215 | 924 |
| Max. contig length (bp) | 20,187,448 | 20,424,333 | 17,910,017 | 17,878,854 | 17,878,854 |
| Assembly size (bp) | 501,985,705 | 508,061,170 | 507,215,754 | 506,184,192 | 485,115,345 |
| Assembly size without N (bp) | 501,985,705 | 508,061,170 | 507,215,754 | 506,184,192 | 485,115,345 |
| GC content | 37.74% | 37.66% | 37.87% | 37.59% | 37.57% |
| N50 (bp) | 3,896,882 | 3,930,287 | 2,598,889 | 2,593,751 | 3,190,870 |
| N90 (bp) | 136,614 | 138,199 | 137,206 | 136,754 | 156,407 |
| BUSCO (complete) | 85.70% | 89.80% | 91.90% | 92.30% | 92.30% |
Figure 1(a) Distribution of the average sequencing read depth per gene model. Predicted gene models were classified into phase-separated and merged based on the average read depth value deduced from the analysis presented here. The haploid read depth with Illumina short reads ranges from 50-fold to 150-fold. (b) Number of heterozygous sequence variants in phase separated and merged genes. The high proportion of heterozygous variants in merged gene models is due to the mapping of reads originating from two different alleles to the same region of the assembly.