| Literature DB >> 30834381 |
Hana Lichancová1, Viktória Hodorová1, Karolina Sienkiewicz2, Sarah Mae U Penir3, Philipp Afanasyev4, Dominic Boceck5, Sarah Bonnin6, Siras Hakobyan7, Pawel S Krawczyk8, Urszula Smyczynska9, Erik Zhivkoplias10, Maryna Zlatohurska11, Adrian Odrzywolski12, Eugeniusz Tralle13, Alina Frolova14, Leszek P Pryszcz13, Broňa Brejová15, Tomáš Vinař16, Jozef Nosek1.
Abstract
Saprochaete suaveolens is an ascomycetous yeast that produces a range of fruity flavors and fragrances. Here, we report the high-contiguity genome sequence of the ex-holotype strain, NRRL Y-17571 (CBS 152.25). The nuclear genome sequence contains 24.4 Mbp and codes for 8,119 predicted proteins.Entities:
Year: 2019 PMID: 30834381 PMCID: PMC6395866 DOI: 10.1128/MRA.00094-19
Source DB: PubMed Journal: Microbiol Resour Announc ISSN: 2576-098X
Candidate genome assemblies
| Assembly software | Software version | Polishing procedure | Length of assembly (Mbp) | No. of contigs | No. of contigs >50 kbp | Longest contig | No. of mismatches per 100 kbp | No. of indels per 100 kbp | |
|---|---|---|---|---|---|---|---|---|---|
| SPAdes ( | 3.12.0 | 24.2 | 2,224 | 137 | 640 kbp | 173 kbp | |||
| Canu ( | 1.7.1 | Pilon (2×) | 24.9 | 26 | 24 | 4.2 Mbp | 1.7 Mbp | 41.7 | 13.9 |
| MaSuRCA ( | 3.2.8 | Pilon (1×) | 25.4 | 29 | 23 | 6.8 Mbp | 2.7 Mbp | 45.7 | 7.6 |
| Miniasm ( | 0.3/2.12 | Racon (2×) | 24.5 | 15 | 13 | 3.7 Mbp | 2.8 Mbp | 81.5 | 367.9 |
| DBG2OLC ( | 1.2.4 | Racon (1×), Pilon (1×) | 25.8 | 31 | 24 | 4.2 Mbp | 1.8 Mbp | 55.2 | 35.1 |
| Final | Racon (2×), Pilon (2×) | 24.4 | 12 | 11 | 3.8 Mbp | 2.8 Mbp | 45.6 | 11.0 |
Statistics were produced with Quast v. 4.5 (15). To estimate mismatches and indels, SPAdes assembly based on Illumina short reads was used as a reference. With SPAdes, the result was filtered for length >100 and coverage >10. Canu assembly used only reads overlapping SPAdes by >200 bp, and we filtered out contigs supported by fewer than 5 reads. All assemblies were polished with Pilon v. 1.21 (16) and Racon v. 1.3.1 (17). Most of the size differences between candidate assemblies can be accounted for by mtDNA and rRNA gene fragments as well as other repetitive sequences.