| Literature DB >> 35106561 |
Hugo Devillers1, Véronique Sarilar2,3, Cécile Grondin1,2, Lieven Sterck4,5, Diego Segond1, Noémie Jacques6, Delphine Sicard1, Serge Casaregola2, Colin Tinsley2.
Abstract
Recent studies have suggested that species of the Kazachstania genus may be interesting models of yeast domestication. Among these, Kazachstania barnettii has been isolated from various microbially transformed foodstuffs such as sourdough bread and kefir. In the present work, we sequence, assemble, and annotate the complete genomes of two K. barnettii strains: CLIB 433, being one of the two reference strains for K. barnettii that was isolated as a spoilage organism in soft drink, and CLIB 1767, recently isolated from artisan bread-making sourdough. Both assemblies are of high quality with N50 statistics greater than 1.3 Mb and BUSCO score greater than 99%. An extensive comparison of the two obtained genomes revealed very few differences between the two K. barnettii strains, considering both genome structure and gene content. The proposed genome assemblies will constitute valuable references for future comparative genomic, population genomic, or transcriptomic studies of the K. barnettii species.Entities:
Keywords: comparative genomics; sourdough bread; whole-genome sequencing
Mesh:
Year: 2022 PMID: 35106561 PMCID: PMC8825440 DOI: 10.1093/gbe/evac007
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Assembly and Annotation Statistics of the Two Kazachstania barnettii Strains CLIB 433 (Reference) and CLIB 1767 (Sourdough) and of the Kazachstania saulgeensis Strain CLIB 1764 (Sourdough)
| CLIB 433 | CLIB 1767 | CLIB 1764 | |
|---|---|---|---|
| Scaffold count | 15 | 14 | 17 |
| Overall size | 12,610,268 | 12,616,033 | 12,935,755 |
| Scaffold max. length | 2,518,272 | 1,873,880 | 2,959,652 |
| Scaffold min. length | 16,307 | 123,979 | 17,310 |
| Average G + C content (%) | 33.51 | 33.46 | 32.19 |
| N50/L50 | 1,360,346/4 | 1,404,614/4 | 1,371,409/4 |
| Assembly gap count | 89 | 94 | 77 |
| CDS count (pseudo) | 5,322 (48) | 5,316 (49) | 5,376 (67) |
| Intron count | 193 | 193 | 199 |
| tRNA count | 195 | 199 | 196 |
| BUSCO | C: 99.3 (S: 97.6, D: 1.7), F: 0.1, M: 0.6 | C: 99.2 (S: 97.5, D: 1.7), F: 0.1, M: 0.7 | C: 98.9 (S: 97.2, D: 1.7), F: 0.1, M: 1.0 |
Note.—C, complete copy; S, single copy; D, duplicated copy; F, fragmented copy; M, missing.
The cutoff threshold for definition of a scaffold was 10 kb.
Genome size and other lengths are given in bases.
BUSCO version 4.0.5, based on the saccharomycetes_odb10 data set (n = 2,137 proteins).
(A) Synteny blocks between the Kazachstania barnettii strains CLIB 433 (reference) and CLIB 1767 (sourdough) and the Kazachstania saulgeensis strain CLIB 1764 (sourdough). Each synteny block is colored according to the scaffolds of CLIB 433. Thus, for example, blocks from the scaffold 01 in CLIB 433 (in violet) are found in scaffolds 01 and 13 in the assembly of CLIB 1767 and in scaffolds 01, 14, and 16 in the assembly of K. saulgeensis CLIB 1764. (B) Dot plot between the two K. barnettii strains, CLIB 433 (reference) and CLIB 1767 (sourdough) based on MUMmer matches. To facilitate reading and interpretation of, scaffolds are reordered in accordance with the synteny block organization in (A). (C) Comparison of the structure of the scaffolds of the two K. barnettii assemblies. Red arrows represent possible inversion events between the two scaffolds in one possible sequence leading from one strain’s organization to the other. The central band represents the corresponding intermediate organization. Positions of mating type loci are indicated in green.