Literature DB >> 28725555

Genome sequence of the type strain CLIB 1764T (= CBS 14374T) of the yeast species Kazachstania saulgeensis isolated from French organic sourdough.

Véronique Sarilar1, Lieven Sterck2,3, Saki Matsumoto1, Noémie Jacques1, Cécile Neuvéglise4, Colin R Tinsley1, Delphine Sicard5, Serge Casaregola1.   

Abstract

Kazachstania saulgeensis is a recently described species isolated from French organic sourdough. Here, we report the high quality genome sequence of a monosporic segregant of the type strain of this species, CLIB 1764T (= CBS 14374T). The genome has a total length of 12.9 Mb and contains 5326 putative protein-coding genes, excluding pseudogenes and transposons. The nucleotide sequences were deposited into the European Nucleotide Archive under the genome assembly accession numbers FXLY01000001-FXLY01000017.

Entities:  

Keywords:  Genome; Kazachstania; Saccharomycotina; Sourdough; Yeast

Year:  2017        PMID: 28725555      PMCID: PMC5501885          DOI: 10.1016/j.gdata.2017.07.003

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Direct link to deposited data

https://www.ncbi.nlm.nih.gov/bioproject/PRJEB20516.

Introduction

The role of yeasts in bread making involves leavening the dough by fermenting carbon sources present in flour and producing aroma. In addition to the baker's yeast Saccharomyces cerevisiae, a number of other yeast species can be found in dough, in particular Torulaspora delbrueckii, Wickerhamomyces anomalus and Pichia kudriavzevii along with several members of the genus Kazachstania, such as Candida humilis (syn. Candida milleri, now Kazachstania humilis), Kazachstania exigua, less frequently Kazachstania bulderi and Kazachstania unispora [1], [2]. A recent analysis of French organic sourdough revealed the presence a novel species, Kazachstania saulgeensis [2], [3]. Here we report a high quality draft of the genome sequence of a monosporic segregant of the type strain of this species. The availability of the genome of K. saulgeensis will facilitate studies on the role of nonconventional yeasts in dough and the search for alternative baker's yeasts with interesting properties such as novel natural aromas.

Experimental design, materials and methods, results

Spore isolation from strain CLIB 1764T grown on malt agar was performed as described in [4]. DNA from a single spore grown on YPD medium was prepared as previously described [4]. Preparation of two mate-pair libraries from the purified DNA and sequencing (Illumina HiSeq 2500 platform) was performed by BGI Genomics, Shenzhen, China. Two mate-pair libraries of 6-kbp insert size were sequenced, generating 6,055,467 read pairs of 100 bp and 5,496,657 read pairs of 125 bp. After trimming according to quality criteria with Trimmomatic [5], 21,095,636 reads were retained, leading to an apparent 190-fold coverage. The reads were assembled using Platanus, v1.2.1 [6] with default parameters. GapCloser v1.12 [7] was used to fill gaps where possible. The resulting assembly consisted of 3748 scaffolds with a maximum length of 2.96 Mb and with an N50 length of 1.37 Mb. The cumulative size was 13.99 Mb. The rDNA unit was assembled separately and manually integrated between the two scaffolds identified as being next to rDNA after mate-pair read mapping using BWA [8]. The resulting scaffold containing the rRNA locus was 0.89 Mb in size. Annotation was performed on the 17 scaffolds larger than 10 kb (cumulative size of 12,935,755 bp, 32.5% GC content), whose size varied from 17.3 kb to 2.95 Mb (Table 1).
Table 1

Genome statistics for the strain CLIB 1764T.

AttributeCLIB 1764T
Genome size (bp)12,935,755
Scaffolds > 10 kb17
N501.37 Mb
G + C content33%
Protein coding genes5326
Pseudogenes38
tRNA genes197
LTR-retrotransposons (including pseudogenes)15
Solo Long Terminal Repeats278
DNA transposons (including. pseudogenes)6
Genome statistics for the strain CLIB 1764T. Based on the reference genomes of two related and well annotated, species belonging to the Saccharomycetaceae, Saccharomyces cerevisiae (http://www.yeastgenome.org/) and Lachancea kluyveri [9], a total of 5326 putative protein coding genes (CDS) and 38 pseudogenes were found using the Amadea Annotation transfer tool (Isoft, France). Functional annotation was performed based on protein similarity with S. cerevisiae. Coding sequences with no similarity to those in S. cerevisiae were annotated using the refseq and nr databases at NCBI. Further putative CDS were added after prediction of CDS longer than 150 aa with ORF Finder (http://www.ncbi.nlm.nih.gov/orffinder/) and blast analysis against the NCBI non redundant database, to yield a total of 5326 CDS (Table 1). Some of the gene models were manually curated on the ORCAE platform (http://bioinformatics.psb.ugent.be/orcae/; [10]) and visualized on GenomeView (http://genomeview.org; [11]). Interestingly, an arginase, whose gene had no equivalent in Saccharomycotina yeasts, but which presented strong sequence similarities with those of Penicillium is very likely the result of a horizontal gene transfer event. One entire and one partial Ty3/gypsy retrotransposon were identified, together with 13 Ty-like pseudogenes. A total of 278 Long Terminal Repeats from retrotransposons were identified, belonging to at least 10 subfamilies. One of these subfamilies displays an unusual size of 714 bp, reminiscent of the long LTR found in Kazachstania exigua [12]. Members of two families of hAT DNA transposons, Roamer and Rover [13], [14], [15] with four and two elements respectively, were also identified; all were pseudogenes. A total of 197 tRNA were identified, using tRNAscan-SE v1.3.1 [16] (Table 1). We used the available genome of the type strain of two Kazachstania species, Kazachstania africana and Kazachstania naganishii, to investigate chromosome colinearity between K. saulgeensis and these species [17]. We examined the synteny based on the presence and order of orthologous genes using SynChro [18], with Delta = 4 to minimize artifactual synteny breaks. This showed that rearrangements that have occurred since the last common ancestor of K. saulgeensis, K. africana and K. naganishii are numerous and affect each scaffold equally (Fig. 1).
Fig. 1

Synteny blocks between the genomes of K. saulgeensis and two other Kazachstania species. Orthology relationships between genes from K. africana, K. naganishii and K. saulgensis were defined on the basis of bidirectional hits in a blastp comparison (reciprocal best hits) computed by SynChro [18]. The color attributed to the genes of a given K. saulgeensis scaffold is conserved for their counterparts in K. africana and K. naganishii.

Synteny blocks between the genomes of K. saulgeensis and two other Kazachstania species. Orthology relationships between genes from K. africana, K. naganishii and K. saulgensis were defined on the basis of bidirectional hits in a blastp comparison (reciprocal best hits) computed by SynChro [18]. The color attributed to the genes of a given K. saulgeensis scaffold is conserved for their counterparts in K. africana and K. naganishii.

Nucleotide accession number

The genome sequences generated in this study are available from the European Nucleotide Archive under the genome assembly accession number GCA_900180425 and the scaffold accession range FXLY01000001–FXLY01000017. The genome can be browsed and searched at http://bioinformatics.psb.ugent.be/orcae/overview/Kasa.

Conflict of interest statement

The authors declare no conflict of interest.
Specifications
Organism/strainKazachstania saulgeensis strain CLIB 1764T
SexN/A
Sequencer or array typeIllumina HiSeq 2500, mate pair libraries
Data formatProcessed data: genome assembly and annotated embl files
Experimental factorsN/A
Experimental featuresGenomic DNA extracted from pure yeast
ConsentN/A
Sample source locationSourdough samples obtained from baker Michel Perrin at Ferme des plants, Saulgé, France (46° 20′ 27.08″ N 0° 53′ 12.21″ E)
  18 in total

1.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

2.  Domesticated transposase Kat1 and its fossil imprints induce sexual differentiation in yeast.

Authors:  Naghmeh Rajaei; Kishore K Chiruvella; Feng Lin; Stefan U Aström
Journal:  Proc Natl Acad Sci U S A       Date:  2014-10-13       Impact factor: 11.205

3.  Three novel ascomycetous yeast species of the Kazachstania clade, Kazachstania saulgeensis sp. nov., Kazachstaniaserrabonitensis sp. nov. and Kazachstania australis sp. nov. Reassignment of Candida humilis to Kazachstania humilis f.a. comb. nov. and Candida pseudohumilis to Kazachstania pseudohumilis f.a. comb. nov.

Authors:  Noémie Jacques; Véronique Sarilar; Charlotte Urien; Mariana R Lopes; Camila G Morais; Ana Paula T Uetanabaro; Colin R Tinsley; Carlos A Rosa; Delphine Sicard; Serge Casaregola
Journal:  Int J Syst Evol Microbiol       Date:  2016-10-12       Impact factor: 2.747

4.  Comparative genomics of protoploid Saccharomycetaceae.

Authors:  Jean-Luc Souciet; Bernard Dujon; Claude Gaillardin; Mark Johnston; Philippe V Baret; Paul Cliften; David J Sherman; Jean Weissenbach; Eric Westhof; Patrick Wincker; Claire Jubin; Julie Poulain; Valérie Barbe; Béatrice Ségurens; François Artiguenave; Véronique Anthouard; Benoit Vacherie; Marie-Eve Val; Robert S Fulton; Patrick Minx; Richard Wilson; Pascal Durrens; Géraldine Jean; Christian Marck; Tiphaine Martin; Macha Nikolski; Thomas Rolland; Marie-Line Seret; Serge Casarégola; Laurence Despons; Cécile Fairhead; Gilles Fischer; Ingrid Lafontaine; Véronique Leh; Marc Lemaire; Jacky de Montigny; Cécile Neuvéglise; Agnès Thierry; Isabelle Blanc-Lenfle; Claudine Bleykasten; Julie Diffels; Emilie Fritsch; Lionel Frangeul; Adrien Goëffon; Nicolas Jauniaux; Rym Kachouri-Lafond; Célia Payen; Serge Potier; Lenka Pribylova; Christophe Ozanne; Guy-Franck Richard; Christine Sacerdot; Marie-Laure Straub; Emmanuel Talla
Journal:  Genome Res       Date:  2009-06-12       Impact factor: 9.043

5.  GenomeView: a next-generation genome browser.

Authors:  Thomas Abeel; Thomas Van Parys; Yvan Saeys; James Galagan; Yves Van de Peer
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

6.  Evolutionary dynamics of hAT DNA transposon families in Saccharomycetaceae.

Authors:  Véronique Sarilar; Claudine Bleykasten-Grosshans; Cécile Neuvéglise
Journal:  Genome Biol Evol       Date:  2014-12-21       Impact factor: 3.416

7.  SynChro: a fast and easy tool to reconstruct and visualize synteny blocks along eukaryotic chromosomes.

Authors:  Guénola Drillon; Alessandra Carbone; Gilles Fischer
Journal:  PLoS One       Date:  2014-03-20       Impact factor: 3.240

8.  Reconstruction of ancestral chromosome architecture and gene repertoire reveals principles of genome evolution in a model yeast genus.

Authors:  Nikolaos Vakirlis; Véronique Sarilar; Guénola Drillon; Aubin Fleiss; Nicolas Agier; Jean-Philippe Meyniel; Lou Blanpain; Alessandra Carbone; Hugo Devillers; Kenny Dubois; Alexandre Gillet-Markowska; Stéphane Graziani; Nguyen Huu-Vang; Marion Poirel; Cyrielle Reisser; Jonathan Schott; Joseph Schacherer; Ingrid Lafontaine; Bertrand Llorente; Cécile Neuvéglise; Gilles Fischer
Journal:  Genome Res       Date:  2016-05-31       Impact factor: 9.043

9.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more
  1 in total

1.  Whole-Genome Sequences of Two Kazachstania barnettii Strains Isolated from Anthropic Environments.

Authors:  Hugo Devillers; Véronique Sarilar; Cécile Grondin; Lieven Sterck; Diego Segond; Noémie Jacques; Delphine Sicard; Serge Casaregola; Colin Tinsley
Journal:  Genome Biol Evol       Date:  2022-02-04       Impact factor: 3.416

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.