| Literature DB >> 33920164 |
Carolina Santiago1,2, Teresa Rito1,2,3, Daniel Vieira1,2, Ticiana Fernandes1,2, Célia Pais1,2, Maria João Sousa1,2, Pedro Soares1,2, Ricardo Franco-Duarte1,2.
Abstract
Saccharomyces cerevisiae is the most commonly used yeast in wine, beer, and bread fermentations. However, Torulaspora delbrueckii has attracted interest in recent years due to its properties, ranging from its ability to produce flavor- and aroma-enhanced wine to its ability to survive longer in frozen dough. In this work, publicly available genomes of T. delbrueckii were explored and their annotation was improved. A total of 32 proteins were additionally annotated for the first time in the type strain CBS1146, in comparison with the previous annotation available. In addition, the annotation of the remaining three T. delbrueckii strains was performed for the first time. eggNOG-mapper was used to perform the functional annotation of the deduced T. delbrueckii coding genes, offering insights into its biological significance, and revealing 24 clusters of orthologous groups (COGs), which were gathered in three main functional categories: information storage and processing (28% of the proteins), cellular processing and signaling (27%), and metabolism (23%). Small intraspecies variability was found when considering the functional annotation of the four available T. delbrueckii genomes. A comparative study was also conducted between the T. delbrueckii genome and those from 386 fungal species, revealing a high number of homologous genes with species from the Zygotorulaspora and Zygosaccharomyces genera, but also with Lachancea and S. cerevisiae. Lastly, the phylogenetic placement of T. delbrueckii was clarified using the core homologs that were found across 204 common protein sequences of 386 fungal species and strains.Entities:
Keywords: Torulaspora delbrueckii; fermentation; genome annotation; genomics; non-Saccharomyces yeasts
Year: 2021 PMID: 33920164 PMCID: PMC8070057 DOI: 10.3390/jof7040287
Source DB: PubMed Journal: J Fungi (Basel) ISSN: 2309-608X
Torulaspora delbrueckii genomes used in this study and the corresponding number of protein-coding sequence (CDS) transposable elements and transfer RNA (tRNA) genes predicted by Yeast Genome Annotation Pipeline (YGAP).
| Strain | Source | Reference | Genome Size | BUSCO | Coding | Transposable | tRNA | Homologies | Unidentified |
|---|---|---|---|---|---|---|---|---|---|
| CBS1146 | Unknown; type strain | [ | 9.22 Mb | 98.5 | 4978 | 5 | 191 | 4514 | 464 |
| COFT1 | Wine fermentations | [ | 9.36 Mb | 98.1 | 5009 | 5 | 223 | 4506 | 503 |
| NRRL Y-50541 | Mezcal-fermentations | [ | 11.53 Mb | 80.2 | 4361 | 6 | 180 | 3875 | 486 |
| SRCM101298 | Fermented food | - | 9.68 Mb | 98.2 | 5016 | 7 | 274 | 4513 | 503 |
1 Corresponding to no homologies with S. cerevisiae S288c detected by YGAP.
Nucleotide identity matrix. The values represent the average amounts of pairwise differences (as a percentage) obtained by comparing the four T. delbrueckii genomes assessed in this work. The standard deviations are indicated between brackets.
| CBS1146 | COFT1 | NRRL Y-50541 | |
|---|---|---|---|
| CBS1146 | |||
| COFT1 | 99.54 (±0.71) | ||
| NRRL Y-50541 | 97.98 (±2.28) | 97.89 (±2.29) | |
| SCRM101298 | 99.54 (±0.74) | 99.63 (±0.66) | 97.62 (±2.29) |
Figure 1Total number of tRNA gene annotations obtained with the YGAP pipeline for the four Torulaspora delbrueckii strains.
Figure 2Basic Local Alignment Search Tool (BLAST) top hits distributed by species based on the best sequence alignments and lowest E-values by considering the proteins that were not identified by YGAP in the four T. delbrueckii strains and the five top hits for each protein. Only species with more than 20 top hits are shown.
Figure 3eggNOG classifications of the annotated T. delbrueckii genes. The functional annotations were divided into 24 categories, corresponding to clusters of orthologous groups (COGs). (A) Number of genes grouped in each of the 24 COG categories for the T. delbrueckii COFT1 genome. The colors are indicative of the functional categories used in panel B. (B) Classification of the genes into functional categories, comparing T. delbrueckii COFT1 with the reference S. cerevisiae S288c. (C) Comparison between the four available genomes of T. delbrueckii in terms of the number of grouped genes (as a percentage) in each COG category.
Figure 4Homology comparison between the protein-coding genes of Torulaspora delbrueckii COFT1 genome (used as the reference) and 56 related yeast species/strains. The protein-coding regions of the COFT1 genome were detected using YGAP and the homology was determined using a BLAST analysis. Only species/strains with more than 4600 homologs are presented (with the exception of some S. cerevisiae strains and T. delbrueckii NRRL Y-50541, which were included for comparison); the results for the complete set of 386 yeasts are available in Supplementary Data S8.
Figure 5Phylogeny of fungi regarding 386 fungal core genomes (alignment of 204 common proteins). The phyla are highlighted using dashed lines and the subphyla are identified using colored boxes. The placements of the Torulaspora delbrueckii strains are shown in detail relative to the closely related species inside the Saccharomycotina subphylum. The concatenated alignment was used for the phylogenetic reconstruction by considering the maximum likelihood and 500 bootstrap replicates. The bootstrap values (in percentages) are shown for the highlighted branches. A detailed phylogenetic tree and the full bootstrap values are available in Supplementary Data S10.