| Literature DB >> 32457706 |
Vincent Perkins1, Stéphanie Vignola1, Marie-Hélène Lessard1, Pier-Luc Plante2, Jacques Corbeil2, Eric Dugat-Bony1,3, Michel Frenette4,5, Steve Labrie1.
Abstract
The yeast Geotrichum candidum (teleomorph Galactomyces candidus) is inoculated onto mold- and smear-ripened cheeses and plays several roles during cheese ripening. Its ability to metabolize proteins, lipids, and organic acids enables its growth on the cheese surface and promotes the development of organoleptic properties. Recent multilocus sequence typing (MLST) and phylogenetic analyses of G. candidum isolates revealed substantial genetic diversity, which may explain its strain-dependant technological capabilities. Here, we aimed to shed light on the phenotypic and genetic diversity among eight G. candidum and three Galactomyces spp. strains of environmental and dairy origin. Phenotypic tests such as carbon assimilation profiles, the ability to grow at 35°C and morphological traits on agar plates allowed us to discriminate G. candidum from Galactomyces spp. The genomes of these isolates were sequenced and assembled; whole genome comparison clustered the G. candidum strains into three subgroups and provided a reliable reference for MLST scheme optimization. Using the whole genome sequence as a reference, we optimized an MLST scheme using six loci that were proposed in two previous MLST schemes. This new MLST scheme allowed us to identify 15 sequence types (STs) out of 41 strains and revealed three major complexes named GeoA, GeoB, and GeoC. The population structure of these 41 strains was evaluated with STRUCTURE and a NeighborNet analysis of the combined six loci, which revealed recombination events between and within the complexes. These results hint that the allele variation conferring the different STs arose from recombination events. Recombination occurred for the six housekeeping genes studied, but most likely occurred throughout the genome. These recombination events may have induced an adaptive divergence between the wild strains and the cheesemaking strains, as observed for other cheese ripening fungi. Further comparative genomic studies are needed to confirm this phenomenon in G. candidum. In conclusion, the draft assembly of 11 G. candidum/Galactomyces spp. genomes allowed us to optimize a genotyping MLST scheme and, combined with the assessment of their ability to grow under different conditions, provides a reliable tool to cluster and eventually improves the selection of G. candidum strains.Entities:
Keywords: Galactomyces candidus; Geotrichum candidum; assimilation of carbon compounds; cheese; genome assembly; multilocus sequence typing; yeast
Year: 2020 PMID: 32457706 PMCID: PMC7220993 DOI: 10.3389/fmicb.2020.00737
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Origins and allelic profiles of the 41 G. candidum strains analyzed.
| Strain | Geo complex | Lineage | ST | Allele | Origina | References | |||||
| 15 | A | 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | Mold-ripened cheese, Canada | |
| 20 | A | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | Mold-ripened cheese, Canada | |
| 21 | C | 1 | 3 | 2 | 3 | 2 | 2 | 2 | 3 | Clotted carrot, Japan | |
| 28 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Mold-ripened cheese, Canada | |
| 34 | A | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | Mold-ripened cheese, Spain | |
| 37 | A | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | Smear cheese, France | |
| 38 | A | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | Mold-ripened cheese, France | |
| 39 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Smear cheese, France | |
| 48 | B | 4 | 5 | 1 | 1 | 2 | 3 | 2 | 4 | Bioreactor contaminant | |
| 73 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Smear cheese, France | |
| 74 | B | 4 | 6 | 1 | 4 | 2 | 1 | 3 | 5 | Grass, France | |
| 75 | B | 4 | 7 | 1 | 5 | 2 | 1 | 4 | 4 | Corn silage, France | |
| 76 | A | 5 | 8 | 1 | 1 | 1 | 1 | 2 | 4 | Milk, France | |
| 436 | A | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | Industrial strain–company A | |
| 562 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Milk, MPP 1, Canada | |
| 645 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Smear cheese, Canada | |
| 655 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Mold-ripened cheese, Canada | |
| 664 | B | 4 | 11 | 4 | 4 | 2 | 1 | 2 | 4 | Milk, MPP 1, Canada | |
| 690 | B | 4 | 12 | 5 | 6 | 2 | 1 | 4 | 7 | Milk, MPP 3, Canada | |
| 1024 | A | 3 | 13 | 5 | 1 | 1 | 1 | 1 | 1 | Industrial strain–company B | |
| 1025 | A | 3 | 13 | 5 | 1 | 1 | 1 | 1 | 1 | Industrial strain–company B | |
| 1026 | A | 4 | 14 | 1 | 1 | 1 | 1 | 2 | 8 | Industrial strain–company B | |
| 1031 | A | 3 | 13 | 5 | 1 | 1 | 1 | 1 | 1 | Industrial strain–company A | |
| 1032 | A | 3 | 13 | 5 | 1 | 1 | 1 | 1 | 1 | Industrial strain–company A | |
| 1033 | A | 3 | 13 | 5 | 1 | 1 | 1 | 1 | 1 | Industrial strain–company A | |
| 1034 | A | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | Industrial strain–company A | |
| 1035 | C | 1 | 15 | 3 | 4 | 2 | 2 | 2 | 3 | Industrial strain–company A | |
| 1036 | A | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | Industrial strain–company A | |
| 1037 | A | 3 | 13 | 5 | 1 | 1 | 1 | 1 | 1 | Industrial strain–company A | |
| 1038 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Industrial strain–company A | |
| 1039 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Industrial strain–company A | |
| 1040 | A | 5 | 4 | 1 | 1 | 1 | 1 | 1 | 4 | Industrial strain–company A | |
| 1041 | A | 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | Industrial strain–company A | |
Identification and origins of the Galactomyces spp. strains.
| Species | Strain | Origin | References |
| 1147 | Soil extract, Puerto Rico | CBS 772.71 | |
| 1148 | Cold water retting, Indonesia | CBS 179.60 | |
| 1150 | Orange orchard soil extract, California | CBS 175.89 |
Primer sequences of 11 MLST target loci.
| Locus | Primer sequence 5′→ 3′ | Locus size (bp) | References |
| Fwd-GCTCTTCGTGAGGTTCTTGG | 424 | ||
| Rev-ACCTCGTAGGCATCAGTGCT | |||
| Fwd-CGCCAGTCAGAGAAGGAATA | 355 | ||
| Rev-GTCGACCTGGTTCTTGACAC | |||
| Fwd-AACACAACATTTCCCGTGAG | 652 | ||
| Rev-AGAGCTTAGCGTCAGCACTGA | |||
| Fwd-TGTTCTCAGAGGGTTTCCTG | 558 | ||
| Rev-CCACATCTGAGGATTGTCGT | |||
| Fwd-ACCGCTGAGACTCTTCGCAA | 472 | ||
| Rev-CTCCATGGAAAGCTGCTGGA | |||
| Fwd-GAACGGTGTCTACGGTCTTG | 548 | ||
| Rev-TCAATGTACAGACGGATGGTC | |||
| For-ACCGCTACAACTGGATTTGG | 425 | ||
| Rev-GAGACCTGTTTGAGGGCTTG | |||
| For-CAAGCCAATTGTGCTGAGAA | 465 | ||
| Rev-GGTGTCGTAGGGCAGTTGAT | |||
| For-GCCAAAAAGACCAACCTGTG | 470 | ||
| Rev-CCTCATCCATACGGTTCTGC | |||
| For-ATCATTAACACCCCGGCATA | 501 | ||
| Rev-GTGTCACCAAGCAGAGCAAA | |||
| For-AAGAATATCTGGGATCTTTC | 393 | ||
| Rev-TGAAGAAGAAGTACCAAGAA |
FIGURE 1Morphology of Geotrichum candidum and Galactomyces spp. strains on YEG culture media. (Left) Macroscopic observation of colonies. (Right) Microscopic observation of arthrospores. (A) Mold-like (example: LMA-1028). (B) Intermediate (example: LMA-244). (C) Yeast-like (example: LMA-1146).
Carbon assimilation profile for the G. candidum/Galactomyces spp. strains.
| Genus | |||||||||||
| Strain (LMA) | 77 | 1146 | 1028 | 40 | 563 | 317 | 70 | 244 | 1147 | 1148 | 1150 |
| Geo complex | A | A | A | A | A | B | C | C | |||
| ST | ST1 | ST1 | ST2 | ST4 | ST4 | ST10 | ST3 | ST9 | |||
| W | + | − | + | − | + | − | + | − | − | − | |
| α-ketoglutarate | W | + | W | + | W | + | − | − | − | − | − |
| Malate | + | W | W | W | W | + | W | + | − | − | − |
| Glucose | + | + | + | + | + | + | + | + | + | + | + |
| Inuline | − | − | − | − | − | − | − | − | − | − | − |
| Sucrose | − | − | − | − | − | − | − | − | − | − | − |
| Raffinose | − | − | − | − | − | − | − | − | − | − | − |
| Melibiose | − | − | − | − | − | − | − | − | − | − | − |
| Galactose | + | + | + | + | + | + | + | + | + | + | + |
| Lactose | − | − | − | − | − | − | − | − | − | − | − |
| Trehalose | − | − | − | − | − | − | − | − | − | − | − |
| Maltose | − | − | − | − | − | − | − | − | − | − | − |
| Methyl-glucoside | − | − | − | − | − | − | − | − | − | − | − |
| Cellobiose | − | − | − | − | − | − | − | − | − | − | − |
| Salicin | − | − | − | − | − | − | − | − | − | − | − |
| Sorbose | + | + | + | + | + | + | + | + | + | + | + |
| Rhamnose | − | − | − | − | − | − | − | − | − | − | − |
| Xylose | + | + | + | + | + | + | + | + | + | + | + |
| − | − | − | − | − | − | − | − | − | − | W | |
| Ribose | − | − | − | − | − | − | − | − | − | − | − |
| Glycerol | + | + | + | + | + | + | + | + | + | + | + |
| Adonitol | − | − | + | + | − | W | − | − | − | − | + |
| Mannitol | + | + | + | + | + | + | + | + | + | − | + |
| Succinate | + | + | + | + | + | + | + | + | + | + | + |
| Citrate | − | − | − | − | W | W | W | W | W | − | + |
| Gluconate | − | − | − | − | − | − | − | − | − | − | − |
| Glucosamine | − | − | − | − | − | − | − | − | − | − | − |
| Xylitol | − | − | − | − | − | − | − | − | − | − | − |
| Gluconolactone | + | + | + | + | + | + | + | + | + | − | W |
| Lactate | + | + | + | + | + | + | + | + | W | + | + |
| Bromosuccinate | + | + | − | + | + | + | + | + | N/A | N/A | N/A |
| Amygdalin | W | W | W | W | W | W | W | W | N/A | N/A | N/A |
| Glutamate | + | + | + | + | + | + | + | + | N/A | N/A | N/A |
FIGURE 2Genomic similarity of three Galactomyces spp. and eight G. candidum strains represented by a heatmap obtained from the Euclidean distance between the normalized set of k-mers (31 bp fragments) shared between the partial genomes of Galactomyces spp. and G. candidum strains. Phenetic tree analyses were conducted in MEGA6. (Right panel) Ability of G. candidum/Galactomyces spp. strains to grow on carbon compounds (+ for positive assimilation, − for negative assimilation, and W for weak assimilation).
Assembly statistics for the genome of Geotrichum candidum and Galactomyces spp. strains.
| LMA strain | Mean coverage | Assembly size (Mb) | Total no. of scaffolds | Scaffold N50 (bp) | Longest scaffold (bp) | GC content (%) | BUSCO Completeness (%) |
| 40 | 63× | 23.10 | 1,268 | 32,639 | 167,475 | 41.8 | 90.3 |
| 70 | 70× | 24.17 | 1,450 | 44,909 | 195,322 | 41.4 | 90.2 |
| 77 | 58× | 23.32 | 1,488 | 29,977 | 158,856 | 41.7 | 90.5 |
| 244 | 114× | 23.20 | 836 | 137,000 | 210,021 | 41.5 | 90.3 |
| 317 | 19× | 23.36 | 1,370 | 35,529 | 209,353 | 41.8 | 90.3 |
| 563 | 49× | 23.30 | 1,427 | 31,326 | 124,845 | 41.7 | 89.9 |
| 1028 | 34× | 23.42 | 1,332 | 36,124 | 191,607 | 41.7 | 90.5 |
| 1146 | 54× | 23.35 | 1,363 | 32,892 | 152,162 | 41.7 | 90.3 |
| 1147 | 64× | 21.96 | 3,121 | 13,090 | 102,582 | 40.4 | 89.8 |
| 1148 | 105× | 22.56 | 2,775 | 13,420 | 64,496 | 36.6 | 89.5 |
| 1150 | 55× | 25.98 | 2,668 | 16,289 | 69,389 | 39 | 90.9 |
FIGURE 3Phylogram of the MLST2019 scheme (ALA1, CDC19, SAPT4, GLN4, PGI1, and PGM2) constructed with the PhyML Maximum Likelihood method with 1,000 bootstraps (bootstrap values are shown next to the main nodes) on 41 isolates of Geotrichum candidum. Phylogenetic analyses were performed with Unipro UGENE v1.31.1.
Loci characteristics targeted in the consensus MLST scheme used to genotype 41 isolates of G. candidum.
| Locus | Locus size (bp) | Number of polymorphic sites* | Number of alleles | Proportion of variable sites (%) | Number of alleles per variable site | dN/dS |
| 424 | 8 (1) | 5 | 1.89 | 0.63 | 0.000 | |
| 355 | 5 (1) | 6 | 1.41 | 1.20 | 0.171 | |
| 501 | 16 (2) | 2 | 3.19 | 0.13 | 0.079 | |
| 558 | 18 (2) | 4 | 3.23 | 0.23 | 0.028 | |
| 472 | 6 (4) | 4 | 1.27 | 0.67 | 0.616 | |
| 548 | 11 (1) | 8 | 2.01 | 0.73 | 0.048 |
FIGURE 4Estimation of the population structure of 41 G. candidum strains based on their sequence type profile. For each of the strains, on the abscissa, the estimated contribution, on the ordinate, to its genotype from each of five hypothetical ancestral genotypes is represented by bar plot of different colors. The source of isolation of each strain is shown above the histogram: cheese strain (C), environmental strain (E), industrial strain (I), milk (M). Each color represents a lineage in the G. candidum population. Lineage 1 (60% red and 40% yellow), lineage 2 (green), lineage 3 (blue), lineage 4 (close to 60% yellow and 40% red), and lineage 5 (purple).
FIGURE 5NeighborNet analysis of the combined six loci (ALA1, CDC19, SAPT4, GLN4, PGI1, and PGM2) obtained from 41 G. candidum isolates. Numbering in the figure corresponds to sequence type. Parallelogram formation indicate recombination events. NeighborNet analysis was performed with SplitsTree version 4.
Recombination and linkage disequilibrium statistics.
| Geo group | I | I | ||
| A | 1.0 | 0.21 | 0.4 | 0.5 |
| B | 0.03 | –0.33 | –0.07 | 0.9 |
| C | NA | –0.63 | –0.13 | 1 |
| A–B | ≤0.001 | 0.32 | 0.06 | 0.05 |
| B–C | 0.3 | –0.2 | –0.04 | 0.8 |
| A–C | 0.2 | 1.9 | 0.37 | 0.0001 |
| A–B–C | ≤0.001 | 0.36 | 0.07 | 0.03 |