| Literature DB >> 29249661 |
Laia Carreté1, Ewa Ksiezopolska1, Cinta Pegueroles1, Emilia Gómez-Molero2, Ester Saus1, Susana Iraola-Guzmán1, Damian Loska1, Oliver Bader2, Cecile Fairhead3, Toni Gabaldón4.
Abstract
Candida glabrata is an opportunistic fungal pathogen that ranks as the second most common cause of systemic candidiasis. Despite its genus name, this yeast is more closely related to the model yeast Saccharomyces cerevisiae than to other Candida pathogens, and hence its ability to infect humans is thought to have emerged independently. Moreover, C. glabrata has all the necessary genes to undergo a sexual cycle but is considered an asexual organism due to the lack of direct evidence of sexual reproduction. To reconstruct the recent evolution of this pathogen and find footprints of sexual reproduction, we assessed genomic and phenotypic variation across 33 globally distributed C. glabrata isolates. We cataloged extensive copy-number variation, which particularly affects genes encoding cell-wall-associated proteins, including adhesins. The observed level of genetic variation in C. glabrata is significantly higher than that found in Candida albicans. This variation is structured into seven deeply divergent clades, which show recent geographical dispersion and large within-clade genomic and phenotypic differences. We show compelling evidence of recent admixture between differentiated lineages and of purifying selection on mating genes, which provides the first evidence for the existence of an active sexual cycle in this yeast. Altogether, our data point to a recent global spread of previously genetically isolated populations and suggest that humans are only a secondary niche for this yeast.Entities:
Keywords: Candida glabrata; adhesion; evolution; human fungal pathogens; mating; population genomics
Mesh:
Year: 2017 PMID: 29249661 PMCID: PMC5772174 DOI: 10.1016/j.cub.2017.11.027
Source DB: PubMed Journal: Curr Biol ISSN: 0960-9822 Impact factor: 10.900
Information About the 33 C. glabrata Isolates Analyzed in the Present Study, Including the Reference CBS138
| Sample ID | Synonymous ID | Mean Coverage | Site | Country | Mating Type | CC | RT | Source Data |
|---|---|---|---|---|---|---|---|---|
| BG2 | US01BG2Blo | 84.192 | blood | USA | 15 | 15 | [ | |
| CST34 | US000NY034 | 476.944 | blood | USA | alpha | 64 | 64 | [ |
| CST35 | US003NY035 | 489.168 | blood | USA | alpha | 77 | 77 | [ |
| E1114 | EB1114Mou | 120.349 | mouth | Belgium | 15 | 15 | [ | |
| EB0911Sto | − | 339.156 | stool | Belgium | alpha | 77 | 94 | [ |
| EF0616Blo1 | − | 261.098 | blood | France | 52 | 52 | [ | |
| EF1237Blo1 | − | 301.835 | blood | France | 52 | 52 | [ | |
| EF1620Sto | − | 285.374 | stool | France | 52 | 98 | [ | |
| EI1815Blo1 | − | 301.624 | blood | Italy | alpha | 52 | 52 | [ |
| EG01004Sto | − | 262.568 | stool | Germany | 15 | 17 | [ | |
| F03013 | EF0313Blo1 | 340.021 | blood | France | 15 | 13 | [ | |
| F11 | F11017, EF1117Blo1 | 69.306 | blood | France | NA | 88 | [ | |
| F15021 | EF1521Blo1 | 116.500 | blood | France | 15 | 15 | [ | |
| F15 | F15035, EF1535Blo1 | 80.737 | blood | France | 41 | 41 | [ | |
| M17 | US02Bal017 | 121.681 | blood | USA | 6 | 6 | [ | |
| P35_2 | P35-2 | 285.029 | mouth | Taiwan | NA | 106 | [ | |
| P35_3 | P35-3 | 245.637 | mouth | Taiwan | alpha | NA | 106 | [ |
| B1012M | EB1012MouC | 302.046 | mouth | Belgium | alpha | NA | 103 | [ |
| B1012S | EB1012StoC | 240.391 | stool | Belgium | alpha | 64 | 102 | [ |
| BO101S | EB0101StoC | 312.022 | stool | Belgium | alpha | 64 | 104 | [ |
| CST109 | US003NY109 | 294.590 | blood | USA | alpha | 64 | 66 | [ |
| CST110 | US003NY110 | 235.403 | blood | USA | 15 | 15 | [ | |
| CST78 | US003NY078 | 266.426 | blood | USA | 6 | 8 | [ | |
| CST80 | US003NY080 | 241.972 | blood | USA | alpha | 64 | 64 | [ |
| EB101M | EB0101MouC | 300.099 | mouth | Belgium | alpha | 64 | 104 | [ |
| F1019 | EF1019Blo1 | 274.760 | blood | France | 6 | 6 | [ | |
| F1822 | EF1822Blo1 | 291.361 | blood | France | 6 | 10 | [ | |
| F2229 | EF2229Blo1 | 652.126 | blood | France | 6 | 7 | [ | |
| I1718 | EI1718Blo1 | 217.526 | blood | Italy | 6 | 5 | [ | |
| M12 | US02Bal012 | 268.030 | blood | USA | alpha | 6 | 11 | [ |
| M6 | US02Bal006 | 247.431 | blood | USA | 15 | 15 | [ | |
| M7 | US02Bal007 | 314.610 | blood | USA | alpha | 64 | 65 | [ |
| CBS138 | ATCC 2001 | NA | stool | Belgium | alpha | NA | 62 | [ |
Columns indicate, in this order: strain name or ID; synonym (if any); mean sequencing coverage (if sequenced in this study); body site of isolation; country of isolation; mating type; CC (clonal complex); RT (repeat type); publication describing the source. NA, not assigned.
Commensal strain.
Figure 1Population Structure of the 33 Strains of C. glabrata
Distribution and population structure of the 33 strains of C. glabrata based on SNP data analysis.
(A) 3D scatterplot of the multiple correspondence analysis (MCA), in which the different colors designate the seven clades detected.
(B) Phylogenetic tree computed using a ML approach. Super-indices indicate pairs of strains in which the two originate from the same patient (different body site or different isolation date; see Table 1 for more details). Clades from I to VII were designated using the same colors as in (A).
(C) Population admixture using STRUCTURE software with K = 7 using the same colors as in (A).
(D) Mean FST for all pairwise comparisons between the seven clades. Fisher test was used to analyze the association with geographical structure (p(country-clade) = 0.006), body site of isolation (p(site-clade) = 0.157), and mating type (p(mating-clade) = 6.064e-05).
See also Figures S1 and S2.
Figure 2Structural Variations in the Analyzed Strains of C. glabrata
Heatmap showing the deletions, duplications, and aneuploidies (Anpl.) detected in the analyzed strains of C. glabrata sorted by clade. Reference (CBS138) and chromosomes with aneuploidies in affected strains (see below) or genomes with unstable coverage are not shown. The heatmap at the top of the figure designates gene information: light gray, gray, and black represent genes in a tandem duplication (T), orphan genes (O), and genes encoding GPI-anchored adhesin-like proteins (A), respectively. The heatmap colored in green designates the 46 genes affected by deletions, and the heatmap colored in red designates the 62 genes affected by duplications. Aneuploidies are indicated with a light gray background with the letter of the chromosome affected. Fisher test was used to test the significant enrichment in genes encoding GPI-anchored adhesin-like proteins (p < 1.4e-26 and 5.2e-31 in deletions and duplications, respectively), orphan proteins (p < 0.051 and 0.436 in deletions and duplications, respectively), and genes in a tandem duplication (p < 1.5e-09 and 3.0e-05 in deletions and duplications, respectively). See Data S1 for the complete list of genes affected.
See also Figures S3–S5.
Figure 3Chromosomal Re-arrangements
(A) Diagram showing the 20 different large re-arrangements found in C. glabrata. Re-arrangements are grouped in 17 different conformations, including 14 translocations and 3 inversions. Chromosomes are indicated in letters from A to M and each one in a different color. Re-arrangement in a chromosome is indicated with the colors of the chromosomes affected. Arrows near the inversion plots indicate the relative orientation of the indicated fragments.
(B) Heatmap showing the distribution of annotated re-arrangements (1–20) and the 26 affected strains. Asterisks indicate strains not included in the analysis due to high fragmentation of the assembly.
See also Figure S4.
Figure 4Recombination Analyses
(A) Profile of SNP densities obtained when comparing the genomes of strains in clade I and clade II using non-overlapping 10-kb windows along the entire genome. The bar at the top indicates the order and relative length of C. glabrata chromosomes in CBS138. The first profile indicates SNP density between the two strains from clade I (M7 versus B1012M). Second and third profiles indicate SNP density between EB0911Sto and CST35 from clade II versus clade I (using B1012M as a reference for this clade). The fourth profile indicates SNP density between the two strains of clade II. Boxes indicate regions without SNPs, which is indicative of recombination.
(B) Distribution of the recombination rate (rho) across chromosomes, estimated from SNP data using the interval program implemented in the LDhat v2.2 package. For this figure, we selected chromosome A and chromosome B as an example (recombination rate 0.008 and 0.004, respectively). A complete illustration including all chromosomes is found in Figure S6.
(C) Visual representation of the population genetic structure and recombination events inferred by fastGEAR. The bar at the top indicates the order and relative length of C. glabrata chromosomes. We provide two panels corresponding to ancestral recombinations (occurred before the most recent common ancestor of both clusters) and recent recombinations (occurred after the diversification of the clusters). In each panel, the different colors designate each lineage; rows correspond to sequences and columns to positions.
(D) Analysis of the region surrounding the deletion in CAGL0C00847 g gene is shown as an example. First, we selected regions containing the gene of interest and 1-kb flanking regions in the 33 strains. Second, we estimated recombination rates (rho/bp) in the selected regions and computed phylogenetic networks (see STAR Methods). Bottom left: plot showing recombination rates along the genomic region. Values higher than 0.05 indicate a recombination hotspot. Bottom right: NeighborNet splits network showing gene flow between strains and the phylogenetic signal in the region. Clades are indicated as dots of different colors, and the length of the edges is proportional to the weight of the associated split. Strains of different clades cluster together, suggesting a much closer genetic relationship than expected from the genome-wide analysis, which is indicative of recombination between different clades.
Figure 5Ratio of Non-synonymous and Synonymous Nucleotide Diversity and Mating-type Switching in C. glabrata
(A) Ratio of non-synonymous and synonymous nucleotide diversity (πN/πS) in genes involved in mating and recombination in C. glabrata, and in their one-to-one orthologs in C. albicans and S. cerevisiae. Dark blue plots show overall πN/πS values in each category, and light blue plots show specific groups of genes included in each category. The most distant outliers are not shown, as the length of the y axis was limited to 2. NHEJ, non-homologous end joining.
(B) Organization of mating-type loci in C. glabrata: MTL1 in white, MTL2 in green, and MTL3 in blue. MTL1 is shown enlarged on the right, encoding either a- or alpha-type genes.
(C) Diagram of the four cases of mating-type switching events likely to have occurred in sequenced strains. BS, before switching; AS, after switching; MMR, mismatch repair; NER, nucleotide excision repair.
See also Figure S7 and Tables S1 and S2.
Figure 6Phenotype Analysis Testing the Growth Rate Using Seven Different Conditions
(A) Growth curves for the 7 different conditions. The first condition was YPD as a normal medium for growth. Following conditions were H2O2, NaCl, DTT, high temperature (41.5 T°), basic pH (pH 9), and acid (pH 2). Unless indicated otherwise, all growth curves were carried out at 37°C. The y axis shows the optical density (OD) for each clade, and the x axis shows time (in hours).
(B) Principal component analysis (PCA) showing the relationship between the statistics values for the growth curves and the distribution of the strains.
(C) Heatmap with growth rate value (r) for all strains and normalized values with the growth rate from the reference strain.
(D) Results represent the averages of three independent replicas of four technical repeats each. Positive controls are well-characterized clinical isolates from urine and respiratory material, respectively, with known high-adherence phenotypes [30]. Error bars indicate SD from mean values.
See also Tables S3, S4, and S5 and Data S2.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Penicillin / Streptomycin solution | THERMO FISHER SCIENTIFIC, S.L.U. | 15070063 |
| Chloroform-isoamylalcohol (24:1) | Sigma-Aldrich | C0549-1PT |
| Isopropanol | Merck | LOT0476145 |
| Ethanol | SIGMA-ALDRICH QUIMICA S.L | 51976-500ML-F |
| Ribonuclease A from bovine pancreas | SIGMA-ALDRICH QUIMICA S.L | R6513-10MG |
| T4 DNA polymerase | New England Biolabs | M0201L |
| dATP | New England Biolabs | N0440S |
| 3′-5′-exo- Klenow fragment | New England Biolabs | M0212L |
| T4 DNA ligase | New England Biolabs | M0202L |
| Phusion DNA polymerase | Finnzymes | F530S |
| dNTPs | This study | R0181 |
| Hydrogen Peroxide | SIGMA-ALDRICH QUIMICA S.L. | 16911-250ML-F |
| Dithiothreitol | LIFE TECHNOLOGIES S.A. | R0861 |
| 0.1% (w/v) crystal violet | Fisher Scientific | 2479-4 |
| Sodium dodecyl sulfate 10% SDS, 100 mL | SIGMA-ALDRICH QUIMICA S.L. | 71736-100ML |
| Sabouraud agar plates | Oxoid | PO0410 |
| Fluconazole | SIGMA-ALDRICH QUIMICA S.L. | F8929-100MG |
| Isavuconazole | CLINISCIENCES SL | A15783-2 |
| Posaconazole | SIGMA-ALDRICH QUIMICA S.L. | 32103-25MG |
| Voriconazole | SIGMA-ALDRICH QUIMICA S.L. | PZ0005-5MG |
| Micafungin | MOLPORT | MolPort-035-789-689 |
| Caspofungin | SIGMA-ALDRICH QUIMICA S.L. | SML0425-5MG |
| 5-Fluorcytosine | SIGMA-ALDRICH QUIMICA S.L. | F7129-1G |
| Amphotericin B | SIGMA-ALDRICH QUIMICA S.L | A4888-100MG |
| Methanol (Reag. Ph. Eur.) for analysis, ACS, ISO | PANREAC QUIMICA SLU | 1310911211 |
| D-(+)-Glucose anhydrous, free-flowing, Redi-Dri, ≥ 99.5% | SIGMA-ALDRICH QUIMICA S.L. | RDD016-1KG |
| MOPS ≥ 99.5% (titration), 250 g | SIGMA-ALDRICH QUIMICA S.L. | M1254-250G |
| Antibiotic Broth for microbiology (AM 3) | SIGMA-ALDRICH QUIMICA S.L. | 70184-500G |
| RPMI-1640 Medium With L-glutamine, without sodium bicarbonate, powder, suitable for cell culture | SIGMA-ALDRICH QUIMICA S.L. | R6504-10L |
| Agarose | Cultek, SL | H350000 |
| Ethidium Bromide | ThermoFisher Scientific | 15585011 |
| Glass beads | SIGMA-ALDRICH QUIMICA S.L. | G8772-100G |
| QIAquick PCR purification kit | QIAGEN | 50928106 |
| MinElute spin columns | QIAGEN | 28004 |
| MasterPure Yeast DNA Purification Kit | EPICENTRE | MPY80200 |
| Pfu DNA polymerase | PROMEGA | M7745 |
| Sequence data | This study | PRJNA361477 |
| Sequence data | [ | PRJNA222546 |
| Sequence data | [ | PRJNA310957 |
| Sequence data | [ | PRJNA297263 |
| This study | BG2 | |
| This study | CST34 | |
| This study | CST35 | |
| This study | E1114 | |
| This study | EB0911Sto | |
| This study | EF0616Blo1 | |
| This study | EF1237Blo1 | |
| This study | EF1620Sto | |
| This study | EI1815Blo1 | |
| This study | EG01004Sto | |
| This study | F03013 | |
| This study | F11 | |
| This study | F15021 | |
| This study | F15 | |
| This study | M17 | |
| This study | P35_2 | |
| This study | P35_3 | |
| This study | B1012M | |
| This study | B1012S | |
| This study | BO101S | |
| This study | CST109 | |
| This study | CST110 | |
| This study | CST78 | |
| This study | CST80 | |
| This study | EB101M | |
| This study | F1019 | |
| This study | F1822 | |
| This study | F2229 | |
| This study | I1718 | |
| This study | M12 | |
| This study | M6 | |
| This study | M7 | |
| [ | CBS138 | |
| FWD1: TTGGTCTGTTCCTGAGCCGG | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| FWD2: ACGAACTGGATAGCACCTCC | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| FWD3: ATACTGTGACCTTCCCTGTT | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| REV1: CTCAGCATTGGCAGTAGTGG | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| REV2: CTTCGCTCCGTGGGTAAACA | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| REV3: CTTCAGATTGGCAGTGTCGG | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| MTL1_Forward: CGGTCTGATGGTGCAATTGT | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| MTL1_Reverse: TTGAGTCAAGTGTCGAGGCT | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| MTL2_Forward: GCTCTTCACTCAACGTACTCC | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| MTL2_Reverse: TTTACAAACCCACACCGAGG | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| MTL3_Forward: GTGAGCACTTTGGACCTTCA | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| MTL3_reverse: ACCATAGTCAGACCACCGAC | SIGMA-ALDRICH QUIMICA S.L. | N/A |
| Trimmomatic v0.36 | [ | |
| SOAPdenovo2 r240 | [ | |
| SPAdes v3.1.1 | [ | |
| AUGUSTUS v3.2.3 | [ | |
| OrthoMCL v2.0.9 | [ | |
| BWA 0.7.12 | [ | |
| Wgsim, v0.3.1 | N/A | |
| GATK v3.3 | [ | |
| SAMtools | [ | |
| LDhat v2.2 | [ | |
| fastGEAR | [ | |
| RDP4 v4.15 | [ | |
| SplitTree v4 | [ | |
| Mauve v2.4.0 | [ | |
| BlastN | [ | |
| Mugsy v1.2.3 | [ | |
| TrimAl v.1.4 | [ | |
| RAxML v7.3.5 | [ | |
| STRUCTURE v2.3.4 | [ | |
| PopGenome, R package | [ | |
| Growthcurver v0.2.1, R package | [ | |
| ggplot2, R package | [ | |
| ade4, R package | [ | |
| dbSNP | NCBI | |
| MICROPLATE, 96 WELL, PS, F-BOTTOM, CLEAR, STERILE, 2 PCS./BAG | Greiner Bio-One North America | 655161 |
| LID, PS, HIGH PROFILE (9 MM), CLEAR, STERILE | Greiner Bio-One North America | 656161 |