| Literature DB >> 34909054 |
A Arastehfar1, M Marcet-Houben2,3,4, F Daneshnia1, S J Taj-Aldeen5, D Batra6, S R Lockhart6, E Shor1,7, T Gabaldón2,3,4, D S Perlin1,7,8.
Abstract
Candida glabrata is the second leading cause of candidemia in many countries and is one of the most concerning yeast species of nosocomial importance due to its increasing rate of antifungal drug resistance and emerging multidrug-resistant isolates. Application of multilocus sequence typing (MLST) to clinical C. glabrata isolates revealed an association of certain sequence types (STs) with drug resistance and mortality. The current C. glabrata MLST scheme is based on single nucleotide polymorphisms (SNPs) at six loci and is therefore relatively laborious and costly. Furthermore, only a few high-quality C. glabrata reference genomes are available, limiting rapid analysis of clinical isolates by whole genome sequencing. In this study we provide long-read based assemblies for seven additional clinical strains belonging to three different STs and use this information to simplify the C. glabrata MLST scheme. Specifically, a comparison of these genomes identified highly polymorphic loci (HPL) defined by frequent insertions and deletions (indels), two of which proved to be highly resolutive for ST. When challenged with 53 additional isolates, a combination of TRP1 (a component of the current MLST scheme) with either of the two HPL fully recapitulated ST identification. Therefore, our comparative genomic analysis identified a new typing approach combining SNPs and indels and based on only two loci, thus significantly simplifying ST identification in C. glabrata. Because typing tools are instrumental in addressing numerous clinical and biological questions, our new MLST scheme can be used for high throughput typing of C. glabrata in clinical and research settings.Entities:
Keywords: Candida glabrata; MLST; Sequence type; Whole-genome sequencing
Year: 2021 PMID: 34909054 PMCID: PMC8640552 DOI: 10.1016/j.simyco.2021.100133
Source DB: PubMed Journal: Stud Mycol ISSN: 0166-0616 Impact factor: 16.097
List of clinical Candida glabrata isolates used in this study.
| Isolate # | Original # | Sequence type | Minimum inhibitory concentration (μg/ml) | Susceptibility profiles | Comments | |||
|---|---|---|---|---|---|---|---|---|
| Fluconazole | Micafungin | Anidulafungin | AMB | |||||
| 1 | CAS08-0293 | ST3 | 64 | 4 | 4 | 1 | MDR | |
| 2 | CAS08-0725 (CMD00311) | ST3 | 64 | 0.5 | 2 | 1 | MDR | |
| 3 | CAS09-0869 (CMD00373) | ST3 | 64 | 0.5 | 2 | 1 | MDR | |
| 4 | BG2 | ST3 | 4 | 0.015 | 0.125 | 1 | S | |
| 5 | Qatar 36 | ST3 | 4 | 0.015 | 0.125 | 0.5 | S | |
| 6 | DPK 159 | ST3 | 4 | 0.015 | 0.125 | 1 | S | |
| 7 | DPL27 (5962) | ST3 | 4 | 2 | 4 or 2 | 1 | ECR | |
| 8 | ATCC 66032 | ST6 | 4 | 0.015 | 0.06 | 0.5 | S | |
| 9 | CAS09-1225 (CGA00908) | ST6 | > 64 | 4 | 2 | 2 | MDR | |
| 10 | CAS09-1786 (CGA01258) | ST6 | 64 | 4 | 2 | 1 | MDR | |
| 11 | DPL155 (M234) | ST6 | 1 | 0.25 | 1 | 1 | ECR | |
| 12 | DPL157 (17351, CGC2) | ST6 | 64 | 4 | 4 | 1 | MDR | |
| 16 | CAS08-0089 | ST8 | 4 | 0.015 | 0.125 | 1 | S | |
| 17 | CAS08-494 | ST8 | 4 | 0.015 | 0.125 | 1 | S | |
| 18 | CAS08-528 | ST8 | 4 | 0.015 | 0.125 | 1 | S | |
| 19 | CAS11-3112 (CGA02083) | ST8 | 64 | 0.25 | 0.5 | 1 | MDR | |
| 20 | DPL209 (M2952) | STX | 64 | 4 | 4 | 1 | MDR | |
| 25 | CAS08-0205 | ST10 | 4 | 0.015 | 0.125 | 0.5 | S | |
| 26 | CAS09-1083 (CGA00822) | ST10 | 64 | 0.5 | 1 | 0.5 | MDR | |
| 27 | CAS09-1437 (CGA01045) | ST10 | 64 | 2 | 1 or 0.5 | 1 | MDR | |
| 28 | DPK 305 | ST10 | 4 | 0.015 | 0.125 | 1 | S | WGS |
| 29 | CAS08-0425 (CMD00008) | ST10 | 64 | 4 | 4 | 1 | MDR | WGS |
| 35 | DPL1021 (ATCC 90030) | ST10 | 4 | 0.015 | 0.125 | 1 | S | WGS |
| 21 | Qatar 38 | ST15 | 4 | 0.015 | 0.25 | 1 | S | |
| 22 | Qatar 57 | ST15 | 4 | 0.015 | 0.125 | 1 | S | |
| 23 | Qatar 71 | ST15 | > 64 | 0.06 | 0.25 | 1 | FLZR | |
| 24 | ATCC 2001 (CBS 138) | ST15 | 4 | 0.015 | 0.125 | 0.5 | S | |
| 30 | CAS08-0027 | ST15 | 64 | 0.015 | 0.125 | 1 | FLZR | WGS |
| 31 | CAS08-485 | ST15 | 4 | 0.015 | 0.125 | 1 | S | |
| 32 | CAS09-755 | ST15 | 4 | 0.015 | 0.125 | 1 | S | |
| 33 | DPK 762 | ST15 | 4 | 0.015 | 0.125 | 1 | S | WGS |
| 34 | DPL274 (3-CPH-W20800) | ST15 | 1 | 1 | 4 | 4 | ECR | |
| 36 | CAS08-0092 | STY | 2 | 0.015 | 0.125 | 1 | S | |
| 37 | CAS11-3129 (CMD01408 | STY | 64 | 4 | 4 | 0.5 | MDR | |
| 38 | DPL245 (1611) | STY | 16 | 4 | 4 | 0.5 | ECR | WGS |
| 39 | DPL38 (42997) | STY | 32 | 1 | 2 | 0.25 | ECR | |
| 40 | CAS08-0016 (CGA00019) | STY | 64 | 2 | 1 | 1 | MDR | WGS |
| 41 | DPL217 | ST17 | 2 | 0.5 | 1 | 0.5 | ECR | |
| 42 | DPL219 | ST17 | 2 | 0.5 | 1 | 0.5 | ECR | |
| 44 | LB599-02 | ST46 | 4 | 0.015 | 0.125 | 1 | S | |
| 45 | LB906-05 | ST46 | 64 | 0.015 | 0.125 | 1 | FLZR | |
| 46 | Qatar 51 | ST46 | 2 | 0.015 | 0.125 | 1 | S | |
| 48 | Qatar 59 | ST46 | 2 | 0.015 | 0.06 | 1 | S | |
| 50 | Qatar 62 | ST46 | 2 | 0.015 | 0.06 | 1 | S | |
| 53 | Qatar 69 | ST46 | 4 | 0.03 | 0.25 | 1 | S | |
| 54 | Qatar 72 | ST46 | 64 | 0.015 | 0.125 | 1 | FLZR | |
| 56 | Qatar 76 | ST46 | 4 | 0.015 | 0.125 | 1 | S | |
| 47 | Qatar 53 | ST7 | 2 | 0.015 | 0.06 | 1 | S | |
| 49 | Qatar 61 | ST7 | 4 | 0.015 | 0.06 | 1 | S | |
| 51 | Qatar 63 | ST7 | 4 | 0.015 | 0.06 | 1 | S | |
| 52 | Qatar 64 | ST7 | 2 | 0.015 | 0.06 | 1 | S | |
| 55 | Qatar 73 | ST7 | 4 | 0.015 | 0.125 | 1 | S | |
| 57 | CAS08-0094 | ST76 | > 64 | 2 | 4 | 1 | MDR | |
AMB: Amphotericin B, MDR: Multidrug-resistant, S: Susceptible, ECR: Echinocandin-resistant, FLZR: Fluconazole-resistant, WGS: Whole-genome sequenced.
Oligos used in this study, their functions, and their PCR conditions.
| Oligo name | Function | Oligo sequence (5'--3') |
|---|---|---|
| PCR/Sequencing | CCTGGAGAAGATGTATGTGTT | |
| PCR/Sequencing | TCTTCATGGTCGTGCTGAT | |
| PCR/Sequencing | TATCAAGTGTGTGGTGCC | |
| PCR/Sequencing | CGTGTTCGACAGATTGTCC | |
| PCR/Sequencing | GATGCTACTTATACTGGCGG | |
| PCR/Sequencing | TCAATCGGTTTCGTCTGG | |
| PCR/Sequencing | GAGAAAGCTATTGCTTTTGGT | |
| PCR/Sequencing | TCAATAGATGATGGCTGCAC | |
| PCR/Sequencing | AATACGAAAGCCACGACG | |
| PCR/Sequencing | ATATCTGGAATGCACTACCTG | |
| PCR/Sequencing | TGGATTCGAATGAGGGACT | |
| PCR/Sequencing | CGAGTTCACACCGATTATCC | |
| PCR/Sequencing | TCAAATGCTCCTCCTGGC | |
| Sequencing | CTCTACAGGCGGCAAAA | |
| Sequencing | GAATAATTAGAGTCGCCTCCG | |
| Sequencing | TGG CAG AAA TAA ACG CCA G | |
| PCR/Sequencing | CCAACATCAATTTCAGGAGC | |
| PCR/Sequencing | GAGCTGGAACTCGATCCG | |
| PCR/Sequencing | AGGAAGGGGAGTAATGATGG | |
| PCR/Sequencing | ATGCAATACAAAAAGACTCTAGC | |
| PCR/Sequencing | TGGATCATTTGGTGCCTTAC | |
| PCR/Sequencing | CTTCTTCCTCTGTCGCTAAG | |
| PCR/Sequencing | CAATTTGAGAAGCAGCGC | |
| PCR/Sequencing | AGCCTCTGTCCCTACTTTATC | |
| PCR/Sequencing | GTCGCTGTTGGTGTGTAA | |
| PCR/Sequencing | CCCAATCCCTTTCTCTGCT | |
| PCR/Sequencing | TATCCTCTTCTCATCGCTCG | |
| PCR/Sequencing | GATGTTATGGTCTAGCTTTGC | |
| PCR/Sequencing | CTGCCTATCTTAGATTGCTAGA |
Note that except for G00825 and PIRI, the rest of the loci had the same PCR program (94 °C 5 mins, 35 cycles of (94 °C 30 s, 60 °C 30 s, 72 °C 40 s), 72 °C 8 mins). PCR programs for G00825 was 94 °C 5 mins, 35 cycles of (94 °C 30 s, 58 °C 30 s, 72 °C 2 mins), 72 °C 8 mins and for PIRI was 94 °C 5 mins, 35 cycles of (94 °C 30 s, 58 °C 30 s, 72 °C 40 s), 72 °C 8 mins.
Fig. 1The workflow of finding hyperpolymorphic loci (HPL) in this study.
Fig. 2Circos plots representing inversions (red) and translocations (blue) between the genomes of the C. glabrata reference strain (dark yellow on the right of each circle) and each individual strain (light yellow on the left of each circle). Chromosomes are ordered as mirror images of each other, starting with chromosome A at the top and ending with chromosome M at the bottom of the image.
Polymorphic loci of interest identified through comparative whole-genome sequencing.
| Polymorphic loci | Standard names | Systematic names |
|---|---|---|
| CHS5/ CAGL0G00814g | YLR330W | |
| G00715/ CAGL0G00858g | YLR332W | |
| G00825/ CAGL0G00968g | YLR337C | |
| SRP40/ CAGL0G02409g | YKR092C | |
| G03839/ CAGL0G03993g | Not available | |
| G06281/ CAGL0G06446g | Not available | |
| G07117/ CAGL0G07293g | YML029W | |
| SSR1/ CAGL0H06413g | YLR390W-A | |
| H09053/ CAGL0H09152g | YKL198C | |
| PIR2/ CAGL0I06182g | YJL159W | |
| I07645/ CAGL0I07821g | YOL087C | |
| K01793/ CAGL0K01947g | YER137C | |
| K06501/ CAGL0K06655g | YBR212W | |
| K09845/ CAGL0K10010g | YDR083W | |
| PIR3/ CAGL0M08492g | YKL164C | |
| M09053/ CAGL0M09086g | YJR092W | |
| M09075/ CAGL0M09108g | YJR091C | |
| GVI51_A02255/ CAGL0A02486g | YDR351W | |
| A00825/ CAGL0A01001g | YLR326W | |
| A02431/ CAGL0A02651g | YDR359C | |
| C01617/ CAGL0C01859g | YCL001W | |
| ARB1/ CAGL0C02343g | YER036C | |
| C02321/ CAGL0C02541g | YLR399C | |
| POP2/ CAGL0C03399g | YNR052C | |
| ABP1/ CAGL0C03597g | YCR088W | |
| D02871/ CAGL0D02926g | YLR015W | |
| D05049/ CAGL0D05082g | YLL039C | |
| E00341/ CAGL0E00561g | YCR084C | |
| SWI5/ CAGL0E01331g | YDR146C | |
| STP8/ CAGL0F01837g | YLR055C | |
| SLG1 | YOR008C | |
| HKR1/ CGAL0F03003g | YDR420W | |
| F03333/ CAGL0F03641g | YML018C |
These were the 13 polymorphic loci that were selected for MLST analysis in this study.
Fig. 3The gene ontology enrichment to select the polymorphic loci (PL) of interest among a pool of candidate loci. Eleven loci involved in cellular integrity and stress response pathways and two loci lacking orthologs in Saccharomyces cerevisiae were selected for further analysis.
Fig. 4A. Tree resulting from the concatenation of six traditional MLST genes. Colours represent the ST group each strain belongs to. Bootstraps below 90 % are shown on the tree. B. Tree resulting from the concatenation of 13 hypervariable regions.
Statistics for trees based on individual genes. MLST refers to the conventional MLST approach employing six loci to type C. glabrata and PLOI refers to the approach proposed in this study. The resolution of each approach and locus is evaluated by three factors, namely precision, recall, and F1. The higher the overall score, the more resolutive is the approach/locus.
| Loci | Approach | Precision | Recall | F1 |
|---|---|---|---|---|
| MLST | 0.96 | 0.87 | 0.88 | |
| MLST | 0.87 | 0.75 | 0.74 | |
| MLST | 0.93 | 0.93 | 0.93 | |
| MLST | 0.96 | 0.81 | 0.83 | |
| MLST | 0.79 | 0.85 | 0.77 | |
| MLST | 0.92 | 0.78 | 0.79 | |
| PLOI | 0.95 | 0.83 | 0.85 | |
| PLOI | 0.88 | 0.87 | 0.84 | |
| PLOI | 0.83 | 0.91 | 0.86 | |
| PLOI | 0.99 | 1.0 | 0.99 | |
| PLOI | 0.92 | 0.82 | 0.82 | |
| PLOI | 0.92 | 0.81 | 0.82 | |
| PLOI | 0.92 | 0.81 | 0.83 | |
| PLOI | 0.97 | 0.90 | 0.91 | |
| PLOI | 0.90 | 0.86 | 0.85 | |
| PLOI | 0.94 | 0.88 | 0.90 | |
| PLOI | 0.99 | 1.0 | 0.99 | |
| PLOI | 0.96 | 0.95 | 0.95 | |
| PLOI | 0.90 | 0.93 | 0.90 |
PLOI: Polymorphic loci of interest.
Fig. 5Single gene trees belonging to A. NMT1, B. G00825 and C. SLG1. Clades are colored according to the ST group. In NMT1 ST10 and ST15 are colored in the same color as they are identical in all strains.
Summary statistics for each group of trees
| Statistics | Concatenated tree of MLST | Concatenated tree of PLOIs | 1 gene | 2 genes | 3 genes | 4 genes | 5 genes | 6 genes |
|---|---|---|---|---|---|---|---|---|
| - | - | 0.93 | 0.97 | 0.98 | 0.98 | 0.98 | 0.98 | |
| - | - | 0.89 | 0.94 | 0.97 | 0.98 | 0.99 | 0.995 | |
| - | - | 0.89 | 0.95 | 0.97 | 0.98 | 0.98 | 0.99 | |
| 1.0 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 |
PLOI: Polymorphic loci of interest.
Best combinations of two polymorphic loci.
| Polymorphic loci combination | Alignment length | Precision |
|---|---|---|
| 1 549 | 0.99 | |
| 2 120 | 0.99 | |
| 1 573 | 0.99 | |
| 2 280 | 0.99 | |
| 2 229 | 0.99 | |
| 2 874 | 0.99 | |
| 3 165 | 0.99 | |
| 2 519 | 0.99 | |
| 1 036 | 0.99 | |
| 1 252 | 0.99 | |
| 1 543 | 0.99 | |
| 2 035 | 0.99 | |
| 2 327 | 0.99 | |
| 1 681 | 0.99 | |
| 1 558 | 0.99 | |
| 2 326 | 0.99 | |
| 1 680 | 0.99 | |
| 1 557 | 0.99 | |
| 1 972 | 0.99 | |
| 1 849 | 0.99 | |
| 1 203 | 0.99 |
Fig. 6Trees based on the concatenation of two genes. A.G00825 + TRP1. B.SLG1 + TRP1.