| Literature DB >> 30463870 |
Douda Bensasson1,2, Jo Dicks3, John M Ludwig2, Christopher J Bond3, Adam Elliston3, Ian N Roberts3, Stephen A James3.
Abstract
The human pathogen Candida albicans is considered an obligate commensal of animals, yet it is occasionally isolated from trees, shrubs, and grass. We generated genome sequence data for three strains of C. albicans that we isolated from oak trees in an ancient wood pasture, and compared these to the genomes of over 200 clinical strains. C. albicans strains from oak are similar to clinical C. albicans in that they are predominantly diploid and can become homozygous at the mating locus through whole-chromosome loss of heterozygosity. Oak strains differed from clinical strains in showing slightly higher levels of heterozygosity genome-wide. Using phylogenomic analyses and in silico chromosome painting, we show that each oak strain is more closely related to strains from humans and other animals than to strains from other oaks. The high genetic diversity of C. albicans from old oaks shows that they can live in this environment for extended periods of time.Entities:
Keywords: Population genomics; asexual; clonality; environmental reservoir; loss of heterozygosity (LOH); molecular evolution; yeast ecology
Mesh:
Year: 2018 PMID: 30463870 PMCID: PMC6325710 DOI: 10.1534/genetics.118.301482
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Three C. albicans isolates from English and sessile oaks in the new forest in the United Kingdom
| Strain | Alternate name | Latitude and longitude | Trunk girth (m) | Other yeast species from the same tree |
|---|---|---|---|---|
| NCYC 4144 | FRI10b.1 | 50.92785 −1.657083 | 4.12 | |
| NCYC 4145 | FR11a.1 | 50.928483 −1.655183 | 3.88 | |
| NCYC 4146 | FRI5d.SM.1 | 50.928067 −1.656 | 2.83 | None |
| FRI and OCK sites | 27 oaks, New Forest | 0.65–3.79 |
Information from Robinson . NCYC 4144 and NCYC 4145 were isolated from sessile oaks (Quercus petraea) and NCYC 4146 was isolated from English oak (Quercus robur).
Assuming average United Kingdom woodland boundary conditions for sessile and English oaks, these trunk girths suggest tree ages of ∼220 years old (FRI10), 200 years old (FRI11), and 130 years old (FRI5) according to the guidelines at http://www.wdvta.org.uk/pdf/Estimating-the-age-of-trees.pdf.
Twenty-five trees had uncoppiced trunk girth estimates. These were mostly smaller than those with C. albicans (Wilcoxon test, ).
C. albicans from oak are more heterozygous than most clinical strains from Ropars
| Strain or clade | MTL | Mean heterozygosity (maximum–minimum) | Filtered length (Mbp) | Mean filtered heterozygosity |
|---|---|---|---|---|
| NCYC 4146, clade 4 | 0.0062 | 11.1 | 0.0066 | |
| NCYC 4144, clade18 | 0.0061 | 9.7 | 0.0068 | |
| NCYC 4145, unknown | 0.0077 | 11.4 | 0.0078 | |
| 10.7 | ||||
| 176 | 9.9 | |||
| 3 | ||||
| 1 | ||||
| 27 clade 4 strains | 0.0053 (0.0045–0.0060) | 10.2 | 0.0062 (0.0060–0064) | |
| 4 clade 18 strains | 0.0059 (0.0053–0.0065) | 10.7 | 0.0062 (0.0060–0066) | |
| 10 unknown strains | 0.0057 (0.0048–0.0070) | 10.5 | 0.0063 (0.0054–0071) |
Rows in bold show data summarized for all 3 oak strains and 180 clinical strains from Ropars .
Heterozygosity was estimated genome-wide (from ∼14 million high-quality sites) for each strain as the proportion of high-quality sites where 20–80% of reads differed from the reference sequence. Where multiple strains are considered, we show the mean, maximum, and minimum levels of genome-wide heterozygosity.
The length of genome sequence after excluding loss of heterozygosity (LOH) regions, known repeats, putatively repetitive regions, and centromeres.
The proportion of heterozygous sites in filtered regions.
Rows in bold show data summarized for all 3 oak strains and 180 clinical strains from Ropars et al (2018).
Clinical strains are less heterozygous than oak strains (Wilcoxon test, ).
Wilcoxon test,
Figure 1 C. albicans from oak are mostly diploid with some loss of heterozygosity (LOH). A. The proportion of base calls differing from the reference strain (allele ratios) are mostly 1.0 or 0.5 for oak strains (NCYC 4144-6), suggesting diploidy. Regions that recently homozygozed are shaded light blue. The points that occur in these LOH regions often correspond to the locations of known repeats where short reads are probably mismapped. The oak strain with at its mating locus (NCYC 4144) arrived at this state by LOH for the whole of chromosome 5. B. The proportion of heterozygous sites in 100 kb nonoverlapping windows are either high or low for oak strains (NCYC 4144, NCYC 4145, and NCYC 4146). For oak strains (shown here) and clinical strains (Figure S2) we see two modes; heterozygosity is either low (below the red line at 0.1%), or high (with a mean > 0.4%). Regions with >0.1% heterozygous sites in a 100 kb window were classed as LOH regions and are shown in light blue in A.
Figure 2 C. albicans from oak are more similar to clinical strains than to each other. Phylogenetic and pairwise sequence comparisons show that the oak strain NCYC 4146 is similar to clade 4 clinical strains, NCYC 4144 is similar to clade 18 clinical strains, and NCYC 4145 is diverged from the 17 clades sampled and 10 further strains of unknown clade. A. Genome-wide phylogeny. Maximum likelihood phylogenetic analysis of a whole-genome alignment (14,282,655 bp) shows that oak strains (green) are more closely related to clinical strains than they are too each other. Bootstrap support is estimated from 100 replicates and only values >70% are shown. The alignment contained 313,250 variable sites with at least one homozygous difference between strains. B. Painted chromosomes of oak strains. Most parts of the genomes of oak strains are more similar to clinical strains than to other oak strains (green). The genome of each oak strain was colored according to the clade assignment of the most similar strain for each 100 kb window in the genome using the colours for each clade shown in A. Regions are colored white if a strain sequence is diverged from all the other oak or clinical strains that we sampled (the proportion of sites differing is >0.028%). From left to right, NCYC 4146 in light blue is most similar to Clade 4 strains, NCYC 4144 in blue is most similar to Clade 18, and NCYC 4145 is mostly white and therefore diverged from all known clades.