| Literature DB >> 24747362 |
Leszek P Pryszcz1, Tibor Németh, Attila Gácser, Toni Gabaldón.
Abstract
The Candida parapsilosis species complex comprises a group of emerging human pathogens of varying virulence. This complex was recently subdivided into three different species: C. parapsilosis sensu stricto, C. metapsilosis, and C. orthopsilosis. Within the latter, at least two clearly distinct subspecies seem to be present among clinical isolates (Type 1 and Type 2). To gain insight into the genomic differences between these subspecies, we undertook the sequencing of a clinical isolate classified as Type 1 and compared it with the available sequence of a Type 2 clinical strain. Unexpectedly, the analysis of the newly sequenced strain revealed a highly heterozygous genome, which we show to be the consequence of a hybridization event between both identified subspecies. This implicitly suggests that C. orthopsilosis is able to mate, a so-far unanswered question. The resulting hybrid shows a chimeric genome that maintains a similar gene dosage from both parental lineages and displays ongoing loss of heterozygosity. Several of the differences found between the gene content in both strains relate to virulent-related families, with the hybrid strain presenting a higher copy number of genes coding for efflux pumps or secreted lipases. Remarkably, two clinical strains isolated from distant geographical locations (Texas and Singapore) are descendants of the same hybrid line, raising the intriguing possibility of a relationship between the hybridization event and the global spread of a virulent clone.Entities:
Keywords: Candida orthopsilosis; fungi; genome sequencing; hybridization; pathogens
Mesh:
Year: 2014 PMID: 24747362 PMCID: PMC4040990 DOI: 10.1093/gbe/evu082
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Genome Sequencing Stats
| Sample | Origin | No. Reads (Millions) | Depth of Coverage | Scaffolds | Assembly Size (kb) | GC (%) | Ns | Genes | Homo SNPs | Hetero SNPs |
|---|---|---|---|---|---|---|---|---|---|---|
| 90-125 | San Francisco, USA | 35.32 | 235 | 8 | 12,659 | 37.46 | 177,268 | 5,700 | 872 | 1,184 |
| MCO456 | San Antonio, USA | 90.22 | 544 | 116 | 13,272 | 34.42 | 1,453 | 5,756 | 92,497 | 197,531 |
| AY2 | Singapore | — | — | 4,152 | 14,511 | 37.31 | 0 | — | — | — |
Note.—Basic strain and assembly statistics for the genome obtained within this work, the reference strain, and an assembly available in GenBank (AMDC00000000). For each strain the table provides, in this order: Strain name, geographical origin, number of reads obtained, average coverage (× fold), number of scaffolds, total assembly size, GC content, number of bases in gaps, number of predicted genes, and number of homozygous and heterozygous sites. Note that the highly fragmented nature of the AY2 assembly is likely due to the fact that high heterozygosity was not specifically taken care of as we do here.
FCopy number differences between Candida orthopsilosis strains. Two duplications, DUP28 (A) and DUP4 (B), were analyzed in detail (marked with red rectangles). For each duplication, ten panels are given in this order, from top to down: Genomic coordinates; genomic coverage, genomic reads alignments, and split-read mapping for 90-125; genomic coverage, genomic reads alignments, and split-read mapping for MCO456; transcriptomic coverage and transcriptomic reads alignments from MCO456; and predicted loci. Both regions, DUP28 and DUP4, are fully homozygous and present in 12 copies in the genome when compared with two in C. orthopsilosis 90-125 reference genome. All copies of DUP4 are most likely in tandem, whereas copies of DUP28 are spread through at least five chromosomes. A genes encoded within DUP28 (CORT_0H02780, which encodes a putative outer membrane Na/K efflux pump) and DUP4 (CORT_0A11440, which encodes a vacuolar transporter) are expressed at higher levels, when compared with their neighboring genes.
FCandida orthopsilosis MCO456 is a hybrid of two subspecies. Distribution of number of SNPs between 1 kb homozygous regions of MCO456 and 90-125 is bimodal, with peaks around 0.3% and above 3% interstrain divergence (A). The genome of MCO456 can be assigned into four categories: Haplotype A (close in sequence to 90-125, therefore Type 2), haplotype B (Type 1), heterozygous, and unknown (B). The distance between the two haplotypes in heterozygous regions of MCO456 and AY2 is larger than distances between strains but smaller than distance to the closest species; therefore, we hypothesize that MCO456 is hybrid of two C. orthopsilosis subspecies (C).