| Literature DB >> 23341551 |
Anastasia Gioti1, Björn Nystedt, Wenjun Li, Jun Xu, Anna Andersson, Anna F Averette, Karin Münch, Xuying Wang, Catharine Kappauf, Joanne M Kingsbury, Bart Kraak, Louise A Walker, Henrik J Johansson, Tina Holm, Janne Lehtiö, Jason E Stajich, Piotr Mieczkowski, Regine Kahmann, John C Kennell, Maria E Cardenas, Joakim Lundeberg, Charles W Saunders, Teun Boekhout, Thomas L Dawson, Carol A Munro, Piet W J de Groot, Geraldine Butler, Joseph Heitman, Annika Scheynius.
Abstract
UNLABELLED: Malassezia commensal yeasts are associated with a number of skin disorders, such as atopic eczema/dermatitis and dandruff, and they also can cause systemic infections. Here we describe the 7.67-Mbp genome of Malassezia sympodialis, a species associated with atopic eczema, and contrast its genome repertoire with that of Malassezia globosa, associated with dandruff, as well as those of other closely related fungi. Ninety percent of the predicted M. sympodialis protein coding genes were experimentally verified by mass spectrometry at the protein level. We identified a relatively limited number of genes related to lipid biosynthesis, and both species lack the fatty acid synthase gene, in line with the known requirement of these yeasts to assimilate lipids from the host. Malassezia species do not appear to have many cell wall-localized glycosylphosphatidylinositol (GPI) proteins and lack other cell wall proteins previously identified in other fungi. This is surprising given that in other fungi these proteins have been shown to mediate interactions (e.g., adhesion and biofilm formation) with the host. The genome revealed a complex evolutionary history for an allergen of unknown function, Mala s 7, shown to be encoded by a member of an amplified gene family of secreted proteins. Based on genetic and biochemical studies with the basidiomycete human fungal pathogen Cryptococcus neoformans, we characterized the allergen Mala s 6 as the cytoplasmic cyclophilin A. We further present evidence that M. sympodialis may have the capacity to undergo sexual reproduction and present a model for a pseudobipolar mating system that allows limited recombination between two linked MAT loci. IMPORTANCE: Malassezia commensal yeasts are associated with a number of skin disorders. The previously published genome of M. globosa provided some of the first insights into Malassezia biology and its involvement in dandruff. Here, we present the genome of M. sympodialis, frequently isolated from patients with atopic eczema and healthy individuals. We combined comparative genomics with sequencing and functional characterization of specific genes in a population of clinical isolates and in closely related model systems. Our analyses provide insights into the evolution of allergens related to atopic eczema and the evolutionary trajectory of the machinery for sexual reproduction and meiosis. We hypothesize that M. sympodialis may undergo sexual reproduction, which has important implications for the understanding of the life cycle and virulence potential of this medically important yeast. Our findings provide a foundation for the development of genetic and genomic tools to elucidate host-microbe interactions that occur on the skin and to identify potential therapeutic targets.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23341551 PMCID: PMC3560662 DOI: 10.1128/mBio.00572-12
Source DB: PubMed Journal: MBio Impact factor: 7.867
Data and assembly and annotation statistics for the M. sympodialis genome
| Data | 454 Titanium (SE)[ | Illumina HiSeq (MP)[ | Contigs | Scaffolds | |
|---|---|---|---|---|---|
| Nuclear | Mitochondrial | ||||
| No. | 1,278,053 (reads) | 32 × 106 (reads) | 156 | 65 | 1 |
| Read length (bp) | 433 (avg) | 50 | |||
| Read coverage (fold) | 60 | 200 | |||
| Total assembly | 7,682,651 | 7,669,689 | 38,622 | ||
| 186,342 | 513,493 | ||||
| GC content (%) | 59 | 32 | |||
| No. of protein- | 3,517 | 19 | |||
| No. of rRNA genes | NE | 2 | |||
| No. of tRNA genes | NE | 25 | |||
SE, single-end reads.
MP, 3-kb mate-pair reads.
Contigs come from assembly of the 454 data.
Scaffolds come from assembly of the 454 and Illumina data. NE, not estimated.
N50, weighted median statistic such that 50% of the entire assembly is contained in contigs or scaffolds equal to or larger than this value.
FIG 1 Nuclear genome and proteomics analyses of M. sympodialis ATCC 42132. (A) BLASTN alignment between assembly scaffolds from M. globosa and contigs of M. sympodialis (454 data assembly) indicates a globally conserved synteny. Red and blue bands indicate syntenic and inverted syntenic regions. Note that the contigs of M. sympodialis were ordered according to the M. globosa scaffold configuration, but their true order in both species is unknown. The alignment was visualized with the ACT tool (100). Mass spectrometry based proteomics (B to D). (B) Boxplot of the number of unique peptides per protein and the number of peptide spectrum matches (PSMs) per protein. (C) Boxplot presenting protein sequence coverage of identified peptides per protein. (D) Venn diagram showing overlap (30,559; 86%) of unique peptides identified by mass spectrometry both from predicted protein coding genes and peptides generated by searching the 6-reading-frame (6RF) translation of M. sympodialis.
FIG 2 Physical map of the mitochondrial genome of M. sympodialis ATCC 42132. The 38,622-bp mtDNA maps as a circular molecule and is displayed in a linear form beginning with the RNL gene. Black bars represent genes or exons of highly conserved protein-encoding regions, with the orientation indicated by the pointed end. Other bars represent rRNAs (blue), tRNAs (green), introns (gray), and a large inverted repeat (purple). The RNL, COB, and COX1 genes are interrupted by group I introns (GI), with three of these introns containing open reading frames belonging to the LAGLI-DADG family of homing endonuclease genes (GI + HEG). The intron-located HEG of the second intron of COB is immediately adjacent to and in-frame with the upstream exon, while the HEGs in the first and third introns of COX1 are free standing.
Main lipid-hydrolyzing enzymes in M. globosa and M. sympodialis
| Gene family[ | No. of enzymes in: | |
|---|---|---|
| 6 | 4 | |
| 6 | 5 | |
| Acid sphingomyelinase | 4 | 4 |
| Phospholipase C | 6 | 4 |
From reference 6.
Cell wall genes
| Gene class[ | No. of genes in | ||||
|---|---|---|---|---|---|
| Msym | Mglo | Umay | Cneo | Scer | |
| Chitin synthase | 6 | 7 | 8 | 9 | 3 |
| Chitin deacetylase | 6[ | 4 | 6 | 4 | 2 |
| Chitinase (class IV) | 1 | 1 | 2 | 4 | 2 |
| Catalytic subunit of β-1,3-glucan synthase ( | 1 | 1 | 1 | 1 | 3 |
| Exo-β-1,3-glucanase ( | 8 | 6 | 8 | 8 | 3 |
| Transglycosylase (GH16, | 1 | 1 | 2 | 2 | 3 |
| Transglucosylase (GH72, | 0 | 0 | 1 | 1 | 5 |
| Mixed-linked glucanase ( | 2 | 2 | 4 | 5 | 0 |
| Putative β-1,6-glucan transglycosylase ( | 4[ | 4 | 8 | 6 | 2 |
| ER chaperone involved in protein N- and O-glycosylation ( | 1 | 2 | 4 | 1 | 1 |
| Predicted GPI proteins | 10 | 20 | 55e | 63 | 59 |
Homologous gene or gene families in S. cerevisiae (except for MLG1, which is from Cochliobolus carbonum).
Msym, M. sympodialis; Mglo, M. globosa; Umay, U. maydis; Cneo, C. neoformans; Scer, S. cerevisiae.
One of the gene models seems incorrect and comprises two paralogous genes.
Second paralog identified by tBLASTn but no gene model available.
Data from reference 101.
Data from reference 102.
Data from reference 30.
FIG 3 M sympodialis cell wall architecture revealed by HPF-TEM (high-pressure freezing--transmission electron microscopy). M. sympodialis was grown on mDixon agar at 32°C for 4 days. (A) Transmission electron micrograph of a budding M. sympodialis yeast cell. (B) Ultrastructure of the M. sympodialis cell wall. Bars: 0.5 µm (magnification, ×25,000) (A) and 100 nm (magnification, ×130,000) (B).
Allergens encoded in the M. sympodialis ATCC 42132 genome and putative orthologs in M. globosa
| Allergen | Accession no. | Predicted function | Prediction for | % identity | |||||
|---|---|---|---|---|---|---|---|---|---|
| Amino | Nucleotide | ||||||||
| Mala s 1 | Unknown; | Secreted | MSY001_0607 | MGL_1303 | 69 | 66 | 0.23 | 4.09 | |
| Mala f 2 | Peroxisomal protein | No | MSY001_2163 | MGL_4042 | 55 | 61 | 0.39 | 3.45 | |
| Mala f 3 | Peroxisomal protein | No | MSY001_2163 | MGL_4042 | 55 | 61 | 0.39 | 3.45 | |
| Mala f 4 | Malate | No | MSY001_0149 | MGL_2703 | 84 | 77 | 0.10 | 2.74 | |
| Mala s 5 | Peroxisomal protein | No | MSY001_2163 | MGL_4042 | 55 | 61 | 0.39 | 3.45 | |
| Mala s 6 | Cytoplasmic | No | MSY001_1373 | MGL_3612 | 93 | 85 | 0.04 | 1.61 | |
| Mala s 7 | Unknown | Secreted | MSY001_3348 | MGL_0968 | NA | NA | NA | NA | |
| Mala s 8 | Unknown | Secreted | MSY001_0606 | MGL_1304 | 71 | 68 | 0.22 | 4.52 | |
| Mala s 9 | Unknown | No | MSY001_1912 | MGL_2179 | 77 | 74 | 0.19 | 1.95 | |
| Mala s 10 | Heat shock | No | MSY001_0570 | MGL_0201 | 89 | 78 | 0.07 | 4.60 | |
| Mala s 11 | AJ548421 ( | Manganese | No | MSY001_2804 | MGL_3190 | 71 | 73 | 0.18 | 3.23 |
| Mala s 12 | AJ871960 ( | GMC | Secreted | MSY001_2108 | MGL_0750 | 64 | 65 | 0.29 | 2.52 |
| Mala s 13 | AJ937746 ( | Thioredoxin | No | MSY001_0904 | MGL_1781 | 85 | 81 | 0.10 | 1.76 |
Isolated from ATCC 42132 except for Mala f 2, 3, and 4, which come from isolate 2782 (Teikyo Institute for Medical Mycology, Tokyo, Japan).
Evidence for no secretion is absence of signal peptides, transmembrane domains, and GPI-anchoring peptides.
Single-copy orthologs between the two species were identified with a bidirectional best-hit BLASTP approach (E value = 1E – 50).
The gene shows weak similarity to the gene encoding Mala s 7, and due to gene family amplification [see the text], it cannot be safely assigned as its ortholog; therefore, dN/dS analysis between the two copies is not applicable (NA).
FIG 4 Functional characterization of Mala s 6. (A) Western blot detection of Mala s 6 antigen using a polyclonal antiserum against C. neoformans cyclophilin A. Total protein extracts from C. neoformans strains, including wild-type H99 and cpa1 and cpa1 cpa2 mutants, were separated in parallel with a protein extract from M. sympodialis and recombinant Mala s 6 (rMala s 6). (B) Mala s 6 catalyzes cis-trans peptidyl-prolyl isomerization, as shown by a chymotrypsin-coupled assay. x axis, time (in minutes); y axis, net absorbance measured in the spectrophotometer. Curve A, rMala s 6; curve B, C. albicans cyclophilin A (Cyp1); curve C, rMala s 6 + 1 µM cyclosporine A; curve D, C. albicans cyclophilin A (Cyp1) +1 µM cyclosporine A; curve E, control reaction mixture without enzyme.
FIG 5 Organization of the MAT locus in Malassezia. (A) Comparison of the MAT locus of M. globosa and M. sympodialis. As seen in the lower comparison, the MAT locus of M. globosa CBS 7966 (accession no. AYY01000003.1) maps to five scaffolds of the M. sympodialis isolate ATCC 42132. Scaffolds 12 and 4 correspond to the A (pheromone/receptor [P/R]) and B (homeodomain [HD]) loci, comprising the pheromone and the pheromone receptor gene and the transcription factor genes bW and bE, respectively. The A and B loci are ~167.4 kb apart in M. globosa and ~141 kb apart in M. sympodialis, with scaffolds 22, 34, and 38 linked to scaffolds 4 and 12, based on analysis of Illumina reads and PCR and sequence analysis spanning each gap (see the supplemental methods). Alignments between the two species were done with tBLASTx and visualized with ACT (100). (B) Dot plot comparison of the two alleles of the MAT locus between M. sympodialis isolates ATCC 42132 (sequenced isolate) and M. sympodialis ATCC 44340 (sequence determined by PCR and sequencing; see Materials and Methods). Sequences were aligned using Dnadot (http://www.vivo.colostate.edu/molkit/dnadot/index.html) with a window size of 15. The pheromone and pheromone receptor genes in the A locus (right) have different sequences and orientations in M. sympodialis ATCC 44340 (accession no. JX964849) and ATCC 42132 (accession no. JX964848), and the flanking regions are highly conserved. The bW and bE genes in the B (HD) locus (left) share high similarity between these two isolates (ATCC 44340, accession no. JX964801; ATCC 42132, accession no. JX964802), and the flanking regions are highly conserved.
Genes involved in mating signaling and in basidiomycetes
| Process | Gene | Presence | ||||
|---|---|---|---|---|---|---|
| Msym | Mglo | Umay | Sreil | Cneo | ||
| Mating signaling | 1 | 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | 1 | ||
| 0 | 0 | 1 | 1 | 1 | ||
| 0 | 0 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 0 | 0 | 1 | 1 | 0 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 0 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| ?[ | ?[ | 1 | 1 | 1 | ||
| Recombination and crossing over | 0 | 0 | 0 | 0 | 1 | |
| 0 | 0 | 0 | 0 | 1 | ||
| 0 | 0 | 0 | 0 | 1 | ||
| 0 | 0 | 0 | 0 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| Crossover resolution | 0 | 0 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | 1 | ||
| 0 | 0 | 1 | 1 | 1 | ||
| 0 | 0 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| Cohesin complex | 1 | 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | 1 | ||
| 0 | 0 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
| Mismatch repair | 1 | 1 | 1 | 1 | 1 | |
| 0 | 0 | 0 | 0 | 0 | ||
| 0 | 0 | 1 | 1 | 1 | ||
| 1 | 1 | 1 | 1 | 1 | ||
Core meiosis genes are in bold.
1, presence; 0, absence. Msym, M. sympodialis; Mglo, M. globosa; Umay, U. maydis; Sreil, S. reilianum; Cneo, C. neoformans.
The gene is present in the genome based on tBLASTn, but there is no model available.
STE18 homologs were not found, but this might be due to the fact that they are small and poorly conserved.
The RAD50 homolog in M. globosa is incorrectly split into two genes, MGL_0431 and MGL_0432.
The PDS5 homolog is split into MGL_3630 and MGL_3631.
The M. sympodialis gene model (MSY001_1319) might be an incorrect fusion of two genes, corresponding to M. globosa MGL_0016 and MGL_0017.