| Literature DB >> 22951933 |
Diego A Martinez1, Brian G Oliver, Yvonne Gräser, Jonathan M Goldberg, Wenjun Li, Nilce M Martinez-Rossi, Michel Monod, Ekaterina Shelest, Richard C Barton, Elizabeth Birch, Axel A Brakhage, Zehua Chen, Sarah J Gurr, David Heiman, Joseph Heitman, Idit Kosti, Antonio Rossi, Sakina Saif, Marketa Samalova, Charles W Saunders, Terrance Shea, Richard C Summerbell, Jun Xu, Sarah Young, Qiandong Zeng, Bruce W Birren, Christina A Cuomo, Theodore C White.
Abstract
The major cause of athlete's foot is Trichophyton rubrum, a dermatophyte or fungal pathogen of human skin. To facilitate molecular analyses of the dermatophytes, we sequenced T. rubrum and four related species, Trichophyton tonsurans, Trichophyton equinum, Microsporum canis, and Microsporum gypseum. These species differ in host range, mating, and disease progression. The dermatophyte genomes are highly colinear yet contain gene family expansions not found in other human-associated fungi. Dermatophyte genomes are enriched for gene families containing the LysM domain, which binds chitin and potentially related carbohydrates. These LysM domains differ in sequence from those in other species in regions of the peptide that could affect substrate binding. The dermatophytes also encode novel sets of fungus-specific kinases with unknown specificity, including nonfunctional pseudokinases, which may inhibit phosphorylation by competing for kinase sites within substrates, acting as allosteric effectors, or acting as scaffolds for signaling. The dermatophytes are also enriched for a large number of enzymes that synthesize secondary metabolites, including dermatophyte-specific genes that could synthesize novel compounds. Finally, dermatophytes are enriched in several classes of proteases that are necessary for fungal growth and nutrient acquisition on keratinized tissues. Despite differences in mating ability, genes involved in mating and meiosis are conserved across species, suggesting the possibility of cryptic mating in species where it has not been previously detected. These genome analyses identify gene families that are important to our understanding of how dermatophytes cause chronic infections, how they interact with epithelial cells, and how they respond to the host immune response.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22951933 PMCID: PMC3445971 DOI: 10.1128/mBio.00259-12
Source DB: PubMed Journal: MBio Impact factor: 7.867
Dermatophyte genome statistics
| Characteristic | Result for: | ||||||
|---|---|---|---|---|---|---|---|
| CBS no. | 118892 | 112818 | 127.97 | 113480 | 118893 | 44 | 112371 |
| Isolation location | Germany | Quebec | Finland | Germany | Germany | Germany | Switzerland |
| Site of infection | Nail | Cheek | Nail | Head | Skin | Genitals | Face |
| Yr of isolation | 2004 | 2003 | 1996 | 2004 | 2004 | 2002 | |
| Mating type | |||||||
| Gene ID prefix | TERG | TESG | TEQG | MCYG | MGYG | TRV | ARB |
| Assembly size (Mb) | 22.5 | 23.0 | 24.1 | 23.1 | 23.2 | 22.5 | 22.2 |
| Repeat (%) | 1.73 | 3.70 | 7.20 | 2.43 | 1.54 | 2.13 | 1.34 |
| GC (%) | 48.31 | 48.15 | 47.39 | 47.5 | 48.5 | 48.24 | 48.75 |
| No. of predicted protein-coding genes | 8,707 | 8,523 | 8,679 | 8,915 | 8,907 | 8,024 | 7,980 |
| Mean coding sequence length (nt) | 1,393 | 1,409 | 1,371 | 1,459 | 1,436 | 1,468 | 1,483 |
| Mean intron length (nt) | 76 | 89 | 96 | 80 | 85 | 84 | 83 |
| Mean exon no. per gene | 3.1 | 3.2 | 3.2 | 3.21 | 3.15 | 2.6 | 2.6 |
| Mean intergenic length (nt) | 995 | 1,078 | 1,174 | 970 | 990 | 1,136 | 1,160 |
| No. of tRNAs | 82 | 82 | 85 | 82 | 83 | 81 | 85 |
| No. of transmembrane proteins | 1,601 | 1,532 | 1,531 | 1,605 | 1,636 | 1,640 | 1,679 |
| No. of secreted proteins | 545 | 526 | 523 | 546 | 589 | 428 | 469 |
| No. of GPI-anchored proteins | 40 | 35 | 42 | 40 | 43 | 26 | 30 |
T. verrucosum strain is designated 44 but is not currently deposited at CBS.
FIG 1 (A) Phylogenetic relationship and gene conservation of the dermatophytes and other species compared in this study; (B) section of the phylogeny in Fig. 1A comprising the dermatophytes; (C) shared and unique ortholog groups (see Materials and Methods) for the seven dermatophyte genomes grouped by host or ecological niche preference. Ortholog groups in this figure include paralogous duplications; therefore, total gene counts including paralogs are higher. The number of genes unique to each species (not contained in any other species in panel A) is shown in parentheses after the species name.
FIG 2 IPR domains most enriched in dermatophytes compared to all nondermatophytes in the study. Values are colored along a blue (low) to red (high) color scale, with color scaling relative to the low and high values of each row. IPR categories are sorted by P value within each supercategory. IPR domain enrichments shown have P values of at least 3e−8.
FIG 3 Phylogenetic tree of all fungal proteins containing LysM domains. Phylogeny was estimated using RAxML (see Materials and Methods). Two major clades were identified and labeled clade A (black gene names) and clade B (green gene names). Clade B may have arisen differently, as the majority of proteins in clade B do not have chitinase domains. Clade C contains LysM domains that have evidence of recent intergene duplication of the LysM domain, accounting for the high number of LysM domains in this group. Genes in clade D contain polysaccharide deacetylase domains in combination with the LysM domain. This combination is found only in dermatophytes. Genes in clade E contain the WA/GW signature in the LysM domain, typically the domain closest to the chitinase domain. Clade E.1 (labeled in the phylogeny tree) is a recently expanded group of genes that do not contain the WA/GW phylogenetically embedded within the WA/GW signature-containing group.
Common and unique nonribosomal peptide synthase (NRPS) and polyketide synthase (PKS) gene clusters in dermatophytes
| Type of cluster | Total no. | No. of clusters common | No. of clusters unique to less than three dermatophytes | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Nonreducing PKS | 8 | 3 | 0 | 1 | 1 | 0 | 0 | 4 | 0 |
| Reducing PKS | 13 | 5 | 0 | 1 | 1 | 1 | 1 | 4 | 6 |
| NRPS | 14 | 12 | 0 | 1 | 1 | 1 | 0 | 0 | 0 |
| Reducing PKS/NRPS hybrids | 8 | 3 | 0 | 0 | 0 | 0 | 0 | 3 | 2 |
| Species totals, including common | 43 | 23 | 23 | 22 | 23 | 25 | 24 | 32 | 27 |
In these clusters, some of the genes in a given species may be missing because of gaps in the sequence.
Total numbers within a species include those clusters in common as well as those that are unique. See Table S3 in the supplemental material for all details. For a comparison, C. immitis has 16 total clusters (5).
FIG 4 Eukaryotic protein kinase superfamily members (kinomes) of the dermatophytes compared with the kinomes of other fungi. Kinases from established families are shown on the left of the black line, while novel or recently discovered kinases are shown on the right. Kinases are classified into major groups shown as colored blocks (76). Abbreviations: Aben, A. benhamie; Tver, T. verrucosum; Trub, T. rubrum; Tton, T. tonsurans; Tequ, T. equinum; Mgyp, M. gypseum; Mcan, M. canis; Cimi, C. immitis; Anid, A. nidulans; Afum, A. fumigatus; Scer, S. cerevisiae; AGC, protein kinases A, G, and C; CAMK, calcium/calmodulin-dependent kinases; CK1, casein kinase 1; CMGC, cyclin-dependent kinases (CDK), mitogen-activated, glycogen-synthase, and CDK-like kinases; STE, sterile phenotype kinases; FunK1, fungal-specific kinase 1; SRPKL, serine/arginine-rich protein-specific kinase-like; other, kinases not falling into major groups.