| Literature DB >> 25015739 |
Jeremy Semeiks1, Dominika Borek, Zbyszek Otwinowski, Nick V Grishin.
Abstract
BACKGROUND: The fungal genus Stachybotrys produces several diverse toxins that affect human health. Its strains comprise two mutually-exclusive toxin chemotypes, one producing satratoxins, which are a subclass of trichothecenes, and the other producing the less-toxic atranones. To determine the genetic basis for chemotype-specific differences in toxin production, the genomes of four Stachybotrys strains were sequenced and assembled de novo. Two of these strains produce atranones and two produce satratoxins.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25015739 PMCID: PMC4117958 DOI: 10.1186/1471-2164-15-590
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The two toxin chemotypes of Both atranones and satratoxins are terpenoid secondary metabolites thought to derive from the primary metabolite farnesyl pyrophosphate (FPP). Box colors indicate each class of molecule and its specific secondary metabolite precursors: blue for atranones, green for simple trichothecenes, and pink for macrocyclic trichothecenes, which include satratoxins. Atranones are diterpenoids thought to originate from cyclization of geranylgeranyl pyrophosphate to form dolabellane, which has an eleven-membered ring [11]. Shown are the structures of all atranones solved by Hinkley et al.[11], as well as types of enzymes capable of catalyzing the two postulated reactions in the pathway. Trichothecenes are sesquiterpenoids that are products of FPP cyclization. The pathway of trichodermol biosynthesis from FPP is known experimentally [12, 13], but there are no experimental data regarding biosynthesis pathways of satratoxins or other trichodermol derivatives. Shown is a conceptual pathway adapted from [14] and references therein. It integrates results from several trichothecene producers. Enzymes shown have been functionally characterized from Fusarium (Tri5) or Trichoderma (Tri4 and Tri11). Trichodiol is shown to represent several intermediates that undergo both enzymatic hydroxylation and spontaneous rearrangement to form trichodermol, which is the first molecule shown that contains the trichothecene skeleton, i.e., the tricyclic ring 12,13-epoxytrichothec-9-ene (EPT). In Fusarium, trichodermol is not observed. Instead, the pathway after trichodiol diverges into a series of products substituted at C-3 of EPT. There are two known trichoverrols (A and B) and two known trichoverrins (A and B), but the respective pairs differ only in the stereochemistry of the C-4 side chain. The satratoxin F/G skeleton is shown as representative of satratoxins, and roridin E as representative of roridins. Omitted for brevity are the verrucarins (double arrow between roridins and satratoxins).
Figure 2Conceptual and ortholog-based maximum likelihood phylogeny of and other fungi. A. The conceptual phylogeny shows the toxin chemotypes of the four sequenced Stachybotrys strains in relation to other trichothecene-producing fungi of order Hypocreales. S. cerevisiae is only distantly related to Hypocreales and is shown for context. Topology adapted from [18]. B. Phylogeny was constructed from alignment of 2,177 proper protein orthologs identified by OrthoMCL. Scale bar shows number of substitutions per site. All branches have 100% support.
Features of genome and proteome assemblies
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| APWP00000000 | AQPQ00000000 | ASEQ00000000 | APIU00000000 | AACM00000000 |
|
| 66.4 | 58.6 | 68.8 | 71.4 | NA |
|
| 1246 | 957 | 826 | 897 | 36 |
|
| 34.2 | 36.5 | 36.1 | 36.2 | 36.2 |
|
| 196 | 162 | 192 | 199 | 10 |
|
| 116 | 130 | 214 | 177 | 5350 |
|
| 0.25 | 0.08 | 0.16 | 0.13 | 0.22 |
|
| 1.62 | 0.93 | 0.93 | 1.01 | 0.66 |
|
| 51.75 | 53.42 | 53.19 | 53.31 | 57.18 |
|
| 10866 | 11719 | 11532 | 11543 | 13332 |
|
| 1357/403 | 1377/411 | 1380/412 | 1379/413 | 1259/375 |
|
| 2.8 | 2.8 | 2.8 | 2.8 | 2.8 |
|
| 293/59 | 296/59 | 297/59 | 296/59 | 255/55 |
|
| 65.87 | 65.84 | 66.29 | 65.94 | 61.43 |
Stachybotrys assemblies include all contigs and scaffolds of at least 1-kbp. N50 is the sequence that includes the middle nucleotide of the assembly when the sequences are ordered by length.
Ortholog-based pairwise proteome identities of and other fungi
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|
|
| 100 | 99.830 | 99.746 | 97.701 | 73.668 | 73.646 | 72.995 | 54.834 | 39.231 |
|
| 100 | 99.742 | 97.707 | 73.663 | 73.644 | 72.998 | 54.836 | 39.231 | |
|
| 100 | 97.673 | 73.663 | 73.649 | 73.000 | 54.836 | 39.237 | ||
|
| 100 | 73.667 | 73.638 | 73.011 | 54.832 | 39.240 | |||
|
| 100 | 97.174 | 89.068 | 55.506 | 39.796 | ||||
|
| 100 | 89.380 | 55.452 | 39.742 | |||||
|
| 100 | 54.934 | 39.373 | ||||||
|
| 100 | 39.740 | |||||||
|
| 100 |
The proteome abbreviations in the table represent four organisms sequenced in this study: 7711 – Stachybotrys chartarum 7711, 40293 – Stachybotrys chartarum 40293, 40288 – Stachybotrys chartarum 40288, and 40285 – Stachybotrys chlorohalonata. Proteomes of other fungi included in the analysis are: Fve, Fusarium verticillioides; Fox, Fusarium oxysporum; Fgr, Fusarium graminearum; Ani, Aspergillus nidulans; Sce, Saccharomyces cerevisiae.
Figure 3Distribution of orthologs of and This Venn diagram shows the number of protein homolog groups, out of a 16,311 total, in each combination of three sets: (1) groups with a homolog in any Stachybotrys genome; (2) groups with a homolog in any Fusarium genome; and (3) groups with a homolog in A. nidulans or S. cerevisiae which for simplicity are pooled as a single outgroup.
Figure 4The core trichothecene clusters of and and satratoxin chemotype-specific clusters SC1, SC2, and SC3 of A. The core trichothecene cluster (CTC). For all genomes an arrow indicates a gene and its transcriptional sense. The core trichothecene clusters of Stachybotrys are shown in the green box, and the adjacent satratoxin cluster SC3 is shown in the pink box. The other genes that are shown outside the boxes lack similarity to known trichothecene synthesis genes, so they are assumed to be in flanking regions outside these two clusters. A black, dotted arrow indicates that a scaffold extends to include other genes beyond the region shown, whereas lack of such an arrow indicates a scaffold border. The color indicates orthology with respect to Trichoderma and F. graminearum trichothecene clusters (shown in the gray, dotted box). TRI18, which is a paralog of TRI3, is colored as TRI3, but the arrow is dotted. Note that Trichoderma TRI5 is known to exist outside of the CTC [12]. The ruler at the top indicates length in kbp. Trichoderma and F. graminearum CTCs were redrawn from prior work [12, 29]. B. The satratoxin-specific clusters are shown in the pink boxes. The other genes shown are chemotype-independent. Other figure conventions follow those described for the CTC.
Figure 5The core atranone clusters of the atranone-producing strains. The core atranone clusters are shown in the blue box. The other genes shown are chemotype-independent. ATR12 of strain 40288 is gray to indicate that it is a possible pseudogene, because despite its translation having ~90% identity to 40285 Atr12, in the present assembly its exon 1 contains an internal stop codon.
Summary of functions putatively encoded by genes in satratoxin clusters SC1, SC2, and SC3
| Symbol | Exons | Putative functions | Closest homolog | E-value/Identity [%] | Conserved domain database |
|---|---|---|---|---|---|
| SAT1-7711 SAT1-40293 | 1 | Membrane protein, FAD binding protein, contains domain found in fungal squalene epoxidases and monoxygenases | Putative FAD binding domain-containing protein from | 5e-58/49 | cl19134 cl17314 |
| SAT2-7711 SAT2-40293 | 2 | NADPH-dependent short-chain dehydrogenase/reductase | Hypothetical protein from | 5e-124/53 | cd05327 |
| SAT3-7711 SAT3-40293 | 2 | Classical short-chain reductase | Putative short chain dehydrogenase reductase protein from | 5e-124/67 | cd05233 |
| SAT4-7711 SAT4-40293 | 3 | Putative integral membrane protein | Putative integral membrane protein from | 7e-89/54 | No conserved domains detected |
| SAT5-7711 SAT5-40293 | 1 | Putative acetyltransferase | Putative trichothecene 3-o- protein | 93-165/54 | cl19241 |
| SAT6-7711 SAT6-40293 | 1 | A secretory lipase domain | Related to lipase 1 from | 1e-166/53 | cl14925 |
| SAT7-7711 SAT7-40293 | 1 | Squalene epoxidase | Putative salicylate hydroxylase from | 5e-132/46 | cl17314 |
| SAT8-7711 SAT8-40293 | 3 | Putative polyketide synthase with a conventional non-reducing architecture | Putative polyketide synthase from | 0.0/48 | cd00833 |
| SAT9-7711 SAT9-40293 | 2 | Putative Cys6 transcriptional factor | Hypothetical protein from | 2e-25/33 | No conserved domains detected |
| SAT10-7711 SAT10-40293 | 6 | Putative protein containing four ankyrin repeats | Multiple ankyrin repeats single kh domain protein from | 5e-78/25 | cd00204 |
| 7 | |||||
| SAT11-7711 SAT11-40293 | 5 | Putative cytochrome P450 monooxygenase and a Tri4 paralog | Cytochrome P450 from | 0.0/78 | cl12078 |
| SAT12-7711 SAT12-40293 | 5 | Putative 15-O-acetyltransferase Tri3 | Hypothetical protein from | 1e-91/36 | cl06457 |
| SAT13-7711 SAT13-40293 | 2 | Putative reducing polyketide synthase | Putative polyketide synthase protein from | 0.0/57 | cd00833 |
| SAT14-7711 SAT14-40293 | 1 | Sat14 and Sat16 are complete and truncated paralogs of the acetyltransferase Tri3 | Predicted protein from | 0.0/67 | pfam13523 |
| SAT15-7711 SAT15-40293 | 2 | The zinc finger protein Sat15 | LolU from | 8e-25/38 | No conserved domains detected |
| SAT16-7711 SAT16-40293 | 4 | Sat14 and Sat16 are complete and truncated paralogs of the acetyltransferase Tri3 | Trichothecene 15-O-acetyltransferase from | 9e-36/45 | cl06457 |
| 3 | |||||
| SAT17-7711 SAT17-40293 | 5 | TauD hydroxylase | Hypothetical protein from | 5e-67/39 | pfam02668 |
| SAT18-7711 SAT18-40293 | 4 | Methyltransferase | Hypothetical protein from | 4e-35/28 | cl16913 |
| SAT19-7711 SAT19-40293 | 2 | N-acetyltransferase | Hypothetical protein from | 4e-25/30 | cd04301 |
| SAT20-7711 SAT20-40293 | 3 | Cys6-type zinc finger | Hypothetical protein from | 3e-43/30 | No conserved domains detected |
| 4 | |||||
| SAT21-7711 SAT21-40293 | 6 | MFS (Major Facilitator Superfamily)-type transporter | Hypothetical protein | 2e-94/37 | pfam07690 |
For brevity the closest homologs, their E-values and levels of identity are identified only for Stachybotrys chartarum 7711.
Figure 6Maximum likelihood phylogenies of selected Tri homologs. A. Reference phylogeny made from partial 18S rRNA sequences. B. Tri3, including all four Stachybotrys paralogs from CTC and SC2. C. Tri4, including Stachybotrys paralog from SC2. D. Tri5, including the paralog Tri19 from strain 40293. E. Tri11. Each phylogeny is rooted at the midpoint. Branches in each tree are colored by gene using the same scheme as in Figure 4. Each organism is labelled with a separate color. Branches are labelled with support values of 100 total bootstrap replicates. Scale bars show the number of substitutions per site.
Figure 7Unified genetic model for atranone and satratoxin biosynthesis. Molecules are color-coded per Figure 1. The gray box indicates trichothecenes whose catalysis is uncertain; they may be synthesized by enzyme products of the core trichothecene cluster, by products of satratoxin chemotype-specific clusters, or by a mix of both types.