| Literature DB >> 26537223 |
Grzegorz Koczyk1, Adam Dawidziuk2, Delfina Popiel2.
Abstract
In recent years, the influx of newly sequenced fungal genomes has enabled sampling of secondary metabolite biosynthesis on an unprecedented scale. However, explanations of extant diversity which take into account both large-scale phylogeny reconstructions and knowledge gained from multiple genome projects are still lacking. We analyzed the evolutionary sources of genetic diversity in aromatic polyketide biosynthesis in over 100 model fungal genomes. By reconciling the history of over 400 nonreducing polyketide synthases (NR-PKSs) with corresponding species history, we demonstrate that extant fungal NR-PKSs are clades of distant siblings, originating from a burst of duplications in early Pezizomycotina and thinned by extensive losses. The capability of higher fungi to biosynthesize the simplest precursor molecule (orsellinic acid) is highlighted as an ancestral trait underlying biosynthesis of aromatic compounds. This base activity was modified during early evolution of filamentous fungi, toward divergent reaction schemes associated with biosynthesis of, for example, aflatoxins and fusarubins (C4-C9 cyclization) or various anthraquinone derivatives (C6-C11 cyclization). The functional plasticity is further shown to have been supplemented by modularization of domain architecture into discrete pieces (conserved splice junctions within product template domain), as well as tight linkage of key accessory enzyme families and divergence in employed transcriptional factors. Although the majority of discord between species and gene history is explained by ancient duplications, this landscape has been altered by more recent duplications, as well as multiple horizontal gene transfers. The 25 detected transfers include previously undescribed events leading to emergence of, for example, fusarubin biosynthesis in Fusarium genus. Both the underlying data and the results of present analysis (including alternative scenarios revealed by sampling multiple reconciliation optima) are maintained as a freely available web-based resource: http://cropnet.pl/metasites/sekmet/nrpks_2014.Entities:
Keywords: duplication; horizontal transfer; loss; polyketide; secondary metabolism; sources of diversity
Mesh:
Substances:
Year: 2015 PMID: 26537223 PMCID: PMC5635595 DOI: 10.1093/gbe/evv204
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FThe workflow of phylogenomic analysis of NR-PKSs.
Summary of Transfers Predicted by ALE and the Consistency of Their Support Based on DTL-RANGER Sampling
| Clade | No. | Donor Gene Tree Node | Top Transfer Cost (at | Consistent | Description of Affected Genes |
|---|---|---|---|---|---|
| β (azaphilones, meroterpenoids) | 1 | g109 (go109) | 15 (15) |
| Origin of |
| 2 | g110 (go110) | 12 (12) |
| Origin of | |
| 3 | g166 (go167) | 22 (10) | Origin of | ||
| 4 | g229 (go226) | 27 (12) |
| Origin of | |
| γ.C2–C7a monocyclic (orsellinic acid, resorcylic acid lactones) | 5 | g352 (go352) | 12 (11) | Origin of | |
| 6 | g355 (go372) | 12 (11) | Origin of | ||
| 7 | g361 (go367) | 15 (15) |
| Origin of | |
| 8 | g362 (go368) | 12 (11) | Origin of | ||
| 9 | g369 (NA) | 12 (NA) | Origin of | ||
| 10 | g370 (go359) | 12 (11) | Origin of | ||
| 11 | g371 (go360) | 12 (11) | Origin of | ||
| 12 | g383 (go381) | 14 (13) |
| Origins of | |
| γ.C6–C11 (emodins, atrochyrsone, asperthecin) | 13 | g418 (go419) | 17 (10) | Origin of | |
| 14 | g454 (go510) | 7 (8) | Origin of | ||
| 15 | g462 (go517) | 17 (13) |
| Origin of | |
| 16 | g491 (go547) | 12 (12) | Origin of | ||
| 17 | g490 (go546) | 12 (12) |
| Origin of | |
| 18 | g511 (go433) | 21 (15) |
| Origin of | |
| 19 | g542 (go468) | 29 (20) |
| Origin of | |
| γ.C2–C7c multicyclic (naphtopyrone, melanins, aurofusarin, bikaverin) | 20 | g664 (go754) | >40 (>40) |
| Origin of |
| γ.C4–C9 (aflatoxins, sterigmatocystin, fusarubins) | 21 | g731 (go684) | 15 (16) |
| Origin of |
| 22 | g740 (go647) | 14 (NA) | Origin of | ||
| 23 | g766 (go667) | 26 (27) |
| Transfer of sterigmatocystin biosynthetic cluster to | |
| 24 | g791 (go704) | 17 (16) |
| Origin of | |
| 25 | g807 (go719) | 17 (16) |
| Transfer of |
aFirst number denotes node in gene tree post-ALE reconciliation; if bipartition is present in original BI tree, the number is given in parentheses.
bTransfer cost at which transfer event is predicted for majority of sampled scenarios; second number given in parentheses, if bipartition is present in the original BI tree.
cMost robust predictions where detected transfer is shown across multiple transfer cost thresholds (for L = 1, 2, 3; on both the amalgamated and the original gene tree) are indicated with plus sign.
dIn nonamalgamated tree, the different parent bipartition results in two inferred transfers at T = 12.0 (second involves TRIREDRAFT_73621).
Summary of Strongest Associations between Syntenic Homologs Present in the Genomic Neighborhood (of the core NR-PKS) and Monophyletic Clades of NR-PKSs
| Clade | Gnode | # | CHG | Description—CHG (Candidate Homolog Group) |
| Example Accessory Genes |
|---|---|---|---|---|---|---|
| β (azaphilones, meroterpenoids) |
| 1 | ch109 | Terpene cyclase | 4E-06 |
|
|
| 2 | ch71 | Mitochondrial carrier protein (Pfam: Mito_carr) | 3E-05 | — | |
|
| 3 | ch73 | YCII superfamily protein (Pfam: YCII) | 8E-07 |
| |
|
| 4 | ch7 | Accessory HR-PKS | 1E-06 |
| |
|
| 5 | ch9 | FAD-binding oxidoreductase | 3E-25 |
| |
| 6 | ch12 | Zinc-binding oxidoreductase (alcohol dehydrogenase) | 1E-06 |
| ||
| 7 | ch48 | Acetyltransferase | 2E-09 |
| ||
|
| 8 | ch20 | FSH1 (serine hydrolase) | 4E-19 |
| |
| 9 | ch26 | 2-oxoglutarate/Fe(II)-dependent dioxygenase | 7E-11 |
| ||
|
| 10 | ch17 | Zinc-finger transcription factor | 3E-24 |
| |
| γ.C2–C7a monocyclic (orsellinic acid, resorcylic acid lactones) |
| 11 | ch7 | Accessory HR-PKS | 3E-08 |
|
| γ.C6–C11 (emodins, atrochrysone, asperthecin) |
| 12 | ch46 | Short-chain dehydrogenase (classical SDR subtype) | 3E-05 |
|
|
| 13 | ch68 | Policyclic prenyltransferase (pcPTase) | 6E-10 |
| |
|
| 14 | ch23 | Zinc-finger transcription factor | 8E-11 |
| |
|
| 15 | ch60 | FAD-binding oxidase (Pfam: FAD_binding_3) | 1E-22 |
| |
|
| 16 | ch32 | FAD-binding oxidase (Pfam: FAD_binding_4) | 8E-07 | — | |
|
| 17 | ch36 | Scytalone dehydratase | 5E-08 |
| |
|
| 18 | ch18 | Dehydratase (EthD domain) | 1E-09 |
| |
| 19 | ch51 | Domain of unknown function (Pfam: DUF1772) | 1E-05 |
| ||
| 20 | ch131 | Methyltransferase | 5E-09 |
| ||
|
| 21 | ch31 | NAD-binding oxidase (Pfam: NAD_binding_10) | 5E-22 |
| |
| 22 | ch41 | Baeyer–Villiger oxidase (Pfam: DUF4243) | 3E-07 |
| ||
| 23 | ch43 | Zinc-finger transcription factor (Pfam: AflR) | 5E-09 |
| ||
|
| 24 | ch14 | β-lactamase | 3E-42 |
| |
| γ.C2–C7b bicyclic (THN, melanins) |
| 25 | ch81 | Prefoldin (chaperone) | 4E-19 | — (housekeeping gene) |
|
| 26 | ch125 | Pre-mRNA-splicing factor | 8E-09 | — (housekeeping gene) | |
|
| 27 | ch94 | Imide hydrolase (Pfam: Hydantoinase) | 9E-08 | — | |
|
| 28 | ch33 | Zinc-finger transcription factor | 1E-29 | — | |
| γ.C2–C7c multicyclic (naphtopyrone, melanins, aurofusarin, bikaverin) |
| 29 | ch130 | Maleylacetate reductase homolog (iron-containing alcohol dehydrogenase) | 2E-10 | — |
| 30 | ch132 | Zinc-finger transcription factor | 7E-08 | — | ||
|
| 31 | ch37 | FAD-binding oxidase (Pfam: FAD_binding_3) | 1E-06 |
| |
|
| 32 | ch101 | Multidomain protein (multiple zinc-finger sites, jumonji, ARID domains)—distant homology to histone demethylases ( | 4E-09 | — | |
| 33 | ch116 | GIY-YIG nuclease superfamily protein (structure-specific endonuclease subunit | 2E-07 | — (housekeeping gene) | ||
| 34 | ch126 | Glycosyl hydrolase family 31 proteins (∼50% sequence identity to model α-glycoside hydrolases) | 2E-10 | — | ||
| γ.C4–C9 (aflatoxins, sterigmatocystin, fusarubins) |
| 35 | ch43 | Zinc-finger transcription factors (Pfam: AflR) | 3E-06 |
|
| 36 | ch49 | Accessory FAS—type α | 8E-09 |
| ||
| 37 | ch54 | Accessory FAS—type β | 1E-07 |
| ||
|
| 38 | ch58 | 3-keto-ACP reductase (classical SDR) | 6E-05 | — | |
| 39 | ch85 | Flavin-binding monooxygenase (Pfam: FMO_like) | 6E-06 |
| ||
|
| 40 | ch1 | Methyltransferase | 7E-05 |
| |
| γ.C2–C7b/c + γ.C4–C9 |
| 41 | ch6 | Laccase (Cu-dependent oxidoreductase) | 1E-06 |
|
| γ.C6–C11 + γ.C2–C7b/c + γ.C4–C9 |
| 42 | ch8 | Enoyl-CoA reductase | 2E-07 |
|
aPost-Bonferroni correction.
bModel accessory genes from species not covered by the phylogeny reconstruction.
FThe ancestral duplications, subsequent transfers, and gene losses in the NR-PKS complement—annotated on the simplified species tree of higher fungi. The events were annotated based on the results of ALE reconciliation. Deletions are indicated by violet bubbles, duplications by red bubbles. Transfer events are marked through their respective numbers from table 1 (filled outlines indicate donors, hollow outlines indicate transfer acceptors). Salient broad taxa within higher fungi are highlighted by colored backgrounds (in clockwise direction: Basidiomycota, Leotiomycetes, Sordariomycetes, Dothideomycetes, Eurotiomycetes). Subscript on branches denotes predicted numbers of, respectively, duplications/deletions/genes for respective branch. For ease of reference, some nodes were collapsed (thickened branches, proportional to the number of species/strains). Duplications within the Agaricomycetes clade are not shown due to space constraints (see fig. 4).
FThe phylogeny of ancestral clade α, which predates basidiomycete–ascomycete split, contains conserved NR-PKSs biosynthesizing orsellinic acid (pks14, pks1). Support noted below branches in form BI/ML/C (BI, support from BI; ML, support from ML ultrafast bootstrap; C, whether bipartition is present in Bayesian consensus tree; * denotes full support, % denotes support below 1% level). Domain architecture and gene structure are visualized on the right side (colors: red, SAT; blue, KS; green, AT; purple, PT; orange, ACP; pink, TE; brown, R). Color of the tree edges and letters denote predicted events (D, duplication; TD, transfer donated; TA, transfer accepted). Tree is not drawn to scale.
FThe divergence of major NR-PKS clades, which enabled diversification of biosynthesized aromatic compounds, was facilitated by close linkage with key accessory enzymes and transcriptional factors. The simplified phylogeny (based on amalgamated gene tree) shows selected groups of syntenic homologs as colored shapes (also indicated by numbers in box shape, which reference descriptions in table 2).
FSelected examples of predicted horizontal transfers, acting as additional source of diversity: (A) Origin of sterigmatocystin biosynthesis in P. anserina (predicted donor in Rhytidhysteron rufulum lineage), (B) origin of fusarubin biosynthesis in Fusarium sp. (HGT from Pleosporales), (C) origin of putative pksP pigment biosynthetic gene into A. terreus. Visual conventions (branch support, exon, domains) analogous to figure 4, see also table 1 (summary of predicted HGT events). Trees are not drawn to scale.
FThe fragmentation of PT domain into pieces encoded on different exons facilitates gated access to cyclization chamber (clade γ). The structural relationships are visualized on A. parasiticus PksA model (PDB: 3HRQ, chain A). Different colors correspond to fragments encoded on separate exons. The large, filled shape corresponds to the inner surface of cyclization chamber, where all exons contribute to the chamber entrance area. Views from the front (A) and back (B) are both shown.
FThe phylogeny of tetrahydroxynaphthalene synthases (the core enzyme of melanin biosynthesis) mirrors speciation explicitly in the majority of Dothideomycetes, Leotiomycetes, and Sordariomycetes. Visual conventions (branch support, exon, domains) analogous to figure 4. Gene models for MBM_00260 and M7I_3853 were truncated from 5′ and 3′ sides, respectively.
FThe parology/orthology relationships of bikaverin, aurofusarin, and Trichoderma conidial pigment core polyketide synthases are due to an ancestral duplication in filamentous fungi (g723), followed by an additional duplication in Hypocreales (g721). Visual conventions (branch support, exon, domains) are analogous to figure 4. Tree fragment is not drawn to scale.