| Literature DB >> 31014248 |
Alessandra Villani1, Robert H Proctor2, Hye-Seon Kim2, Daren W Brown2, Antonio F Logrieco1, Maria Teresa Amatulli1,3, Antonio Moretti4, Antonia Susca1.
Abstract
BACKGROUND: The Fusarium incarnatum-equiseti species complex (FIESC) comprises 33 phylogenetically distinct species that have been recovered from diverse biological sources, but have been most often isolated from agricultural plants and soils. Collectively, members of FIESC can produce diverse mycotoxins. However, because the species diversity of FIESC has been recognized only recently, the potential of species to cause mycotoxin contamination of crop plants is unclear. In this study, therefore, we used comparative genomics to investigate the distribution of and variation in genes and gene clusters responsible for the synthesis of mycotoxins and other secondary metabolites (SMs) in FIESC.Entities:
Keywords: Comparative genome analyses; Fusarium incarnatum-equiseti species complex; Horizontal gene transfer; Phylogeny; Secondary metabolite genes
Mesh:
Year: 2019 PMID: 31014248 PMCID: PMC6480918 DOI: 10.1186/s12864-019-5567-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Assembly statistics for draft genome sequences of the 13 FIESC strains analyzed in this study
| Species | Strain designation | Alternative Designation | GenBank Accession No.a | Genome Size (Mb) | No. Contigs | N50 | GC (%) |
|---|---|---|---|---|---|---|---|
|
| NRRL 13381 | FRC R-5200 | QGED00000000 | 36.6 | 601 | 209,627 | 48.44 |
|
| ITEM 11363 | NRRL 66338 | QGEB00000000 | 40.0 | 669 | 233,726 | 48.46 |
|
| NRRL 66328 | FRC R-06979 | QHHJ00000000 | 39.5 | 1035 | 168,857 | 48.42 |
| FIESC 5 | ITEM 11348 | NRRL 66337 | QGEC00000000 | 38.6 | 880 | 157,961 | 48.58 |
| FIESC 5 | CS3069 | NRRL 62617 | CBMI000000000 | 38.0 | 5111 | – | – |
| FIESC 12 | ITEM 11294 | NRRL 66336 | QHHI00000000 | 39.6 | 1502 | 75,753 | 48.57 |
| FIESC 15 | NRRL 31160 | – | QGEA00000000 | 37.4 | 634 | 153,141 | 48.64 |
| FIESC 23 | ITEM 7155 | NRRL 66325 | QGDZ00000000 | 37.5 | 421 | 291,968 | 48.66 |
| FIESC 25 | ITEM 6748 | NRRL 66324 | QGDY00000000 | 37.2 | 560 | 213,588 | 48.62 |
| FIESC 28 | ITEM 1616 | NRRL 66322 | QGDX00000000 | 37.0 | 562 | 199,056 | 48.62 |
| FIESC 29 | ITEM 10392 | NRRL 66334 | QHHH00000000 | 37.9 | 471 | 225,592 | 48.53 |
| FIESC 33 | ITEM 10395 | NRRL 66335 | QHHG00000000 | 39.0 | 593 | 183,362 | 48.56 |
| FIESC 33 | ITEM 11401 | NRRL 66339 | QHKN00000000 | 39.2 | 806 | 208,659 | 48.55 |
aWith the exception of FIESC 5 strain CS3069, genome sequences were generated during the course of this study using a MiSeq Illumina platform. The sequence for CS3069 was generated previously [28] and was downloaded from the National Center for Biotechnology Information website
Fig. 1Fusarium species tree inferred using the extended majority rule consensus method from individual maximum likelihood trees of 30 housekeeping genes (Additional file 1). Values above branches are internode certainty values, and values below branches are bootstrap values based on 1000 replicates generated during maximum likelihood analysis of concatenated sequences of the 30 housekeeping genes. Colored boxes demarcate species complexes and the names of the complexes are indicated to the right using unitalicized specific epithets of the species after which each complex is named [3]
Fig. 2Distribution of NRPS and PKS genes among Fusarium species based on BLASTn analysis of genome sequences. A gray box indicates that an apparently functional copy of the gene was present in the genome of species/strain indicated in the species tree to the left. A white box indicates that the gene was absent. A white box with the Greek letter Ψ indicates that only a pseudogene was detected in the genome. A black box indicates that an apparently functional gene was present in one or more strains of a species but was absent (or present as a pseudogene) in one or more other strains of the same species. Note that we analyzed multiple strains of only a small number of species; in addition to the two strains of F. oxysporum, FIESC 5 and FIESC 33 included in the species tree, we analyzed publicly available genomes of two or more strains of F. avenaceum, F. fujikuroi, F. graminearum, F. oxysporum, F. pseudograminearum and F. proliferatum. The species tree shown to the left is derived from (i.e., includes the same species and has the same topology) as the species tree in Fig. 1
antiSMASH-based estimates of secondary metabolite (SM) biosynthetic gene clusters and proportion of genome involved in SM biosynthesis
| Species | Strain No. | No. SM Clusters | Genome Size (Mb) | Mb of SM Clusters | SM Clusters as % of Genome | No. Predicted Genes | No. Predicted SM Genes | SM Genes as % of Genome |
|---|---|---|---|---|---|---|---|---|
| Species from current study | ||||||||
| | NRRL 13381 | 39 | 36.8 | 1.6 | 4.3 | 12,018 | 512 | 4.3 |
| | 11,363 | 40 | 40.0 | 1.6 | 4.1 | 12,905 | 521 | 4.0 |
| | NRRL 66328 | 43 | 39.5 | 1.7 | 4.4 | 13,071 | 560 | 4.3 |
| FIESC5 | 11,348 | 36 | 38.6 | 1.4 | 3.6 | 12,723 | 459 | 3.6 |
| FIESC12 | 11,294 | 42 | 39.6 | 1.4 | 3.5 | 13,219 | 433 | 3.3 |
| FIESC15 | NRRL 31160 | 36 | 37.4 | 1.4 | 3.7 | 12,421 | 442 | 3.6 |
| FIESC23 | 7155 | 34 | 37.5 | 1.4 | 3.7 | 12,325 | 446 | 3.6 |
| FIESC25 | 6748 | 35 | 37.2 | 1.4 | 3.8 | 12,175 | 449 | 3.7 |
| FIESC28 | 1616 | 37 | 37.0 | 1.3 | 3.7 | 12,168 | 456 | 3.7 |
| FIESC29 | 10,392 | 33 | 37.9 | 1.3 | 3.3 | 12,469 | 417 | 3.3 |
| FIESC33 | 10,395 | 39 | 39.0 | 1.6 | 4.1 | 12,810 | 522 | 4.1 |
| FIESC33 | 11,401 | 38 | 39.2 | 1.6 | 4.1 | 12,819 | 506 | 3.9 |
| Species from Hansen et al. [30] | ||||||||
| FIESC 5 | CS3069 | 37 | 38.1 | 0.7 | 1.9 | 13,047 | 215 | 1.6 |
| | CS5907 | 55 | 44.0 | 1.6 | 3.6 | 14,516 | 461 | 3.2 |
| | Fa05001 | 63 | 41.5 | 2.8 | 6.8 | 13,214 | 831 | 6.3 |
| | CS7071 | 42 | 37.7 | 1.3 | 3.5 | 11,922 | 393 | 3.3 |
| | IMI58289 | 50 | 43.8 | 2.2 | 4.9 | 13,692 | 607 | 4.4 |
| | PH-1 | 42 | 36.4 | 1.9 | 5.2 | 11,683 | 604 | 5.2 |
| | 4287 | 50 | 60.2 | 2.0 | 3.4 | 18,382 | 664 | 3.6 |
| | CS3096 | 38 | 36.3 | 1.7 | 4.5 | 11,721 | 511 | 4.4 |
| | 77–13-4 | 38 | 51.2 | 1.6 | 3.1 | 16,410 | 523 | 3.2 |
| | 7600 | 49 | 41.1 | 2.1 | 5.1 | 13,701 | 931 | 6.8 |
Fig. 3Organization of the ZEA gene cluster and flanking genes. The arrows represent the indicated genes while the direction of the arrow shows direction of transcription: Genes A, B, C, D, E, F, G, H, I, J, K, L, M, and N correspond to FGSG_02392, FGSG_02393, FGSG_02394, FGSG_12124, FGSG_12125, FGSG_02399, FGSG_02400, FGSG 11341, FGSG_07712, FGSG_04615, FGSG_11645, FPSE_12200, FVEG_13785, and FGSG 02196, respectively. FIESC homologs A, B, C, D, E, F, G, H, J, M, N, and I share > 70% identity while FIESC homologs of J and K share < 50% identity
Fig. 4a. Organization of homologs of the endocrocin biosynthetic gene cluster in Aspergillus species, FIESC 28 and Pestalotiopsis fici. Blue arrows represent homologous genes present in at least two fungi. Direction of arrows indicate direction of transcription. Abbreviations for gene/protein functions are as follows: AO, anthrone oxidase; TE, metallo-β-lactamase type thioesterase (MβL-TE); PKS, polyketide synthase; TF, = transcription factor, UN, gene of unknown function. The numbers in blue arrows indicate homologs. Black arrows indicate genes that are considered part of the biosynthetic cluster in the respective fungi but that were not present in FIESC 28. b. Proposed biosynthetic pathway for endocrocin anthrone in FIESC 28 and for endocrocin in the other fungi
Fig. 5Phylogeny of PKS genes in FIESC. The condensed phylogenetic tree was generated by maximum likelihood analysis of the entire amino acid sequences of the coding regions predicted from 146 PKS enzymes identified in 13 FIESC genomes. All the homologous PKSs from Brown and Proctor 2016 were included in the analysis. The Gallus gallus fatty acid synthase (FAS) gene was used as outgroup. Clades corresponding to the three previously described major clades of reducing PKSs (R-PKS I, R-PKS II and R-PKS III) and the one major clade of non-reducing PKSs were resolved in this analysis with high levels of bootstrap support (in red type) and are delineated with horizontal lines
Fig. 6ML phylogenetic tree of adenylation domains from NRPSs and hybrid PKS-NRPS. Numbers at nodes indicate bootstrap value greater than 70%, performed with 1000 replications. Name of subfamilies are shown to the left [46]. Known products and domain structures [30] are shown on the right
Putative horizontal gene transfer (HGT) events of NRPS and PKS genes between FIESC and other lineages of Fusarium. Grey highlighting indicates that results from all analyses were consistent with HGT
| Gene | HGT Donor | HGT Recipient | Identificationa | Additional Evidence for HGTb | |||
|---|---|---|---|---|---|---|---|
| Manual Tree Comparison | NOTUNG | Bootstrap | SH-AU |
| |||
|
| |||||||
| | Tricinctum complex | FIESC (Incarnatum clade) | + | + | 100 | + | + |
| | Sambucinum complex (or close relative) | FIESC | – | + | – | – | – |
| | Sambucinum complex (or close relative) | FIESC | – | + | 100c | + | – |
| | FIESC | + | + | 77 | – | ± | |
| | Fujikuroi complex (African clade) | FIESC (Incarnatum clade) | + | + | 100 | + | + |
| | Tricinctum complex | + | + | 100 | + | + | |
| | Sambucinum complex | FIESC (Incarnatum clade) | + | + | 100 | + | – |
| | FIESC (Equiseti clade) | – | + | 100 | + | – | |
| | Fujikuroi complex | FIESC (Incarnatum clade) | + | + | 100d | + | + |
| | Tricinctum complex ( | FIESC (Incarnatum clade) | + | + | 100 | + | – |
| | Fujikuroi complex | FIESC | NA | NA | – | NA | + |
| | Fujikuroi complex | FIESC | +f | NA | 90 | + | + |
|
| |||||||
| | FIESC | Sambucinum complex | + | + | 74 | + | ± |
| | FIESC (Equiseti clade) |
| – | + | – | – | ± |
| | FIESC (Equiseti clade) | Tricinctum complex | + | + | 100 | + | – |
| | FIESC (Equiseti clade) | Ancestor of Sambucinum and Chlamydosporum complexes | – | + | – | – | ± |
| | FIESC (Incarnatum clade) | Sambucinum complex ( | – | + | – | – | ± |
| | FIESC (Equiseti clade) | – | + | – | – | ± | |
| | FIESC5 (or close relative) | – | + | – | – | ± | |
aPutative HGT events were identified by manual comparison of individual NRPS and PKS gene trees to the species tree (Fig. 1) and by using the gene tree reconciliation program NOTUNG [76]. + indicates the method identified the putative HGT event, and – indicates the method did not identify the HGT event. For the Manual comparison column, ± indicates a branch conflict was identified, but that we considered an alternative hypothesis (i.e., an hypothesis that did not involved HGT to FIESC) was also plausible. NA indicates not applicable
bThree analyses were done to further assess evidence for HGT: bootstrap analysis, SH- AU tests, and estimates of number of synonymous substitutions per synonymous site (d). In the Bootstrap column, numerical values correspond to the bootstrap values for the branch in the NRPS/PKS gene tree that conflicted with the species tree; and – indicates that the bootstrap value for the conflicting branch was < 70%, or that in our estimation the branch indicative of HGT in NOTUNG analysis did not conflict with the species tree. In the SH-AU column, + indicates that the constrained tree was significantly worse than the unconstrained tree; and – indicates that the constrained tree was not significantly worse than the unconstrained tree. In the d column, + indicates d ratio < 1; indicates d ratio > 1; and ± indicates comparisons for which d ratios < 1 may not be evidence of HGT, because over 50% of comparisons for the gene yielded d ratios < 1. NA indicates not applicable
cThe bootstrap value of 100 was for a FIESC and Sambucinum complex branch that excluded F. aywerte (i.e., Chlamydosporum complex). From our manual comparison of the NRPS14 tree with the species tree, we considered that the topology of the NRPS14 tree could have resulted if the relationship of FIESC and Sambucinum complex sequences were concordant with the species tree, but the relationship of F. aywerte sequences to Sambucinum complex sequences was not concordant with the species tree
dThis bootstrap value is for a clade that includes members of FIESC and the Fujikuroi and Nisikadoi complexes (See Additional file 5). The bootstrap value for the clade consisting of only FIESC and the Fujikuroi complex was < 70%, and therefore not significant. This in turn suggests that the donor of HGT of PKS23 may have been an ancestor or other relative of the Fujikuroi and Nisikadoi complexes
eSome non-FIESC homologs used in the PKS62 and PKS69 analyses were not species included in the species tree inferred the current study. Fusarium agapanthi and F. dlaminii are members of the Fujikuroi complex [80], and Fusarium sp. NRRL 25184 (25184) is a member of the F. newnesense species complex, a lineage that is closely related to the Fujikuroi, Nisikadoi and Oxysporum complexes [81]
fThis branch conflict was inferred by comparison of the PKS69 tree to previously reported species trees showing the relationships of FIESC and F. dlaminii to one another and/or other lineages of Fusarium [3, 29]. The previous studies indicate that F. dlaminii and F. fujikuroi are both members of the Fujikuroi complex. In the PKS69 tree, the F. dlaminii homolog is more closely related to FIESC homologs than to the F. fujikuroi homolog. Thus, the PKS69 tree conflicts with the relationships of species
Fig. 7Results of phylogenetic analyses of the beauvericin/enniatin NRPS gene NRPS22 (ESYN1) as an example of analyses used to assess HGT of NRPS and PKS genes. a. ML tree of NRPS22 sequences from all species included in this study that have the gene. The colored boxes demarcate species complexes. Numerical values near branches are bootstrap values based on 1000 replications. b. Phylogenetic tree from NOTUNG reconciliation analysis using the species tree shown in Fig. 1 and default cost settings specified by NOTUNG. Yellow arrows indicate NOTUNG-inferred HGT events. c. Mean dS values for NRPS22 and HK genes from pairwise comparisons of members of FIESC and the Sambucinum and Fujikuroi species complexes shown in A. Plot of ratio of dS values for NRPS22:HK genes (i.e., dS ratio) for the 153 pairwise comparisons of taxa shown in A. The ratios at the bottom right of the plot that are highlighted with yellow are the only ratios that were less than 1 and correspond to comparison involving FIESC 23 or FIESC 25 with members of the Fujikuroi, Nisikadoi and Oxysporum complexes. The specific pairwise comparisons corresponding to the numbers along the X-axis are shown in the NRPS22 tab of Additional file 7
Fig. 8Left: FIESC species tree derived from ML tree in Fig. 1. Right: variation in trichothecene biosynthetic cluster homologs in FIESC. Arrows indicate genes and direction of transcription. White arrows labelled with numbers indicate known TRI genes. Gray arrows indicate genes flanking TRI cluster . *indicates that TRI16 is present in the genome