| Literature DB >> 28818040 |
Abstract
BACKGROUND: Fungal pathogens of plants produce diverse repertoires of secondary metabolites, which have functions ranging from iron acquisition, defense against immune perturbation, to toxic assaults on the host. The wheat pathogen Zymoseptoria tritici causes Septoria tritici blotch, a foliar disease which is a significant threat to global food security. Currently, there is limited knowledge of the secondary metabolite arsenal produced by Z. tritici, which significantly restricts mechanistic understanding of infection. In this study, we analyzed the genome of Z. tritici isolate IP0323 to identify putative secondary metabolite biosynthetic gene clusters, and used comparative genomics to predict their encoded products.Entities:
Keywords: Gene cluster; Mycosphaerella graminicola; Secondary metabolism; Siderophore; Toxin; Zymoseptoria tritici
Mesh:
Substances:
Year: 2017 PMID: 28818040 PMCID: PMC5561558 DOI: 10.1186/s12864-017-3969-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Predicted secondary metabolite loci in Z. tritici
| Cluster number | Secondary metabolite class | Predicted key biosynthetic gene(s) | Predicted transporter | Resident transcription factor | Subtelomeric | Evidence of transcriptional co-expression? | AntiSMASH | SMURF | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Cluster boundary | No. of genes | Cluster boundary | No. of genes | |||||||
| 1 | NRPS-Like | Mycgr3G107072 | YES | NO | NO | NO | 1:2,172,553:2,212,738 | 12 | 1:2,189,682:2,216,244 | 8 |
| 2 | Transporter | Na | YES | NO | NO | NO | 1:2,306,586:2,310,597 | 2 | Na | Na |
| 3 | Terpene | Mycgr3G34236 | NO | NO | NO | YES | 1:2,366,583:2,438,578 | 28 | Na | Na |
| 4 | Terpene | Mycgr3G33174 | YES | NO | NO | NO | 1:3,518,416:3,535,837 | 9 | Na | Na |
| 5 | PKS | Mycgr3G83965 | NO | NO | NO | NO | 1:3,821,874:3,860,702 | 14 | Na | Na |
| 6 | NRPS | Mycgr3G16590 | YES | YES | NO | NO | 1:5,128,643:5,192,491 | 23 | Na | Na |
| 7 | PKS | Mycgr3G67477 | YES | YES | NO | YES | 1:5,513,274:5,566,181 | 17 | 1:5,524,934:5,578,932 | 20 |
| 8 | PKS | Mycgr3G9788 | YES | YES | YES | YES | 2:31,258:149,728 | 37 | 2:123,001:157,676 | 11 |
| 9 | NRPS | Mycgr3G19958 | NO | NO | YES | NO | 2:222,377:310,014 | 19 | 2:260,654:284,183 | 5 |
| 10 | NRPS | Mycgr3G36951 | YES | NO | NO | NO | 2:420,337:471,819 | 14 | 2:433,742:453,243 | 2 |
| 11 | NRPS | Mycgr3G90558 | YES | NO | NO | NO | 2:1,008,626:1,058,615 | 13 | 2:1,026,842:1,052,480 | 6 |
| 12 | Terpene | Mycgr3G99148 | NO | YES | NO | YES | 2:1,934,751:2,046,004 | 31 | Na | Na |
| 13 | PKS-Like | Mycgr3G39149 | YES | NO | YES | YES | Na | Na | 3:3,181,477:3,213,160 | 13 |
| 14 | NRPS | Mycgr3G39931, Mycgr3G40534 | YES | NO | YES | NO | 4:12,877:43,949 | 7 | Na | Na |
| 15 | PKS | Mycgr3G100089 | YES | YES | YES | YES | 5:85,463:149,465 | 22 | 5:94,206:129,292 | 12 |
| 16 | NRPS-Like | Mycgr3G93235, Mycgr3G100227 | NO | YES | NO | YES | 5:1,291,545:1,380,182 | 31 | 5:1,309,417:1,324,422 | 4 |
| 17 | PKS | Mycgr3G72709 | YES | YES | YES | YES | 6:72,339:170,251 | 36 | 6:106,974:121,853 | 4 |
| 18 | NRPS | Mycgr3G72768 | YES | NO | YES | YES | 6:283,399:352,537 | 24 | 6:310,922:346,260 | 11 |
| 19 | NRPS | Mycgr3G44313 | NO | NO | NO | YES | 6:1,808,394:1,845,239 | 16 | 6:1,821,703:1,841,259 | 9 |
| 20 | NRPS | Mycgr3G109989 | YES | YES | NO | NO | 6:2,243,873:2,278,779 | 9 | Na | Na |
| 21 | PKS | Mycgr3G45348 | YES | NO | YES | NO | 7:2,566,479:2,608,068 | 12 | 7:2,577,096:2,606,813 | 8 |
| 22 | NRPS | Mycgr3G110642 | YES | NO | YES | NO | 8:2,346,378:2,374,211 | 8 | 8:2,352,913:2,362,665 | 2 |
| 23 | NRPS-Like | Mycgr3G75370 | YES | YES | NO | YES | 9:483,924:714,190 | 65 | Na | Na |
| 24 | PKS | Mycgr3G47832 | YES | YES | NO | NO | 9:1,474,679:1,516,611 | 12 | 9:1,494,054:1,506,383 | 3 |
| 25 | Terpene | Mycgr3G76129 | YES | YES | NO | NO | 10:341,276:419,193 | 29 | Na | Na |
| 26 | Hybrid PKS- NRPS | Mycgr3G62978 | YES | NO | NO | YES | 10:425,702:485,734 | 20 | 10:439,622:460,865 | 6 |
| 27 | PKS | Mycgr3G101493 | YES | NO | YES | YES | 10:1,274,562:1,480,511 | 54 | 10:1,436,111:1,437,155 | 7 |
| 28 | NRPS-Like | Mycgr3G49555 | NO | NO | NO | YES | 11:393,046:429,601 | 12 | 11:409,058:423,584 | 5 |
| 29 | PKS | Mycgr3G96592 | NO | YES | NO | NO | 11:566,694:611,143 | 14 | Na | Na |
| 30 | NRPS-Like | Mycgr3G50095 | YES | YES | NO | NO | 11:823,532:917,882 | 30 | 11:849,800:868,675 | 7 |
| 31 | NRPS | Mycgr3G96900 | YES | NO | YES | YES | 12:157,368:195,467 | 15 | Na | Na |
| 32 | NRPS-Like | Mycgr3G77312 | NO | NO | NO | NO | 12:434,096:471,899 | 9 | Na | Na |
| 33 | PKS | Mycgr3G51018 | NO | NO | NO | YES | 13:656,107:712,077 | 17 | Na | Na |
| 34 | Terpene | Mycgr3G101921, Mycgr3G12838 | YES | NO | YES | NO | 13:1,116,277:1,139,970 | 8 | Na | Na |
AntiSMASH and SMURF analysis pipelines were used to predict secondary metabolite clusters in Z. tritici isolate IP0323, identifying a total of 34 putative loci. Ensembl gene identifiers for predicted key biosynthetic genes identified by these analyses are given. Clusters were numbered in ascending numerical order based on their respective location in the Z. tritici genome. In order to identify genes encoding ABC or MFS transporters at each locus, genes were interrogated based on GO-terms GO:0055052 (ATP-binding cassette transporter complex) and GO:0055085 (transmembrane transporter). Transcription factors were identified by screening genes for GO-terms GO:0003677 (DNA binding) and GO:0003700 (transcription factor activity). Subtelomeric clusters were defined as any loci with predicted genes residing within 300 kb of the chromosome end. Interrogation of each individual cluster suggests the key gene of cluster 13 encodes a ketoacyl synthase domain-containing protein, and consequently is likely involved in fatty acid synthesis. Cluster 2 consists of two transport proteins, and lacks a gene encoding a putative key biosynthetic enzyme
Fig. 1Physical distribution of predicted secondary metabolite clusters on the Z. tritici genome. Putative loci on the 13 Z. tritici core chromosomes are shown, and no clusters were predicted on the 8 dispensable chromosomes. Manual interrogation of clusters 2 and 13 demonstrated they are unlikely to biosynthesize a secondary metabolite (see main text), resulting in a total of 32 putative secondary metabolite clusters. Line boxes indicate boundaries of putative secondary metabolite cluster as predicted by AntiSMASH. Numbers in parentheses are the cluster number (Table 1). Shaded regions are loci that demonstrate evidence of gene co-regulation in transcriptional profiling. Red lines below secondary metabolite loci indicate regions containing genes that have one or more paralogues that also reside within a predicted biosynthetic gene cluster
Fig. 2Schematic representation of conserved gene architecture between predicted Z. tritici loci and clusters from other fungi (a-f). Predicted gene boundaries are depicted by arrows and assigned Ensemble identifiers (Z. tritici) or gene names (other species). Identical colors between top and bottom loci are indicative of significant similarity at the level of encoded amino acid sequence (see Table 2). Non-colored arrows are predicted genes that lack an orthologue at the other respective locus. Numbers in parentheses are Z. tritici cluster number (see Table 1)
BLAST analyses of predicted Z. tritici secondary metabolite loci across the Genbank sequence archive
| Putative cluster in | Hit cluster with experimentally defined biosynthetic product | BLAST Results | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Putative product (cluster number) | Ensembl Gene ID | GenBank ID | Annotation in | Species Name | Ensembl Gene ID | GenBank ID | Annotation | % similarity | Sequence coverage | E value |
| Siderophore (14) | Mycgr3G40534 | EGP88586 | Non-ribosomal peptide synthetase |
| UMAG_01433 | 23562457 | Ferrichrome siderophore peptide synthetase | 48% | 82% | 8,00E-61 |
| Mycgr3G85486 | EGP87766 | putative siderophore biosynthesis protein | UMAG_01432 | 23562456 | Putative lysine N-acyltransferase | 43% | 89% | 8,00E-103 | ||
| Mycgr3G41235 | EGP87768 | putative ABC transporter | UMAG_01431 | 23562455 | Multidrug resistance-associated protein 1 | 42% | 100% | 0,00E + 00 | ||
| Mycgr3G70577 | EGP87767 | putative siderophore-dependent iron transporter | UMAG_01439 | 23562463 | Siderophore iron transporter 3 | 39% | 93% | 3,00E-114 | ||
| (Mycgr3G76805) | Hypothetical protein | UMAG_01433 | putative enoyl-CoA hydratase/isomerase | 39% | 83% | 2e-46 | ||||
| (Mycgr3G5470) | HMG-CoA synthase | (UMAG_05362) | HMG-CoA synthase | 54% | 98% | 3,00E-180 | ||||
| DHN melanin (29) | Mycgr3G87993 | EGP83310 | Hypothetical protein |
| MGG_07215 | EHA55622 | Transcription factor | 40% | 100% | 0,00E + 00 |
| Mycgr3G87994 | EGP83311 | Hypothetical protein | MGG_07216 | EHA55623 | Versicolorin reductase 4HNR | 48% | 97% | 2,00E-73 | ||
| Mycgr3G96592 | EGP83620 | Polyketide synthase | MGG_07219 | EHA55627 | Conidial pigment polyketide synthase | 45% | 100% | 0,00E + 00 | ||
| Carotenoid (34) | Mycgr3G97527 | EGP82655 | Hypothetical protein |
| FFUJ_11801 | CCT75764 | related to lignostilbene alphabeta-dioxygenase I | 57% | 90% | 0,00E + 00 |
| Mycgr3G51545 | EGP82649 | Hypothetical protein | FFUJ_11802 | CCT76070 | probable geranylgeranyl-diphosphate geranylgeranyltransferase | 53% | 99% | 0,00E + 00 | ||
| Mycgr3G101921 | EGP82650 | Hypothetical protein | FFUJ_11803 | CCT75765 | probable phytoene dehydrogenase AL-1 | 58% | 96% | 0,00E + 00 | ||
| Mycgr3G106573 | EGP82651 | Hypothetical protein | FFUJ_11804 | CCT75766 | related to HSP30 heat shock protein Yro1p | 68% | 97% | 0,00E + 00 | ||
| (Mycgr3G72713) | Hypothetical protein | (FFUJ_07962) | AM418467.1 | Torulene oxygenase | 41% | 95% | 2,00E-130 | |||
| ETP (9) | Mycgr3G19958 | EGP89696 | Hypothetical protein |
| AFUA_6G09660 | EAL88817 | NRPS GliP | 28% | 100% | 1,00E-146 |
| Mycgr3G37004 | EGP89694 | aminocyclopropane-1-carboxylate synthase-like protein | AFUA_6G09640 | EAL88815 | Aminotransferase gliI | 34% | 96% | 4,00E-61 | ||
| Mycgr3G37024 | EGP90779 | putative P450 monooxygenase | AFUA_6G09730 | EAL88824 | cytochrome P450 oxidoreductase GliF | 51% | 97% | 3,00E-168 | ||
| Mycgr3G35526 | EGP90781 | putative O-methyltransferase | AFUA_6G09680 | EAL88819 | O-methyltransferase GliM | 42% | 90% | 2,00E-102 | ||
| Mycgr3G35698 | EGP90782 | putative glutathione S-transferase | AFUA_6G09690 | EAL88820 | glutathione S-transferase GliG | 41% | 87% | 7,00E-55 | ||
| Mycgr3G35967 | EGP89693 | gliotoxin biosynthesis protein | AFUA_6G09700 | EAL88821 | gliotoxin biosynthesis protein GliK | 35% | 96% | 1,00E-41 | ||
| Mycgr3G36631 | EGP90783 | putative P450 monooxygenase | AFUA_6G09670 | EAL88818 | Cytochrome P450 oxidoreductase gliC | 34% | 96% | 7,00E-94 | ||
| Mycgr3G84377 | EGP89692 | putative pyridine nucleotide-disulfide oxidoreductase | AFUA_6G09740 | EAL88825 | Thioredoxin reductase gliT | 44% | 99% | 3,00E-82 | ||
| Fumonisin (27) | Mycgr3G101493 | EGP83950 | Polyketide synthase |
| ACB12550 | Fum1 polyketide synthase | 36% | 87% | 0,00E + 00 | |
| Mycgr3G63387 | EGP84006 | Hypothetical protein | ACB12552 | Fum3 cytochrome p450 | 52% | 90% | 8,00E-110 | |||
| Mycgr3G48687 | EGP84002 | putative P450 monooxygenase | ACB12551 | Fum2 cytochrome p450 | 36% | 97% | 2,00E-79 | |||
| Mycgr3G48855 | EGP84005 | putative P450 monooxygenase | ACB12560 | Fum15 cytochrome p450 | 28% | 100% | 4,00E-59 | |||
| Mycgr3G48855 | EGP84005 | putative P450 monooxygenase | ACB12565 | Cpm1 cytochrome p450 | 38% | 98% | 3,00E-118 | |||
| AM-toxin (18) | Mycgr3G72768 | EGP86944 | Hypothetical protein |
| BAI44759 | nonribosomal peptide synthetase AMT1 | 40% | 84% | 0,00E + 00 | |
| Mycgr3G43805 | EGP86331 | Hypothetical protein | BAI44762 | similar to branched-chain-amino-acid AMT5-2aminotransferase | 55% | 98% | 5,00E-31 | |||
| Mycgr3G43729 | EGP86946 | Hypothetical protein | BAI44741 | similar to 3-isopropylmalate dehydrogenase AMT6 | 60% | 95% | 3,00E-150 | |||
| Mycgr3G12048 | EGP86333 | Hypothetical protein | BAI44764 | similar to 2-isopropylmalate synthase AMT7 | 58% | 95% | 0,00E + 00 | |||
| Mycgr3G86363 | EGP86330 | Hypothetical protein | BAI44765 | aconitase family protein | 58% | 100% | 0,00E + 00 | |||
| Mycgr3G43304 | EGP86943 | Hypothetical protein | BAI44766 | thioesterase domain containing protein AMT4 | 38% | 89% | 2,00E-43 | |||
| Mycgr3G43786 | EGP86329 | Hypothetical protein | BAI44767 | similar to methyltransferase AMT9 | 28% | 100% | 2,00E-13 | |||
| Mycgr3G11027 | EGP86945 | Hypothetical protein | BAI44768 | nonribosomal peptide synthetase AMT10–2 | 44% | 100% | 0,00E + 00 | |||
MultiGeneBLAST analyses of predicted cluster loci (Table 1) were conducted across all available sequences in the Genbank archive (Clark et al. 2016). Loci with statistically significant similarity to Z. tritici query sequences were manually interrogated to identify biosynthetic gene clusters for which the secondary metabolite product has been experimentally confirmed. Results for statistically significant BLAST alignment between predicted amino acid sequences encoded by genes at each locus are reported. Gene identifiers in parentheses are not contiguously located at each respective cluster, but are necessary for biosynthesis of the experimentally confirmed product (see main text for details). These genes were identified by individual protein BLAST searches between respective genomes using the NCBI BLAST tool. Both Genbank and Ensembl gene identifiers are given where available. Gene annotations were retrieved from Ensembl, or where unavailable from Genbank
Fig. 3Transcriptional analysis reveals stage specific expression of numerous predicted secondary metabolite loci during infection. Numbers in parentheses indicate cluster number from this study (Table 1). Transcriptional values (y-axis) are average fragments per kilobase for each gene per million fragments (FPKM), with the Ensemble gene identifier numbers given. Data were taken from Rudd and colleagues Additional file 3: Table S3 ([6]). In this study, RNA samples were isolated from Czapek-Dox broth (CDB), potato dextrose broth (PDB), and from various days post inoculation in a virulence assay of IP0323 on ‘Riband’ wheat cultivar. We identified co-expressed contiguous loci using the FunGeneClusterS web interface [26]