| Literature DB >> 28949419 |
Frances M Van Dolah1,2, Gurjeet S Kohli3,4, Jeanine S Morey2,5, Shauna A Murray3.
Abstract
Dinoflagellates are prolific producers of polyketide compounds, many of which are potent toxins with adverse impacts on human and marine animal health. To identify polyketide synthase (PKS) genes in the brevetoxin-producing dinoflagellate, Karenia brevis, we assembled a transcriptome from 595 million Illumina reads, sampled under different growth conditions. The assembly included 125,687 transcripts greater than 300 nt in length, with over half having >100× coverage. We found 121 transcripts encoding Type I ketosynthase (KS) domains, of which 99 encoded single KS domains, while 22 contained multiple KS domains arranged in 1-3 protein modules. Phylogenetic analysis placed all single domain and a majority of multidomain KSs within a monophyletic clade of protist PKSs. In contrast with the highly amplified single-domain KSs, only eight single-domain ketoreductase transcripts were found in the assembly, suggesting that they are more evolutionarily conserved. The multidomain PKSs were dominated by trans-acyltransferase architectures, which were recently shown to be prevalent in other algal protists. Karenia brevis also expressed several hybrid nonribosomal peptide synthetase (NRPS)/PKS sequences, including a burA-like sequence previously reported in a wide variety of dinoflagellates. This contrasts with a similarly deep transcriptome of Gambierdiscus polynesiensis, which lacked NRPS/PKS other than the burA-like transcript, and may reflect the presence of amide-containing polyketides in K. brevis and their absence from G. polynesiensis. In concert with other recent transcriptome analyses, this study provides evidence for both single domain and multidomain PKSs in the synthesis of polyketide compounds in dinoflagellates.Entities:
Keywords: zzm321990Karenia breviszzm321990; algal toxin; dinoflagellate; polyketide synthase; toxin biosynthesis
Mesh:
Substances:
Year: 2017 PMID: 28949419 PMCID: PMC5725682 DOI: 10.1111/jpy.12586
Source DB: PubMed Journal: J Phycol ISSN: 0022-3646 Impact factor: 2.923
Figure 1Diversity of polyketide compounds produced by Karenia brevis.
Coverage and annotation statistics of Karenia brevis gene catalogue. An e‐value cut‐off of 10−3 was applied during BLASTx analysis
| Coverage | No. of contigs | Length (mean) | BLASTx analysis | PKS sequences | ||
|---|---|---|---|---|---|---|
| Annotated match | Non‐annotated match | No match | ||||
| 1×–5.00× | 2,819 | 300–1,401 (409.3) | 167 | 310 | 2,342 | 7 |
| 5.01×–20× | 15,033 | 300–10,974 (817.5) | 2,455 | 3,046 | 9,532 | 14 |
| 20.01×–50× | 19,906 | 300–24,286 (1,292.2) | 5,278 | 5,425 | 9,203 | 5 |
| 50.01×–100.00× | 20,907 | 300–36,109 (1,536.5) | 6,804 | 5,396 | 8,707 | 10 |
| 100.01×–1,000× | 61,676 | 300–34,047 (1,641.8) | 20,692 | 16,265 | 24,719 | 91 |
| 1,000.00×–10,000× | 5,272 | 300–6,989 (956.6) | 2,291 | 11,09 | 1,872 | – |
| 10,000.01× and above | 74 | 311–2,254 (543.9) | 58 | 5 | 11 | – |
| Total number of contigs (percentage) | 125,687 | 300–36,109 (1,413.3) | 37,745 (30%) | 31,556 (25.1%) | 56,386 (44.9%) | 127 |
Figure 2Phylogenetic analysis of ketoacyl synthase (KS) domains from prokaryotic and eukaryotic Type I and II polyketide synthases (PKS) and Type I FAS. The alignment included 121 KS domains from Karenia brevis sequences encoding single and multidomain PKS and NRPS/PKSs and 154 KS domains from Gambierdscus polynesiensis and other prokaryotic and eukaryotic taxa. Analysis was conducted using RAxML, using the LG model of rate heterogeneity, with 1,000 bootstraps. Bootstrap values ≥50% shown.
Figure 3Consensus sequences found in the conserved active sites within single domain KS sequences. Sequences in Clades 1 and 2 possess the highly conserved cysteine and histidines (starred), whereas sequences in Clade 3 are divergent, most lacking the conserved C and Hs. Consensus sequences were generated using Weblogo (http://weblogo.berkeley.edu/logo.cgi).
Predicted signal peptides and cleavage sites in known plastid‐localized proteins and Type II FAS sequences in Karenia brevis. Signal peptide prediction and score generated by signalP. The D‐score (discrimination score) is used to discriminate signal peptides from non‐signal peptides, using a threshold of 0.5. Cleavage site sequences were screened for the conserved FVAP sequence found in peridinin dinoflagellates. In some sequences a plausible FVAP‐like sequences is present, but most possess the RRV(Q/K) also present in Karlodinium. No signal peptides were present in any single domain PKS KS or KR sequences
| GenBank accession no. or contig no. | CD and blast ID | Prediction | D score | Cleavage site |
|---|---|---|---|---|
|
| Cytochrome b6f FeS subunit | No | 0.131 | – |
| Contig 15936 | Cytochrome b6‐f | Yes | 0.55 | G‐QSPREQ |
|
| Ferredoxin | Yes | 0.613 | R‐RRVRD |
| Contig 15936 | Multimeric flavodoxin | Yes | 0.763 | A‐ASLAS |
|
| Flavodoxin nadph reductase | Yes | 0.673 | G‐FRVQ |
|
| Oxygen enhancer | No | 0.419 | G‐RFQQK |
|
| psII 12kD | Yes | 0.864 | A‐FSPA |
|
| GapDH C1 | Yes | 0.606 | A‐(+9)FEEQ |
|
| GapDH C1 | No | 0.323 | A‐FIAPA |
| kbrevis_combined_contig_386 | KASII 3 oxoacyl ACP synthase | Yes | 0.7 | G‐RRVQ or G‐(+9)FKPA |
| kbrevis_combined_contig_387 | KASII 3 oxoacyl ACP synthase | Yes | 0.681 | G‐RRVQ or G‐(+9)FKPA |
| kbrevis_combined_contig_46229 | KASII 3 oxoacyl ACP synthase | Yes | 0.723 | S‐DYGR or G‐RRVK |
| kbrevis_combined_contig_37304 | KASII 3 oxoacyl ACP synthase | Yes | 0.607 | G‐RRVQ or A‐FNPA or G‐(+9)FNPA |
| kbrevis_combined_contig_49622 | KASIII 3 oxoacyl ACPsynthase | Yes | 0.816 | G‐RRIL |
| kbrevis_combined_contig_275 | KASII 3 oxoacyl ACP synthase | Yes | 0.572 | G‐RRLN |
| kbrevis_combined_contig_97889 | KR | Yes | 0.837 | A‐QTPT or A‐(+9)FPHA |
| kbrevis_combined_contig_64411 | KR | Yes | 0.479 | G‐RPMQ |
| kbrevis_combined_contig_86499 | KR | Yes | 0.798 | A‐YGEF or C‐SSRE |
| kbrevis_combined_contig_28593 | ER | Yes | 0.646 | G‐KRVQ |
| kbrevis_combined_contig_47965 | ER | Yes | 0.69 | G‐KRVK |
| kbrevis_combined_contig_44484 | DH | No | 0.148 | – |
| kbrevis_combined_contig_6314 | AT | Yes | 0.659 | G‐RTLQ |
| kbrevis_combined_contig_101182 | AT | No | 0.427 | G‐RRLQ |
Domain structure of multidomain PKS, NRPS/PKS hybrids, and NRPS contigs in the Karenia brevis assembly
| Contig number | Length (nt) | Classification | Cis/Trans AT | No. of modules | Domain structure |
|---|---|---|---|---|---|
| Contig 1930 | 19,025 | NRPS/PKS | trans | 3 | TE‐A‐DH‐PP‐C‐A‐PP‐ |
| Contig 10563 | 18,781 | NRPS/PKS | trans | 3 | TE‐A‐DH‐PP‐C‐A‐PP‐ |
| Contig 5155 | 11,047 | PKS | trans | 2 | PP‐ |
| Contig 78360 | 2,221 | PKS | trans | 3 | PP‐ |
| Contig 10709 | 14,778 | PKS | trans | 3 | PP‐ |
| Contig 15957 | 3,031 | PKS | trans | 1 | KR‐PP‐ |
| Contig 81604 | 7,799 | PKS | trans | 3 |
|
| Contig 28414 | 12,319 | PKS | cis/trans | 2 | PP‐ |
| Contig 54805 | 8,035 | PKS | cis | 1 |
|
| Contig 75628 | 10,131 | PKS | cis | 1 |
|
| Contig 57200 | 3,254 | PKS | ? | 1 | PP‐ |
| Contig 89014 | 1,027 | PKS | ? | 1 | PP‐ |
| Contig 99638 | 3,763 | PKS | ? | 1 | PP‐ |
| Contig 113789 | 2,654 | PKS | ? | 1 | (KS)‐KR |
| Contig 114143 | 2,119 | PKS | ? | 1 | PP‐ |
| Contig 124885 | 3,014 | PKS | ? | 1 |
|
| Contig 134145 | 2,544 | PKS | ? | 1 | PP‐ |
| Contig 34829 | 18,811 | NRPS/PKS | trans | 2 | tpp‐A‐KR‐PP‐ |
| Contig 10632 | 9,386 | NRPS/PKS | cis | 1 | A‐ |
| Contig 3318 | 8,238 | NRPS/PKS | cis | 1 | TE‐A‐PP‐ |
| Contig 77766 | 4,288 | NRPS | ? | 2 | PP‐ |
| Contig 4898 | 2,638 | NRPS | ? | 2 | TE‐A‐DH‐PP‐ |
KS, ketosynthase; KR, ketoreductase; DH, dehydratase; ER, enoyl reductase; KR(ER)K, KR with embedded ER domain; AT, acyl transferase; TE, thioesterase; A, adenylation domain; C, condensation domain; PP, phosphopantetheine binding site of ACP or PCP domains; (MT), possible methyl transferase; tpp, thiamine pyrophosphate binding omain; LbH, left‐handed beta helix; ( ), partial domain; [ER], embedded within KR domain.
Figure 4Details of clade membership of Karenia brevis KS domains in Figure 2. Multidomain PKS sequences were found primarily in three clades. Inset shows locations of clades detailed in this figure. (A) Dinoflagellate multidomain Clade 1 within the protist clade that included apicomplexans. This clade contains KS domains from K. brevis and Gambierdscus polynesiensis trans‐AT PKSs. (B) Dinoflagellate multidomain Clade 2 within the protist clade includes two cis‐AT and one trans‐AT PKS in K. brevis and several trans‐AT G. polynesiensis KSs. (C) Clade 3 includes bacterial and dinoflagellate BurA‐like NRPS/PKS, cis‐AT NRPS/PKS sequences and cis‐AT bacterial Type I PKS. S. atroolivaceus, Streptococcus atroolivaceus.
Figure 5Phylogenetic analysis of ketoacyl reductase (KR) domains from prokaryotic and eukaryotic Type I and II PKS and Type I and II FAS. The alignment included 193 KR domain sequences, including 33 encoding single and multidomain PKS and NRPS/PKSs from Karenia brevis, and 160 from other dinoflagellates and other prokaryotic and eukaryotic taxa. Analysis was conducted using RAxML using the LG model of rate heterogeneity, with 1,000 bootstraps. Bootstrap values ≥50% shown.
Figure 6Details of clade membership of Karenia brevis KR domains summarized in Figure 5. KR domains from multidomain PKS sequences were found in subclades that correspond more closely to module architecture than to contig membership. Sub‐clade 1: KS‐KR‐ACP; Sub‐clade 2: KS‐DH‐KR; Sub‐clade 3 KS‐DH‐KR[ER]KR‐ACP: Sub‐clade 4: burA‐like NRPS/PKS; Sub‐clade 5: trans‐AT modules from NRPS/PKS with architecture KS‐DH‐KR in sequences with highly amplified ACP domains at their c‐terminal ends. G. australis, Gambierdscus australis; G. excentricus, Gambierdiscus excenctricus; H. triquestra, Heterocapse triquestra; A. carterae, Amphidinium carterae.