| Literature DB >> 35628705 |
Julia V Gerasimova1,2, Andreas Beck1,2, Silke Werth1, Philipp Resl1,3.
Abstract
Fungi involved in lichen symbioses produce a large array of secondary metabolites that are often diagnostic in the taxonomic delimitation of lichens. The most common lichen secondary metabolites-polyketides-are synthesized by polyketide synthases, particularly by Type I PKS (TI-PKS). Here, we present a comparative genomic analysis of the TI-PKS gene content of 23 lichen-forming fungal genomes from Ascomycota, including the de novo sequenced genome of Bacidia rubella. Firstly, we identify a putative atranorin cluster in B. rubella. Secondly, we provide an overview of TI-PKS gene diversity in lichen-forming fungi, and the most comprehensive Type I PKS phylogeny of lichen-forming fungi to date, including 624 sequences. We reveal a high number of biosynthetic gene clusters and examine their domain composition in the context of previously characterized genes, confirming that PKS genes outnumber known secondary substances. Moreover, two novel groups of reducing PKSs were identified. Although many PKSs remain without functional assignments, our findings highlight that genes from lichen-forming fungi represent an untapped source of novel polyketide compounds.Entities:
Keywords: Type I PKS; comparative genomics; fungi; lichen; polyketide synthases (PKS); secondary compounds
Year: 2022 PMID: 35628705 PMCID: PMC9146135 DOI: 10.3390/jof8050449
Source DB: PubMed Journal: J Fungi (Basel) ISSN: 2309-608X
Starting material, sequencing strategy and genome quality statistics of Ramalinaceae genomes analyzed in this study.
| Nuclear Genome |
|
|
|
|
|---|---|---|---|---|
|
| bark | bark | rock | twig |
|
| crustose | crustose | fruticose | fruticose |
|
| Axenic culture | Whole thallus | Whole thallus | Axenic culture |
|
| Illumina NovaSeq SP; MinION | PromethION 24 | Illumina MiSeq | Illumina MiSeq |
|
| 17 Gbp; 22 Gbp (roughly) | 32 Gbp | 13.33 Gbp | - |
|
| 500× | 500× | 290× | 2.0× |
|
| 33.52 | 33.11 | 26.19 | 25.53 |
|
| 2,353,056 | 3,530,911 | 898,913 | 694,821 |
|
| 657,358 | 1,379,912 | 148,821 | 25,764 |
|
| 51 | 24 | 176 | 991 |
|
| 1,771,855 | 1,807,239 | 282,362 | 43,940 |
|
| 45.28 | 44.67 | 51.90 | 50.58 |
|
| 8773 | 8451 | 7405 | 6756 |
|
| 8728 | 8400 | 7355 | 6706 |
|
| 2514 | 2343 | 1099 | 1088 |
|
| 45 | 51 | 50 | 50 |
|
| 5860 | 5711 | 6111 | 5467 |
|
| 2345 | 2345 | 2345 | 2345 |
|
| 678 | 612 | 464 | 384 |
Figure 1Two alternative topologies used for comparison in the topology test. Topology 1, recovering Bacidia + Cladoniaceae as monophyletic; Topology 2, recovering Cladoniaceae + Parmeliaceae as monophyletic.
Overview of biosynthetic gene clusters and polyketide synthase families found in the twenty-three studied fungal genomes. The occurrence of the major secondary substance of B. rubella, atranorin, is highlighted. All sequences except the de novo sequenced B. rubella genome are from NCBI, if not otherwise specified in the first column.
| Species | Species Tag | Genome Size (Mb) | No of Clusters | Type I PKS (Total) | Type I NR-PKS | Type I | PR-PKS | Hybrid PKS-NRPS | NRPS/ | Metabolites Reported |
|---|---|---|---|---|---|---|---|---|---|---|
| alesarpred | 39.5 | 36 | 12 | 5 | 7 | 0 | 1 | 2/12 | Usnic acid, alectoronic acid (major), thamnolic, squamatic and barbatic acids | |
| bacgigpred | 33.1 | 31 | 11 | 7 | 4 | 0 | 1 | 5/10 | Homosekikaic acid | |
| bacrubpred | 33.5 | 31 | 10 | 6 | 4 | 0 | 0 | 3/7 |
| |
| clagrapred | 34.4 | 48 | 21 | 8 | 12 | 1 | 0 | 2/11 | 4-O-demethylgrayanic acid, colensoic acid, confumarprotocetraric acid, divaronic acid, fumarprotocetraric acid, grayanic acid, protocetraric acid, stenosporonic acid | |
| clamacpred | 36.8 | 55 | 25 | 15 | 10 | 0 | 2 | 3/9 | Thamnolic acid, barbatic acid, didymic acid, squamatic acid, usnic acid, rhodocladonic acid | |
| clametpred | 36.6 | 51 | 24 | 10 | 14 | 0 | 1 | 1/11 | Usnic acid, didymic acid, squamatic acid, rhodocladonic acid | |
| claranpred | 34.5 | 68 | 34 | 14 | 20 | 0 | 2 | 3/12 | ||
| clauncpred | 30.7 | 61 | 30 | 15 | 14 | 1 | 2 | 1/11 | Usnic acid, squamatic acid | |
| cyaastpred | 28.6 | 35 | 11 | 5 | 5 | 1 | 1 | 3/10 | Astin, skyrin | |
| dibbaepred | 34.2 | 55 | 27 | 11 | 15 | 1 | 0 | 4/14 | Baeomycic acid, squamatic acid | |
| endpuspred | 36.2 | 32 | 14 | 4 | 9 | 1 | 1 | 3/4 | Not reported | |
| eveprupred | 40.2 | 86 | 36 | 16 | 19 | 1 | 4 | 3/20 | Usnic acid, | |
| grascrpred | 34.8 | 54 | 21 | 6 | 15 | 0 | 1 | 6/14 | Not reported | |
| gyaflapred | 34.4 | 41 | 16 | 8 | 8 | 0 | 1 | 3/8 | Parietin, emodin, fallacinal, fragilin | |
| lashispred | 39.7 | 28 | 15 | 8 | 6 | 1 | 1 | 0/3 | Gyrophoric acid, lecanoric acid, umbilicaric acid, skyrin | |
| laspuspred | 32.9 | 26 | 17 | 9 | 7 | 1 | 0 | 0/4 | Gyrophoric acid, lecanoric acid, hiascinic acid, skyrin | |
| letcolpred | 52.2 | 43 | 14 | 7 | 7 | 0 | 2 | 3/6 | Vulpinic acid and | |
| letluppred | 49.2 | 48 | 18 | 11 | 7 | 0 | 2 | 3/11 | Vulpinic acid and | |
| psefurpred | 37.8 | 48 | 26 | 8 | 18 | 0 | 3 | 3/11 | ||
| ramintpred | 26.2 | 54 | 31 | 13 | 17 | 1 | 3 | 5/9 | Usnic acid (major); medulla with homosekikaic acid (major), sekikaic acid (major), 4’-O-methylnorhomosekikaic acid (minor) | |
| ramperpred | 25.5 | 43 | 17 | 9 | 7 | 1 | 1 | 4/10 | Usnic acid, homosekikaic acid (major), sekikaic acid (major), and 4’-O-methylnorhomosekikaic acid and 4’-O-methylnorsekikaic acid (minor) | |
| usnhakpred | 40.4 | 70 | 23 | 10 | 12 | 1 | 3 | 5/22 | Usnic and norstictic acids | |
| xanelepred | 44.2 | 63 | 25 | 7 | 18 | 0 | 1 | 7/17 | Parietin (major), fallacinal, emodin, teloschistin and parietinic acid |
Figure 2Organization of putative atranorin BGCs in Bacidia rubella and Cladonia rangiferina. The genes PKS23 (atr1), Cytochrome 450 (atr2), O-methyltransferase (atr3), and transporter gene (atr4) are present on the left side of the gene arrow plot. The BLAST percent similarity between two genes is 73% for atr1, 74% for atr2, 78% for atr3, and 91.5% for atr4. APH, phosphotransferase enzyme family (PF01636); DUF, domain of unknown function; FMO-like, flavin-containing monooxygenase family (PF00743); HCP60, HSP60 chaperone family (PF00118); HP, hypothetical protein; MFS, Major Facilitator Superfamily, transporter gene (PF07690); R-PKS, reducing PKS (in this case from R-VIII group); OrsD, Orsellinic acid/F9775 biosynthesis cluster protein D family (PF12013); TFIIH, Ssl1-like protein, subunit of the transcription factor II H complex (PF04056); Sugar_tr, Sugar transporter family (PF00083); WD-40, WD40 repeat (PF00400).
Figure 3Maximum-likelihood phylogeny of Type I PKS genes inferred by IQ-TREE using Type II Bacterial PKSs as outgroup. Clades containing lichen-forming fungi are highlighted and the corresponding Orthogroups (1 to 9, respectively) are indicated by different colors. PR-PKS corresponds to Orthogroup 1 (green); the nine NR-PKS groups (NR-I to NR-IX) belong to Orthogroups 2, 3 and 4 (dark blue, pink and light green, respectively); the ten R-PKS groups (R-I to R-X) belong to Orthogroups 5, 6, 7, 8, and 9 (light blue, red, peach, orange, and lilac, respectively). Characteristic secondary substances for the groupings are given in the corresponding colored boxes. Groups not containing lichen-forming fungal genes are indicated by grey boxes. For each group, the domain arrangement of PKS is highlighted with distinct colors: SAT—starter unit-ACP transacylase; KS—ketoacyl synthase; AT—acyltransferase; ACP—acyl carrier protein; KR—ketoreductase; DH—dehydratase; ER—enoyl reductase; CMeT—methyltransferase; TE—thioesterase; HTH—helix-to-helix; ADH—adhydrolase; NAD—NAD-binding; Carn—Choline/Carnitine O-acyltransferase domain.
Parameters for the topology test. Topology 1, recovering Bacidia + Cladoniaceae as monophyletic; Topology 2, recovering Cladoniaceae + Parmeliaceae as monophyletic. Value for the Kishino–Hasegawa test (Kishino and Hasegawa, 1989), Shimodaira and Hasegawa, 1989), expected likelihood weights (Strimmer and Rambaut, 2002) and approximately unbiased (AU) test (Shimodaira, 2002) are given.
| Tree | logL | Deltal | p-KH | p-SH | c-ELW | p-AU |
|---|---|---|---|---|---|---|
|
| −28,054.52811 | 2.062 × 10−8 | 0.495+ | 0.814+ | 0.496+ | 0.566+ |
|
| −28,108.54142 | 54.013 | 0.0126− | 0.0126− | 0.00853− | 0.0081− |
deltaL: logL difference from the maximal logl in the set. p-KH: p-value of one-sided Kishino–Hasegawa test (1989). p-SH: p-value of Shimodaira–Hasegawa test (2000). c-ELW: Expected Likelihood Weight (Strimmer and Rambaut 2002). p-AU: p-value of approximately unbiased (AU) test (Shimodaira, 2002).
Genome basics for twenty-three studied fungal genomes.
| Assembly | No. of Contigs | Largest Contig (bp) | Total Length (Mb) | GC (%) | N50 (bp) | N75 (bp) | BUSCO Completeness (%) | Genes Predicted | Proteins Predicted |
|---|---|---|---|---|---|---|---|---|---|
|
| 915 | 400,628 | 39.9 | 40.22 | 93,085 | 44,808 | 75.8 | 8440 | 8406 |
|
| 24 | 3,530,911 | 33.1 | 44.67 |
|
|
| 8451 | 8400 |
|
| 246 | 2,353,056 | 33.7 | 45.25 |
|
|
| 8773 | 8728 |
|
| 414 | 958,967 | 34.6 | 44.44 | 243,412 | 104,892 |
| 9215 | 9168 |
|
| 240 | 2,265,542 | 37.1 | 44.68 |
|
|
| 8183 | 8135 |
|
| 30 | 2,400,105 | 36.6 | 44.91 |
|
|
| 8357 | 8313 |
|
| 1008 | 751,829 | 35.6 | 45.46 | 273,041 | 142,056 |
| 9264 | 9218 |
|
| 2124 | 143,175 | 32.8 | 46.38 | 34,871 | 18,367 |
| 8706 | 8645 |
|
| 37 | 3,440,352 | 28.6 | 53.29 |
|
|
| 7946 | 7851 |
|
| 1369 | 352,342 | 35.2 | 47.02 | 70,496 | 37,098 |
| 9799 | 9756 |
|
| 908 | 803,103 | 37.1 | 46.00 | 178,225 | 78,254 |
| 8446 | 8392 |
|
| 277 | 732,541 | 40.3 | 48.97 | 264,454 | 154,311 |
| 11,072 | 10,979 |
|
| 1453 | 383,549 | 36.2 | 46.66 | 78,723 | 38,837 | 88.6 | 9808 | 9744 |
|
| 36 | 2,816,824 | 34.4 | 41.89 |
|
|
| 8062 | 8008 |
|
| 1619 | 615,827 | 41.2 | 51.28 | 145,035 | 51,438 |
| 8218 | 8162 |
|
| 43 | 3,307,933 | 32.9 | 51.67 |
|
|
| 6973 | 6936 |
|
| 161 | 2,188,364 | 52.2 | 39.57 | 666,803 | 377,091 |
| 9966 | 9890 |
|
| 31 | 3,031,725 | 49.2 | 38.73 |
|
|
| 9266 | 9206 |
|
| 46 | 3,053,396 | 37.7 | 47.86 |
| 859,355 |
| 9148 | 9082 |
|
| 196 | 898,913 | 26.2 | 51.89 | 273,318 | 142,876 |
| 7405 | 7355 |
|
| 1657 | 694,821 | 26.9 | 50.75 | 40,431 | 15,829 | 90.0 | 6756 | 6706 |
|
| 879 | 624,317 | 41.1 | 45.57 | 166,123 | 71,534 |
| 10,700 | 10,641 |
|
| 261 | 990,773 | 44.3 | 40.70 | 385,707 | 188,494 |
| 9033 | 8911 |