| Literature DB >> 31572496 |
Stephen J Mondo1,2, Diego Javier Jiménez3, Ronald E Hector4, Anna Lipzen1, Mi Yan1, Kurt LaButti1, Kerrie Barry1, Jan Dirk van Elsas5, Igor V Grigoriev1,6, Nancy N Nichols4.
Abstract
BACKGROUND: Particular species of the genus Coniochaeta (Sordariomycetes) exhibit great potential for bioabatement of furanic compounds and have been identified as an underexplored source of novel lignocellulolytic enzymes, especially Coniochaeta ligniaria. However, there is a lack of information about their genomic features and metabolic capabilities. Here, we report the first in-depth genome/transcriptome survey of a Coniochaeta species (strain 2T2.1).Entities:
Keywords: Allopolyploidization; Coniochaeta; Fungal genomics; Lignocellulolytic enzymes; Lytic polysaccharide monoxygenases; Wheat straw
Year: 2019 PMID: 31572496 PMCID: PMC6757388 DOI: 10.1186/s13068-019-1569-6
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Fig. 1Source of isolation of Coniochaeta sp. 2T2.1 [26] and growth on Potato Dextrose Agar (PDA) (left) and in liquid medium using wheat straw as the sole carbon source (micrograph on the right)
Fig. 2a Self-synteny dot plots showing (left) first 30 scaffolds of Coniochaeta sp. 2T2.1 and (right) zoom in on two example syntenic regions (scaffold_7:scaffold_11 and scaffold_4:scaffold_14). b Length (x-axis) and percent identity at the nucleic acid level (y-axis) between duplicated regions in Coniochaeta sp. 2T2.1 (red) and representative haploid (C. lignaria, grey), dikaryotic (P. striiformis f. sp. tritici, blue) [36] and diploid (R. globosum, purple) fungi [37]. Each dot represents a single duplicated region
Fig. 3Unique pattern of sequence divergence between duplicates is observed in Coniochaeta sp. 2T2.1 (red) compared to haploid (black) and diploid/dikaryotic (blue) fungi. For each genome, a self-BLASTp was conducted to identify duplicates by reciprocal best blast hits (BBHs; min e value 1e−5). The fraction of bidirectional best blast hits (BBHs) at varying identity levels (steps = 0.25%) are then plotted (y-axis, grey = 0) for each lineage (x-axis). Only published PacBio genomes and close relatives of 2T2.1 were included. Despite being dispersed across most of the fungal kingdom, a consistent pattern is observed based on ploidy regardless of phylogenetic neighborhood
Fig. 4Genome-wide d/d distribution across homeologs in 2T2.1 shows the same distribution as orthologues across species, indicating that the source of this duplication was likely a hybridization event (allopolyploidization). Histograms (green) show d/d distribution across duplicated single-copy genes from Coniochaeta sp. 2T2.1 (top left) and single-copy orthologues across: Coniochaeta sp. PMI 546 and C. lignaria CBS111746 (top right), Coniochaeta sp. PMI546 and C. lignaria NRRL30616 (middle right), and Lecythophora sp. AK0013 and Coniochaeta sp. PMI546 (bottom right). Quantile–Quantile plots were then generated to compare d/d distribution in 2T2.1 homeologs with orthologues between species, revealing that distributions are highly correlated (R2 ≥ 0.945)
Fig. 5Analysis of orthologous genes. a Phylogenetic tree based on 2522 conserved genes (across all genomes) constructed using RAxML. Topology is fully consistent with results from FastTree (Additional file 3: Fig. S3). Fully supported branches are thickened. b Number of clusters of orthologous genes within Lecythophora/Coniochaeta genomes (strains 2T2.1, CBS111746, NRRL30616, PMI546 and AK0013) and T. reesei genome. Unique proteins in Coniochaeta sp. 2T2.1 (2199) were annotated using the CAZy database
Comparison of number of CAZymes encoding genes that were differentially abundant (e.g., expansions/contractions) between Coniochaeta sp. 2T2.1 (after remove the duplicated content) against other fungal genomes
| CAZy familya | # of genes in 2T2.1b | Mean # of genes in | Mean # of genes in other fungal genomes (SD)d | |
|---|---|---|---|---|
| CBM24 | 22 | X | 3.7 (3.4) | |
| GH43 | H | 22 | 17.5 (1.5) | X |
| GH16 | H | 20 | 17.0 (0.7) | X |
| AA3_2 | 19 | 10 (0.7) | X | |
| GH76 | 15 | 10.25 (0.4) | 7.0 (3.7) | |
| CE1 | H | 13 | 8.5 (0.8) | 4.5 (3.7) |
| GH47 | 13 | X | 7.1 (1.9) | |
| GH31 | S | 12 | X | 6 (1.8) |
| GH71 | S | 11 | X | 4.2 (1.5) |
| AA8 | 10 | X | 3.6 (2.2) | |
| GH55 | 10 | X | 4.7 (2.6) | |
| AA3 | 8 | 4.25 (0.4) | 2.1 (2.4) | |
| CE3 | H | 8 | 5 (1.2) | X |
| GH11 | H | 8 | 5.75 (0.4) | 2.9 (2.3) |
| AA4 | L | 7 | 3.75 (0.4) | 1.2 (1.5) |
| GH32 | 7 | 4.5 (0.5) | X | |
| GH78 | 7 | 3.75 (0.8) | X | |
| GH7 | H | 6 | 8.5 (0.5) | X |
| AA1_2 | L | 5 | 4 (0) | 1.5 (1) |
| AA12 | 5 | X | 1.8 (1.3) | |
| AA3_3 | 5 | 3 (0.7) | 1.6 (0.9) | |
| GH13_40 | S | 5 | X | 2.4 (1.2) |
| GH45 | H | 5 | X | 1.2 (0.8) |
| GH5_5 | 5 | X | 2 (1) | |
| CE15 | H | 4 | X | 1 (1) |
| GH128 | H | 4 | 5.5 (0.5) | X |
| GH88 | 4 | 1.5 (0.5) | 0.8 (1.1) | |
| AA7_dist | 3 | 1 (0) | X | |
| CBM35 | H | 3 | 4 (0) | X |
| CBM52 | H | 3 | X | 0.4 (0.6) |
| GH127 | H | 3 | X | 0.4 (0.6) |
| GH39 | H | 3 | 0 | 0.5 (1.2) |
| GH5_7 | 3 | 4 (0) | X | |
| AA2_dist | 2 | X | 0.4 (0.4) | |
| CE16 | H | 2 | 4 (0) | X |
| GH1 | H | 2 | 3 (0) | X |
| GH114 | 2 | 1 (0) | X | |
| GH13 | S | 2 | 1 (0) | 0.1 (0.3) |
| GH130 | 2 | 0.5 (0.5) | 0.1 (0.3) | |
| PL4_1 | 2 | X | 0.7 (0.6) |
SD standard deviation, X represented families from 2T2.1 where we not found two standard deviations above or below of the mean counts, H predicted (hemi)cellulose-degrading enzymes, L predicted lignin-degrading enzymes, S predicted starch-degrading enzymes
aOnly AA, GH, CBM, CE, and PL
bGenes with more than two copies after remove duplicated content
cFungal genomes form C. ligniaria CBS111746, C. ligniaria NRRL30616, Coniochaeta sp. PMI546, and Lecythophora sp. AK0013
dFungal genomes Trire2, Neucr_trp3_1, Fusoxy1, Spoth2, Podans1, Aspacri1, Penex1, Acrchr1, Phchr2, and Triol1 (see Additional file 1: Table S1)
Significantly and highly upregulated CAZymes (padj-value ≤ 0.05 and Log2 FC ≥ 10) from Coniochaeta sp. 2T2.1 on wheat straw (WS) and pretreated wheat straw (PTWS) compared with glucose (Glu) cultures
| JGI-IDs | CAZyme domains | Type of enzymeb | FPKM_WS/FPKM_Glu | FPKM_PTWS/FPKM_Glu | TPMd in WS | TPMd in PTWS | TPMd in Glu |
|---|---|---|---|---|---|---|---|
| 1061794 | GH51 | α- | 71,701 | 799 | 1068.13 | 11.79 | 0.01 |
| 961618 | GH62 | α- | 35,028 | 262 | 2435.17 | 18.27 | 0.07 |
| 1069155 | CE1-CBM1 | Feruloyl esterase | 23,908 | 1390 | 1424.70 | 82.70 | 0.05 |
| 1273638 | CE5-CBM1 | Acetyl xylan esterase | 16,257 | 1807 | 1856.79 | 207.17 | 0.11 |
| 1273701a | CE5-CBM1 | Acetyl xylan esterase | 51,046 | 10,010 | 1013.94 | 199.35 | 0.02 |
| 1196733a | GH11 | 32,085 | 7111 | 3186.54 | 705.95 | 0.10 | |
| 1172553 | GH11-CBM1 | 28,718 | 2526 | 1853.9 | 163.06 | 0.06 | |
| 1242067 | GH62 | α- | 638.69c | 2.32c | 951.47 | 3.46 | 0 |
| 955194 | GH10-CBM1 | 19,212 | 1272 | 572.4 | 38.06 | 0.02 | |
| 1005138 | GH10-CBM1 | 14,003 | 1698 | 1390.73 | 168.76 | 0.09 | |
| 1231977 | GH11-CBM1 | 9612 | 1190 | 1241.06 | 153.69 | 0.12 | |
| 344640 | GH51 | α- | 5111 | 75 | 812.27 | 11.94 | 0.15 |
| 1206532 | GH62-CBM1 | α- | 13,418 | 583 | 1066.11 | 46.29 | 0.07 |
| 539071 | CE1-CBM1 | Feruloyl esterase | 2770 | 65 | 935.49 | 22.03 | 0.33 |
| 1096633a | GH11 | 10,340 | 8725 | 616.15 | 520.15 | 0.06 | |
| 953908 | GH12 | Endoglucanase | 9423 | 48 | 327.55 | 1.69 | 0.03 |
| 970254 | GH12 | Endoglucanase | 3973 | 135 | 1282.40 | 43.43 | 0.31 |
| 382788 | GH93 | 2440 | 19 | 1175.6 | 9.48 | 0.47 | |
| 980755 | AA9 | Lytic polysaccharide monoxygenases | 102.06c | 33.52c | 152.04 | 50.20 | 0 |
| 969860 | CBM1-CE15 | 4- | 3212 | 519 | 143.54 | 23.22 | 0.04 |
| 645961 | GH10 | 2013 | 160 | 549.92 | 43.82 | 0.27 | |
| 1207935 | GH127 | β- | 2109 | 62 | 115.2 | 3.42 | 0.05 |
| 1265978 | GH27-CBM35 | α-Galactosidase | 32.82c | 0.85c | 48.89 | 1.26 | 0 |
| 1186025 | GH30_5 | 60.49c | 8.71c | 90.11 | 12.94 | 0 | |
| 1273538 | GH74-CBM1 | Xylo-endoglucanase | 25.48c | 8.43 | 37.97 | 12.60 | 0 |
| 646743 | GH7-CBM1 | Reducing end-acting cellobiohydrolase | 2035 | 1322 | 1779.11 | 1154.39 | 0.85 |
| 1054649a | GH7-CBM1 | Reducing end-acting cellobiohydrolase | 2332 | 2006 | 787.48 | 678.04 | 0.33 |
aTranscripts that were significantly and highly upregulated in PTWS compared with Glu. Transcripts in this table were significantly and highly upregulated in WS compared with Glu
bPutative activity deduced from top-ten hits in BLASTp search against NCBI-nr database
cTranscripts that showed values equal to zero in glucose. Here, we showed the FPMK average values in WS or PTWS
dAverage of transcripts per kilobase million
Fig. 6a Expression profile (normalized TPM values) of AA9 genes from Coniochaeta sp. 2T2.1 after growth (0.1 OD, 600 nm) on wheat straw (WS), and dilute-acid-pretreated wheat straw solids (PTWS). Asterisks represent putative secreted enzymes that were significantly upregulated (padj-value ≤ 0.05 and Log2 FC ≥ 8) in WS and PTWS compared with glucose (Glu) cultures; s, d and t letters represent single, duplicate and triplicate genes within the 2T2.1 genome. b Structural 3D modeling of five selected AA9 proteins that were significantly and highly upregulated (padj-value ≤ 0.05 and Log2 FC ≥ 8) on wheat straw (WS) compared with glucose (Glu) cultures. Phyre2 [41] and EZmol [42] web portals were used to predict the putative 3D structural conformation. The molecular size of these proteins (JGI-IDs 1170506, 980755, 1220247, 1175568, and 1230134) ranged between 22 and 29 kDa with different isoelectric points (from 4.56 to 7.51). We identified predicted metal-binding and histidine brace sites based on the structural position and comparison with the best protein for modeling (Additional file 7: Table S6). In the five AA9 proteins, these sites were identified and contain generally two to three histidines (green), one to two tyrosines (red) and one residue of glutamine (blue)
Log2 FC and normalized expression values (FPKM) of the significantly and highly expressed proteins from CAZy family AA9 (LPMOs) in WS and PTWS compared with Glu
| JGI-IDs | Log2 FC (WS vs Glu) | Log2 FC (PTWS vs Glu) | FPKM_WS/FPKM_Glu | FPKM_PTWS/FPKM_Glu |
|---|---|---|---|---|
| 1170506a | 9.9 | 9.4 | 4980 | 3189 |
| 1175568a | 9.9 | 7.7 | 4670 | 1019 |
| 1179874 | 8.3 | 6.4 | 538 | 135 |
| 1216758 | 8.2 | 9 | 23.47b | 39.48 f |
| 1220247ac | 9 | 7.1 | 786 | 199 |
| 1230134 | 8.5 | 5.6 | 966 | 116 |
| 1231383 | 7.1 | 8.2 | 7.43b | 16.44 f |
| 1232676a | 9.1 | 5.6 | 1046 | 87 |
| 1245155c | 8 | 5.7 | 360 | 75 |
| 510059c | 8.5 | 6.8 | 535 | 164 |
| 980755a | 10 | 8.4 | 102.06b | 33.52 f |
| 980894c | 8.5 | 6.6 | 577 | 148 |
aAre the top-five upregulated proteins in WS and PTWS compared with Glu
bProteins that showed values equal to zero in glucose. Here, we showed the FPMK average values in WS or PTWS
cLPMOs with CBM1 domains