| Literature DB >> 34523973 |
Andrei Stecca Steindorff1, Luana Assis Serra1,2, Eduardo Fernandes Formighieri1, Fabrícia Paula de Faria3, Marcio José Poças-Fonseca4, João Ricardo Moreira de Almeida1,2.
Abstract
Humicola grisea var. thermoidea is a thermophilic ascomycete and important enzyme producer that has an efficient enzymatic system with a broad spectrum of thermostable carbohydrate-active (CAZy) enzymes. These enzymes can be employed in lignocellulose biomass deconstruction and other industrial applications. In this work, the genome of H. grisea var. thermoidea was sequenced. The acquired sequence reads were assembled into a total length of 28.75 Mbp. Genome features correlate with what was expected for thermophilic Sordariomycetes. The transcriptomic data showed that sugarcane bagasse significantly upregulated genes related to primary metabolism and polysaccharide deconstruction, especially hydrolases, at both pH 5 and pH 8. However, a number of exclusive and shared genes between the pH values were found, especially at pH 8. H. grisea expresses an average of 211 CAZy enzymes (CAZymes), which are capable of acting in different substrates. The top upregulated genes at both pH values represent CAZyme-encoding genes from different classes, including acetylxylan esterase, endo-1,4-β-mannosidase, exoglucanase, and endoglucanase genes. For the first time, the arsenal that the thermophilic fungus H. grisea var. thermoidea possesses to degrade the lignocellulosic biomass is shown. Carbon source and pH are of pivotal importance in regulating gene expression in this organism, and alkaline pH is a key regulatory factor for sugarcane bagasse hydrolysis. This work paves the way for the genetic manipulation and robust biotechnological applications of this fungus. IMPORTANCE Most studies regarding the use of fungi as enzyme producers for biomass deconstruction have focused on mesophile species, whereas the potential of thermophiles has been evaluated less. This study revealed, through genome and transcriptome analyses, the genetic repertoire of the biotechnological relevant thermophile fungus Humicola grisea. Comparative genomics helped us to further understand the biology and biotechnological potential of H. grisea. The results demonstrate that this fungus possesses an arsenal of carbohydrate-active (CAZy) enzymes to degrade the lignocellulosic biomass. Indeed, it expresses more than 200 genes encoding CAZy enzymes when cultivated in sugarcane bagasse. Carbon source and pH are key factors for regulating the gene expression in this organism. This work shows, for the first time, the great potential of H. grisea as an enzyme producer and a gene donor for biotechnological applications and provides the base for the genetic manipulation and robust biotechnological applications of this fungus.Entities:
Keywords: CAZy enzymes; Humicola grisea; genome sequencing; pH regulation; sugarcane bagasse; transcriptome
Mesh:
Substances:
Year: 2021 PMID: 34523973 PMCID: PMC8557918 DOI: 10.1128/Spectrum.01088-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
Genome features of H. grisea var. thermoidea and genome comparison with other filamentous fungi
| Feature |
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|
| Genome size | 28.75 Mb | 36.91 Mb | 34.9 Mb | 39.9 Mb | 34.1 Mb | 31.67 Mb | 23.3 Mb |
| Scaffolds | 33 | 6 | 37 | 21 | 89 | 91 | 6 |
|
| 1.805 Mb | 9.477 Mb | 4.721 Mb | 6.00 Mb | 2.44 Mb | 4.00 Mb | |
| GC content | 55.9% | 54.8% | 55.4% | 48.6% | 51.0% | 50.3% | 52.2% |
| Genes | 8,736 | 9,813 | 11,232 | 10,812 | 9,143 | 9,541 | 5,105 |
| Secreted proteins | 781 | 789 | 862 | 592 | 704 | ||
| Completeness | 98.20% | 97% | 99% | 95% | 98% | ||
| References | This work |
|
|
|
|
|
|
Data from Lum and Min, 2011 (62).
FIG 1Genome features of 13 Ascomycetes genomes. (A) The RAxML maximum-likelihood phylogenetic tree using 200 single-copy genes shows the three orders Sordariales, Hypocreales, and Eurotiales. All nodes have maximum support value. Thermophile genomes are shown in red. (B) Genome size and percentage of GC of each genome. (C) Gene count and percentage of genes with the presence of at least one Pfam domain. (D) Distribution of CAZyme families among the genomes. CAZymes: auxiliary activities (AA), carbohydrate-binding module (CBM), carbohydrate esterases (CE), glycoside hydrolases (GH), glycosyl transferases (GT), and polysaccharide lyases (PL).
Number of genes related to degradation of plant-based polysaccharides
| Genome | Cellulose | Xylan | Galactomanan | Xyloglucan | Pectin | Starch | Inulin |
|---|---|---|---|---|---|---|---|
|
| 4 | 8 | 1 | 4 | 5 | 12 | 1 |
|
| 12 | 9 | 11 | 5 | 28 | 28 | 6 |
|
| 18 | 15 | 11 | 8 | 50 | 24 | 5 |
|
| 14 | 10 | 9 | 8 | 48 | 19 | 5 |
|
| 21 | 29 | 6 | 9 | 39 | 18 | 6 |
|
| 3 | 16 | 3 | 5 | 3 | 12 | 1 |
|
| 9 | 10 | 11 | 7 | 6 | 10 | 0 |
|
| 24 | 15 | 3 | 3 | 15 | 15 | 2 |
|
| 49 | 54 | 5 | 4 | 12 | 14 | 0 |
|
| 39 | 36 | 2 | 4 | 7 | 12 | 0 |
|
| 33 | 28 | 3 | 5 | 20 | 14 | 1 |
|
| 34 | 31 | 4 | 4 | 16 | 14 | 0 |
|
| 59 | 47 | 5 | 7 | 22 | 14 | 1 |
Gene numbers related to degradation of different plant-based polysaccharides detected in the selected genomes according to De Vries et al., 2017 (63). Cellulose: GH5_4, GH5_5, GH5_22, GH6, GH7, GH45; xylan: GH10, GH11, GH62, GH67, GH115, CE1, CE15; galactomannan: GH5_7, GH26, GH27, GH36, GH134; xyloglucan: GH12, GH29, GH74, GH95; pectin: GH28, GH53, GH78, GH88, GH93, GH105, PL1, PL3, PL4, PL9, PL11, PL22, CE8, CE12; starch: GH13_1, GH13_5, GH13_40, GH15, GH31, GH133; inulin: GH32.
FIG 2Distribution of gene expression of H. grisea grown in sugarcane bagasse normalized with growth on glucose as the sole carbon source. The two pH values, pH 5 (A) and pH 8 (B), exhibited differentially expressed genes (P value adjusted of <0.05) showed in red when normalized with glucose.
FIG 3Differentially expressed genes at pH 5, pH 8, or both values. (A) Scatterplot of differentially expressed genes at pH 5 or pH 8. The colors denote genes expressed at pH 5 (red), pH 8 (green), and both conditions (blue). The Venn diagram shows the number of exclusive and shared genes between the pH values. (B) Fisher enrichment analysis of GO terms on each condition compared to the whole genome as background. No category exclusively enriched at pH 5 was found.
Expression of top 10 genes differentially expressed and upregulated at pH 5 and pH 8
| Putative genes (blast best-hit) | log2FC pH 5 | log2FC pH 8 | CAZy annotation |
|---|---|---|---|
| Top 10 genes upregulated at pH 5 | |||
| | 7.39 | 9.22 | CE5-CBM1 |
| | 6.97 | 10.27 | GH26-CBM35 |
| cel1 cellulose-growth-specific protein | 6.57 | 6.63 | AA9 |
| | 6.40 | 5.46 | GH3 |
| | 6.29 | 7.32 | CBM1-AA9 |
| Pectin lyase-like protein | 6.15 | 6.28 | |
| | 6.10 | 8.32 | GH7 |
| | 6.06 | 5.58 | CE8 |
| Hypothetical protein | 5.97 | 1.19 | |
| FSH1 serine hydrolase | 5.90 | 3.46 | |
| Top 10 genes upregulated at pH 8 | |||
| | 6.97 | 10.27 | GH26-CBM35 |
| | 7.39 | 9.22 | CE5-CBM1 |
| | 5.16 | 9.11 | AA9 |
| CUTI cutinase | 4.65 | 8.55 | CE5 |
| Hypothetical protein | 2.64 | 8.35 | AA9 |
| | 6.10 | 8.32 | GH7 |
| LIP3 secreted lipase | 5.37 | 8.19 | CE10 |
| | 4.73 | 8.14 | GH11 |
| | 2.78 | 8.08 | GH53 |
| | 4.93 | 8.05 | GH5 |
FIG 4Expression of putative genes encoding glycoside hydrolases (GH) separated by the likely substrate. The asterisk (*) means the gene is potentially regulated by the PacC regulator.