| Literature DB >> 22750088 |
Abstract
Recent progress and improvement in "-omics" technologies has made it possible to study the physiology of organisms by integrated and genome-wide approaches. This bears the advantage that the global response, rather than isolated pathways and circuits within an organism, can be investigated ("systems biology"). The sequencing of the genome of Trichoderma reesei (teleomorph Hypocrea jecorina), a fungus that serves as a major producer of biomass-degrading enzymes for the use of renewable lignocellulosic material towards production of biofuels and biorefineries, has offered the possibility to study this organism and its enzyme production on a genome wide scale. In this review, I will highlight the use of genomics, transcriptomics, proteomics and metabolomics towards an improved and novel understanding of the biochemical processes that involve in the massive overproduction of secreted proteins.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22750088 PMCID: PMC3568919 DOI: 10.1016/j.jbiotec.2012.05.020
Source DB: PubMed Journal: J Biotechnol ISSN: 0168-1656 Impact factor: 3.307
Fig. 1CAZys, indicated by GH-number, that are amplified in Trichoderma. Numbers on the vertical axes of the plots indicate number of gene paralogues. Abbreviations: J, T. reesei; V, T. virens; T, T. atroviride; A, Aspergillus nidulans; N, Aspergillus niger; G, Gibberella zeae; M, Magnaporthe grisea; P, Podospora anserina; B, Botryodiplodia fuckeliana; S, Sclerotium sclerotiorum; E, Neurospora crassa; O, Rhizopus nigricans; D, Batrachochytrium dendrobatidis. Data were compiled from Martinez et al. (2008), Eastwood et al. (2011), Goodwin et al. (2011), and Kubicek et al. (2011, 2012).
Inventory of genes encoding cellulolytic and xylanolytic in Trichoderma spp. and other fungi.a
| Cellulases | β-Glucosidases | PMO | Xylanases | β-Xylosidase | Total | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GH6 | GH7 | GH12 | GH45 | GH1 | GH3 | GH61 | GH10 | GH11 | GH30 | GH54 | ||
| 4 | 5 | 4 | 3 | NI | 16 | 30 | 7 | 7 | NI | 1 | 77 | |
| 1 | 3 | 6 | 1 | 5 | 38 | 12 | 3 | 3 | 0 | 1 | 73 | |
| 4 | 6 | 2 | 2 | 1 | 11 | 33 | 8 | 6 | 0 | 0 | 73 | |
| 3 | 6 | 3 | 1 | 2 | 19 | 17 | 5 | 5 | 1 | 1 | 63 | |
| 1 | 2 | 4 | 1 | 3 | 22 | 15 | 5 | 3 | 0 | 1 | 57 | |
| 2 | 3 | 1 | 1 | 3 | 20 | 9 | 3 | 2 | 0 | 1 | 45 | |
| 2 | 2 | 4 | 0 | 3 | 17 | 7 | 1 | 4 | 1 | 1 | 42 | |
| 3 | 5 | 1 | 1 | 1 | 9 | 14 | 4 | 2 | 1 | 1 | 42 | |
| 1 | 3 | 1 | 2 | 5 | 14 | 10 | 2 | 2 | NI | 1 | 41 | |
| 1 | 2 | 4 | 2 | 2 | 17 | 3 | 1 | 4 | 2 | 2 | 40 | |
| 1 | 2 | 3 | 1 | 4 | 14 | 3 | 1 | 4 | 2 | 2 | 37 | |
| 1 | 2 | 3 | 0 | 3 | 17 | 4 | 3 | 1 | 1 | 1 | 36 | |
| 1 | 3 | 2 | 2 | 1 | 12 | 9 | 2 | 3 | NI | 1 | 36 | |
| 1 | 2 | 2 | 1 | 2 | 13 | 3 | 2 | 4 | 2 | 2 | 34 | |
| 0 | 1 | 1 | 1 | NI | NI | 2 | 2 | 1 | NI | 1 | 9 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | NI | 0 | 2 | |
Data taken from Martinez et al. (2008), Kubicek et al. (2011, 2012), Eastwood et al. (2011) and Goodwin et al. (2011); GH numbers specify the CAZy glycosyl hydrolase classification. Note that GH5 is not listed, because it includes – in addition to endo-β-1,4-glucanases – also many other β-glycanases. GH1 and GH3 are given, though, although it must be born in mind that not all of them are indeed β-glucosidases. NI, no information available.
PMO, polysaccharide monooxygenases; former termed GH61 “endo-β-1,4-glucanases”.
Analysis of potential clustering of CAZyme encoding genes in the T. reesei genome.a
| Scaffold | Total scaffold length (Mbp) | Position of cluster (Mbp) | Distance | Clustered genes | Cellulose | Lactose | |||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 2.75 | 2.3–2.47 | 0.45 | 4 | 3.30E−03 | 4 | <0.05 | 3 | <0.05 |
| 2.4–2.5 | 0.25 | 3 | 3.10E−03 | 3 | <0.05 | ||||
| 2 | 2 | 1.56–1.69 | 0.44 | 3 | 1.90E−03 | ||||
| 3 | 1.91 | 1.74–1.86 | 0.17 | 3 | 2.50E−03 | ||||
| 5 | 1.72 | 0–0.27 | 0.27 | 7 | 1.20E−03 | ||||
| 0.95–1.08 | 0.55 | 3 | 3.30E−03 | ||||||
| 6 | 1.45 | 0.01–0.12 | 0.14 | 3 | 1.30E−03 | ||||
| 0.04–0.21 | 0.12 | 4 | 2.30E−03 | 3 | <0.05 | ||||
| 0.89–1.06 | 0.56 | 3 | 1.50E−03 | ||||||
| 7 | 1.42 | 1.21–1.35 | 0.21 | 3 | 3.00E−03 | ||||
| 15 | 0.84 | 0–0.14 | 0.14 | 3 | 2.30E−03 | ||||
| 30 | 0.24 | 0–0.21 | 0.03 | 7 | 2.50E−03 | 3 | <0.05 | 3 | <0.05 |
| 33 | 0.2 | 0.06–0.2 | 0.06 | 3 | 1.20E−03 | ||||
| Total gene number | 49 | 13 | 6 | ||||||
Clustering of genes was tested by REEF, a software that applies a statistic test based on the hypergeometric distribution by using a sliding window approach and adopting the false discovery rate for controlling multiplicity (http://telethon.bio.unipd.it/bioinfo/reef/). To use it for T. reesei, the scaffolds were treated as chromosomes. A window width of 100 kb, a shift of 10 kb and an FDR-corrected p-value of <0.05 were used. A minimum number of 3 clustered genes was used as a threshold for the analysis. CAZyme genes expressed during cultivation on cellulose or lactose were taken from Tisch et al. (2012) and Ivanova et al. (2012).
Distance to either end of the scaffold.
Fig. 2Genomic synteny of areas containing cellulase clusters, as determined by REEF. Synteny data were retrieved from the synteny browser at http://genome.jgi-psf.org/Trire2/Trire2.home.html, using the genome of T. atroviride as a comparison. Interrupted bars indicate lack of synteny. The different degrees of fillings of the bars (from black to grey) indicate occurrence of the respective areas on different scaffolds in T. atroviride. Numbers indicate the respective position on the scaffold in Mbp's. Areas containing cellulase genes are indicated by red boxes.
Fig. 3Pedigree of H. jecorina mutant strains whose genome has been sequenced. The parent strain, whose genome sequence is already available is boxed; the cellulase negative mutants are located within the blue box.
Proteins, whose genes show a non-silent mutation in T. reesei RUT C30, NG14, QM 9414 and QM 9123.a
| Protein ID | Annotation |
|---|---|
| Pso2 (Snm1) protein family involved in DNA interstrand crosslink repair | |
| 67658 | A/G-specific adenine DNA glycosylase |
| 60243 | AMP1, medium subunit of the adaptor protein complex AP-1 of clathrin-coated vesicles |
| 69437 | B-type cyclin |
| C2H2 transcriptional regulator | |
| 64866 | DSBA-like thioredoxin domain-containing protein |
| 105874 | FAD-dependent (isoamyl?) oxidase |
| 55358 | Flavin-containing monooxygenase, putative |
| 79405 | HET-containing protein, unknown |
| 40758 | Homocysteine methyltransferase |
| 104161 | Importin beta-5 subunit, putative |
| 104599 | Mandelate racemase/muconate lactonizing enzyme |
| 28409 | MFS permease |
| 58561 | MFS permease |
| 64882 | MFS permease |
| 109320 | Orphan protein |
| 109432 | Orphan protein |
| 45456 | Protein kinase |
| 124172 | Protein kinase NPKA [ |
| 75072 | Snf7 family protein, possibly involved in vesicular trafficking |
| 78320 | Sulfatase |
| Unknown protein | |
| Unknown protein | |
| 22841 | Unknown protein |
| 53492 | Unknown protein |
| 67030 | Unknown protein |
| 69181 | Unknown protein |
| 74570 | Unknown protein |
| 109285 | Unknown protein |
| 74765 | Unknown protein with C-terminal bromodomain |
| 65104 | Vacuolar protein sorting-associated protein Vps13 |
| 70071 | Zn2Cys6 transcriptional regulator |
| 77513 | Zn2Cys6 transcriptional regulator |
| 107601 | Zn-finger protein, Tim8, probable subunit of mitochondrial import inner membrane translocase |
| FAD monooxygenase | |
| Imidazoleglycerol-phosphate synthase subunit H | |
| Enoyl-CoA hydratase/isomerase | |
| 22294 | Nuclear transport factor 2 |
| 22994 | AAA-ATPase Cdc48, ER-resident protein degradation |
| 26255 | Zn2Cys6 transcriptional regulator |
| 54157 | Unknown protein |
| 54511 | Unknown protein, contains BTB/POZ domain (=protein binding) |
| 55887 | Unknown protein, secreted |
| 56077 | Zn2Cys6 transcriptional regulator |
| 56726 | Branched chain alpha-keto acid dehydrogenase complex, alpha subunit |
| 58073 | Uroporphyrinogen decarboxylase |
| 58790 | Glycerol-3-phosphate phosphatase, putative |
| 59146 | Polyketide synthase |
| 59388 | MSF permease |
| 59801 | Unknown protein |
| 60458 | NRPS |
| 63702 | Unknown protein |
| 63935 | Isoleucyl-tRNA synthetase, class Ia. |
| 65106 | CN_hydrolase |
| 65773 | Unknown protein |
| 73912 | Unknown protein |
| 75105 | Unknown protein |
| 76515 | Vacuolar protein sorting-associated protein Vps33 |
| 78158 | Importin subunit beta-3 |
| 78268 | Unknown protein |
| 79014 | Vacuolar ATP synthase 16 kDa proteolipid subunit |
| 81149 | Aquaglyceroporin |
| 82499 | Integral membrane protein, putative |
| 102776 | Unknown protein |
| 103061 | Unknown protein |
| 105391 | Unknown protein |
| 106009 | Zn2Cys6 transcriptional regulator |
| 107078 | Unknown protein |
| 108133 | Unknown protein |
| 110882 | Unknown protein |
| 111109 | Unique, no reference in JGI |
| 119768 | Myb transcriptional regulator |
| 120117 | C2H2 transcriptional regulator |
| 122689 | Unknown protein |
| 123786 | NRPS |
| Arc15 ( | |
| G_glu_transpeptGamma-glutamyltranspeptidase | |
| G_glu_transpeptGamma-glutamyltranspeptidase | |
| 35386 | Actin-interacting protein AIP3 |
| 43191 | Unknown protein |
| 54781 | Peptidyl-tRNA-hydrolase |
| 57940 | Alternative oxidase aox1 |
| 58910 | Unknown protein |
| 68956 | |
| 68956 | |
| 70546 | Poly(A) polymerase, RNA-binding region |
| 76018 | CysK Cysteine synthase |
| 81136 | Serine/threonine phosphatase 2C, PTC2 |
| 104335 | Unknown protein |
| 107460 | Unknown protein |
| 108645 | Unknown protein |
| 108962 | Unknown protein |
| 111849 | GH30 |
| 120127 | Transcriptional regulator GATA-type zinc finger protein ASD-4 (ascospore development protein 4) |
| 124022 | Zn2Cys6 transcriptional regulator |
| QM 9414 | |
| 23028 | Ca2+ permeable channel, related to |
| 33895 | Peptidylprolyl isomerase, FKBP-type |
| 52499 | C2H2 transcriptional regulator |
| 60508 | Unknown protein |
| 62633 | Unknown protein |
| 65021 | Oxidoreductase, short-chain dehydrogenase/reductase family |
| 65141 | Cytochrome P450 monooxygenase |
| 65232 | Short chain oxidoreductase/dehydrogenase, putative |
| 77661 | Oxysterol binding protein-like protein |
| 78306 | Rhodanese-like domain-containing protein |
| 79813 | Flavoprotein monooxygenases |
| 102776 | Unknown protein |
| 103044 | Unique, no reference in JGI |
| 104175 | Unique protein |
| 104333 | Unknown protein |
| 108540 | Unknown protein |
| 109278 | GH24 |
| 109313 | NmrA-like family protein |
| 111645 | Unique protein |
| 122036 | Unknown protein |
| 124295 | Cell wall protein with a CFEM domain. |
Only mutations that arose specifically in the strain listed, are given; i.e. mutations already present in NG14 are not again listed in RUT C30, and the same accounts for QM 9123 and QM 9414.
Novel genes affecting cellulase and xylanase production.a
| Cellulase | Xylanase | Notes | ||
|---|---|---|---|---|
| 77513 | Zn2Cys6 transcriptional regulator | Up | Up | Lower biomass formation |
| 80291 | Zn2Cys6 transcriptional regulator | Up | Up | |
| 41573 | bZIP-transcriptional regulator | Up | (Up) | |
| 74765 | Unknown bromodomain-containing protein | (Up) | (Up) | |
| 64608 | Unknown protein with WD 40 domains | Up | (Up) | |
| 123668 | GCN5-related N-acetyltransferase | Up | (Up) | |
| 66966 | Unknown protein with WD 40 repeats | Up | (Up) | |
| 122523 | Zn2Cys6 transcriptional regulator | Up | No effect | |
| 120120 | GCN5-related N-acetyltransferase | Up | (Up) | |
| 112524 | Zn2Cys6 transcriptional regulator | No effect | Up | |
| 108381 | Zn2Cys6 transcriptional regulator | Down | Down | |
| 81972 | Cellular retinaldehyde-binding/triple function | Down | Down | |
| 62244 | Zn2Cys6 transcriptional regulator | Down | Down | |
| 76677 | Zn2Cys6 transcriptional regulator | Down | Down | |
| 47317 | Zn2Cys6 transcriptional regulator | Down | No effect | |
Extracted from Pakula et al. (2011a,b,c). “up” means upregulation in a strain overexpressing the gene, “down” means downregulation in the overexpressing strain. Up in brackets indicates only a weak effect (<30%).
Fig. 4Extracellular CAZymes identified by proteomic approaches by Jun et al., 2011 (A), Herpoël-Gimbert et al., 2009 (B) and Adav et al., 2011 (C). Black boxes indicate that the protein was detected, numbers are the protein IDs in the T. reesei genome database. Note that the endoglucanase marked with an * is actually a cellulose monooxygenase.