| Literature DB >> 25550711 |
Dante Poggi-Parodi1,2, Frédérique Bidard1, Aurélie Pirayre1, Thomas Portnoy1,3,4,5, Corinne Blugeon3,4,5, Bernhard Seiboth6, Christian P Kubicek7, Stéphane Le Crom2,3,4,5, Antoine Margeot1.
Abstract
BACKGROUND: The filamentous fungus Trichoderma reesei is the main industrial cellulolytic enzyme producer. Several strains have been developed in the past using random mutagenesis, and despite impressive performance enhancements, the pressure for low-cost cellulases has stimulated continuous research in the field. In this context, comparative study of the lower and higher producer strains obtained through random mutagenesis using systems biology tools (genome and transcriptome sequencing) can shed light on the mechanisms of cellulase production and help identify genes linked to performance. Previously, our group published comparative genome sequencing of the lower and higher producer strains NG 14 and RUT C30. In this follow-up work, we examine how these mutations affect phenotype as regards the transcriptome and cultivation behaviour.Entities:
Keywords: Biofuels; Cellulase; Fed-batch; Systems biology; Transcriptome; Trichoderma reesei
Year: 2014 PMID: 25550711 PMCID: PMC4279801 DOI: 10.1186/s13068-014-0173-z
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Figure 1RUT C30 and NG 14 bioreactor culture profiles. Growth curves (dry biomass concentration g.L-1) and protein production levels (extracellular protein concentration g.L-1 (EC Prot. Conc.), as assayed by Bradford method (see Results), are displayed for NG 14 and RUT C30 T. reesei strains. Time 0 h marks the start of lactose feeding. Negative values represent the 24 h of batch culture and positive values the first 48 h of fed-batch culture with lactose. Each curve represents the average measure of two independent cultures; error bars show average standard deviation to give an estimate of replicates’ quality (excluding technical replicates). One NG 14 replicate is lacking after 24 h lactose induction, so dispersion data is not available. Bradford method allows accurate assessment of production start and comparison between strains but underestimates actual values and is therefore not appropriate for carbon balancing (see Results for detailed explanations).
Figure 2Differentially expressed genes of NG 14 and RUT C30 during lactose induction. (A) Venn diagram indicates the number of genes specific to each strain and the overlap between them. (B) The number of induced or repressed genes during induction by lactose is depicted in a bar chart. It displays the number of induced (red) and repressed (green) genes for the four time points (1, 3, 6 and 24 h) found during the induction in the RUT C30 and NG 14 strains compared to time 0. The differentially expressed genes have been selected using a 5% false discovery rate cut-off and with an absolute log2 fold change greater than 1.
Figure 3Clustering of differentially expressed genes during induction by lactose in both studied strains. From 568 genes identified as significantly regulated among the several expression experiments performed, 532 were gathered in 9 clusters according to their changes during the NG 14 and RUT C30 lactose induction. The average profile of each cluster is shown next to its heat map.
Main functional categories from transcriptome analysis in NG 14 and RUT C30 strains
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ribosome biogenesis | 12.01 |
| 1 (2%) | 1 (2%) |
| ||||||
| Metabolism | 01.00 | 3 (7%) | 4 (9%) | 3 (7%) |
| 1 (2%) | 7 (15%) | 4 (9%) | 2 (4%) |
| |
| Extracellular metabolism (CAZymes) | 01.25 | 1 (2%) | 2 (4%) | 3 (7%) | 5 (11%) | 5 (11%) | 7 (15%) | 2 (4%) | 9 (20%) |
|
|
| Transported compounds | 20.01 | 4 (9%) | 6 (14%) | (7%) | 8 (19%) | 4 (9%) | 1 (2%) |
| 7 (16%) |
| |
| Amino acid metabolism | 01.01 |
| 7 (17%) | 6 (14%) | 7 (17%) | 7 (17%) | 3 (7%) | 1 (2%) | 1 (2%) |
| |
| Lipid, fatty acid and isoprenoid metabolism | 01.06 | 8 (22%) |
| 2 (6%) | 9 (25%) | 1 (3%) | 1 (3%) | 2 (6%) |
| ||
| Transcriptional control | 11.2.3.4 | 2 (10%) | 1 (5%) | 5 (25%) |
| 4 (20%) | 1 (5%) |
| |||
| Extracellular protein degradation | 01.25.03 | 1 (6%) | 4 (24%) | 2 (12%) |
| 2 (12%) |
| ||||
| Energy metabolism | 02.00 | 3 (19%) | 4 (25%) | 1 (6%) |
| 1 (6%) |
| ||||
| Stress response | 32.01 | 1 (7%) | 1 (7%) | 4 (29%) | 3 (21%) | 5 (36%) |
| ||||
| Vesicular transport (secretion) | 20.09.07 | 1 (8%) | 1 (8%) | 1 (8%) |
| 3 (23%) |
| ||||
| Cellular communication, signal transduction | 30.00 | 3 (23%) | 2 (15%) | 2 (15%) | 4 (31%) | 1 (8%) | 1 (8%) |
| |||
| Secondary metabolism | 01.20 | 2 (17%) | 1 (8%) | 2 (17%) | 3 (25%) | 1 (8%) | 1 (8%) | 2 (17%) |
| ||
| Detoxification | 32.07 | 1 (9%) | 3 (27%) | 2 (18%) | 4 (36%) | 1 (9%) |
| ||||
| Other categories |
| 4 (10%) | 4 (10%) | 7 (18%) | 5 (13%) | 3 (8%) | 4 (10%) | 2 (5%) |
| ||
| Unclassified | 4 (13%) | 1 (3%) | 5 (16%) |
| 2 (6%) | 5 (16%) | 1 (3%) |
| |||
| Unknown protein | 7 (9%) | 14 (18%) | 1 (1%) |
| 16 (20%) | 14 (18%) | 6 (8%) | 3 (4%) |
| ||
|
|
|
|
|
|
|
|
|
|
|
|
Only the FunCat categories with more than ten genes in the whole dataset are considered main functional categories. The number of genes belonging to categories is provided for each cluster. Category-associated genes found more abundantly (at least a 2-gene of difference between clusters) are indicated in bold. Percentages in parentheses indicate the rounded percentage of genes from each category found in a cluster. Genes whose function could not be associated with a category were annotated as Unclassified. Genes without a predicted function were annotated as Unknown.
Genes regulated after lactose induction with different basal expression between NG 14 and RUT C30
|
|
|
|
|
|
|---|---|---|---|---|
| 59272 | 1 | Putative MFS transporter | -4.78 | 17462 |
| 82032 | 1 | Putative protein of unknown function | 4.46 | 32445 |
| 121491 | 1 | Putative glycosyltransferase family 4 | 4.21 | 38826 |
| 61374 | 3 | Putative MFS transporter | -5.99 | 7537 |
| 74156 | 3 | Putative secreted pepsin PROA | -4.92 | 7152 |
| 3717 | 4 | Putative 2-keto-3-deoxy-L-galactonate aldolase | -4.86 | 7427 |
| 76155 | 4 | Putative acid phosphatase | -4.85 | 23883 |
| 122582 | 4 | Putative mitochondrial dihydrodipicolinate synthase | -4.38 | 8170 |
| 69115 | 7 | Putative dienelactone hydrolase | -5.11 | 27147 |
| 69944 | 8 | Putative glycoside hydrolase family 31 | 7.01 | 1685 |
| 73643 | 8 | Glycoside hydrolase family 61 EGL4/CEL61A | 4.34 | 4641 |
| 121127 | 8 | Glycoside hydrolase family 3 BXL1 | 5.65 | 6623 |
| 123967 | 8 | Hydrophobin HFB3 | 6.8 | 2548 |
| 69276 | 9 | Putative glycoside hydrolase family 30 | 4.15 | 556 |
| 72567 | 9 | Glycoside hydrolase family 6 CBH2/CEL6A | 4.64 | 10892 |
| 73638 | 9 | Secreted cellulose induced protein CIP1 | 6.37 | 1326 |
| 76210 | 9 | Glycoside hydrolase family 62 ABF2 | 7.65 | 653 |
| 120312 | 9 | Glycoside hydrolase family 5 EGL2/CEL5A | 7.41 | 2331 |
| 120961 | 9 | Glycoside hydrolase family 61 CEL61B | 4.36 | 434 |
| 121418 | 9 | Carbohydrate esterase family AES1 | 6.76 | 1027 |
| 60945 | Not clustered | Putative MFS transporter | -4.51 | 6194 |
| 79816 | Not clustered | Putative transcriptional regulator GAL80 | -9.6 | 59020 |
| 123251 | Not clustered | Putative NADH-quinone oxidoreductase | 4.67 | 16873 |
Samples from T0 (before lactose induction) from NG 14 and RUT C30 were compared by means of RNA-seq experiments. The relative transcript abundance was measured in reads per kilobase of exon per million mapped sequence reads (RPKM). The log2 ratio of the RPKM values between RUT C30 and NG 14 were used to identify differentially expressed genes.
Figure 4Differences in gene expression before induction of protein production (T0) in NG 14 and RUT C30 identified by RNA-seq. The percentage of genes with superior basal level (red bars), inferior basal level (green bars) or equal basal level (grey) is shown for each cluster.
Inventory of the mutations affecting genes regulated during induction by lactose
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| ||||||
| 1751 | 1 | Putative FAD monooxygenase | RUT C30 | SNV | Promoter | [ |
| 54511 | 5 | Putative POZ domain protein | RUT C30 | SNV | Promoter | [ |
| 56077 | 5 | Putative transcription factor | RUT C30 | SNV | Promoter | [ |
| 77513 | 5 | Putative fungal C6 transcription factor | RUT C30 | SNV | Exon | [ |
| 121087 | Not clustered | Putative 4Fe-4S ferredoxin domain protein | NG 14 & RUT C30 | SNV | Promoter | [ |
|
| ||||||
| 54736 | 1 | Putative ATP-dependent RNA helicase DED1 | KDG-12 PC-3-7 | SNV | Promoter | [ |
| 58427 | 1 | Putative ATP-dependent RNA helicase DBP2 | PC-3-7 | SNV | Exon | [ |
| 71380 | 1 | Putative 3-hydroxy-3-methylglutaryl-coenzyme A reductase | PC-3-7 | SNV | Exon | [ |
| 78836 | 1 | Putative mitochondrial phosphate carrier | PC-3-7 | SNV | Exon | [ |
| 82512 | 1 | Putative 26S proteasome transcription factor RPN4 | PC-3-7 | SNV | Promoter | [ |
| 122036 | 1 | Putative 40S ribosomal protein S2 RPS2 | QM9414 KDG-12 PC-3-7 | InDel & SNV | Promoter & Terminator | [ |
| 63882 | 4 | Putative mitochondrial 3-hydroxyisobutyryl-CoA hydrolase | PC-3-7 | SNV | Terminator | [ |
| 124031 | 4 | Putative 20S proteasome alpha 3 subunit PRE9 | PC-3-7 | SNV | Promoter | [ |
| 6108 | 5 | Putative SAM binding domain protein | QM9414 KDG-12 PC-3-7 | InDel | Intron | Unpublished data |
| 112390 | 5 | Putative WD40 repeat domain protein | QM9414 KDG-12 PC-3-7 | InDel | Terminator | Unpublished data |
| 120688 | 5 | Putative protein of unknown function | QM9414 KDG-12 PC-3-7 | InDel | Exon | Unpublished data |
| 110853 | 6 | Putative glutathione S-transferase | PC-3-7 | SNV | Terminator | [ |
| 77481 | 8 | Putative D-xylulose 5-phosphate/D-fructose 6-phosphate phosphoketolase | PC-3-7 | SNV | Exon | [ |
| 120749 | 8 | Glycoside hydrolase family 1 BGL2 | KDG-12 PC-3-7 | SNV | Exon | [ |
We differentiate common mutations to RUT C30 and NG 14 strains from the mutations specific for RUT C30 strain. Mutations in regulated genes described in other cellulase improved strains are shown. Mutations are described as single nucleotide variation (SNV) and deletions or insertions (InDel). Affected gene region is also provided.
Figure 5Schematic view of gene expression kinetics in NG 14 and RUT C30. Approximate regulation timing for each cluster is shown, and the number of each cluster is indicated inside each arrow. Red up-arrows and green down-arrows represent up- and down-regulated clusters respectively. Black bold line indicates the start time of protein induction and production in each strain. Main functional categories found in each cluster are indicated as Neo (gluconeogenesis), Rib (ribosome biogenesis), AA (amino acid metabolism), Lip (lipid metabolism), Caz (CAZymes), TP (transporter), Met (metabolism), Pro (extracellular protein degradation), Sec (secretion) and Ctrl (transcriptional control).