| Literature DB >> 24295562 |
Charlotte M Wilson, Miguel Rodriguez, Courtney M Johnson, Stanton L Martin, Tzu Ming Chu, Russ D Wolfinger, Loren J Hauser, Miriam L Land, Dawn M Klingeman, Mustafa H Syed, Arthur J Ragauskas, Timothy J Tschaplinski, Jonathan R Mielenz, Steven D Brown1.
Abstract
BACKGROUND: The thermophilic anaerobe Clostridium thermocellum is a candidate consolidated bioprocessing (CBP) biocatalyst for cellulosic ethanol production. The aim of this study was to investigate C. thermocellum genes required to ferment biomass substrates and to conduct a robust comparison of DNA microarray and RNA sequencing (RNA-seq) analytical platforms.Entities:
Year: 2013 PMID: 24295562 PMCID: PMC3880215 DOI: 10.1186/1754-6834-6-179
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Major fermentation products and residual biomass sugars
| 0 | 0.05 (± 0.04) | 0.02 (± 0.01) | 522 (± 2) | 0.05 (± 0.04) | 0.03 (± 0.01) | 584 (± 15) |
| 12 | 0.3 (± 0.09) | 0.1 (± 0.02) | 423 (± 41) | 0.4 (± 0.01) | 0.2 (± 0.01) | 368 (± 13) |
| 37 | 0.5 (± 0.01) | 0.2 (± 0.001) | 281 (± 22) | 0.8 (± 0.08) | 0.3 (± 0.03) | 220 (± 32) |
aData reported is the average from duplicate fermentations on each substrate. Fermentation products and carbohydrates of residual soluble biomass sugars determined by HPLC and quantitative saccharification, respectively.
Figure 1Two-way clustering of normalized RNA-seq and microarray logtransformed read counts or probe fluorescent intensities for all genes, respectively, for grown on switchgrass or 12 hours and 37 hours postinoculation. RNA-seq read counts normalized by RPM, RPKM, KDMM, UQS, or TMM in the JMP Genomics 6 software suite were plotted with the microarray probe fluorescent intensities normalized by the LOESS method. Genes were clustered into a default of ten clusters based on similarity of expression patterns across all transcriptomic platforms and normalization techniques. KDMM, kernel density mean of M component; RNA-seq, RNA sequencing; RPKM, reads per kilobase per million; RPM, reads per million; TMM, trimmed mean of M component; UQS, upper quartile scaling.
Subset of relative expression values for cellulosome-related genes
| Cthe_0109 | Dockerin | 78.7 | 89.3 | 73.0 | 85.8 | 69.2 | 82.4 | 66.2 | 81.1 |
| Cthe_0239 | Dockerin | 79.4 | 93.4 | 81.7 | 96.4 | 68.4 | 88.1 | 67.9 | 87.9 |
| Cthe_0271 | CBM_3 | 99.1 | 88.3 | 99.8 | 92.3 | 99.8 | 83.8 | 99.9 | 89.2 |
| Cthe_0392 | Transport | 100 | 98.5 | 99.8 | 95.3 | 99.9 | 97.5 | 99.7 | 96.8 |
| Cthe_0624 | Dockerin | 81.8 | 96.1 | 78.2 | 96.5 | 91.5 | 98.6 | 65.8 | 93.7 |
| Cthe_0736 | Scaffoldin | 59.1 | 86.7 | 53.6 | 85.0 | 72.8 | 96.6 | 42.4 | 75.2 |
| Cthe_1890 | Dockerin | 57.6 | 71.9 | 71.7 | 83.4 | 32.1 | 44.8 | 77.3 | 89.7 |
| Cthe_1963 | GH 10 | 86.7 | 93.1 | 93.9 | 98.1 | 91.0 | 97.7 | 92.7 | 97.5 |
| Cthe_2089 | GH 48/CelS | 99.9 | 99.9 | 99.9 | 99.9 | 99.9 | 99.9 | 99.7 | 99.8 |
| Cthe_2271 | Dockerin | 66.5 | 45.4 | 13.9 | 9.3 | 18.3 | 15.6 | 30.9 | 16.1 |
| Cthe_2949 | Dockerin | 22.1 | 28.1 | 24.3 | 27.9 | 16.9 | 22.9 | 18.5 | 24.9 |
| Cthe_3077 | CipA | 99.6 | 100 | 99.3 | 100 | 99.5 | 100 | 98.5 | 100 |
KDMM, kernel density mean of M component; RNA-seq, RNA sequencing.
Summary of genes passing significance and 2-fold differential expression thresholds
| Microarray | 2,351 | 315 |
| RNA-seq, KDMM | 280 | 192 |
| RNA-seq, UQS | 249 | 104 |
| RNA-seq, RPM | 147 | 117 |
| RNA-seq, RPKM | 21 | 7 |
| RNA-seq, TMM | 10 | 10 |
FDR <0.05 used for significance criteria. FDR, false discovery rate; KDMM, kernel density mean of M component; RPKM, reads per kilobase per million; RPM, reads per million; TMM, trimmed mean of M component; UQS, upper quartile scaling.
Figure 2Venn diagram of genes identified as significantly (FDR <0.05) differentially expressed (± 1, logscale) by microarray and RNA-seq data normalized by KDMM or RPM. Those genes common between the KDMM and microarray strategies but not present in the RPM analytical strategy are outlined in accompanying table. FDR, false discovery rate; KDMM, kernel density mean of M component; RNA-seq, RNA sequencing; RPM, reads per million.
Seventy-three genes significantly (FDR <0.05) and differentially expressed (± 1, log scale) that were in common between RNA-seq normalized by the KDMM strategy and microarray data
| Cthe_0057 | Hypothetical protein | -0.18 | 0.56 | 0.26 | -2.27 | -1.53 | |||
| Cthe_0264 | Hypothetical protein | -2.96 | -1.05 | -0.45 | -3.16 | -1.70 | |||
| Cthe_0272 | Peptidase S11 D-alanyl-D-alanine carboxypeptidase 1 | 0.80 | 0.47 | -0.06 | -0.35 | ||||
| Cthe_0414 | Manganese containing catalase | -0.84 | 0.42 | 0.23 | -3.14 | -2.45 | |||
| Cthe_0415 | Spore coat protein CotJB | -0.51 | -0.05 | -1.47 | |||||
| Cthe_0420 | Dipicolinate synthase subunit A | -2.50 | -0.41 | -0.23 | -2.08 | -0.98 | |||
| Cthe_0453 | Hypothetical protein | -3.32 | -0.67 | -0.17 | -2.96 | -1.75 | |||
| Cthe_0470 | Hypothetical protein | 1.50 | 0.73 | -0.20 | 0.07 | -0.02 | |||
| Cthe_0471 | Flagellar hook-length control protein | 2.16 | 1.29 | 0.01 | -0.05 | 0.84 | |||
| Cthe_0472 | Flagellar hook-capping protein | 2.39 | 1.13 | -0.07 | 0.16 | 0.71 | |||
| Cthe_0473 | Flagellar operon protein | 1.91 | 1.36 | -0.26 | 0.70 | 0.91 | |||
| Cthe_0516 | IS66 Orf2 family protein | -0.27 | -0.43 | 0.13 | |||||
| Cthe_0624 | Glycoside hydrolase (CelJ) | 0.13 | -1.05 | -0.44 | 0.99 | 0.52 | |||
| Cthe_0964 | Amino acid-binding ACT domain protein | -0.44 | -0.33 | -0.48 | -0.27 | ||||
| Cthe_1081 | Hypothetical protein | 0.89 | 0.23 | -3.16 | -1.92 | ||||
| Cthe_1083 | Spore coat protein, CotS family | -2.56 | -0.75 | -0.46 | -2.68 | -1.99 | |||
| Cthe_1084 | Spore coat protein, CotS family | -0.87 | -0.30 | -2.89 | -1.15 | ||||
| Cthe_1085 | Glycosyltransferase group 1 | -0.69 | -0.33 | -2.70 | -1.36 | ||||
| Cthe_1086 | NAD-dependent epimerase/dehydratase | -2.27 | -1.05 | -0.46 | -2.67 | -1.88 | |||
| Cthe_1255 | Hypothetical protein | 2.11 | 1.57 | -0.98 | 0.04 | ||||
| Cthe_1256 | Glycoside hydrolase family 3 domain protein | 0.70 | 0.84 | 0.45 | -1.47 | ||||
| Cthe_1257 | Carbohydrate-binding CenC domain protein | 1.03 | 0.43 | -0.02 | -1.57 | ||||
| Cthe_1309 | Radical SAM domain protein | -0.24 | 0.07 | 1.12 | 0.09 | -3.37 | |||
| Cthe_1310 | Accessory gene regulator B | 0.35 | -0.06 | -0.23 | 0.60 | ||||
| Cthe_1340 | Hypothetical protein | -0.84 | 1.44 | 0.00 | 0.09 | ||||
| Cthe_1404 | GTP-binding protein HSR1-related | -1.26 | 0.02 | -0.02 | -2.55 | -1.46 | |||
| Cthe_1437 | Hypothetical protein | -0.16 | -0.04 | -0.16 | -0.07 | ||||
| Cthe_1438 | RNA polymerase sigma factor, sigma-70 family | -1.90 | -0.10 | -0.09 | -1.03 | -0.16 | |||
| Cthe_1480 | Hypothetical protein | -0.38 | -0.31 | -0.10 | |||||
| Cthe_1481 | Hypothetical protein | -0.44 | -0.39 | ||||||
| Cthe_1539 | Glutamine synthetase catalytic region | -1.25 | -0.27 | 0.00 | -3.74 | ||||
| Cthe_1540 | Glutamyl-tRNA(Gln) amidotransferase, B subunit | -0.27 | -0.43 | 0.17 | -0.05 | -1.61 | |||
| Cthe_1603 | Phosphate ABC transporter, inner membrane subunit PstA | -0.06 | -1.45 | 1.51 | |||||
| Cthe_1604 | Phosphate ABC transporter, inner membrane subunit PstC | 0.53 | 0.26 | -2.83 | 1.46 | ||||
| Cthe_1605 | ABC-type phosphate transport system periplasmic component-like protein | 0.35 | 0.11 | -2.60 | 1.80 | ||||
| Cthe_1759 | Sporulation protein YunB | 0.22 | 0.02 | -2.03 | -1.03 | ||||
| Cthe_1807 | Hypothetical protein | -2.03 | -0.98 | 0.20 | 0.21 | -0.05 | 0.05 | ||
| Cthe_1809 | RNA polymerase sigma factor, sigma-70 family | -0.54 | -0.18 | 0.55 | 0.09 | ||||
| Cthe_1812 | Urease accessory protein UreD | -0.41 | -0.48 | -0.39 | 0.46 | -0.10 | |||
| Cthe_1813 | Urease accessory protein UreG | -0.46 | -0.48 | 0.01 | 0.89 | 0.15 | |||
| Cthe_1821 | Urea ABC transporter, permease protein UrtC | -0.61 | -0.21 | 0.23 | 0.22 | -2.03 | |||
| Cthe_1879 | Hypothetical protein | -0.13 | 0.04 | 1.32 | 0.17 | ||||
| Cthe_1890 | Dockerin type 1 protein | -1.07 | 2.05 | 1.38 | -0.59 | -0.24 | |||
| Cthe_2299 | CRISPR-associated helicase Cas3 | 1.65 | 0.93 | -1.38 | -0.50 | -0.35 | |||
| Cthe_2400 | Sporulation peptidase YabG | -0.66 | 0.06 | -1.62 | |||||
| Cthe_2503 | Cupin 2 conserved barrel domain protein | -2.11 | 0.40 | 0.16 | -2.73 | -1.96 | |||
| Cthe_2506 | S-layer domain-containing protein | 0.79 | 0.15 | -0.08 | |||||
| Cthe_2531 | Sulfate ABC transporter, periplasmic sulfate-binding protein | -1.62 | -0.20 | 1.72 | 0.37 | ||||
| Cthe_2532 | Sulfate ABC transporter, inner membrane subunit CysT | -1.15 | -0.19 | 5.66 | 0.92 | 0.01 | |||
| Cthe_2536 | Phosphoadenosine-phosphosulfate reductase | -1.27 | 4.11 | 1.47 | 0.50 | ||||
| Cthe_2539 | UBA/THIF-type NAD/FAD binding protein | -1.39 | 4.04 | 1.94 | 0.18 | ||||
| Cthe_2573 | ABC transporter-related | -0.31 | 0.19 | -1.81 | -1.03 | ||||
| Cthe_2575 | Hypothetical protein | -1.93 | -0.03 | 0.15 | -1.97 | -0.87 | |||
| Cthe_2582 | Dihydroneopterin aldolase | 0.14 | 0.36 | -0.68 | 1.05 | 0.26 | |||
| Cthe_2640 | UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolyzing | 0.07 | 0.00 | -1.69 | -0.94 | -3.63 | -1.38 | ||
| Cthe_2642 | Sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family | -0.35 | 0.36 | -1.73 | -0.13 | -3.95 | -2.24 | ||
| Cthe_2643 | Nucleotidyltransferase | -0.89 | 0.10 | -1.97 | |||||
| Cthe_2644 | DegT/DnrJ/EryC1/StrS aminotransferase | 0.28 | 0.20 | -1.36 | -0.64 | ||||
| Cthe_2645 | NAD-dependent epimerase/dehydratase | -2.71 | -0.63 | -0.03 | -1.63 | -0.25 | |||
| Cthe_2647 | ATP-dependent carboxylate-amine ligase domain protein ATP-grasp | -1.55 | -0.08 | 0.12 | -2.13 | -0.35 | |||
| Cthe_2649 | HpcH/HpaI aldolase | -2.44 | 0.99 | 0.22 | -2.88 | -1.45 | |||
| Cthe_2650 | Polysaccharide biosynthesis protein CapD | -1.83 | -0.59 | -0.21 | -1.93 | -0.86 | |||
| Cthe_2651 | Hypothetical protein | -3.52 | 0.37 | -0.23 | -4.29 | -2.73 | |||
| Cthe_2652 | Lipopolysaccharide biosynthesis protein | -1.41 | -0.19 | 0.26 | -2.34 | -1.57 | |||
| Cthe_2653 | Capsular exopolysaccharide family | -0.14 | 0.39 | 0.08 | -1.87 | -0.92 | |||
| Cthe_2654 | PHP domain protein | -1.08 | 0.03 | -0.16 | -1.92 | -0.76 | |||
| Cthe_2763 | Hypothetical protein | -0.44 | 0.75 | 0.51 | 0.04 | -0.29 | |||
| Cthe_3006 | ErfK/YbiS/YcfS/YnhG family protein | -1.61 | -1.35 | -0.24 | -2.74 | -1.16 | |||
| Cthe_3007 | Peptidoglycan-binding lysin domain | -0.82 | -0.20 | -0.30 | -2.02 | -1.26 | |||
| Cthe_3008 | Manganese/iron superoxide dismutase | -1.44 | 0.04 | -1.77 | -0.03 | ||||
| Cthe_3102 | Cell envelope-related function transcriptional attenuator, LytR/CpsA family | -0.87 | -0.62 | 0.78 | -0.13 | ||||
| Cthe_3307 | Hypothetical protein | 0.95 | 0.38 | -2.29 | -1.22 | ||||
| Cthe_3350 | Hypothetical protein | 0.14 | 1.39 | 0.64 | -1.16 | -1.31 | |||
Expression values are shown as log2 values. A -1 value in the (Populus 12 hours) - (Populus 37 hours) column would indicate that this gene had 2-fold higher expression in the C. thermocellum Populus 37-hour fermentation compared to the C. thermocellum Populus 12-hour sample; a positive value would indicate greater expression from the Populus 12-hour sample. Bold face values indicate values were significant for this comparison. FDR, false discovery rate; KDMM, kernel density mean of M component; RNA-seq, RNA sequencing.