| Literature DB >> 27829029 |
Cresten B Mansfeldt1, Gretchen W Heavner1, Annette R Rowe2, Boris Hayete3, Bruce W Church3, Ruth E Richardson1.
Abstract
The interpretation of high-throughput gene expression data for non-model microorganisms remains obscured because of the high fraction of hypothetical genes and the limited number of methods for the robust inference of gene networks. Therefore, to elucidate gene-gene and gene-condition linkages in the bioremediation-important genus Dehalococcoides, we applied a Bayesian inference strategy called Reverse Engineering/Forward Simulation (REFS™) on transcriptomic data collected from two organohalide-respiring communities containing different Dehalococcoides mccartyi strains: the Cornell University mixed community D2 and the commercially available KB-1® bioaugmentation culture. In total, 49 and 24 microarray datasets were included in the REFS™ analysis to generate an ensemble of 1,000 networks for the Dehalococcoides population in the Cornell D2 and KB-1® culture, respectively. Considering only linkages that appeared in the consensus network for each culture (exceeding the determined frequency cutoff of ≥ 60%), the resulting Cornell D2 and KB-1® consensus networks maintained 1,105 nodes (genes or conditions) with 974 edges and 1,714 nodes with 1,455 edges, respectively. These consensus networks captured multiple strong and biologically informative relationships. One of the main highlighted relationships shared between these two cultures was a direct edge between the transcript encoding for the major reductive dehalogenase (tceA (D2) or vcrA (KB-1®)) and the transcript for the putative S-layer cell wall protein (DET1407 (D2) or KB1_1396 (KB-1®)). Additionally, transcripts for two key oxidoreductases (a [Ni Fe] hydrogenase, Hup, and a protein with similarity to a formate dehydrogenase, "Fdh") were strongly linked, generalizing a strong relationship noted previously for Dehalococcoides mccartyi strain 195 to multiple strains of Dehalococcoides. Notably, the pangenome array utilized when monitoring the KB-1® culture was capable of resolving signals from multiple strains, and the network inference engine was able to reconstruct gene networks in the distinct strain populations.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27829029 PMCID: PMC5102406 DOI: 10.1371/journal.pone.0166234
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Diagram of the experimental and analytical procedures.
(a) Overview of the experimental design. The transcriptome of the continuously or batch fed D2 (single strain) or KB-1® cultures were monitored using microarrays, and the metabolite data was collected through chromatographic techniques. (b) Summary of the REFS™ gene network inference process. All possible edges are assigned a score based on the data in the enumeration step. Optimized networks and a consensus network are constructed from these enumerated fragments.
Fig 2Ordering Dhc pangenome array probes based on sequence similarity and captured expression profiles for the KB-1® culture.
The array contains multiple probes for Dhc orthologs. The white-to-blue shaded columns (left) display the genomic % identity of the probe sequence to gene sequences for representative members of the Cornell, Victoria, and Pinellas groups of Dhc. The yellow-to-purple columns (right) represent the correlation relationship scores of the probe intensity across all cDNA pools from all samples. Bolded (*) probes indicate those that were retained for the REFS™ analysis of the KB-1® data.
Fig 3REFS™ network summaries for the D2 (black) and KB-1® (grey) networks.
The number of (a) nodes and (b) edges remaining in the network is plotted against the frequency cutoff established for the edges. (c) The slope of (b) versus frequency; the minimum was observed at approximately f = 0.6 for both networks. (d) The D2 consensus network of all edges with f ≥ 0.6. (e) The KB-1® consensus network of all edges with f ≥ 0.6. More detailed (e.g., with labels and edge strengths) consensus networks are available in S3 and S4 Files.
Gene-gene edges modified by a discrete variable in the D2 consensus network with high frequencies (f > 0.85).
| f | r | Node 1 | Description | Node 2 | Description |
|---|---|---|---|---|---|
| If DCE (and all higher chlorinated ethenes, when provided) was available to be and was respired (1) or not (0) | |||||
| 0.85 | 0.73 | DET0006 | histidyl-tRNA synthetase | DET1386 | hypothetical protein |
| 0.91 | 0.78 | - | EA feed rate over the final 48 hrs | - | respiration rate of DCE |
| 0.88 | -0.23 | - | respiration rate of PCE | DET0315 | sensory box sensor histidine kinase |
| 0.89 | 0.92 | DET0564 | ATP synthase subunit B | DET1263 | DAK2 domain protein |
| 0.89 | 0.92 | DET1225 | hypothetical protein | DET1417 | formyltransferase |
| If TCE (and PCE, when provided) was available to be and was respired (1) or not (0) | |||||
| 0.96 | 0.88 | DET1477 | hypothetical protein | DET1554 | hypothetical protein |
| 0.90 | 0.95 | DET1510 | hypothetical protein | DET0657 | phosphoribosyltransferase |
| If PCE was available to be and was respired (1) or not (0) | |||||
| 0.98 | 0.19 | tRNA | Cys-1 | tRNA | Gly-3 |
| 0.86 | 0.66 | DET0414 | hypothetical protein | DET0165 | ISDet2, transposase orfA |
| 0.90 | 0.93 | DET1041 | PQQ enzyme repeat domain protein | DET1039 | SpoIIIJ-associated protein Jag |
| If DCE was detected above background levels (1) or not (0) | |||||
| 0.94 | 0.32 | DET0057 | ATP-dependent Clp protease, ClpC | - | concentration of DCE |
| 0.94 | -0.72 | DET0128 | cobyrinic acid a,c-diamide synthase | DET0299 | transcriptional regulator, Crp/Fnr family |
| 0.85 | 0.81 | DET0840 | adenylosuccinate lyase | DET0439 | FtsK/SpoIIIE family protein |
| 0.87 | -0.68 | DET1430 | HypA hydrogenase nickel insertion protein | - | methanogenesis rate |
| 0.79 | 0.88 | DET1431 | HypB hydrogenase accessory protein | DET1433 | hydrogenase assembly chaperone |
| 0.94 | 0.74 | DET1476 | DNA binding domain, excisionase family | DET0981 | tRNA pseudouridine synthase B |
| If PCE was detected above background levels (1) or not (0) | |||||
| 0.88 | 0.84 | DET1345 | tagatose 1,6-diphosphate aldolase | DET1530 | sensor histidine kinase |
| If an electron donor was being provided to the culture (1) or not (0) | |||||
| 0.93 | 0.74 | DET1084 | DNA packaging protein, putative | DET1070 | endolysin, putative |
| 0.88 | 0.82 | DET1244 | NADPH-dependent FMN reductase | DET1005 | transcriptional regulator, ArsR family |
| 0.90 | 0.78 | DET1570 | hydrogenase, group 4, HycG subunit | DET1106 | hypothetical protein |
| If the culture displayed inhibited growth (1) or not (0) | |||||
| 0.86 | 0.90 | DET0111 | [Ni/Fe] hydrogenase, small subunit | DET0112 | [Ni/Fe] hydrogenase, Fe-S cluster |
| 0.86 | 0.74 | DET0451 | malate dehydrogenase, NAD-dependent | DET0115 | ABC transporter, permease protein |
| 0.92 | 0.81 | DET0610 | hypothetical protein | tRNA | Cys-1 |
| 0.99 | 0.76 | DET1111 | ATP-binding protein | DET0097 | iron dependent repressor, putative |
| If the culture displayed inhibited growth because of DCE (1) or not (0) | |||||
| 0.96 | 0.92 | tRNA | Val-2 | tRNA | Ser-1 |
| 0.99 | 0.88 | DET0343 | cell division protein FtsZ | DET0342 | cell division protein FtsA |
| 0.94 | 0.70 | DET0383 | hypothetical protein | DET1438 | hypothetical protein |
| 0.95 | 0.90 | DET0591 | hypothetical protein | DET0334 | polyA polymerase family protein |
| 0.85 | 0.77 | DET0839 | phosphoribosylaminoimidazole carboxylase | DET0607 | hypothetical protein |
| 0.87 | 0.85 | DET0920 | iron-sulfur cluster-binding protein | DET0921 | hypothetical protein |
| 0.92 | 0.86 | DET1207 | protein export protein, putative | DET0543 | fatty acid/phospholipid synthesis |
| 0.96 | 0.80 | DET1350 | DNA-binding response regulator | DET0922 | twitching mobility protein |
| 0.89 | 0.32 | DET1446 | excisionase family DNA binding protein | DET0432 | DNA-binding response regulation |
The relationship identified occurs when the discrete variable is set to 1. Gray squares indicate non-gene variables related to experimental conditions. The correlation score (r value) between the intensity values of two probes across all datasets is presented to display whether the relationship between the intensity values of the probes is positive or negative.
Fig 4Transcripts in the consensus networks that are a maximum of two edges away from connecting with reductive dehalogenases in D2 (left) and KB-1® (right).
The four highest transcribed RDases in the D2 culture and the top five transcribed RDases in the KB-1® culture are displayed. Other RDases are present in the final consensus network as well. The dashed lines are indicative of negative relationships, and the solid lines represent positive relationships. For the D2 consensus network, the transcript ID and a brief description is provided. For the KB-1® consensus network, the probe ID, orthologous transcript ID in Dhc strain 195 (where applicable), and a brief annotation are provided. The grayed text in the KB-1® culture represents transcripts from a minor Cornell-type strain.
Fig 5REFS™ consensus network summary for the hup and fdh transcripts.
(a) D2 and (b) KB-1®. The connecting lines indicate edge strength scores that exceeded 0.6. Gray text in the KB-1® culture indicates the minor stain of the Cornell/Victoria type. All relationships identified in the model between these transcripts were positive.