| Literature DB >> 29424683 |
Jimmy Omony1,2, Anne de Jong1,2, Antonina O Krawczyk1,2, Robyn T Eijlander1,2,3, Oscar P Kuipers1,2.
Abstract
Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.Entities:
Keywords: Bacillus amyloliquefaciens; Bacillus subtilis; RNA-Seq; gene co-expression network; sporulation; stages of sporulation
Mesh:
Substances:
Year: 2018 PMID: 29424683 PMCID: PMC5857382 DOI: 10.1099/mgen.0.000157
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Bench-marking the B. subtilis 168 GCN generated using data from Nicholas et al. [23]. The sporulation network showing genes belonging to the Spo0A regulons (93 genes in blue) and σE regulons (144 genes in yellow); these genes met the filtering criterion for inclusion in the network generated using SPACE. Most genes in each of these two regulons cluster within the same vicinity in the network. Such a spatial distribution of genes can be expected since GCN is generated from transcriptome data while the benchmark gene lists for the regulons come from online databases.
Fig. 5.B. amyloliquefaciens gene co-expression network (1665 nodes, 8287 edges). (a) DEGs that were consistently up-regulated and down-regulated under all conditions are colored red and blue. The genes that were both differentially up-regulated and down-regulated across the time contrasts (P2 to P6) are shown in yellow nodes. The gray nodes are the non-DEGs. (b) The Reduce Visualize Gene Ontology (REVIGO) [68] projection of all enriched GO terms from the B. subtilis 168 GCN modules. (c) Equivalent projection of all modules from B. amyloliquefaciens GCN. The significance of the enrichment test is represented by the color intensity. A semantic depiction of the GO categories colored according to significant over-representation in the GCN modules. The GO terms in the blue and green bubble circles are more significantly enriched, lower log10(P-values) than those in orange and red (legends in upper right hand corner of the respective plots). The sizes of the GO terms are indicated by the circle radiuses, the larger the circle, the more over-represented the GO term. The semantic spaces x and y correspond to the multidimensional scaling of the matrices of the GO terms' semantic similarities.
Fig. 4.Sporulation sub-network of B. subtilis 168. (a) Sub-network of the Spo0A and σE regulons with a first-degree connection to the ‘Other genes’. The co-expression network consists of 500 genes and 896 edges. Edge thickness represents the strength of association between nodes; the thicker the edge, the stronger the association. The larger blue and yellow nodes are the spo0A and gerE genes which encode Spo0A and σE, respectively. (b) Bi-directional clustering of the 500 genes and 255 conditions. The genes generally group in two broad clusters (indicated in purple and blue). Most genes from the first (purple) cluster have on average low expression, while those from the second (blue) cluster have on average higher expression, as seen from the figure color key and density plot. The Euclidean distance was used throughout for the distance calculations.
Fig. 2.Detected modules in the B. subtilis 168 and B. amyloliquefaciens GCN and their structural properties. (a) The network has 2867 genes (nodes) and 4436 edges (black lines connecting nodes). The 611 modules are indicated in different colors. A threshold of ρ=0.35 was used for the partial correlation measure during network reconstruction using SPACE. The network was visualized using Cytoscape v3.2.1. The nodes shown in different colors indicate gene membership of the network modules and the number of genes in the individual modules varies widely. (b) Truncated power law distribution plot for the B. subtilis 168 and B. amyloliquefaciens GCNs, plot on a log–log scale. The model fits through the data points show significantly good fits. This means that the model provides a good description of the degree distribution of the network nodes. These plots show that many nodes have few connections while a few nodes are highly connected. Included are plots of the degree distribution of randomly connected networks with the same size and number of edges as that of the B. subtilis 168 and B. amyloliquefaciens GCN. The degree distribution of the networks for the two strains is shown in black and magenta. (c) Distribution of the corresponding average clustering coefficients for the two networks.
Fig. 3.Time progression of sporulation for the B. subtilis 168 and B. amyloliquefaciens networks. (a) and (c): Bar plots of the number of DEGs at the various stages of sporulation, time instants P2 to P6 all referenced to P1 (i.e., P2–P1 to P6–P1, see Methods). (b) and (d): Corresponding Venn diagrams indicating the overlap for the subsets of the number of DEGs found at the time contrasts (P2–P1 to P6–P1) for (b) B. subtilis 168 and (d) B. amyloliquefaciens, respectively. The number of DEGs is indicated in each sector of the Venn diagram. The colored ellipses in indicate the different contrasts for the Venn diagram.