| Literature DB >> 29849014 |
Xin Chen1, Anjun Ma2,3, Adam McDermaid4,5, Hanyuan Zhang6, Chao Liu7, Huansheng Cao8, Qin Ma9,10.
Abstract
Regulons, which serve as co-regulated gene groups contributing to the transcriptional regulation of microbial genomes, have the potential to aid in understanding of underlying regulatory mechanisms. In this study, we designed a novel computational pipeline, regulon identification based on comparative genomics and transcriptomics analysis (RECTA), for regulon prediction related to the gene regulatory network under certain conditions. To demonstrate the effectiveness of this tool, we implemented RECTA on Lactococcus lactis MG1363 data to elucidate acid-response regulons. A total of 51 regulons were identified, 14 of which have computational-verified significance. Among these 14 regulons, five of them were computationally predicted to be connected with acid stress response. Validated by literature, 33 genes in Lactococcus lactis MG1363 were found to have orthologous genes which were associated with six regulons. An acid response related regulatory network was constructed, involving two trans-membrane proteins, eight regulons (llrA, llrC, hllA, ccpA, NHP6A, rcfB, regulons #8 and #39), nine functional modules, and 33 genes with orthologous genes known to be associated with acid stress. The predicted response pathways could serve as promising candidates for better acid tolerance engineering in Lactococcus lactis. Our RECTA pipeline provides an effective way to construct a reliable gene regulatory network through regulon elucidation, and has strong application power and can be effectively applied to other bacterial genomes where the elucidation of the transcriptional regulation network is needed.Entities:
Keywords: Lactococcus lactis MG1363; RECTA; acid stress response; cis-regulatory motif finding; differentially expressed gene; gene co-expression; gene regulatory network; regulon
Year: 2018 PMID: 29849014 PMCID: PMC6027394 DOI: 10.3390/genes9060278
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1The flowchart of constructing the global acid stress response (ASR) transcriptional network in MG1363. Step 1: microarray data was used to generate co-expressed gene clusters and differentially expressed genes (DEGs), and the MG1363 genome sequence was used to find operons. Step 2: a motif finding progress was carried out to identify all statistically significant motifs in each of the co-expression gene modules (CEMs). Step 3: a regulon finding procedure was designed to identify all the possible regulon candidates encoded in the genome based on motif comparison and clustering. Step 4: the motifs of each of these regulons were compared to known transcription factor binding sites (TFBSs), and differential gene expression (DGE) analysis between low pH conditions and normal conditions was used to figure out the ASR-related regulons. Step 5: regulon validation based on literature information verified the significant putative regulons and expanded the results to some insufficiently significant regulons. Step 6: the ASR-related gene regulatory network (GRN) in MG1363 was predicted and described with eight regulons, nine functional modules, and 33 genes. The combination of the above information forms a genome-scale regulatory network constructed for ASR. Abbreviations: DOOR2, Database of Prokaryotic Operons 2.0; BBC, BoBro-based motif comparison; BLAST, basic local alignment search tool; BoBro, Bottleneck Broken.
Altogether, 14 significant regulons that are verified and mapped to known transcription factors (TFs). According to analyses, operon numbers and DEG determination (yes or no), matched template TFs and mapped TFs were assigned for each significant regulon, respectively, and were aligned based on regulon ID number. Five regulons containing DEGs and having the corresponding TF at the same time were bolded, being computationally verified as the regulons responsible for acid stress in MG1363.
| Regulon ID | No. of Operons | DEG | TF Template | TF (Gene) BLAST in MG1363 |
|---|---|---|---|---|
|
| 82 | Y | spo0A | |
| Regulon #3 | 32 | Y | FoxQ1 | N/A |
| Regulon #4 | 20 | Y | SPT2 | N/A |
|
| 49 | Y | lhfB | |
| Regulon #10 | 5 | N | GAL80 | llmg_0271 |
|
| 259 | Y | CovR | |
|
| 19 | Y | c4494 | |
| Regulon #20 | 79 | Y | NHP6A | N/A |
| Regulon #28 | 5 | Y | 1Z916 | N/A |
|
| 65 | Y | ihfA | |
| Regulon #37 | 10 | N | CovR | |
| Regulon #40 | 7 | Y | Awh | N/A |
| Regulon #44 | 12 | N | YBR182C | N/A |
| Regulon #47 | 5 | N | RHE_PF00288 |
Abbreviations: N, no; Y, yes; N/A, not found.
Known ASR-related gene mapping from literature in response to pH change. Literature-supported ASR-related genes found in close species or other Lactococcus lactis strains. The template transporters and genes were first identified in published studies from the NCBI and UniProt databases. Lactococcus lactis Il1403 was used as the organism which is very close to MG1363 if template gene existed. Only 36 templates that successfully mapped to the MG1363 genome were listed, which resulted in 33 genes. All mapped genes and corresponding templated were organized by their regulated pathways which were further used as functional modules. Mapped genes were searched in 51 regulons to build the connections between functional modules and regulons.
| Template Organisms | MG1363 | |||
|---|---|---|---|---|
| Organisms | Transporters | Functions/Pathways | Mapped Genes (Locus Tag) | Regulons |
|
|
| LDH | NHP6A, | |
|
| ||||
|
| ||||
|
|
| GAD | N/A | |
|
| ||||
|
|
| ADI pathway | NHP6A, | |
|
| ||||
|
| ||||
|
| ||||
|
| ||||
| Bacteria |
| Urea degradation | N/A | |
|
|
| F0/F1ATPase | llmg_1952, llmg_1951, llmg_1950, llmg_1949, llmg_1948, llmg_1947, llmg_1946, llmg_1945 | |
|
|
| Acid response | rcfB (llmg_2512) | (Regulon39, |
|
| Chaperone, Protein repair and protease |
| ||
|
| ||||
|
| ||||
|
| ||||
|
| ||||
|
| ||||
|
| ||||
| Envelope alterations | llmg_0878 | NHP6A, | ||
|
| DNA repair | llmg_0374, llmg_0534, llmg_1718 llmg_1221 | (Regulon39, | |
$ Three subunits of urease enzymes coded by ureABC operon found preserved in multiple bacteria. $$ Altogether eight genes. $$$ The homolog prediction or motif research results with low homolog similarity but have meaningful biological relevance. Abbreviations: LDH, L-lactate dehydrogenase; GAD, glutamate decarboxylases; ADI, arginine deiminase; N/A, not found.
Figure 2Regulon prediction using regulon identification based on comparative genomics and transcriptomics analysis (RECTA) pipeline (red) and validation and enrichment using literature information and gene blast (blue). All processes were shown in rectangles and results were highlighted with corresponding background colors. In the computational pipeline, 51 regulons with assigned motifs and operons were analyzed sequentially through significant TFBS pairing, DEG conformation, and TF BLAST. Only regulons contained DEGs (10) which had related mapped TF (8) were believed to be the final predicted ASR-related regulons (5). These five regulons were then merged into four, using the corresponding TFs to represent their names. In the literature validation process, known ASR-related transporters were first mapped to the MG1363 genome and resulted in 33 genes. Those genes were then searched in 51 regulons and determined six related regulons. All regulons resulting from both computational pipeline and literature validation were combined, along with the information of functional modules, to determine the GRN.
Figure 3A working model of the transcriptional gene regulatory network in response to pH change in L. lactis. The mechanism is activated by the change of proton signal in a cell. Regulon RcfB is assumed to be the overall activator for the rest seven regulons and controls the ASR functional module solely. Three kinds of literature were verified; significant ASR-related regulons, llrA, llrC, and hllA, and two insufficiently significant regulons, llrD (regulon #39) and regulon #8 (llmg_1803) were predicted via our workflow but with results under a 0.8 motif similarity cutoff or a hit could not be found; one putative significant regulon NHP6A controls the seven functional modules which are experimentally verified in the close species MG1363. The other significant regulon ccpA failed to be confirmed by any literature-proved genes or transporters. Two extra functional modules, GAD, and urea degradation show no direct connection to all seven of the regulons. One or more homology genes are found in MG1363 for all the nine modules using BLAST. The solid arrows indicate regulation between regulons/TFs and functional modules/genes, and the dashed arrows indicate uncertain control processes. Additionally, two ovals indicate two trans-membrane proteins; one is confirmed as F0/F1ATPase and the other one, with the dashed line, whose related information we cannot find in the public-domain literature.