| Literature DB >> 26886880 |
Wei Wu1, Xian Sun1, Yun Gao1, Jun Jiang1, Zhenling Cui2, Baoxue Ge2, Hai Wu1, Lu Zhang1, Yao Li1.
Abstract
The transcription regulatory system of Mycobacterium tuberculosis (M. tb) remains incompletely understood. In this study, we have applied the eGLECLUBS algorithm to a group of related prokaryotic genomes for de novo genome-wide prediction of cis-regulatory binding sites (CRBSs) in M. tb H37Rv. The top 250 clusters from our prediction recovered 83.3% (50/60) of all known CRBSs in extracted inter-operonic sequences of this strain. We further demonstrated that the integration of our prediction results with the ChIP-Seq datasets is very effective in identifying true binding sites of TFs. Using electrophoretic mobility shift assays and real-time RT-PCR, we experimentally verified our prediction of CRBSs for Rv0081, an important transcription factor thought to be involved in regulation of M. tb under hypoxia.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26886880 PMCID: PMC4757040 DOI: 10.1371/journal.pone.0148965
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Overrepresented clusters of each TF with known binding sites.
| TF | Known binding sites | Overrepresented clusters | ||
|---|---|---|---|---|
| Rank | PBS | P-value | ||
| Rv1033c | KBS_14( | 1 | 3.12E-03 | |
| 2 | PBS_221 | 3.12E-03 | ||
| Rv1846c | KBS_19( | 1 | PBS_28 | 3.40E-03 |
| 2 | PBS_217 | 3.40E-03 | ||
| 3 | 5.80E-03 | |||
| Rv1994c | KBS_21( | 1 | 6.94E-04 | |
| 2 | PBS_27 | 6.94E-04 | ||
| 3 | PBS_158 | 6.94E-04 | ||
| Rv2359 | KBS_23( | 1 | 4.42E-21 | |
| KBS_25( | 2 | PBS_146 | 1.63E-03 | |
| KBS_27( | 3 | PBS_209 | 5.80E-03 | |
| Rv2506 | KBS_31( | 1 | 4.87E-04 | |
| 2 | PBS_171 | 4.87E-04 | ||
| 3 | PBS_156 | 4.92E-04 | ||
| Rv3066 | KBS_46( | 1 | PBS_250 | 1.94E-04 |
| 2 | PBS_142 | 2.05E-04 | ||
| 3 | PBS_138 | 9.99E-04 | ||
| 4 | PBS_40 | 3.50E-03 | ||
| 5 | 4.07E-03 | |||
| Rv3133c | KBS_47( | 1 | 2.48E-40 | |
| 2 | 5.53E-03 | |||
| 3 | PBS_219 | 5.53E-03 | ||
| Rv3574 | KBS_50( | 1 | 3.17E-53 | |
| KBS_52( | 2 | PBS_182 | 4.57E-02 | |
| KBS_54( | 3 | PBS_19 | 4.58E-02 | |
| KBS_56( | ||||
| KBS_58( | ||||
| KBS_60( | ||||
| Rv3855 | KBS_67( | 1 | PBS_152 | 4.12E-04 |
| 2 | 6.28E-04 | |||
| 3 | PBS_110 | 6.28E-04 | ||
Abbreviation: KBS, known binding sites; PBS, predicted binding sites. PBSs corresponding to KBSs are shown in parentheses. For each TF, overrepresented predicted clusters containing KBSs are indicated in bold. P values were calculated by the hypergeometric test with multiple test correction.
Fig 1Evaluation of the top-ranked clusters.
(A) The number of predicted binding sites in the top 500 clusters. (B) The number of known binding sites recovered by the top 500 clusters. (C) Cumulative recovery rate of the known binding sites in the input motifs by the top-ranked cluster, computed as the ratio of the number of cumulative known binding sites recovered in top-ranked clusters to the number of known binding sites in the set of input motifs. (D) The number of predicted clusters of the top 250 clusters for the known TFs of M. tb H37Rv.
Predicted binding sites of Rv0081 selected for experimental verifications.
| ID | Name | Sequence | Target | P value |
|---|---|---|---|---|
| 1 | PBS_218–9 | GAAAGTTC | Rv2778c | 4.44E-02 |
| 2 | PBS_70–16 | GATGCAACGTGCAT | Rv3619c | 1.26E-03 |
| 3 | PBS_218–7 | GAGAATTT | Rv1057 | 4.44E-02 |
| 4 | PBS_206–25 | TAGACGCTAC | Rv0505c | 1.26E-03 |
| 5 | PBS_206–27 | TGGGAACAAG | Rv2329c | 1.26E-03 |
| 6 | PBS_206–35 | CGAGCCCAAT | Rv2329c | 1.26E-03 |
| 7 | PBS_218–19 | AAAACTTC | Rv0002, Rv0003, Rv0004 | 4.44E-02 |
| 8 | PBS_206–53 | AGTTTGAAAT | Rv2145c | 1.26E-03 |
| 9 | PBS_218–5 | GAGAATTC | Rv1503c, Rv1504c | 4.44E-02 |
| 10 | PBS_70–15 | GGTGTAGTTCGCAC | Rv2699c | 1.26E-03 |
P values were calculated by the hypergeometric test with multiple test correction.
Fig 2Electrophoretic mobility shift assay for Rv0081.
For each PBS, the DNA (10 mM) was incubated with increasing concentrations (0, 1nM, 1.7nM and 2.3nM) of the purified Rv0081 protein (lanes 1–4, respectively).
Fig 3Confirmation of Expression fold changes of potential target proteins of Rv0081 by real-time RT-PCR.
Expression fold change of down-stream genes of PBSs mentioned previously after upregulated Rv0081 in M. tb H37Rv stain. RNA samples were extracted at exponential growth phase (14days) under two conditions. Results are shown as average fold-change (up-regulated plamid/empty plasmid as control) values of two conditions, hypoxic conditions (A) and normal conditions (B). (p<0.05, *; 0.001