| Literature DB >> 28720801 |
Rosalba Perrone1, Enrico Lavezzo1, Erika Riello1, Riccardo Manganelli1, Giorgio Palù1, Stefano Toppo2, Roberta Provvedi3, Sara N Richter4.
Abstract
Mycobacterium tuberculosis is the causative agent of tuberculosis (TB), one of the top 10 causes of death worldwide in 2015. The recent emergence of strains resistant to all current drugs urges the development of compounds with new mechanisms of action. G-quadruplexes are nucleic acids secondary structures that may form in G-rich regions to epigenetically regulate cellular functions. Here we implemented a computational tool to scan the presence of putative G-quadruplex forming sequences in the genome of Mycobacterium tuberculosis and analyse their association to transcription start sites. We found that the most stable G-quadruplexes were in the promoter region of genes belonging to definite functional categories. Actual G-quadruplex folding of four selected sequences was assessed by biophysical and biomolecular techniques: all molecules formed stable G-quadruplexes, which were further stabilized by two G-quadruplex ligands. These compounds inhibited Mycobacterium tuberculosis growth with minimal inhibitory concentrations in the low micromolar range. These data support formation of Mycobacterium tuberculosis G-quadruplexes in vivo and their potential regulation of gene transcription, and prompt the use of G4 ligands to develop original antitubercular agents.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28720801 PMCID: PMC5515968 DOI: 10.1038/s41598-017-05867-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Number of putative G4s in both strands of the Mtb genome within 50 nts upstream of a primary TSS.
| Pattern (l_n_d) | Total G4s | G4s in TSS | |
|---|---|---|---|
| Forward strand | 2_4_15 | 33081 | 1115 |
| 2_4_11 | 30561 | 805 | |
| 2_4_7 | 21186 | 426 | |
| 3_4_15 | 902 | 29 | |
| 3_4_11 | 495 | 13 | |
| 3_4_7 | 223 | 4 | |
| 4_4_15 | 82 | 0 | |
| 4_4_11 | 10 | 0 | |
| 4_4_7 | 5 | 0 | |
| 5_4_15 | 0 | 0 | |
| 5_4_11 | 0 | 0 | |
| 5_4_7 | 0 | 0 | |
| Reverse strand | 2_4_15 | 33061 | 1122 |
| 2_4_11 | 31109 | 845 | |
| 2_4_7 | 21747 | 479 | |
| 3_4_15 | 1074 | 33 | |
| 3_4_11 | 574 | 14 | |
| 3_4_7 | 252 | 6 | |
| 4_4_15 | 21 | 1 | |
| 4_4_11 | 4 | 0 | |
| 4_4_7 | 1 | 0 | |
| 5_4_15 | 1 | 0 | |
| 5_4_11 | 0 | 0 | |
| 5_4_7 | 0 | 0 |
Position of the found G4s in the Mtb genome is available in Supplementary Files S1a and S1b.
G4 sequences upstream or overlapping TSS in the Mtb genome, forming G4s with at least three stacked tetrads (at least 3 Gs in each G-rich island) and with short or long score ≥ 2.
| Rv number | Gene name | Nts to TSSa | G4 sequence |
|---|---|---|---|
| Rv0011c | crgA | −14 |
|
| Rv3779 | −35 |
| |
| Rv0284 | eccC3 | −18 |
|
| Rv3208 | −35 |
| |
| Rv2639c | −26 |
| |
| Rv0713 | −47 |
| |
| Rv1338 | murI | 1 |
|
| Rv3435c | 35 |
| |
| Rv2597 | −25 |
| |
| Rv3802c | 18 |
| |
| Rv2732c | −22 |
| |
| Rv1539 | lspA | −12 |
|
| Rv3484 | −24 |
| |
| Rv0150c | −12 |
| |
| Rv2030c | −17 |
| |
| Rv2405 | −48 |
| |
| Rv3404c | 44 |
| |
| Rv2559c | 30 |
| |
| Rv3207c | 34 |
| |
| Rv2308 | −29 |
| |
| Rv0750 | 21 |
| |
| Rv0471c | 20 |
| |
| Rv0628c | −27 |
| |
| Rv1253 | deaD | −40 |
|
| Rv2979c | −47 |
| |
| Rv0245 | −4 |
| |
| Rv1121 | zwf1^ | −8 |
|
| Rv0392c | ndhA^ | −47 |
|
| Rv2457c | clpX^ | 10 |
|
| Rv1419 | 40 |
| |
| Rv1327c | glgE | 12 |
|
| Rv0851c | 18 |
| |
| Rv3634c | galE1 | −4 |
|
| Rv2847c | cysG | −30 |
|
| Rv0896 | gltA2 | −23 |
|
| Rv2367c | −34 |
| |
| Rv0166 | fadD5 | −36 |
|
| Rv1704c | cycA | −13 |
|
| Rv0339c | −14 |
| |
| Rv1049 | mosR^ | −38 |
|
| Rv0238 | −7 |
| |
| Rv1152 | 41 |
| |
| Rv1151c | −32 |
| |
| Rv2021c | −36 |
| |
| Rv1082 | mca | −21 |
|
G tracts with at least three Gs are shown in bold. GG tracts are underlined since they may aid G4 folding. Tracts with the potential to form a bulged G4 (i.e. GXGG, where X is any of the three remaining bases) are additionally shown in italics. The symbol^ indicates genes, the corresponding G4 sequences of which were chosen for further investigation. Rv number is the gene numeration in the considered reference strain H37Rv. aPosition of the last nt of the G4 motif with respect to the TSS. Asterisks indicate that the reported G4 sequence is in the reverse strand.
Figure 1CD spectra of the putative G4 molecules of zwf1 (a), clpx (b), mosR (c) and ndhA (d) in the presence of increasing KCl concentrations (0–150 mM).
Melting temperatures (Tm) of Mtb G4 oligonucleotides (4 µM) in the absence and presence of increasing KCl concentrations (50–150 mM) and G4 ligands (16 µM).
| G4 | K+ (mM) | G-4 ligand added | Tm (°C) | ΔTm (°C) (TmK+[50] or [100] or [150]-TmK+[0]) | Tm (°C) | ΔTm (°C) (TmK+[100]G4 ligand-TmK+[100]) |
|---|---|---|---|---|---|---|
| zwf1 | 0 | 36.8 ± 1.27 | ||||
| 50 | 44.2 ± 1.4 | 7.4 | ||||
| 100 | 48.5 ± 0.5 | 11.7 | ||||
| 150 | 52.2 ± 1.9 | 15.4 | ||||
| 100 | B19 | >90.0 | >41.5 | |||
| 100 | NDI | >90.0 | >41.5 | |||
| clpx | 0 | 40.1 ± 0.1 | ||||
| 50 | 52.0 ± 0.8 | 11.9 | ||||
| 100 | 59.2 ± 0.8 | 19.1 | ||||
| 150 | 74.2 ± 0.6 | 34.1 | ||||
| 100 | B19 | 79.9 ± 0.4 | 20.7 | |||
| 100 | NDI | 79.3 ± 1.4 | 20.1 | |||
| mosR | 0 | 37.8 ± 0.9 | ||||
| 50 | 49.4 ± 0.5 | 11.6 | ||||
| 100 | 50.8 ± 1.7 | 13.0 | ||||
| 150 | 51.7 ± 1.3 | 13.9 | ||||
| 100 | B19 | (I) 68.2 ± 1.1 | 17.4 | |||
| (II) 50.7 ± 0.6/79.7 ± 0.7 | −/28.9 | |||||
| 100 | NDI | (I) 79.4 ± 1.0 | 28.6 | |||
| (II) 53.9 ± 1.6/82.5 ± 2.3 | 3.1/31.7 | |||||
| ndhA | 0 | 60.5 ± 0.3 | ||||
| 50 | 45.4 ± 1.6/ 71.2 ± 2.1 | −/10.7 | ||||
| 100 | 38.1 ± 0.4 / 74.4 ± 1.3 | −/13.9 | ||||
| 150 | 52.6 ± 3.1 /80.2 ± 1.3 | −/19.7 | ||||
| 100 | B19 | (I) > 90 | >51.9/ > 15.6 | |||
| (II) 63.8 ± 1.5/85.2 ± 0.9 | 25.7/10.8 | |||||
| 100 | NDI | >90.0 | >51.9/ > 15.6 |
When more than one G4 species were observed in the CD spectrum (i.e. I, II), Tm values for each species were reported. B19 and NDI stand for the G4 ligands BRACO-19 and c-exNDI 2, respectively.
Figure 2Effect of the G4 ligands BRACO-19 and c-exNDI 2 on the conformation of the selected Mtb G4s. (a) Chemical structures of the G4 ligands BRACO-19 and c-exNDI 2. (b) CD spectra of G4 oligonucleotides zwf1, clpx, mosR and ndhA (final concertation 4 μM) in the presence of KCl (100 mM) and BRACO-19 or c-exNDI 2 (final concentration 16 μM) to assess G4 topology changes. The molar ratio oligonucleotide:compound was 1:4.
Figure 3Taq polymerase stop assay. (a) Sequencing PAGE of Taq-amplified zwf1, clpX, ndhA and mosR templates in the absence (lanes 1) or presence of 100 mM KCl (lanes 2) and G4 ligands BRACO-19 (lanes 3) or c-exNDI 2 (lanes 4). The control template is a sequence unable to fold in G4. Symbols *, ¤, § and # indicate pausing sites just before the G4 region of the templates. Pr indicates the band of the labeled primer. M is a marker lane obtained with the Maxam and Gilbert sequencing protocol. B and N indicate BRACO-19 and c-exNDI 2, respectively. (b) Sequences of the selected G4 oligonucleotides. The exact position of the pausing sites within the template G4 sequence is indicated by the symbols *, ¤, § and #, as shown also in (a). (c) Quantification of the intensity of the stop bands obtained in (a).
Figure 4Resazurine Microplate Assay to measure the activity of different G4 ligands (BRACO- 19 and c-exNDI2) on Mtb.