| Literature DB >> 28902898 |
Minwoo Lee1, Hyejin Um1, Michael W Van Dyke1.
Abstract
One of the primary transcriptional regulators of fatty acid homeostasis in many prokaryotes is the protein FadR. To better understand its biological function in the extreme thermophile Thermus thermophilus HB8, we sought to first determine its preferred DNA-binding sequences in vitro using the combinatorial selection method Restriction Endonuclease Protection, Selection, and Amplification (REPSA) and then use this information to bioinformatically identify potential regulated genes. REPSA determined a consensus FadR-binding sequence 5´-TTRNACYNRGTNYAA-3´, which was further characterized using quantitative electrophoretic mobility shift assays. With this information, a search of the T. thermophilus HB8 genome found multiple operons potentially regulated by FadR. Several of these were identified as encoding proteins involved in fatty acid biosynthesis and degradation; however, others were novel and not previously identified as targets of FadR. The role of FadR in regulating these genes was validated by physical and functional methods, as well as comparative genomic approaches to further characterize regulons in related organisms. Taken together, our study demonstrates that a systematic approach involving REPSA, biophysical characterization of protein-DNA binding, and bioinformatics can be used to postulate biological roles for potential transcriptional regulators.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28902898 PMCID: PMC5597230 DOI: 10.1371/journal.pone.0184796
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1REPSA section of FadR-dependent IISRE cleavage-resistant DNA species.
Shown are LICOR Odyssey images of restriction endonuclease cleavage protection assays during Rounds 1 through 5 of REPSA selection with 6 nM FadR protein. The presence of FadR or IISRE FokI (F) or BpmI (B) is indicated above each lane. Lanes include: (+/-) total DNA control, (-/F or -/B) IISRE cleavage control, and (+/F or +/B) IISRE selection with FadR. The electrophoretic mobility of the intact (T) and cleaved (X) selection template, as well as the IRD7_ST2R primer (P), are indicated at right of figure.
Fig 2Validation of REPSA-selected FadR-binding DNA species.
Shown are LICOR images of EMSAs containing pooled DNA from either Round 1 (left lanes) or Round 5 (right lanes) of REPSA selection and increasing concentrations of FadR protein (from left to right: 0, 0.6, 6, 60, or 600 nM FadR). The electrophoretic mobility of a single protein-DNA complex (S) as well as uncomplexed ST2R24 selection template (T) and IRD7_ST2R primer (P) are indicated at right of figure.
Fig 3Sequence logos of REPSA-selected FadR-binding sequences.
Sequence logos were determined using MEME software with inputs of 1000 Round 5 DNA sequences. (A) MEME performed with no filters. (B) Palindromic filter.
Fig 4Quantitative EMSA analysis of FadR-binding to consensus and mutant sequences.
Shown are LICOR images of IRD700-labeled FadR consensus or point-mutated DNAs, as indicated, incubated with a twofold (wt, m1, m3, m4, m5, m7) or tenfold (m2, m6) titration of FadR protein, as indicated. (S) FadR-DNA complex, (T) uncomplexed DNA. (A) ST2_FadR_R5_wt consensus DNA; 0, 0.038, 0.075, 0.15, 0.3, 0.6, 1.2 nM FadR. (B) ST2_FadR_R5_m1 mutant DNA; 0, 0.38, 0.75, 1.5, 3.0, 6.0, 12 nM FadR. (C) ST2_FadR_R5_m2 mutant DNA; 0, 0.06, 0.6, 6, 60, or 600 nM FadR. (D) ST2_FadR_R5_m3 mutant DNA; 0, 3.8, 7.5, 15, 30, 60, 120 nM FadR. (E) ST2_FadR_R5_m4 mutant DNA; 0, 0.75, 1.5, 3.0, 6.0, 12, 24 nM FadR. (F) ST2_FadR_R5_m5 mutant DNA; 0, 7.5, 15, 30, 60, 120, 240 nM FadR. (G) ST2_FadR_R5_m6 mutant DNA; 0, 0.06, 0.6, 6, 60, or 600 nM FadR. (H) ST2_FadR_R5_m7 mutant DNA; 0, 7.5, 15, 30, 60, 120, 240 nM FadR. Binding site sequence and KD values are indicated below each panel. Lowercase nucleotides indicate mutation from consensus FadR sequence.
FIMO of best possible match TTGGACTTAGTCCAA.
| Start | End | Sequence | Location | Pro? | Gene | ||
|---|---|---|---|---|---|---|---|
| 848942 | 848956 | 6.01e-10 | 0.00183 | –11 | Y | ||
| 380839 | 380853 | 1.66e-09 | 0.00183 | –14 | Y | ||
| –17 | Y | ||||||
| 572967 | 572981 | 1.66e-09 | 0.00183 | –5 | Y | ||
| 811792 | 811806 | 1.73e-09 | 0.00183 | +2 | Y | ||
| 1065216 | 1065230 | 3.39e-09 | 0.00287 | –14 | Y | ||
| –47 | Y | ||||||
| 9906 | 9920 | 4.48e-09 | 0.00316 | –17 | Y | ||
| 370770 | 370784 | 8.38e-09 | 0.00507 | –14 | Y | ||
| 1389097 | 1389111 | 5.04e-08 | 0.0213 | –26 | Y | ||
| –36 | Y | ||||||
| 1549819 | 1549833 | 7.54e-08 | 0.0249 | +23 | |||
| 1089565 | 1089579 | 8.10e-08 | 0.0249 | –52 | Y | ||
| +42 | Y | ||||||
| 103944 | 103958 | 6.68e-07 | 0.135 | –8 | Y | ||
| 724515 | 724529 | 6.99e-07 | 0.135 | +909 | N |
(P-value) Defined as the probability of a random sequence of the same length matching that position of the sequence with as good or better score. (Q-value) False discovery rate if the occurrence is accepted as significant. (Location) Distance of the FadR binding site from the identified start site of translation. (Pro?) Promoter identified proximally upstream of the gene.
(~) Indicates that while a promoter is present, the FadR-binding site does not overlap.
Fig 5Bioinformatic identification of T. thermophilus HB8 promoters potentially regulated by FadR.
Shown are sequences +/- 200 bp of the FadR-binding sequence of a target gene identified through FIMO analysis as being potentially regulated by FadR (see Table 1). Longest open reading frames with identical orientation as the target gene are indicated with blue nucleotides. Open reading frames with opposite orientation are indicated with green nucleotides. Black nucleotides indicate intergenic regions. Potential promoter elements (-30 and -10 boxes, +1 start site of transcription) were identified using Softberry BPROM and are indicated with blue highlighting. FadR-binding sites are indicated with yellow highlighting. Regions of overlap between FadR-binding sites and promoter elements are indicated by green highlighting. Underlining and italics indicate core promoter elements and FadR-binding sites, respectively, identified previously by Agari et al [7].
Potential FadR-regulated genes.
| Promoter | Operon | Gene | Role | Ratio ( | Fold change | Adj. |
|---|---|---|---|---|---|---|
| Y | 1 | 3-hydroxyacyl-CoA dehydrogenase | 9775.4/375.3 | 26.0 | 3.06e-5 | |
| 2 | acetyl-CoA acetyltransferase | 14762.8/426.9 | 34.6 | 1.63e-4 | ||
| 3 | acyl-CoA dehydrogenase | 11722/449.7 | 26.1 | 4.68e-4 | ||
| Y | 1 | hypothetical protein | 641.7/209.9 | 3.1 | 7.34e-3 | |
| 2 | zinc-binding dehydrogenase | 255/96.6 | 2.6 | 5.46e-2 | ||
| Y | N | hypothetical protein | 275.2/113 | 2.4 | 5.46e-2 | |
| Y | N | medium-chain-fatty-acid—CoA ligase | 868.3/307.1 | 2.8 | 7.34e-3 | |
| Y | (2) | metallo-beta-lactamase | 923.3/161.2 | 5.7 | 4.95e-3 | |
| 3 | AsnC family transcriptional regulator | 704/475.6 | 1.5 | 0.251 | ||
| 4 | CAAX amino terminal protease | 331.1/146.5 | 2.3 | 0.146 | ||
| Y | N | iron-sulfur protein | 2793.2/678.8 | 4.1 | 1.34e-2 | |
| Y | N | hypothetical protein | 472.4/269.6 | 1.8 | 0.116 | |
| Y | 1 ( | medium-chain acyl-CoA ligase-related protein | 2938.7/1085.4 | 2.7 | 5.46e-2 | |
| 2 | gluconate 5-dehydrogenase | 308.4/174.6 | 1.8 | 0.204 | ||
| 3 | acyl-CoA dehydrogenase | 224.3/143 | 1.6 | 0.212 | ||
| 4 | phosphotransferase | 137.6/64.9 | 2.1 | 0.365 | ||
| 5 | hypothetical protein | 149.8/106.5 | 1.4 | 0.445 | ||
| 6 | phosphoglycerate mutase family protein | 90.3/62.7 | 1.4 | 0.257 | ||
| Y | 1 ( | hypothetical protein | 154.9/87.1 | 1.8 | 0.203 | |
| 2 | hypothetical protein | 115.3/68.8 | 1.7 | 0.246 | ||
| 3 | methylmalonyl-CoA epimerase | 712.4/550.6 | 1.3 | 0.349 | ||
| 4 | hypothetical protein | 120.6/146.6 | 0.8 | 0.553 | ||
| Y | 1 | phosphoribosyltransferase | 1668.7/795.4 | 2.1 | 0.296 | |
| 2 | hypothetical protein | 917.5/953.7 | 1.0 | 0.965 | ||
| Y | N | long-chain fatty acid—CoA ligase | 3145.6/867.1 | 3.6 | 4.04e-2 | |
| Y | 1 | sensor histidine kinase | 166.4/183.9 | 0.9 | 0.682 | |
| 2 | response regulator receiver domain-containing protein | 121.8/62.9 | 1.9 | 0.228 | ||
| cation-transporting ATPase | 2421.4/854.7 | 2.8 | 5.46e-2 | |||
| Y | 1 | acetyl-coenzyme A dehydrogenase medium subunit | 6247.9/1088.1 | 5.7 | 1.26e-3 | |
| 2 | electron transfer flavoprotein subunit beta | 6786.9/1190.5 | 5.7 | 2.26e-3 | ||
| 3 | electron transfer flavoprotein subunit alpha | 4030.9/866.5 | 4.7 | 1.79e-3 | ||
| Y | 1 | oxidoreductase | 488.6/134.8 | 3.6 | 4.67e-2 | |
| 2 | hypothetical protein | 326.1/104.4 | 3.1 | 2.58e-2 | ||
| 3 | TetR family transcriptional regulator, | 16.5/201.2 | (0.1) | (1.26e-3) |
(Operon) Number indicates gene position within an operon. Parentheses indicates FadR site is not before the first gene of an identified operon. Values in italics indicate differences between databases in their identification of operon members. (N) Single transcriptional unit, not part of an operon. (Gene) Underlined genes indicate those identified by previous investigators [7]. (Ratio) Ratio of averaged Affymetrix GeneChip signals from ΔfadR and wild-type strains of T. thermophilus HB8 deposited in NCBI GEO by Agari et al. [7]. (Fold change) Fold change in expression, ΔfadR:wt T. thermophilus HB8 strains.
Fig 6EMSA analysis of FadR-binding to REPSA-identified promoter sequences through a titration of FadR concentrations.
Shown are LICOR images of IRD700-labeled DNA probes containing FadR-binding sites from the REPSA-identified promoter regions, as indicated, incubated with 0, 0.38, 0.75, 1.5, 3, 6, or 12 nM FadR protein. (S) FadR-DNA complex, (T) uncomplexed DNA. (A) ST2_FadR_TTHA0890 DNA. (B) ST2_FadR_TTHA0402 DNA. (C) ST2_FadR_TTHA0604 DNA. (D) ST2_FadR_TTHA0846 DNA. (E) ST2_FadR_TTHA1118 DNA. (F) ST2_FadR_TTHB017 DNA. (G) ST2_FadR_TTHA0390 DNA. (H) ST2_FadR_TTHA1463 DNA. (I) ST2_FadR_TTHA1144 DNA. (J) ST2_FadR_TTHA0103 DNA. Binding site sequence and KD values are indicated below each panel. Lowercase nucleotides indicate mutation from consensus FadR sequence.
BLASTn analysis of orthologous FadR sites.
| ttj | tth | taq | tos |
|---|---|---|---|
| ( | ( | ||
| ( | ( |
Organisms investigated include Thermus thermophilus HB8 (ttj), Thermus thermophilus HB27 (tth), Thermus aquaticus Y51MC23 (taq), and Thermus oshimai JL2 (tos). (15/15) The number of identical bases present in the orthologous gene FadR site. (TO73_1062) Genes in parentheses have FadR sites downstream of their translation start sites.
(–) Orthologous sequences not found.
FadR clusters of co-regulated orthologous operons.
| CRON | Max. score | tth | taq | dra | ddr | dge |
|---|---|---|---|---|---|---|
| 9 | 5.33 | 3w | 3s | 3s | 2s, 1w | |
| 2 | 5.28 | 1s | 1n | 1s | 1s | |
| 4 | 5.26 | 3s | 1s, 1n | 2w | 1w, 1n | |
| 1 | 5.23 | 3s | 3n | 2s | 3n | |
| 28 | 5.28 | 1s | ||||
| 12 | 5.20 | 1s | ||||
| 13 | 5.20 | 2s | ||||
| 10 | 5.18 | 2s, 6n | 3n | 3n | 3n | |
| 8 | 5.11 | 3s | 1n | 1w, 1n | 1w | |
| 20 | 5.29 | 3n | 2n | 1n | 2n | |
| 25 | 5.18 | |||||
| 3 | 5.06 | 2w, 1n | ||||
| 30 | 5.00 | 3n | ||||
| 29 | 4.81 | 5n | 1w | 3n | 3n |
(CRON) Co-regulated orthologous operon number from the RegPredict run. (Max. Score) Measure of the similarity of candidate FadR-binding sites in the orthologous promoters compared to the position-weight matrix made by the training set. Organisms investigated include T. thermophilus HB27 (tth), T. aquaticus Y51MC23 (taq); D. radiodurans R1 (dra), D. deserti VCD115 (ddr), and D. geothermalis DSM 11300 (dge). Genes within the FadR-regulated T. thermophilus HB 27 operon are shown. For the other organisms, indicated are the numbers of orthologous genes that have strong (s), weak (w), or no (n) identified FadR sites in their promoters.
(–) No orthologous operons identified.