| Literature DB >> 34854920 |
David M Wood1, Renwick C J Dobson1,2, Christopher R Horne3,4.
Abstract
Transcription is the principal control point for bacterial gene expression, and it enables a global cellular response to an intracellular or environmental trigger. Transcriptional regulation is orchestrated by transcription factors, which activate or repress transcription of target genes by modulating the activity of RNA polymerase. Dissecting the nature and precise choreography of these interactions is essential for developing a molecular understanding of transcriptional regulation. While the contribution of X-ray crystallography has been invaluable, the 'resolution revolution' of cryo-electron microscopy has transformed our structural investigations, enabling large, dynamic and often transient transcription complexes to be resolved that in many cases had resisted crystallisation. In this review, we highlight the impact cryo-electron microscopy has had in gaining a deeper understanding of transcriptional regulation in bacteria. We also provide readers working within the field with an overview of the recent innovations available for cryo-electron microscopy sample preparation and image reconstruction of transcription complexes.Entities:
Keywords: bacterial transcription; cryo-electron microscopy; gene expression and regulation; protein-DNA; transcription factors
Mesh:
Year: 2021 PMID: 34854920 PMCID: PMC8786299 DOI: 10.1042/BST20210674
Source DB: PubMed Journal: Biochem Soc Trans ISSN: 0300-5127 Impact factor: 5.407
Figure 1.Schematic model of bacterial transcription initiation and regulation by transcription factors.
(A) In bacteria, transcription is initiated by the formation of the RNA polymerase (RNAP) holoenzyme, which comprises the RNAP core and a sigma (σ) factor (e.g. σ70). Upon interaction with promoter DNA, the RNAP holoenzyme forms a closed transcription complex. The subunits of the RNAP core, σ factor and sequence-specific interactions with the promoter DNA are illustrated. RNAP core: α N-terminal/C-terminal domains (αNTD/CTD) — blue; β and β′ — grey; ω — black. σ factor — orange, linker regions (black lines). Promoter DNA (grey): UP element — cyan; −35 element — yellow; spacer — white; extended −10 (EXT) — violet; −10 element — dark red; discriminator (DISC) — peach; transcription start site (+1, arrow). Sequence-specific interactions are highlighted with dash lines. (B) Transcription factors, known as activators or repressors can activate or repress transcription initiation, respectively. The general mechanism of each is shown here. Transcriptional activators (left panel, green) bind to a site upstream of the promoter (−35 and −10 element) and transcription start site (+1, arrow) where they can recruit the RNAP holoenzyme by interacting with the C-terminal of the α-subunit (αCTD) of RNAP. This process can be enhanced by small molecules or effectors (purple hexagon) to increase the rate of transcription. In contrast, transcriptional repressors (right panel, red) bind to a site that overlaps the core −35 and −10 elements of the promoter to directly block the binding of RNAP to the promoter, switching gene transcription off (shown by red cross). In the presence of an effector (blue hexagon) the DNA affinity is reduced, and the repressor dissociates from the promoter. This allows RNAP to be recruited, switching gene transcription on.
Summary of bacterial transcription complex structures resolved by cryo-EM
| Transcription complex | Family | Organism | EMDB ID | PDB ID | Reference |
|---|---|---|---|---|---|
| EcmrR-RPo2 | MerR |
| EMD-22234 | 6XL5 | [ |
| EcmrR-RPo2 (EcmrR-spacer DNA complex) | EMD-22235 | 6XL6 | |||
| EcmrR-RPint3 with 3 nt RNA transcript | EMD-22236 | 6XL9 | |||
| EcmrR-RPint3 with 3 nt RNA transcript (EcmrR-spacer DNA complex) | EMD-22237 | 6XLA | |||
| EcmrR-RPint3 with 4 nt RNA transcript | EMD-22245 | 6XLJ | |||
| EcmrR-RPint3 with 4 nt RNA transcript (EcmrR-spacer DNA complex) | EMD-22246 | 6XLK | |||
| EcmrR-RPo2 (clearer σ70 density) | EMD-23291 | - | |||
| BmrR-RNA polymerase complex | MerR |
| EMD-30390 | 7CKQ | [ |
| CueR-RNA polymerase complex (without RNA transcript) | MerR |
| EMD-22184 | 6XH7 | [ |
| CueR-RNA polymerase complex (with RNA transcript) | EMD-22185 | ||||
| CueR-RNA polymerase complex (clearer σ70 density) | EMD-22289 | ||||
| CueR- RNA polymerase complex | MerR |
| EMD-30268 | 6LDI | [ |
| CueR- RNA polymerase complex (with fully duplex DNA) | EMD-0874 | 7C17 | |||
| NanR-dimer1/DNA complex | GntR |
| EMD-21652 | 6WFQ | [ |
| NanR-dimer3/DNA complex | EMD-21661 | 6WG7 | |||
| BusR-tetramer1/pAB DNA complex | GntR |
| EMD-13119 | 7OZ3 | [ |
| BusR-tetramer1/pAB1 DNA complex | EMD-12051 | 7B5Y | |||
| TraR-Eσ70 (state I) | LuxR |
| EMD-0348 | 6N57 | [ |
| TraR-Eσ70 (state II) | EMD-0349 | 6N58 | |||
| TraR-Eσ70 (state III) | EMD-20231 | N/A | |||
| MmfR-dimer2/DNA complex | TetR |
| EMD-20781 | N/A | [ |
| Class-II CAP-TAC1 without RNA transcript (state I) | CAP |
| EMD-20287 | 6PB5 | [ |
| Class-II CAP-TAC1 without RNA transcript (state II) | EMD-20288 | 6PB6 | |||
| Class-II CAP-TAC1 with RNA transcript (state II) | EMD-20286 | 6PB4 | |||
| Class-I CAP-TAC1 | CAP |
| EMD-7059 | 6B6F | [ |
| Class-I CAP-TAC1 (focused map on αCTD-CAP region) | EMD-7060 | ||||
| Crl-EσS-RNA polymerase complex | Crl |
| EMD-200090 | 6OMF | [ |
| Spx-RNA polymerase complex | Spx |
| EMD-31485 | 7F75 | [ |
| WhiB7-RPo2 | WhiB |
| EMD-22886 | 7KIF | [ |
| WhiB7-RPc4 | EMD-22887 | 7KIM | |||
| Rgg2-short hydrophobic peptide complex | Rgg |
| EMD-22341 | 7JI0 | [ |
| GreB-RNA polymerase elongation complex (pre-RNA cleavage) | Gre |
| EMD-4892 | 6RIN | [ |
| GreB-RNA polymerase elongation complex (post-RNA cleavage) | EMD-4885 | 6RI7 | |||
| GreB-RNA polymerase reactivated complex (before RNA extension) | EMD-4882 | 6RH3 | |||
| CarD-RPo2 | CarD |
| EMD-9037 | 6EDT | [ |
| CarD-RNA polymerase intermediate (with 8-nt RNA transcript) | EMD-9039 | 6EE8 | |||
| CarD-RPo2 (with corallopyronin A) | EMD-9041 | 6EEC | |||
| CarD-RNA polymerase holoenzyme (with corallopyronin A) | EMD-9047 | 6M7J | |||
| CarD-RPo2 (with Sorangicin A) | CarD |
| EMD-21407 | 6VVY | [ |
| CarD-S456LRPo2 (with Sorangicin A) | EMD-21408 | 6VW0 | |||
| CarD-RPo2 (with Sorangicin A) (with 8-nt RNA transcript) | EMD-21406 | 6VVX | |||
| CarD-S456LRPo2 (with Sorangicin A) (with 8-nt RNA transcript) | EMD-21409 | 6VVZ | |||
| SspA-σ70-RPo2 | GST |
| EMD-30307 | 7C97 | [ |
| DksA-RPo2 (State I) with guanosine tetraphosphate (ppGpp) | DksA |
| EMD-21881 | 7KHI | [ |
| DksA-RPo2 (State II) with guanosine tetraphosphate (ppGpp) | EMD-21883 | 7KHE | |||
| NusG- | NusG |
| EMD-7351 | 6C6U | [ |
| RfaH-NusG-N-Term- | RfaH | EMD-7350 | 6C6T | ||
| RfaH-full-length- | RfaH | EMD-7349 | 6C6S | ||
| RNAP-HelD | HelD |
| EMD-21921 | 6WVK | [ |
| HelD |
| EMD-10996 | 6YXU | [ | |
| EMD-11004 | 6YYS | ||||
| EMD- 11026 | 6Z11 | ||||
| Spt4/5-RNAP complex (with antibodies) | Spt4/5 |
| EMD- 1840 | N/A | [ |
| Mfd-dependent transcription termination complex | MfD |
| EMD- 30117 | 6M6A | [ |
| Mfd-dependent transcription termination complex with ATPγS | MfD | EMD- 30118 | 6M6B | ||
| Mfd-bound RNA polymerase elongation complex — L1 state (with ATP) | MfD |
| EMD-21996 | 6X26 | [ |
| Mfd-bound RNA polymerase elongation complex — L2 state (with ADP) | EMD-22006 | 6X2F | |||
| Mfd-bound RNA polymerase elongation complex — I state | EMD-22012 | 6X2N | |||
| Mfd-bound RNA polymerase elongation complex — II state | EMD-22039 | 6X43 | |||
| Mfd-bound RNA polymerase elongation complex — III state | EMD-22043 | 6X4W | |||
| Mfd-bound RNA polymerase elongation complex — IV state | EMD-22044 | 6X4Y | |||
| Mfd-bound RNA polymerase elongation complex — V state | EMD-22045 | 6X50 | |||
| σ70-RPo2 | σ-factor |
| EMD-0001 | 6GH5 | [ |
| σ70-RNA polymerase (intermediate partially loaded) complex | EMD-0002 | 6GH6 | |||
| σ70-RNA polymerase (initially transcribing) complex | EMD-4397 | 6GFW |
1CAP-TAC, cAMP receptor protein-dependent transcription activation complex.
2RPo, RNA polymerase-promoter open complex.
3RPint, RNA polymerase-promoter initial transcribing complex.
3RPc, RNA polymerase-promoter closed complex.
N/A, Not available.
Figure 2.Recent bacterial transcription complexes solved by cryo-EM.
Examples (A,B) of transcriptional activator complexes that distort DNA. (A) Dimeric EcmrR in complex with promoter DNA (cartoon inset, PDB-6XL6), remodels (58° kink) the promoter DNA to create the optimal promoter architecture for E. coli holoenzyme [σ70 (grey surface) RNAP (α2ββ′ω surface colour shown in shaded box)] to form the EcmrR-RPo (PDB-6XL5) [22]. (B) Overview of the E. coli class-1 CAP-TAC (PDB-6B6H). The cyclic adenosine 3′,5′-monophosphate receptor (CAP) protein dimer (dark blue and dark red, cartoon inset), binds its cognate DNA and αCTD of RNAP (green) to introduce three DNA kinks (33°, 55°, 21°). This results in a full 92° turn, optimally orienting the promoter DNA for σ70-RNAP to bind [31]. Examples of transcription factors that stabilise aspects of the transcription complex are shown in C and D. (C) Crl (dark orange cartoon) binds residues of the β′clamp (beige cartoon) on E. coli RNAP and alternative σ factor σS (grey cartoon) through a distinct interface (shown by spheres) in the left inset. This tethering action creates a Crl-σS-RNAP complex that binds alternate promoter DNA to form the Crl-σS-RPo complex (PDB-6OMF) shown to the right of the inset [32]. (D) WhiB7 (cyan, cartoon) is a transcriptional activator in Mycobacterium tuberculosis (Mtb) that binds an AT-rich ‘hook’ sequence of DNA (shown by arrow) and σA (cartoon inset). By binding the active Mtb RNAP holoenzyme (surfaces coloured as E. coli RNAP), it creates the WhiB7-RPo (PDB-7KIF) [34]. Examples of steric occlusion in transcriptional repressors are shown in E and F. (E) Three E. coli NanR dimers binds three GGTATA repeats to form a NanR-dimer3/DNA complex. Their close proximity allows intramolecular protein–protein interactions to stabilise the multimeric assembly (PDB-6WG7). The 70.5 kDa cryo-EM structure of dimeric NanR in complex with cognate DNA (PBD-6WFQ) [26]. (F) Streptococcus agalactiae (Stag) BusR binds palindromic promoter DNA as a tetramer to repress transcription (PDB-7OZ3) [27]. The 5′ and 3′ DNA strands have been annotated throughout.
Figure 3.Contemporary cryo-EM strategies for solving protein–DNA complexes.
Recent innovations in sample preparation (left panel), 3D reconstruction (middle panel) and model building (right panel) are illustrated. During sample preparation (left panel) there are three facets that must be optimised: sample stability (e.g. optimising the length and number of binding sites within the DNA scaffold); sample homogeneity (using size exclusion chromatography); and preferential orientation (optimising spot-to-plunge time). To combat structural heterogeneity during 3D reconstruction (middle panel) and explore dynamics, multi-body refinement [65], masked refinement, 3D variability analysis [66] or cryoDRGN, using a neural network [67] can be employed. Density map is shown in grey, with hypothetical flexibility shown in orange. To annotate density maps during model building (right panel), Haruspex [68] or Emap2sec+ [69] can be utilised. As transcription complexes are dynamic assemblies, flexible fitting tools (cryo_fit [70], ISOLDE [71], iMODFIT [72] or Namdinator [73]) are required to fit PDB components (e.g. protein and DNA) into a cryo-EM density map (shown in grey) to generate a model (shown on the bottom right of panel). Cryo-EM data, maps and models are from ref. 63 (EMD-21652, PDB-6WFQ).