Literature DB >> 34450657

Saturation mutagenesis charts the functional landscape of Salmonella ProQ and reveals a gene regulatory function of its C-terminal domain.

Alisa Rizvanovic1, Jonas Kjellin1, Fredrik Söderbom1, Erik Holmqvist1.   

Abstract

The global RNA-binding protein ProQ has emerged as a central player in post-transcriptional regulatory networks in bacteria. While the N-terminal domain (NTD) of ProQ harbors the major RNA-binding activity, the role of the ProQ C-terminal domain (CTD) has remained unclear. Here, we have applied saturation mutagenesis coupled to phenotypic sorting and long-read sequencing to chart the regulatory capacity of Salmonella ProQ. Parallel monitoring of thousands of ProQ mutants allowed mapping of critical residues in both the NTD and the CTD, while the linker separating these domains was tolerant to mutations. Single amino acid substitutions in the NTD associated with abolished regulatory capacity strongly align with RNA-binding deficiency. An observed cellular instability of ProQ associated with mutations in the NTD suggests that interaction with RNA protects ProQ from degradation. Mutation of conserved CTD residues led to overstabilization of RNA targets and rendered ProQ inert in regulation, without affecting protein stability in vivo. Furthermore, ProQ lacking the CTD, although binding competent, failed to protect an mRNA target from degradation. Together, our data provide a comprehensive overview of residues important for ProQ-dependent regulation and reveal an essential role for the enigmatic ProQ CTD in gene regulation.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34450657      PMCID: PMC8464044          DOI: 10.1093/nar/gkab721

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The ability of bacteria to survive and thrive in dynamic environments relies on precise and rapid regulation of gene expression. RNA-binding proteins (RBPs) are key factors in post-transcriptional regulatory networks and often work in conjunction with small regulatory RNAs (sRNAs) (1,2). At the molecular level, specific sequence and/or structural RNA motifs are recognized by RBPs through RNA-binding domains, among which the S1 domain, the cold-shock domain, the Sm domain and the K homology domain are well-studied examples (3). The arrangement and combination of RNA-binding domains define the regulatory activity of an RBP (4). To understand how RBPs regulate their RNA targets, it is therefore important to know how these domains function. Recent advances in global RBP identification have assigned RNA-binding activity to many proteins and domains not previously implicated in RNA–protein interactions (5–16). Among bacterial RPBs, ProQ has recently emerged as a global RNA binder in Salmonella enterica (S. enterica) and Escherichia coli (E. coli) (13,14,16–18) and is one of the founding members of the ProQ/FinO protein family (PF04352 Pfam) (14). The members of this family are ubiquitously found in proteobacterial species (19–22), and, in addition to ProQ, include plasmid-encoded FinO proteins (23–25), RocC (19,26) and Lpp1663 (27,28) in Legionella pneumophila, NMB1681 in Neisseria meningitidis (21,29) and S. enterica FopA (22). All of these proteins possess the well-conserved ProQ/FinO domain, which has been suggested to harbor the major RNA-binding activity in each respective protein (19,22,28,30–35). Many ProQ/FinO family members have additional N- or C-terminal domains or extensions (19,22,28–30,36), but their contribution to RNA-binding and/or gene regulatory activity largely remains unknown (19,25,30,32,34). ProQ itself was initially identified as a positive factor for proline uptake and osmoprotection (37–39). It is now well established that ProQ interacts with hundreds of different RNA targets, primarily sRNAs and 3′ untranslated regions (UTRs) of mRNAs (13,14,16,18,32). ProQ recognizes RNA stem-loop structures, flanked by 5′-located A-rich and 3′ located U-rich sequences, which often overlap with intrinsic terminators (13,32). Despite the detailed mapping of ProQ–RNA interactions, the functional consequences following binding are in most cases still unclear. Transcriptomic studies have shown that deletion or overexpression of proQ leads to global changes in gene expression (18,40,41), but a direct link between RNA binding and gene regulation has only been described for a few RNA targets. For instance, ProQ protects the cspE mRNA from 3′-dependent RNA degradation by RNase II (13), blocks RNase III-mediated cleavage of the duplex formed by sRNAs RybB and RbsZ (18) and promotes translation inhibition exerted by the sRNA RaiZ on hupA mRNA (40). Congruent with ProQ’s role as a global regulator, this protein has been linked to diverse cellular responses, including adaptation to osmotic stress (37–39), biofilm formation (42), adaptation to resource limitation (43), bacterial virulence (41) and motility (40,41). ProQ folds into two globular N- and C-terminal domains (NTD and CTD) bridged by an extended, unstructured linker (30). The NTD, spanning residues 1–119, constitutes the ProQ/FinO domain (20,30,44,45) and appears to harbor the core RNA-binding activity; the NTD alone is sufficient for binding to the majority of tested RNA ligands in vitro (32). Similarly, screening for RNA binding-defective ProQ mutants using a bacterial three-hybrid strategy identified many critical residues within the NTD but none in the CTD (31). In contrast to the ProQ NTD, the ProQ CTD is exclusively found in ProQ but absent from other proteins in the ProQ/FinO family. The CTD, spanning residues 180–228, is mostly composed of ß-sheets that form a barrel-like structure, which partially resembles the Tudor domain (30), known to mediate protein–protein interactions via methylated amino acid residues (46). However, Tudor domains are usually found in eukaryotic proteins (30), and no evolutionary relationship between the ProQ CTD and Tudor domains has been convincingly established. Compared to the NTD, the function of the CTD is understudied. Even though it may participate in RNA-binding and RNA strand exchange (30,32,39), the contribution of such activities for ProQ’s overall RNA-binding activity and/or regulatory function is so far unclear. To better understand the relationship between the RNA-binding activity and regulatory function of ProQ, we developed a screen that allowed for systemic identification of ProQ mutant proteins with impaired regulatory activity. We identified >20 residues in the NTD that are critical for ProQ-dependent gene regulation in vivo. These residues largely overlap with those affecting ProQ–RNA interactions (31), indicating an intimate relationship between RNA-binding and gene regulation. Interestingly, mutations in the NTD rendered ProQ unstable in vivo, suggesting that the protein is stabilized through interaction with RNA. In addition to the NTD, our screen identified residues in the CTD which, when mutated, lead to overstabilization of several RNA targets and rendered ProQ inactive in activating a reporter gene, without affecting protein stability in vivo. In line with this, we show that ProQ lacking the CTD, although binding competent, critically impacts the regulatory activity of ProQ.

MATERIALS AND METHODS

Bacterial strains and growth conditions

All bacterial strains used in this study are listed in Supplementary Table S1. Bacteria were routinely grown aerobically at 37°C in LB medium with shaking at 200 rpm. Where indicated, the growth medium was supplemented with kanamycin (50 μg/ml), tetracycline (15 μg/ml), chloramphenicol (30 μg/ml), ampicillin (100 μg/ml) or isopropanyl ß-D-1-thiogalactopyranoside (IPTG) at 5, 50, 100, 250, 500, 1000 or 2000 μM.

Cloning

Plasmids and oligonucleotides used in this study are listed in Supplementary Tables S2 and S3, respectively. To construct transcriptional reporter plasmid pPfliE-gfp (pAR022), plasmid pUA66 was cut with XhoI and BamHI, and ligated to a PCR product amplified from SL1344 genomic DNA (primers EHO-1144/1283) and cut with XhoI and BamHI. The ProQ-plasmid (pAR009) was constructed in three steps. First, promoter PBAD was replaced with promoter PLlacO on plasmid pCH450, yielding plasmid pAR001. Briefly, the PLlacO sequence was amplified (primers EHO-1006/1007) from plasmid pZE12-luc and ligated to the PCR product (primers, EHO-1004/1005) amplified from pCH450. The PCR products were cut with NsiI and BglII prior to ligation. Second, the lacI repressor gene was added to pAR001 to construct plasmid pAR007. The lacI sequence was amplified (primers EHO-1047/1048) from plasmid pACYCDuet-1, cut with NsiI and ligated to NsiI-digested pAR001. Finally, the proQ ORF, amplified (EHO-1000/1001) from plasmid pZE12-proQ, was inserted into plasmid pAR007. Both the insert and the vector were cut with NdeI and NotI prior to ligation. The pdTomato-ProQ plasmid (pAR018) was constructed in two steps. First, the dTomato gene was amplified (primers EHO-992/993) from strain MH235 and inserted into pAR007 to construct plasmid pAR013. Both insert and vector were cut with NdeI and EcoRI prior to ligation. Second, the proQ sequence was amplified (EHO-1000/1001) from pZE12-proQ and inserted into pAR013. Both the insert and the vector were cut with NdeI and NotI prior to ligation. pAR052 was constructed by adding a 3xFLAG sequence to the 3′-end of the proQ ORF on plasmid pAR009. The pAR009 plasmid was amplified using primers EHO-1614/1615 and ligated. The parental plasmid was digested with DpnI prior ligation. To construct plasmids expressing ProQ truncations, the sequences representing residues 1–129, 1–179 and 130–228 of proQ were amplified (primers EHO-1625/1565, EHO-1625/1626, EHO-1627/1628, respectively) from pAR052, followed by DpnI digestion to remove the parental plasmid, and ligation to construct plasmids pAR053, pAR054 and pAR055, respectively. To construct plasmids expressing 3xFLAG-tagged mutant ProQ proteins, the pAR052 plasmid was amplified using primers EHO-1551/1552, EHO-1560/1561, EHO-1558/1559 and EHO-1562/1563, and ligated, generating plasmid pAR058, pAR059, pAR060 and pAR061, respectively.

Error-prone PCR

Error-prone PCR of proQ on plasmid pdTomato-ProQ (pAR018) was carried out in a two-step PCR reaction using the GeneMorph EZClone Domain Mutagenesis Kit (Stratagene). The first PCR reaction (Mutant Megaprimer Synthesis) contained 500 ng of plasmid pAR018, dNTPs (200 μM), primers EHS-1123 and EHS-1124, Mutazyme II DNA Polymerase and Mutazyme II Reaction Buffer in a total volume of 50 μl. The cycling conditions were: 120 s at 95°C, 30 cycles of 30 s at 95°C, 30 s at 56°C, 60 s at 72°C and 10 min at 72°C. The PCR reaction was analyzed on a 1% agarose gel and purified with the GeneJET PCR Purification Kit (ThermoScientific). The second PCR reaction (EZClone Reaction) contained 50 ng of plasmid pAR018, 250 ng of the PCR product from the first PCR reaction (used as primers here), EZClone Enzyme Mix and EZClone Solution in a total volume of 50 μl. The cycling conditions were: 60 s at 95°C, and 25 cycles of 50 s at 95°C, 50 s at 60°C, 840 s at 68°C. After 2 min on ice, 1.5 μl of DpnI was added for 2 h at 37°C to remove template plasmid.

ProQ mutant library preparation

Escherichia coli TOP10 cells (Invitrogen) were made competent by washing exponentially growing cells three times in cold sterile H2O, after which 50 μl of competent cells were transformed with 10 ng of products from the EZClone Reaction by electroporation. After recovery for 1.5 h at 37°C, cells were plated on agar plates and incubated overnight at 37°C. Colonies obtained after transformation were washed off agar plates with LB medium and pooled. Mutant plasmids were extracted using the GeneJet Plasmid Miniprep Kit (ThermoScientific), and 50 ng of plasmids were transformed into competent SL1344 deleted for proQ and harboring PfliE-gfp (pAR022) as described above. Colonies obtained after transformation were washed off the agar plates with LB medium, pooled and divided into aliquots.

GFP and dTomato measurements

For bulk GFP measurements, bacterial overnight cultures grown from single colonies were diluted 1:100 in fresh LB medium and grown in 96-well black assay plates with clear flat bottom (Costar®) at 37°C. Fluorescence (GFP: excitation: 480 nm, emission 520 nm) and optical density (OD) (600 nm) were measured for 16 h at 10 min intervals in a plate reader (Tecan infinite pro). Relative GFP/OD600 values were calculated for stationary phase cells (16 h of growth). Background GFP fluorescence and OD600 values from LB medium alone were first subtracted from all values obtained from wells containing bacterial cultures. For each well, GFP fluorescence values were then divided by OD600 values to normalize for differences in cell density. A mean value and standard deviation were calculated based on the background-subtracted GFP/OD600 values from every well containing a biological replicate of the same bacterial strain. Finally, relative values between bacterial strains were calculated by dividing the mean value for each strain by the mean value for the reference strain. For single cell analysis of GFP fluorescence and dTomato fluorescence, flow cytometry was performed using a MACSQuant VYB (Miltenyi Biotec). Bacterial cultures grown from single colonies were pelleted, diluted in sterile-filtered PBS and loaded in a 96-well polystyrene plate for analysis. GFP was excited with a blue laser (488 nm; bandpass filter 525/50 nm, channel B1). dTomato was excited with a yellow laser (561 nm; bandpass filter 615/20 nm, channel Y2). A total of 20 000–100 000 events were recorded for each sample. Data were acquired with the MACSQuantify™ Software (Miltenyi Biotec) and processed with FlowJo™ Software (Becton, Dickinson and Company).

Fluorescence-activated cell sorting (FACS)

Sample aliquots of the mutant library and SL1344 control cells deleted for proQ, harboring the PfliE-gfp reporter (pAR022) and either the non-mutated pdTomato-ProQ plasmid (pAR018) or an empty control (pAR007), were diluted 100-fold in fresh LB-medium and grown overnight at 37°C. Cells were pelleted by centrifugation for 5 min at 3700 × g, resuspended in RNAlater (ThermoScientific) and diluted 100-fold in sterile PBS. FACS was carried out using a MoFlo Astrios EQ (Beckman Coulter, USA) cell sorter with 488 nm and 532 nm lasers for excitation of GFP and dTomato, respectively. The trigger was set on forward scatter at a threshold of 0.05%. Sorted cells (2 × 106 cells/pool) were re-analyzed by the MoFlo Astrios EQ cell sorter and the MACSQuant VYB flowcytometer. Data analysis of approximately 20 000 cells/pool was done using the FlowJo™ Software. The remaining sorted cells were pelleted by centrifugation and resuspended in 100 μl of H2O.

High-throughput sequencing

Each pool of sorted cells was used as template in a PCR reaction to generate sequences for high-throughout sequencing analysis. Before PCR, sorted cells were denatured at 95°C for 5 min. Each PCR reaction contained 2 μl of sample template, dTNPs, 10 μM barcode-specific primers (forward primer: EHO-1436–1447, reverse primer: EHO-1448), Phusion High-Fidelity (HF) DNA Polymerase (ThermoScientific) and Phusion HF buffer (ThermoScientific) in a total volume of 50 μl. PCR program: 30 s at 98°C, 30 cycles of 10 s at 98°C, 30 s at 70°C, 15 s at 72°C and 5 min at 72°C. PCR products were analyzed on 1% agarose gels and purified using the GeneJET PCR Purification Kit, followed by DNA concentration measurements using Qubit Fluorometric Quantitation (ThermoScientific). All samples were pooled, of which 500 ng was used for sequencing-library preparation. Library preparation and sequencing of the pooled DNA sample (barcoded amplicons) were performed at the Uppsala Genome Centre (UGC), a national facility within the National Genomic Infrastructure (NGI) hosted by Science for Life Laboratory (SciLifeLab) in Uppsala, Sweden. Sequencing libraries were constructed using the SMRTbell™ Template Prep Kit 1.0 (500 bp to 20 kb) and purified with the AMPure Clean-Up kit (PacBio). The library quality was evaluated using a Bioanalyzer Chip (Agilent) before sequencing. Sequencing was performed on a Sequel™ SMART® Cell using the Sequell II system (PacBio).

Bioinformatic analysis

Primer sequences were trimmed from the circular consensus reads (CCS) obtained from the PacBio sequencing with cutadapt v. 2.26.0 (47). To assure that all the reads were output in the same orientation, cutadapt was run with the –revcomp flag. Further sequencing read filtering and subsequent quantification of mutations were performed with python scripts (github.com/kjellinjonas/proQ_mutagenesis). Shortly, trimmed reads of the same length as unmodified ProQ were analyzed for number of mutations on both nucleotide and amino acid levels. Next, reads with mutations corresponding to single amino acid substitutions were quantified, and the distribution of mutations along the ProQ sequence was analyzed. Finally, enriched mutations were identified (FDR-adjusted P ≤ 0.1), and sample clustering was performed by principal component analysis (PCA) using DESeq2 v. 1.18.1 (48) in R v. 3.4.1 (49).

Site-directed mutagenesis

Identified mutations were re-constructed by site-directed mutagenesis. The PCR reactions contained 10 ng of template plasmid pAR018 or plasmid pAR009, dNTPs (200 μM), mutagenic primers listed in Supplementary Table S3, Phusion High-Fidelity DNA Polymerase and Phusion HF buffer in a total volume of 50 μl. The PCR products were analyzed on 1% agarose gels and purified with the GeneJET PCR Purification Kit. After removal of template plasmid with DpnI for 1 h at 37°C, the PCR products (20–100 ng) were ligated using T4 DNA ligase (ThermoScientific) and transformed into E. coli TOP10 cells. Constructed plasmids were extracted using the GeneJet Plasmid Miniprep Kit, validated by Sanger Sequencing (Eurofins) and transformed into competent SL1344 cells deleted for proQ, harboring PfliE-gfp (pAR022).

Western blotting

Bacterial cultures were grown to an OD600 of 2.0, pelleted by centrifugation and resuspended in Laemmli buffer. After denaturation for 5 min at 95°C, samples were separated on Mini-PROTEAN® TGX Stain-Free protein gels (BioRad). Proteins were transferred to PVDF membranes using the TransBlot TURBO transfer system (BioRad) according to the manufacturer’s protocol. ProQ proteins were detected using an anti-ProQ antibody at a 1:10 000 dilution (42) for 1 h at room temperature, followed by incubation with an HRP-conjugated anti-rabbit antibody (Sigma) at 1:100 000 dilution. 3xFLAG-tagged proteins were detected using an HRP-conjugated anti-FLAG antibody at 1:10 000 dilution (Sigma). Even gel loading was validated by probing the membranes with an HRP-conjugated anti-GroEL antibody (Sigma) at 1:50 000 dilution. Chemiluminescense was detected using Amersham™ ECL™ Prime reagents according to the supplier's protocol (GE Healthcare) and a ChemiDoc™ System (BioRad).

Northern blotting

RNA extraction and northern blotting were performed as described (13) with the exceptions that (i) samples were loaded on denaturing gels (5% polyacrylamide, 8 M urea, 1xTBE), (ii) Church buffer (50) was used to block non-specific binding sites on the membrane and (iii) membranes were washed twice in 1× SSC/0.1% SDS. Sequences for the oligonucleotides used for cspE mRNA (EHO-913), cspE 3′UTR (EHO-862), cspD mRNA (EHO-863), RaiZ sRNA (EHO-924), 5S rRNA (EHO-690) and transfer-messenger RNA (tmRNA, EHO-867) detection are shown in Supplementary Table S3.

RESULTS

Construction of a fluorescence-based reporter monitoring ProQ-dependent gene expression

ProQ is a global RNA-binding protein that affects the expression of a large number of genes (13,14,16,18,41). However, a direct connection between RNA binding and gene regulation has only been examined for a few RNA ligands (13,18,40). Regarding ProQ’s RNA-binding activity, a recent study used a three-hybrid strategy to identify ProQ residues critical for stable RNA interactions in vivo (31). As a complementary approach, we here developed a screen for high-throughput identification of ProQ mutants that abolish the protein’s gene regulatory capacity. To this end, we first aimed at constructing a fluorescence-based reporter amenable to high-throughput screening for ProQ-dependent effects on gene expression. Transcriptomic analyses have consistently associated deletion of proQ with reduced expression of motility genes (14,41); the majority of genes involved in flagellar assembly are downregulated in a proQ deletion strain (Figure 1A) (14). In Salmonella, the flagellar system comprises over 60 genes, which are organized into a transcriptional hiearchy of three promoter classes. On top of the hierarchy, class I promoters express the FlhDC master regulator that drives transcription, either directly or indirectly, of all downstream genes (Figure 1B) (51–54). Based on this well-characterized hierarchical regulation of flagellar gene expression, we hypothesized that the observed ProQ-dependent effects stem from a direct post-transcriptional event at the top of the hierarchy, which results in indirect effects on transcription downstream (Figure 1B). To test this, we constructed a transcriptional GFP reporter driven from the promoter of fliE, a class II gene that is directly activated by the FlhDC master regulator (Figure 1B, C and Supplementary Figure S1A). As expected, deletion of proQ resulted in a significant reduction in fluorescence from the PfliE-gfp construct, indicating that ProQ indirectly activates transcription of fliE. Complementing the ΔproQ strain with an IPTG-inducible ProQ overexpression plasmid resulted in a direct and positive correlation between IPTG concentration, ProQ levels and PfliE activity (Figure 1C and Supplementary Figure S1A, S1B). In the absence of IPTG, ΔproQ cells harboring the ProQ complementation plasmid had similar PfliE activity as wild-type cells with an empty plasmid. This is consistent with leaky expression from the complementation plasmid, giving ProQ protein levels similar to endogenous expression from the native proQ locus (Supplementary Figure S1B). Hence, the PfliE-gfp construct indirectly reports on ProQ-dependent regulation of gene expression and should enable screening of ProQ mutants with reduced regulatory activity.
Figure 1.

ProQ-dependent regulation of the flagellar pathway. (A) ProQ positively affects flagellar gene expression. Violin plot displaying the distribution of gene expression changes upon proQ deletion in Salmonella grown in LB to early stationary phase. The plot is based on RNA-seq data from (14). (B) Transcription of flagellar genes (e.g. fliE) is activated by the master regulator FlhDC. The positive effect on flagellar genes exerted by ProQ may stem from an upstream effect on FlhDC production. (C) Salmonella wild-type or ΔproQ strains were transformed with PfliE-gfp, together with empty vector (WT, ΔproQ) or ProQ overexpression plasmid (pProQ). Cells were grown in LB-medium supplemented with increasing concentrations of IPTG (0, 100, 250, 500 μM final concentration) to induce expression of ProQ. GFP fluorescence and OD600 were measured during growth in 96-well plates. Relative GFP fluorescence/OD600 values were normalized to the wild-type strain with an empty vector. The mean values and SD are based on three biological replicates.

ProQ-dependent regulation of the flagellar pathway. (A) ProQ positively affects flagellar gene expression. Violin plot displaying the distribution of gene expression changes upon proQ deletion in Salmonella grown in LB to early stationary phase. The plot is based on RNA-seq data from (14). (B) Transcription of flagellar genes (e.g. fliE) is activated by the master regulator FlhDC. The positive effect on flagellar genes exerted by ProQ may stem from an upstream effect on FlhDC production. (C) Salmonella wild-type or ΔproQ strains were transformed with PfliE-gfp, together with empty vector (WT, ΔproQ) or ProQ overexpression plasmid (pProQ). Cells were grown in LB-medium supplemented with increasing concentrations of IPTG (0, 100, 250, 500 μM final concentration) to induce expression of ProQ. GFP fluorescence and OD600 were measured during growth in 96-well plates. Relative GFP fluorescence/OD600 values were normalized to the wild-type strain with an empty vector. The mean values and SD are based on three biological replicates. Next, to allow monitoring of proQ expression by measuring fluorescence, we modified the ProQ expression plasmid by inserting the ORF encoding the red fluorescent dTomato protein in-frame with ProQ. While induction of dTomato alone did not affect the fliE promoter, induction of dTomato-ProQ resulted in similarly high GFP levels as did overexpression of native ProQ (Supplementary Figure S1C). Cells harboring both the PfliE-gfp construct and the dTomato-ProQ plasmid were subjected to single cell analysis to simultaneously monitor dTomato-ProQ expression and ProQ-dependent regulation. While the majority of cells harboring an empty plasmid gave low fluorescence levels in both the green and red channels, the majority of cells harboring the dTomato-ProQ-expressing plasmid gave high fluorescence levels for both channels (Supplementary Figure S1D). Of note, expression from the fliE promoter was heterogeneous, leading to co-existing populations of cells expressing either low or high GFP, with the distribution of cells within each state being determined by the concentration of dTomato-ProQ. This is in line with previous findings of heterogenous expression of flagellar genes, yielding co-existing on- and off-states in response to external factors (55–57). Thus, the combination of two fluorescent reporters allows for simultaneous monitoring of ProQ expression and ProQ-dependent regulation.

Generation and sorting of a ProQ mutant library

Having established a system in which the activity of ProQ can be easily detected using the readout of a fluorescence reporter, we next set out to delineate the regulatory capacity of ProQ by mutational studies. To enable high-throughput identification of ProQ mutants impaired in mediating gene regulation, we used our previously developed method for saturation mutagenesis coupled to phenotypic cell sorting and high-throughput sequencing (58). Briefly, we introduced mutations into the proQ ORF by error-prone PCR and cloned mutant proQ sequences in-frame with dTomato on the IPTG-inducible plasmid (Figure 2A). The mutant library was transformed into ΔproQ Salmonella cells carrying the PfliE-gfp reporter, yielding approximately 67 000 transformants. This number is predicted to cover each possible single nucleotide mutation in the proQ sequence (58). Single cell analysis showed that the majority of cells transformed with the mutant library gave dTomato fluorescence levels similar to the non-mutated control (Figure 2B), indicating stable expression of most dTomato-ProQ mutants. Among these cells, half resulted in high GFP fluorescence levels similar to cells expressing non-mutated dTomato-ProQ, while the other half phenocopied the low GFP levels of ΔproQ cells harboring an empty vector. These results demonstrate that the mutant library contained both ProQ mutants that retained and lost the protein’s regulatory capacity.
Figure 2.

Generation and sorting of a ProQ mutant library. (A) Overview of the experimental setup. (B) Flow cytometry analysis of Salmonella ΔproQ strains carrying a PfliE-gfp reporter construct together with the empty control vector (gray), the non-mutated dTomato-ProQ expression plasmid (pdTomato-ProQ, black) or the mutant dTomato-ProQ plasmid library (light purple: all cells, dark purple: cells with high dTomato expression levels). Cells were grown in LB-medium supplemented with IPTG (500 μM final concentration) to induce expression of dTomato-ProQ fusions. dTomato and GFP fluorescence were measured for single cells. Gates were set according to the non-mutated control (indicated by dotted line). (C) Salmonella cells harboring the ProQ mutant library were first sorted into different pools based on dTomato and GFP fluorescence levels. Pool 1, sorted for high dTomato and high GFP, represents ProQ mutants with intact function. Pool 2, sorted for high dTomato and low GFP, represents ProQ mutants with impaired function. Successful sorting was verified by independent single cell analysis. dTomato and GFP fluorescence were measured as indicated. Representative data are shown for one out of three replicates for each pool.

Generation and sorting of a ProQ mutant library. (A) Overview of the experimental setup. (B) Flow cytometry analysis of Salmonella ΔproQ strains carrying a PfliE-gfp reporter construct together with the empty control vector (gray), the non-mutated dTomato-ProQ expression plasmid (pdTomato-ProQ, black) or the mutant dTomato-ProQ plasmid library (light purple: all cells, dark purple: cells with high dTomato expression levels). Cells were grown in LB-medium supplemented with IPTG (500 μM final concentration) to induce expression of dTomato-ProQ fusions. dTomato and GFP fluorescence were measured for single cells. Gates were set according to the non-mutated control (indicated by dotted line). (C) Salmonella cells harboring the ProQ mutant library were first sorted into different pools based on dTomato and GFP fluorescence levels. Pool 1, sorted for high dTomato and high GFP, represents ProQ mutants with intact function. Pool 2, sorted for high dTomato and low GFP, represents ProQ mutants with impaired function. Successful sorting was verified by independent single cell analysis. dTomato and GFP fluorescence were measured as indicated. Representative data are shown for one out of three replicates for each pool. The mutant library was physically sorted into two pools based on dTomato and GFP levels using fluorescence-activated cell sorting (FACS). Gates for GFP fluorescence were set according to levels in cells harboring the PfliE-gfp reporter combined with either an empty control vector (low GFP) or the non-mutated dTomato-ProQ plasmid (high GFP), yielding approimately a 5-fold difference in GFP levels between gates. The gate for dTomato fluorescence was set according to fluorescence from the non-mutated dTomato-ProQ protein (Figure 2C). Pool 1 contained cells expressing high dTomato and high GFP, representing ProQ mutants that retained their ability to activate PfliE-gfp. Pool 2 contained cells expressing high dTomato and low GFP, representing ProQ mutants impaired in PfliE activation. As a control, cells carrying a non-mutated dTomato-ProQ were sorted for high dTomato and high GFP. The sorting was performed on three independent cultures, each of which gave approximately 2 million cells per pool. Succesful sorting was verified by independent single cell analysis (Figure 2C and Supplementary Figure S2).

Sequencing data analysis

After FACS sorting, sequences covering the mutated proQ ORF were PCR-amplified from each pool using barcoded primers, pooled together and subjected to high-throughput long-read sequencing. After high-throughput sequencing and read processing, we obtained between 6700 and 17 100 reads of correct length for each of the different pools. Quantifying the number of reads with mutated proQ nucleotide sequences for each pool showed that 96% (±1.0%) of reads for the non-mutated ProQ control had wild-type sequence (Figure 3A). In libraries prepared from pool 1 samples (mutant library, high GFP), 19.0% (±0.1%) of sequencing reads had wild-type sequence, in comparison to 6.2% (±0.4%) for libraries prepared from pool 2 samples (mutant library, low GFP). Thus, for pool 2 samples, representing ProQ mutants with impaired function, approximately 94% of sequencing reads had at least one nucleotide substitution.
Figure 3.

Sequencing data analysis. (A) Distribution of mutations. Percentage of reads obtained with 0, 1, 2, 3 or ≥ 4 nucleotide changes for sorted pools of the mutated library (pool 1, pool 2) and non-mutated control (pdTomato-ProQ). Mean values and SD are given for three replicates. (B) Principal component analysis of data clustering for sorted mutant libraries (pool 1, pool 2). PC1 and PC2 represents 50.7% and 17% of the variance in the data, respectively. Three replicates are shown for each sorted pool. (C) Distribution of single amino acid substitutions in samples from pools of the mutated library (pool 1, pool 2) and non-mutated control (pdTomato-ProQ). On top, a ProQ domain representation with the N-terminal domain (NTD, orange), the linker region (gray) and the C-terminal domain (CTD, yellow). (D) Substitution profile based on DESeq2 analysis of differences in mutation patterns between pool 1, sorted for high dTomato and high GFP, representing ProQ mutants with intact function, and pool 2, sorted for high dTomato and low GFP, representing ProQ mutants with impaired function. A colored bar indicates a unique amino acid substitution detected for the corresponding amino acid position. The colors indicate whether a specific amino acid substitution was over-represented (red) or underrepresented (blue) in pool 2 compared to pool 1. Only those substitutions that differed significantly (FDR-adjusted P-value ≤ 0.1) between the pools are shown. The ProQ N-terminal domain, linker region and C-terminal domain are indicated (NTD, linker, CTD). Each column indicates by which amino acid the amino acid position was replaced to. Asterisk (*) indicate stop codon. All positions of mutations that were significantly enriched can be found in Supplementary Table S4.

Sequencing data analysis. (A) Distribution of mutations. Percentage of reads obtained with 0, 1, 2, 3 or ≥ 4 nucleotide changes for sorted pools of the mutated library (pool 1, pool 2) and non-mutated control (pdTomato-ProQ). Mean values and SD are given for three replicates. (B) Principal component analysis of data clustering for sorted mutant libraries (pool 1, pool 2). PC1 and PC2 represents 50.7% and 17% of the variance in the data, respectively. Three replicates are shown for each sorted pool. (C) Distribution of single amino acid substitutions in samples from pools of the mutated library (pool 1, pool 2) and non-mutated control (pdTomato-ProQ). On top, a ProQ domain representation with the N-terminal domain (NTD, orange), the linker region (gray) and the C-terminal domain (CTD, yellow). (D) Substitution profile based on DESeq2 analysis of differences in mutation patterns between pool 1, sorted for high dTomato and high GFP, representing ProQ mutants with intact function, and pool 2, sorted for high dTomato and low GFP, representing ProQ mutants with impaired function. A colored bar indicates a unique amino acid substitution detected for the corresponding amino acid position. The colors indicate whether a specific amino acid substitution was over-represented (red) or underrepresented (blue) in pool 2 compared to pool 1. Only those substitutions that differed significantly (FDR-adjusted P-value ≤ 0.1) between the pools are shown. The ProQ N-terminal domain, linker region and C-terminal domain are indicated (NTD, linker, CTD). Each column indicates by which amino acid the amino acid position was replaced to. Asterisk (*) indicate stop codon. All positions of mutations that were significantly enriched can be found in Supplementary Table S4. We limited the downstream analysis to mutants with a single amino acid substitution. PCA revealed formation of dense clusters for the three replicates from each pool, which were well separated from samples from the other pool, indicating high reproducibility of the data (Figure 3B). In pool 1, single amino acid substitutions were distributed across the entire ProQ sequence, covering 100% of all residues (Figure 3C). In pool 2, mutations were also found in 100% of all residues but were mainly present in the N- and C-terminal domains. To find amino acid changes that were significantly associated with either retained or impaired ProQ function, DESeq2 analysis was applied to pool 1 and pool 2 samples. In total, 66 amino acid substitutions showed a statistically significant (FDR-adjusted P ≤ 0.1) association with either pool 1 or pool 2 (Figure 3D and Supplementary Table S4). As expected, significantly enriched mutations that led to nonsense codons were exclusively found in pool 2 and were excluded from further analysis. Together, 30 distinct nonsynonymous substitutions, spanning 24 residues, were significantly enriched in pool 2 (Supplementary Table S4), indicating that these mutations disrupted the ProQ regulatory function in vivo. These substitutions were located in both the ProQ NTD and CTD, while pool 2 completely lacked significant mutations in the linker region. In contrast, five substitutions mapping to the linker were significantly enriched in pool 1 samples, indicating that, in comparison to the globular NTD and CTD, changes in the linker are generally less dentrimental to ProQ function.

Loss-of-function mutations in the ProQ NTD correlate with impaired RNA-binding activity

The majority of amino acid substitutions that were significantly enriched for loss-of-function were located in the ProQ NTD and covered 21 residues (Figure 3D and Supplementary Table S4). Most of these residues are well conserved in proteins within the ProQ/FinO family (Supplementary Figure S3) and thus likely to be functionally important. The mutated residues include both surface-exposed and buried residues, based on their location in the NMR structure of the E. coli ProQ NTD (Supplementary Figure S4A) (30), which is >99% identical to the Salmonella ProQ NTD. The majority of mutated buried residues are hydrophobic, indicating that the mutations may compromise the core structure of the NTD. Mutated residues on the NTD surface included hydrophobic (L63P, L71P, V74D), basic (R20P, R80H), acidic (D82E) residues and one glycine residue (G85C). The mutated surface residues form distinct clusters on the concave- and convex sides of the domain, respectively (Figure 4A and Supplementary Figure S4A), in line with a previously proposed model for E. coli ProQ–RNA interaction (31).
Figure 4.

Mutations of ProQ NTD residues are critical for ProQ functionality in vivo. (A) Surface representation of the N-terminal domain of ProQ (PBD ID: 5NB9). Amino acid residues sensitive for substitutions linked with loss-of-function are mapped and colored according to amino acid property: basic residues (blue), acidic residues (green), hydrophobic residue (orange) and glycine (red). (B) Mutations linked with loss-of-function (R80H), or retained function (A18V), were re-introduced into dTomato-ProQ plasmids (pdTomato-ProQ) and expressed in Salmonella ΔproQ cells carrying the PfliE-gfp reporter construct. dTomato and GFP fluorescence were measured by flow cytometry. See Supplementary Figure S4B for results obtained for five additional NTD loss-of-function mutants. Note that data values for empty vector control and non-mutated control (pdTomato-ProQ) are the same as shown in Figure 6B. (C) GFP fluorescence and OD600 of samples from (4B, S4B) were measured at population level using a microplate reader. Color indication, orange: mutation associated with retained ProQ function, red: mutation associated with loss-of-function, black: non-mutated control (pdTomato-ProQ), gray: vector. GFP fluorescence/OD600 values are relative to each wild-type strain with empty vector. Mean values and SD are given for three biological replicates. (D) Northern blot analysis of cspE mRNA levels with respect to non-mutated (pdTomato-ProQ) or mutated dTomato-ProQ as indicated. 5S rRNA served as loading control.

To validate the NTD mutation data, we selected enriched mutations from the dataset, re-introduced them in the dTomato-ProQ plasmid, and assayed them individually. For this analysis, we selected loss-of-function mutations located either in the hydrophobic core (L34Q, L57R, L83R) or at the protein surface (R80H, D82E, G85C) (Supplementary Figure S4A), as well as one mutation (A18V) enriched in pool 1. The generated mutants were tested for activation of the transcriptional PfliE-gfp reporter using flow cytometry (Figure 4B and Supplementary Figure S4B). All seven ProQ mutants displayed dTomato levels comparable to the parental non-mutated control, indicating that the mutations did not affect protein expression. Furthermore, all mutants enriched for loss-of-function (L34Q, L57R, R80H, D82E, L83R, G85C) failed to fully activate the PfliE-gfp reporter, in contrast to the A18V mutant which gave GFP levels similar to the non-mutated ProQ control. These results were confirmed by bulk GFP measurements using a microplate reader (Figure 4C). Together, these results validate the data obtained by sorting and long-read sequencing, and suggest that mutations in the ProQ NTD (L34Q, L57R, R80H, D82E, L83R, G85C) are linked to regulatory defects of ProQ. To investigate another well-known ProQ target, the identified NTD mutants were tested for ProQ-dependent stabilization of the cspE mRNA. The CspE protein is a global RBP which is critical for Salmonella pathogenicity (59). In Salmonella, ProQ stabilizes cspE mRNA by binding at the 3′ end, thereby protecting from RNase II-mediated degradation (13). RNA was harvested from ΔproQ cells expressing the ProQ NTD mutants and subjected to northern blot analysis using a probe against the cspE mRNA (Figure 4D). Complementing the proQ deletion strain with wild-type ProQ strongly elevated cspE mRNA levels, while none of the ProQ NTD mutants fully stabilized the cspE mRNA. Strikingly, the cspE levels for five out of six NTD mutants (L34Q, L57R, R80H, D82E, L83R) were comparable to that of the proQ deletion control. This corroborates our previous results (Figure 4B and Supplementary Figure S4B, S4C) and shows that mutations in the NTD (L34Q, L57R, R80H, D82E, L83R, G85C) have a general effect on ProQ-dependent regulation. Mutations of ProQ NTD residues are critical for ProQ functionality in vivo. (A) Surface representation of the N-terminal domain of ProQ (PBD ID: 5NB9). Amino acid residues sensitive for substitutions linked with loss-of-function are mapped and colored according to amino acid property: basic residues (blue), acidic residues (green), hydrophobic residue (orange) and glycine (red). (B) Mutations linked with loss-of-function (R80H), or retained function (A18V), were re-introduced into dTomato-ProQ plasmids (pdTomato-ProQ) and expressed in Salmonella ΔproQ cells carrying the PfliE-gfp reporter construct. dTomato and GFP fluorescence were measured by flow cytometry. See Supplementary Figure S4B for results obtained for five additional NTD loss-of-function mutants. Note that data values for empty vector control and non-mutated control (pdTomato-ProQ) are the same as shown in Figure 6B. (C) GFP fluorescence and OD600 of samples from (4B, S4B) were measured at population level using a microplate reader. Color indication, orange: mutation associated with retained ProQ function, red: mutation associated with loss-of-function, black: non-mutated control (pdTomato-ProQ), gray: vector. GFP fluorescence/OD600 values are relative to each wild-type strain with empty vector. Mean values and SD are given for three biological replicates. (D) Northern blot analysis of cspE mRNA levels with respect to non-mutated (pdTomato-ProQ) or mutated dTomato-ProQ as indicated. 5S rRNA served as loading control.
Figure 6.

Mutations of ProQ CTD residues are critical for ProQ functionality in vivo. (A) Surface representation of the C-terminal domain of ProQ (PDB ID: 5NBB). Amino acid residues sensitive for substitutions linked with loss-of-function are mapped and colored according to amino acid property: polar uncharged (yellow) and glycine (red). (B) Mutations linked with loss-of-function (G185V, T200P, G216D) were re-introduced in the dTomato-ProQ plasmid (pdTomato-ProQ) and expressed in Salmonella ΔproQ cells carrying the PfliE-gfp reporter by growth in LB-medium supplemented with IPTG (500 μM final concentration). dTomato and GFP fluorescence were measured by flow cytometry. Note that data values for the empty vector control and non-mutated control (pdTomato-ProQ) are the same as the ones shown in Figure 4B and Supplementary Figure S4B. (C) Mutations shown in Figure 6B were re-introduced in the ProQ plasmid (pProQ) and expressed into Salmonella ΔproQ cells carrying the PfliE-gfp reporter construct. Cells were grown in LB-medium supplemented with IPTG (500 μM final concentration) to induce expression of ProQ mutants. GFP fluorescence and OD600 were measured at population level using a microplate reader. Color indication, red: mutation associated with loss-of-function, black: non-mutated control (pProQ), gray: empty vector control. Fluorescence/OD600 values are relative to a wild-type strain with empty vector. Mean values and SD are given for three biological replicates. Note that data values for vector control and wild-type (pProQ) are the same as the ones shown in Figure 5A. (D) The samples shown in Figure 6C were analyzed with western blot using an anti-ProQ antibody to detect ProQ CTD mutants. GroEL served as a loading control. (E) Northern blot analysis of cspE 3′UTR, cspD mRNA and RaiZ-L sRNA (longer variant) levels with respect to non-mutated ProQ (pProQ), CTD mutants (G185V, T200P, G216D) and a NTD mutant (R80H). tmRNA served as loading control.

Several studies have indicated that the NTD harbors the the main RNA-binding activity of ProQ (30–32,39). For example, Pandey et al. identified mutations in the ProQ NTD critical for binding to two RNA ligands (31). Comparing the mutation profiles obtained using ProQ-dependent regulation (this study), or RNA-binding activity (31), showed that the vast majority of NTD mutations that impaired RNA-binding activity also led to impaired gene regulation activity (Supplementary Figure S4C). This indicates that ProQ RNA-binding and regulation of gene expression are closely linked processes.

NTD mutations associated with defective RNA-binding and regulation render ProQ unstable in vivo

The experiments described above were conducted with a non-native dTomato-ProQ fusion protein. In order to test how the identified NTD mutations affect the activity of native ProQ, we re-introduced the same single amino acid substitutions (Figure 4 and Supplementary Figure S4B) into a plasmid-borne ProQ with no additional protein tag and tested their effect on the transcriptional PfliE-gfp reporter (Figure 5A). In line with previous results, we detected impaired PfliE-gfp activation for the six mutants L34Q, L57R, R80H, D82E, L83R and G85C, compared to the parental non-mutated control and the A18V mutation. The GFP levels for these six mutants were similar to that of a proQ deletion strain, indicating a strong loss-of-function phenotype. Next, we assayed protein levels by western blot using an anti-ProQ antibody (Figure 5B). Surprisingly, although all six loss-of-function mutants were expressed, their levels were substantially lower than the non-mutated control and the A18V mutant. Since wild-type and mutated ProQ proteins were expressed from the same promoter, we hypothesized that lower expression levels may stem from reduced protein stability. To test this, we selected a mutant with a substitution in the conserved R80 residue, shown to be crucial for RNA binding (30,31). Expression of the R80H mutant or wild-type ProQ was induced by IPTG until the cells reached exponential phase, after which de novo protein synthesis was arrested by changing to inducer-free medium. Protein levels were analyzed by western blot using an anti-ProQ antibody (Figure 5C). While levels of wild-type ProQ stayed stable for 1 h after media shift, the R80H mutant protein was hardly detectable after 30 min. This result indicates a clearcut relationship between RNA-binding activity and protein stability and suggests that association with RNA stabilizes the ProQ protein.
Figure 5.

Mutations in the ProQ NTD associated with defective RNA-binding and regulation lead to instable protein expression. (A and B) The mutations shown in Figure 4B,C, and Supplementary Figure S4B were re-introduced into the ProQ plasmid (pProQ), transformed in SL1344 ΔproQ carrying the PfliE-gfp reporter construct and expressed during growth in LB-medium supplemented with IPTG (500 μM final concentration). (A) GFP fluorescence and OD600 were measured at the population levels using a microplate reader. Color indication, orange: mutation associated with retained ProQ function, red: mutation associated with loss-of-function, black: non-mutated control (pProQ), gray: empty vector control. GFP fluorescence/OD600 values are relative to each wild-type strain with empty vector control. Mean values and SD are given for three biological replicates. Note that data values for vector control and the non-mutated control (pProQ) are the same as the ones shown in Figure 6C. (B) Samples were analyzed by western blot analysis and probed with anti-ProQ antibody to detect ProQ NTD mutants. GroEL served as loading control. (C) Salmonella ΔproQ carrying either the ProQ wild-type control or a R80H plasmid construct were grown in LB-medium supplemented with IPTG (500 μM final concentration) to exponential growth phase. At OD600 of 0.5, de novo protein synthesis was stopped by changing medium to inducer-free medium, after which samples were collected at indicated timepoints. Western blotting using anti-ProQ antibody was conducted to detect ProQ mutant protein levels. GroEL served as a loading control.

Mutations in the ProQ NTD associated with defective RNA-binding and regulation lead to instable protein expression. (A and B) The mutations shown in Figure 4B,C, and Supplementary Figure S4B were re-introduced into the ProQ plasmid (pProQ), transformed in SL1344 ΔproQ carrying the PfliE-gfp reporter construct and expressed during growth in LB-medium supplemented with IPTG (500 μM final concentration). (A) GFP fluorescence and OD600 were measured at the population levels using a microplate reader. Color indication, orange: mutation associated with retained ProQ function, red: mutation associated with loss-of-function, black: non-mutated control (pProQ), gray: empty vector control. GFP fluorescence/OD600 values are relative to each wild-type strain with empty vector control. Mean values and SD are given for three biological replicates. Note that data values for vector control and the non-mutated control (pProQ) are the same as the ones shown in Figure 6C. (B) Samples were analyzed by western blot analysis and probed with anti-ProQ antibody to detect ProQ NTD mutants. GroEL served as loading control. (C) Salmonella ΔproQ carrying either the ProQ wild-type control or a R80H plasmid construct were grown in LB-medium supplemented with IPTG (500 μM final concentration) to exponential growth phase. At OD600 of 0.5, de novo protein synthesis was stopped by changing medium to inducer-free medium, after which samples were collected at indicated timepoints. Western blotting using anti-ProQ antibody was conducted to detect ProQ mutant protein levels. GroEL served as a loading control.

ProQ CTD residues critical for ProQ functionality

Compared to the NTD, the function of the CTD is less well understood, although a role in RNA-binding and RNA strand exchange activity has been suggested (30,32,39). Interestingly, removing the CTD from ProQ does not impair binding to the majority of RNA ligands tested so far, neither in vivo nor in vitro (31,32). However, whether the CTD is required for ProQ’s ability to mediate gene regulation is not known. Our screen identified three mutations in CTD residues that were significantly enriched for impaired activation of the PfliE-gfp reporter (Figure 3D and Supplementary Table S4). Mapping these residues onto the NMR structure of the ProQ CTD showed that all residues were exposed on the protein surface (Figure 6A). Two out of three residues are also moderately or strongly conserved among homologous proteins (Supplementary Figure S5). Mutations of ProQ CTD residues are critical for ProQ functionality in vivo. (A) Surface representation of the C-terminal domain of ProQ (PDB ID: 5NBB). Amino acid residues sensitive for substitutions linked with loss-of-function are mapped and colored according to amino acid property: polar uncharged (yellow) and glycine (red). (B) Mutations linked with loss-of-function (G185V, T200P, G216D) were re-introduced in the dTomato-ProQ plasmid (pdTomato-ProQ) and expressed in Salmonella ΔproQ cells carrying the PfliE-gfp reporter by growth in LB-medium supplemented with IPTG (500 μM final concentration). dTomato and GFP fluorescence were measured by flow cytometry. Note that data values for the empty vector control and non-mutated control (pdTomato-ProQ) are the same as the ones shown in Figure 4B and Supplementary Figure S4B. (C) Mutations shown in Figure 6B were re-introduced in the ProQ plasmid (pProQ) and expressed into Salmonella ΔproQ cells carrying the PfliE-gfp reporter construct. Cells were grown in LB-medium supplemented with IPTG (500 μM final concentration) to induce expression of ProQ mutants. GFP fluorescence and OD600 were measured at population level using a microplate reader. Color indication, red: mutation associated with loss-of-function, black: non-mutated control (pProQ), gray: empty vector control. Fluorescence/OD600 values are relative to a wild-type strain with empty vector. Mean values and SD are given for three biological replicates. Note that data values for vector control and wild-type (pProQ) are the same as the ones shown in Figure 5A. (D) The samples shown in Figure 6C were analyzed with western blot using an anti-ProQ antibody to detect ProQ CTD mutants. GroEL served as a loading control. (E) Northern blot analysis of cspE 3′UTR, cspD mRNA and RaiZ-L sRNA (longer variant) levels with respect to non-mutated ProQ (pProQ), CTD mutants (G185V, T200P, G216D) and a NTD mutant (R80H). tmRNA served as loading control. Re-introduction of the significantly enriched CTD mutations into the pdTomato-ProQ plasmid followed by PfliE-gfp reporter activity measurements confirmed that all three mutations rendered ProQ ineffective in activating the reporter (Figure 6B). We also re-introduced the CTD mutations into the plasmid expressing native ProQ. GFP measurements again showed lower GFP levels compared to the non-mutated ProQ control (Figure 6C). However, in contrast to the NTD mutations that rendered ProQ unstable, the CTD mutant proteins were expressed at the same (G216D) or even higher (G185V, T200P) levels than wild-type ProQ (Figure 6D). This indicates that the tested CTD mutations impair the regulatory activity of ProQ without affecting protein stability and points at a key function for the CTD in ProQ-dependent gene regulation. To dissect how the identified CTD residues contribute to ProQ-dependent regulation of different targets, the CTD mutants were tested for stabilization of three known ProQ target RNAs, cspE and cspD mRNAs, as well as the sRNA RaiZ. RNA was extracted from cells expressing the three CTD mutants (G185V, T200P, G216D) and subjected to northern blot analysis (Figure 6E). As expected (13,40), the levels of all three targets increased upon complementation of ΔproQ with plasmid-expressed wild-type ProQ. Somewhat surprisingly, expression of each of the CTD mutants lead to strongly elevated levels of all three RNA targets. Taken together, these results indicate that residues within the CTD render ProQ unable to activate flagellar gene expression and overstabilizes direct RNA targets.

The ProQ CTD is required for activation of gene expression in vivo

The data presented in Figure 6 show that mutations in the CTD are detrimental for ProQ functionality in vivo. Yet, ProQ lacking the CTD is not impaired in binding to the majority of RNA ligands tested so far (31,32). To systematically test the contribution of the NTD and CTD to ProQ’s regulatory function, we cloned truncated ProQ variants in the IPTG-inducible plasmid (Figure 7A) and transformed them into ΔproQ cells harboring the PfliE-gfp reporter. In order to monitor expression of truncated ProQ proteins, all constructs were tagged with a C-terminal 3xFLAG peptide. Western blot analysis using an anti-FLAG antibody showed that all truncated ProQ protein fusions were expressed, with the full-length and the NTD + linker variants at slightly higher levels than the NTD and linker + CTD variants (Figure 7B). Consistent with results presented in Figure 1C, full-length ProQ strongly activated the PfliE-gfp reporter (Figure 7C). However, truncating ProQ by removing either the NTD or the CTD resulted in low promoter activities of the reporter gene that were indistinguishable from that of the strain lacking proQ entirely. Since the CTD is dispensable for binding to many RNA ligands (31,32), but required for gene regulation (Figure 7C), we tested whether truncated ProQ variants could compete with full-length ProQ expressed from its native locus. To this end, wild-type Salmonella cells harboring the PfliE-gfp reporter were transformed with plasmids overexpressing truncated ProQ variants and assayed for GFP fluorescence. Overexpression of full-length ProQ increased GFP expression beyond the activation achieved by chromosomally expressed ProQ, but neither the NTD nor the CTD alone had any effect on GFP expression (Figure 7D and Supplementary Figure S6). This suggests that although the NTD alone is binding competent, it is not able to compete with full-length ProQ, even when overexpressed.
Figure 7.

The ProQ CTD is required for activation of the transcriptional-based reporter system, and stabilization of RNA in vivo. (A) Schematic representation of domain organization in truncated ProQ mutants. All constructs contain an IPTG-inducible promoter (PLlacO) and a 3xFLAG-tag for protein detection. (B) Western blot analysis using an anti-FLAG antibody to detect truncated ProQ mutants. GroEL served as loading control. (C) Truncated ProQ mutant plasmids were expressed in Salmonella ΔproQ cells carrying the PfliE-gfp reporter construct during growth in LB-medium supplemented with IPTG (500 μM final concentration). GFP fluorescence and OD600 were measured at population level using a microplate reader. Fluorescence/OD600 values are relative to a wild-type strain with empty vector. Mean values and SD are given for three biological replicates. (D) Truncated ProQ mutant plasmids or an empty control vector were transformed into wild-type Salmonella strains carrying the PfliE-gfp reporter. IPTG (500 μM final concentration) was added to indicated strains to induce expression of truncated ProQ mutants. GFP fluorescence and OD600 were measured at population level using a microplate reader. Fluorescence/OD600 values are relative to empty control for respective non-induced or induced state. Color indication, white: 0 μM IPTG, black: 500 μM IPTG. Mean values and SD are given for three biological replicates. Absolute values are shown in Supplementary Figure S6. (E) Northern blot analysis of cspE mRNA, and RaiZ sRNA (RaiZ-L [longer variant], RaiZ-S [shorter variant]) levels with respect to truncated ProQ mutants. Probing of tmRNA served as loading control.

The ProQ CTD is required for activation of the transcriptional-based reporter system, and stabilization of RNA in vivo. (A) Schematic representation of domain organization in truncated ProQ mutants. All constructs contain an IPTG-inducible promoter (PLlacO) and a 3xFLAG-tag for protein detection. (B) Western blot analysis using an anti-FLAG antibody to detect truncated ProQ mutants. GroEL served as loading control. (C) Truncated ProQ mutant plasmids were expressed in Salmonella ΔproQ cells carrying the PfliE-gfp reporter construct during growth in LB-medium supplemented with IPTG (500 μM final concentration). GFP fluorescence and OD600 were measured at population level using a microplate reader. Fluorescence/OD600 values are relative to a wild-type strain with empty vector. Mean values and SD are given for three biological replicates. (D) Truncated ProQ mutant plasmids or an empty control vector were transformed into wild-type Salmonella strains carrying the PfliE-gfp reporter. IPTG (500 μM final concentration) was added to indicated strains to induce expression of truncated ProQ mutants. GFP fluorescence and OD600 were measured at population level using a microplate reader. Fluorescence/OD600 values are relative to empty control for respective non-induced or induced state. Color indication, white: 0 μM IPTG, black: 500 μM IPTG. Mean values and SD are given for three biological replicates. Absolute values are shown in Supplementary Figure S6. (E) Northern blot analysis of cspE mRNA, and RaiZ sRNA (RaiZ-L [longer variant], RaiZ-S [shorter variant]) levels with respect to truncated ProQ mutants. Probing of tmRNA served as loading control.

The impact of the ProQ CTD on stabilization of RNA targets in vivo

The cspE mRNA has emerged as a model RNA ligand for analyzing ProQ binding and regulation. Binding of ProQ to cspE mRNA has been analyzed in vivo by crosslinking (13,18) and a three-hybrid system (31), as well as in vitro by electromobility shift assays (32) and hydrogen deuterium exchange (HDX) experiments (30). The interaction between ProQ and cspE was shown to rely solely on the NTD; ProQ lacking the CTD binds as well to cspE mRNA as does the full-length protein in vivo (31) and in vitro (32). In accordance with these results, co-immunoprecipitation experiments verified that truncating ProQ by removal of the CTD + linker conferred binding to the cspE mRNA (Supplementary Figure S7A). Since removing the CTD decreased ProQ’s ability to activate the PfliE-gfp reporter (Figure 7C), we wondered whether removing the CTD would affect ProQ’s ability to stabilize RNA targets. To test this, RNA harvested from cells expressing the truncated ProQ variants were subjected to northern blot analysis using probes against cspE mRNA and RaiZ. The results presented in Figure 7E show that while full-length ProQ strongly elevated the steady-state levels of both RNAs, all truncated versions failed to affect cspE mRNA levels, while the NTD alone conferred stability to RaiZ. To test whether the reduced cspE mRNA levels upon truncation of ProQ were due to a reduction in RNA stability, cells were subjected to transcriptional arrest by rifampicin, and RNA was isolated 2, 4 and 8 min after rifampicin addition, followed by northern blot analysis (Supplementary Figure S7B). Complementation of a ΔproQ strain with full-length ProQ, but not the ProQ NTD, restored cspE stability. The reduced mRNA stability in the NTD-expressing strain was dependent on RNase II, as introducing an rnb deletion into the strain restored stability. This suggests a function for the ProQ CTD in protecting the cspE mRNA against RNase II mediated-exoribonucleolytic activity. Thus, even though the NTD is sufficient for RNA binding, the CTD is required for mRNA stabilization.

DISCUSSION

The recent discovery of ProQ as a global RNA-binding protein in Salmonella and E. coli has spurred a revived interest in ProQ biology. However, many aspects still remain poorly understood, including how ProQ regulates its RNA targets. In this work, we have systematically charted the functional landscape of ProQ and revealed residues essential for its gene regulatory function. This was possible by exploiting the fact that ProQ activates flagellar gene expression, allowing the construction of a fluorescent reporter for ProQ-dependent regulation (Figure 1). By combining saturation mutagenesis with phenotypic cell sorting and high-throughput sequencing (Figure 2), mutations in all amino acid residues within ProQ were assessed for effects on regulatory activity (Figure 3). We show here that mutations in the globular NTD and CTD domains, but not the linker region, are detrimental for ProQ-dependent gene regulation in vivo (Figures 3–6). As judged from comparison to a recent mutational study, residues in the NTD appear to be critical for both RNA-binding and regulation (Supplementary Figure S4C) (31). Strikingly, NTD mutations impairing both RNA-binding and regulatory activity render ProQ unstable in vivo (Figure 5), which suggests that ProQ’s association with RNA enhances protein stability. While the major RNA-binding activity of ProQ seem to be specific to the NTD (30–32), our data indicate an important role for the CTD in gene regulation. Mutations in conserved CTD residues (Figure 6), or removal of the entire CTD (Figure 7), abolish ProQ-dependent gene regulation. Taken together, this indicates that ProQ’s capacity to bind and regulate RNA targets requires both the NTD and CTD. Global transcriptomic analyses have consistently associated deletion of proQ with reduced expression of motility genes, suggesting that ProQ controls flagellar gene expression (14,41). Our work shows that ProQ indirectly activates transcription from the class II fliE promoter (Figure 1C). The transcriptional reporter PfliE-gfp was exploited here for high-throughout screening of ProQ mutants with reduced regulatory activity. One important question is how ProQ activates the fliE promoter. In Salmonella and E. coli, the fliE promoter is directly activated by FlhDC, the master transcriptional regulator of the flagellar network (Figure 1B) (51–54). We speculate that ProQ either acts upstream of flhDC transcription or regulates the flhDC mRNA by a post-transcriptional mechanism. In fact, the flhDC mRNA has previously been identified as a direct ProQ ligand in cross-linking experiments (13). Beyond the scope of the current study, we are currently investigating the mechanism behind ProQ-dependent activation of flagellar genes, which will be reported elsewhere. On the molecular level, we have identified >20 NTD residues that are critical for the regulatory activity of ProQ (Figures 3–5 and Supplementary Table S4). Our data complement the findings of Pandey et al. (31) and provide a direct link between RNA-binding activity and regulation (Supplementary Figure S4C and Supplementary Table S4). Conserved residues on both the concave and convex sides of the NTD appear to be functionally important, including surface-exposed acidic, basic and hydrophobic residues, and core-located hydrophobic residues (Figure 4; Supplementary Figures S3 and S4). In fact, several of these NTD residues have been suggested to mediate RNA-binding and/or regulation in other ProQ/FinO proteins (19,22,28). For example, random mutagenesis of Legionella pneumophilia RocC revealed that several residues in the ProQ/FinO domain (ProQ numbering: L31, F21, L63 and L71) affected its ability to repress competence and to stabilize sRNA (19). ProQ is an abundant, constitutively expressed protein (14,42). However, whether ProQ levels and/or activity can be regulated remains unknown. Interestingly, our data show that mutations in the NTD RNA-binding surface affect the intracellular levels of ProQ (Figure 5). It appears that binding to RNA stabilizes the ProQ protein, while disruption of RNA-binding through mutations render ProQ unstable. We predict that ProQ-bound RNA ligands protect otherwise susceptible amino acid residues within the protein against proteolytic degradation. This seems to be a distinctive feature for native ProQ (Figure 5); fusing ProQ to other proteins such as the red fluorescent protein dTomato (Figure 4), or the alpha-subunit of RNA-polymerase (31), enhances protein stability and masks the destabilizing effect of NTD mutations (Figure 4). One important question is which protease is responsible for degradation of mutant ProQ proteins. In a recent pre-print presenting a saturation mutagenesis study of Salmonella ProQ, the authors suggest that the Lon protease is responsible for removal of RNA-binding deficient mutants of both ProQ and Hfq (El Mouali, Y., Ponath, F., Scharrer, V., Wenner, N., Hinton, J. C. D. and Vogel, J. (2021) Scanning mutagenesis of RNA-binding protein ProQ reveals a quality control role for the Lon protease. biorXiv doi: https://doi.org/10.1101/2021.07.12.452043, 12 July 2021, pre-print: not peer-reviewed). Thus, the observed instability of RNA-binding deficient ProQ NTD variants presented here (Figure 4), most likely depends on Lon. We anticipate that future studies will shine light on how ProQ protein levels are regulated and clarify the relationship between RNA-binding and protein stability. While the ProQ NTD has been the subject of several recent studies (30–32), the function(s) of the CTD has remained unclear. Our experiments reveal that mutations in three CTD residues (G185, T200P, G216) completely abolish the ability of ProQ to activate the PfliE-gfp reporter (Figure 6). The same mutations lead to strongly elevated levels of three known ProQ RNA ligands, which likely reflect overstabilization of these RNAs (Figure 6). It is so far unclear whether the increased RNA ligand stability and the failure to activate the reporter are connected phenomena, but we speculate that it may reflect a situation where ProQ becomes sequestered on RNA targets instead of mediating a regulatory event. Besides these significant mutations, two additional CTD mutations (L188, V190), which barely failed to score as significant under the chosen cut-off (Supplementary Figure S5), are likely to affect ProQ regulation. Interestingly, all of these five CTD residues overlap with regions previously suggested from HDX experiments to be involved in binding with the sRNA SraB (30). Conversely, residues exclusively in the NTD seem to be required for stable interactions with the sRNA SibB and the cspE ‘UTR in vivo (30,31). It thus appears that residues within the CTD may contribute to or impact RNA binding but only for specific RNA ligands. Along this line, it was recently proposed that while the NTD is required for binding to all ligands, the CTD may be involved in RNA-binding only to specific ligands (32). In a similar manner, N- or C-terminal extensions present in other ProQ/FinO proteins are suggested to widen target recognition for each protein member (19–21). The increased stability of RNA ligands associated with the CTD mutations identified in this study (Figure 6) may suggest that these mutant proteins harbor a CTD with increased RNA-binding activity. Perhaps the most striking result to emerge from our data is that the CTD is required for the regulatory activity of ProQ (Figure 7 and Supplementary Figure S7B). As exemplified by the model ligand cspE, the CTD is dispensible for mRNA-binding (32) (Supplementary Figure S7A) but necessary for full RNA stabilization (Figure 7E and Supplementary Figure S7B). This was true for mRNA stabilization of cspE but not sRNA stabilization of RaiZ (Figure 7E). Thus, it appears that the ProQ CTD is required for ProQ-dependent regulation for a set of RNA ligands but not for all. Given that ProQ protects the cspE mRNA from RNase II-mediated degradation in Salmonella, we speculate that the ProQ CTD is involved in blocking RNase activity. How does this occur mechanistically? As judged from co-immunoprecipitate experiments, ProQ does not seem to interact with RNase II in vivo (13), pointing against direct contact between the CTD and RNase II as a mechanism of action. It is possible that the CTD may sterically block access of RNase II to the cspE mRNA (13) or bind to other proteins that may block RNase activity, directly or indirectly. Further work will be required to identify any potential connection between the ProQ CTD and RNA stabilization. A related issue is whether ProQ is dependent on, and/or interacts with, other proteins. In an attempt to shine light on this we induced the plasmid-encoded NTD or CTD in a strain encoding full-length ProQ on the chromosome. The experiment shown in Figure 7D shows that the single domains are not able to interfere with ProQ function (which should result in reduced fluorescence from the reporter), suggesting that the domains alone do not competete for binding to trans-acting factors. However, more experiments are certainly needed for addressing this issue. In summary, we have identified functionally important residues within Salmonella ProQ, and revealed the significance of its CTD in gene regulation. We expect that future analyses will expand the individual role(s) of this domain, and elucidate how it cooperates with the NTD to orchestrate the biological role of the global RBP ProQ.

DATA AVAILABILITY

Sequencing data is available as a National Center for Biotechnology Information (NCBI) Bioproject under accession number PRJNA713973. Python scripts used for sequencing read filtering and subsequent quantification of mutations are available in the GitHub repository and can be accessed via https://github.com/kjellinjonas/proQ_mutagenesis. FCS files for flow cytometry and FACS data sets have been deposited in the FlowRepository database and can be accessed via http://flowrepository.org/id/FR-FCM-Z3JW. Click here for additional data file.
  57 in total

Review 1.  RNA-binding proteins: modular design for efficient function.

Authors:  Bradley M Lunde; Claire Moore; Gabriele Varani
Journal:  Nat Rev Mol Cell Biol       Date:  2007-06       Impact factor: 94.444

Review 2.  RNA-binding activity and regulatory functions of the emerging sRNA-binding protein ProQ.

Authors:  Erik Holmqvist; Sofia Berggren; Alisa Rizvanovic
Journal:  Biochim Biophys Acta Gene Regul Mech       Date:  2020-06-19       Impact factor: 4.490

3.  The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts.

Authors:  Alexander G Baltz; Mathias Munschauer; Björn Schwanhäusser; Alexandra Vasile; Yasuhiro Murakawa; Markus Schueler; Noah Youngs; Duncan Penfold-Brown; Kevin Drew; Miha Milek; Emanuel Wyler; Richard Bonneau; Matthias Selbach; Christoph Dieterich; Markus Landthaler
Journal:  Mol Cell       Date:  2012-06-08       Impact factor: 17.970

Review 4.  How RNA-Binding Proteins Interact with RNA: Molecules and Mechanisms.

Authors:  Meredith Corley; Margaret C Burns; Gene W Yeo
Journal:  Mol Cell       Date:  2020-04-02       Impact factor: 17.970

5.  Analysis and expansion of the role of the Escherichia coli protein ProQ.

Authors:  Daniel T Sheidy; Ryszard A Zielke
Journal:  PLoS One       Date:  2013-10-25       Impact factor: 3.240

6.  Molecular mechanism of mRNA repression in trans by a ProQ-dependent small RNA.

Authors:  Alexandre Smirnov; Chuan Wang; Lisa L Drewry; Jörg Vogel
Journal:  EMBO J       Date:  2017-03-23       Impact factor: 11.598

7.  Diverse conjugative elements silence natural transformation in Legionella species.

Authors:  Isabelle Durieux; Christophe Ginevra; Laetitia Attaiech; Kévin Picq; Pierre-Alexandre Juan; Sophie Jarraud; Xavier Charpentier
Journal:  Proc Natl Acad Sci U S A       Date:  2019-08-27       Impact factor: 11.205

8.  Structure of the Escherichia coli ProQ RNA-binding protein.

Authors:  Grecia M Gonzalez; Steven W Hardwick; Sarah L Maslen; J Mark Skehel; Erik Holmqvist; Jörg Vogel; Alex Bateman; Ben F Luisi; R William Broadhurst
Journal:  RNA       Date:  2017-02-13       Impact factor: 4.942

9.  Genetic identification of the functional surface for RNA binding by Escherichia coli ProQ.

Authors:  Smriti Pandey; Chandra M Gravel; Oliver M Stockert; Clara D Wang; Courtney L Hegner; Hannah LeBlanc; Katherine E Berry
Journal:  Nucleic Acids Res       Date:  2020-05-07       Impact factor: 16.971

View more
  3 in total

1.  The Role and Targets of the RNA-Binding Protein ProQ in the Gram-Negative Bacterial Pathogen Pasteurella multocida.

Authors:  Jai J Tree; John D Boyce; Emily L Gulliver; Brandon M Sy; Julia L Wong; Deanna S Deveson Lucas; David R Powell; Marina Harper
Journal:  J Bacteriol       Date:  2022-03-24       Impact factor: 3.476

2.  Scanning mutagenesis of RNA-binding protein ProQ reveals a quality control role for the Lon protease.

Authors:  Youssef El Mouali; Falk Ponath; Vinzent Scharrer; Nicolas Wenner; Jay C D Hinton; Jörg Vogel
Journal:  RNA       Date:  2021-09-08       Impact factor: 4.942

3.  Bacterial RNA chaperones and chaperone-like riboregulators: behind the scenes of RNA-mediated regulation of cellular metabolism.

Authors:  Kai Katsuya-Gaviria; Giulia Paris; Tom Dendooven; Katarzyna J Bandyra
Journal:  RNA Biol       Date:  2021-12-31       Impact factor: 4.766

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.