Literature DB >> 33999173

A novel bipartite antitermination system widespread in conjugative elements of Gram-positive bacteria.

Andrés Miguel-Arribas1, Jorge Val-Calvo1, César Gago-Córdoba1, José M Izquierdo1, David Abia2, Ling Juan Wu3, Jeff Errington3, Wilfried J J Meijer1.   

Abstract

Transcriptional regulation allows adaptive and coordinated gene expression, and is essential for life. Processive antitermination systems alter the transcription elongation complex to allow the RNA polymerase to read through multiple terminators in an operon. Here, we describe the discovery of a novel bipartite antitermination system that is widespread among conjugative elements from Gram-positive bacteria, which we named conAn. This system is composed of a large RNA element that exerts antitermination, and a protein that functions as a processivity factor. Besides allowing coordinated expression of very long operons, we show that these systems allow differential expression of genes within an operon, and probably contribute to strict regulation of the conjugation genes by minimizing the effects of spurious transcription. Mechanistic features of the conAn system are likely to decisively influence its host range, with important implications for the spread of antibiotic resistance and virulence genes.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33999173      PMCID: PMC8191782          DOI: 10.1093/nar/gkab360

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Transcriptional regulation contributes importantly to adaptive and coordinated gene expression, which are crucial for the survival of all biological entities in order to respond adequately to changing environmental conditions. The transcription process can be divided into three stages, -initiation, elongation and termination (1,2). Bacteria have two types of termination signals: Rho-dependent and intrinsic (aka Rho-independent) terminators (2–7). Rho-dependent terminators require the RNA-dependent helicase Rho, which, after binding to specific sites, translocates along the mRNA resulting eventually in dissociation of the transcription elongation complex (TEC) (3,8,9). Intrinsic terminators, though, do not require any additional factor to terminate transcription; they are typically characterized by a GC-rich inverted repeat followed by a stretch of Ts in the non-template DNA strand. When transcribed into RNA, this region forms a stem-loop structure followed by a tract of Us, which is sufficient to terminate transcription, although termination may be enhanced by the NusA transcription factor (10,11). Thus, a transcription terminator marks the end of a transcriptional unit that can consist of one or several genes (6). However, many terminators have evolved to terminate transcription only under certain circumstances, allowing conditional expression of the gene(s) located downstream. Such terminators, which include riboswitches, act via the so-called attenuation mechanism in which the RNA region can fold in a terminator or antiterminator structure depending on the presence or absence of a specific signal (12, and references therein). These attenuation systems that control a single terminator (13–15) are fundamentally different from the processive antitermination (P-AT) mechanisms, which do not depend on differences in the configuration of the terminator. Instead, P-AT mechanisms require the association of an antitermination (AT) factor with the TEC. Once associated, the altered TEC can read through multiple intrinsic terminators and act over great distances. So far, only a limited number of P-AT systems have been described; several of them achieve P-AT when an antiterminator protein engages with a transcribing RNAP. While the primary sequences of some of these, like the phage λ AT proteins N and Q, do not share similarity with the transcription elongation factors, others, like RfaH, ActX, TaA, UpxY and LoaP, share similarity with the elongation factor NusG. In a few other cases, the AT factor is an RNA element that achieves P-AT in largely unknown ways (16–19). The characteristics and mode of action of the relatively few known P-AT systems have recently been reviewed by Goodson and Winkler (20). Here, we describe the discovery of a novel P-AT system which we call conAn (conjugation antitermination), located at the start of the conjugation operon present on plasmid pLS20 of the Gram-positive (Gram+) bacterium Bacillus subtilis. In contrast to all known P-AT systems, this is a bipartite system composed of a large RNA element responsible for AT and a protein component that is required for P-AT. The conAn system permits transcription to read through multiple intrinsic terminators present in the very long conjugation operon, and probably contributes to proper expression of the conjugation genes by minimizing the effects of spurious transcription. The system serves at least one other purpose: differential expression of one or more genes inside the conjugation operon during conjugation and non-conjugation conditions. In addition, we show that bipartite conAn type P-AT systems are most likely present on hundreds of conjugative plasmids of Gram+ bacteria. Finally, we found that conAn systems differ in their functionality in heterologous hosts, which likely depends on the compatibility of the conAn system with the TEC of the host. This could have important consequences for the ability of conjugative elements to spread antibiotic resistance.

MATERIALS AND METHODS

Bacterial strains, plasmids, media and oligonucleotides

Escherichia coli, Bacillus subtilis and Enterococcus faecalis strains were grown in Luria-Bertani (LB) medium (21) and Listeria innocua strains were grown in Brain Heart Infusion (BHI) medium (22). Lactobacillus casei strains were grown in de Man, Rogosa and Sharpe (MRS) medium (23). All bacteria were grown in liquid media with shaking or on 1.5% LB agar plates at 37°C. When appropriate, media were supplemented with the following antibiotics: ampicillin (Amp), 100 μg/ml; erythromycin (Em), 1.5 μg/ml (B. subtilis, L. innocua, L. casei and E. faecalis) or 150 μg/ml (E. coli), chloramphenicol (Cm), 5 μg/ml; spectinomycin (Spec), 100 μg/ml; and kanamycin (Kan), 10 μg/ml (B. subtilis) or 40 μg/ml (E. coli). B. subtilis strains used were isogenic with B. subtilis strain 168. Bacterial strains are listed in Supplementary Table S7. Plasmids and oligonucleotides used are listed in Supplementary Table S8 and S9, respectively. All oligonucleotides were purchased from Isogen Life Science, The Netherlands. Synthetic DNA constructs were obtained from Shinegene (Shanghai, China). These constructs were provided in the vector pUC57Km. The inserts corresponding to the synthetic DNA constructs were amplified by PCR using oligonucleotides M13Fw and M13Rev. Next, the fragments were digested with appropriate enzymes and cloned in the final vector. Alternatively, inserts were provided directly in the final vector.

Transformation

Escherichia coli competent cells were prepared and transformed using standard methods (21). Generation of competent B. subtilis cells and transformations were prepared as described (24). E. faecalis, L. casei and L. innocua cells were transformed as described (25–27). Transformants were selected on LB, BHI and MRS plates with appropriate antibiotics.

Construction of plasmids and strains

DNA techniques were performed using standard molecular methods (21). Plasmids were isolated from E. coli strains using ‘Wizard Plus SV Minipreps DNA Purification Systems’ (Promega). PCR fragments were purified using ‘Wizard SV Gel and PCR Clean-Up System’ (Promega). All enzymes used were purchased from New England Biolabs, USA. Cloned PCR fragments were checked for their correctness by sequence analysis. Total DNA extracted from pLS20cat harbouring strain PKS11 was used as template to amplify pLS20cat regions by PCR. In the same way, total DNA extracted from the plasmid p576 and pAW63 harbouring strains Bacillus pumilus NRS576 and B. thuringiensis subp. kurstaki HD73, respectively, were used as template to amplify conAn regions of these plasmids. Specific information of the construction of each plasmid, including the oligonucleotides and restriction enzymes used to amplify and clone the fragments, is presented in Supplementary Table S8. General information of the use and characteristics of the different parental vectors is given below. Marker less in-frame deletions of gene 28 and its downstream regions on pLS20cat were constructed making use of the pMiniMAD2 vector (28). The method is based on single cross-over integration of a temperature–sensitive replicon via homologous recombination at restrictive temperature followed by permitting replication of the integrated replicon by growth at the permissive temperature to provoke deletion of replicon thereby generating the desired deletion (29). The deletion strategy is described in (30). In short, in a first step, around 600 bp of the interested ‘UP’ and ‘Down’ regions were amplified by PCR, and then joined in a second step by overlapping PCR. The resulting fused PCR product was purified and digested with NdeI and BamHI and cloned into the pMiniMAD2 vector digested with the same enzymes. Using this strategy, the following derivatives of pMiniMAD2 were constructed: pAND86, pAND98, pAND99 and pAND100. These plasmids were used for the construction of derivatives of pLS20cat. pAND86 was used to construct strain AND81 which contains pLS20cat derivative lacking most of gene 28 (pLS20Δ28). Similarly, pAND98, pAND99 and pAND100 were used to generate strains AND98 (pLS20Δds-I), AND99 (pLS20ΔP29) and AND100 (pLS20Δds-II) (see Supplementary Table S7). More details regarding the construction of plasmids is described in Supplementary Tables S8 and S9. Plasmid pAX01 is a B. subtilis lacA integration vector designed for the construction of DNA regions fused to the xylose-inducible promoter P, and the cassette is associated with an erythromycin resistance gene for selection in B. subtilis. This vector was used for controlled expression of gene(s) of interest in B. subtilis and it can be used in combination with another B. subtilis integration vector such as pDR110, which is designed for placing constructs at the amyE locus having genes fused to the IPTG-inducible promoter P. We used pAX01 to construct plasmids pPKS17 and pAND178 and the strains AND84, AND178P and CG14 (see Supplementary Table S1, S2 respectively). Double-crossover integration into the chromosome was checked by PCR using the primer sets [pAX1-Up/pAX1-Dn] and [pAX2-Up/pAX2-Dn] (Supplementary Table S3). When used to study the effect of ectopic expression of a given gene placed under the control of the inducible P promoter, an overnight grown culture was diluted in prewarmed LB supplemented with 1% of xylose. The vector pKSsfGFP was previously constructed in our lab based on the vector pJS104 (30,31). The principal characteristics of this plasmid are: (i) high copy number and AmpR in E. coli, (ii) amyE gene integrative cassette by double cross-over and CmR in B. subtilis and (iii) promotor-less superfolder gfp (gfp) gene that is preceded by the unique restriction sites BamHI, HindIII, EcoRI, NheI and SpeI. The gfp reporter gene used in these studies corresponds to a version that contains several modifications to enhance the stability and other features of the encoded green fluorescent protein (32). For simplicity, we refer to this recombinant gene as gfp. DNA fragments were cloned upstream the gfp gene on pKSsfGFP and the resulting derivatives were used to construct B. subtilis strains that contain at the amyE locus a single copy of the gfp gene transcriptionally fused to the cloned region. pKSsfGFP derivatives were isolated from E. coli strains and used to transform competent B. subtilis 168 cells. Transformants were selected on LB plates containing Cm. Double-crossover integration into the chromosome was checked by the loss of amylase activity. Plasmid pDR110 is a B. subtilis amyE integration vector that contains the IPTG-inducible Pspank promoter and is associated with a spectinomycin resistance gene for selection in B. subtilis. This vector was used to construct pAND101 that is designed for studying terminator and/or antiterminator activities in vivo. Plasmid pAND101 was constructed as follows. The superfolder gfp gene (gfp) was amplified by PCR using oligos oA98 and oA88 and pKSsfGFP vector as template. This PCR fragment was then digested with NheI and SpeI and subsequently cloned into vector pDR110 digested with the same enzymes. Vector pAND101 can be used for two objectives: (i) test the terminator activity and (ii) test the antitermination activity. For these purposes, first a PCR fragment containing a putative terminator is cloned at the multiple cloning site (MCS) located in between the P promoter and the gfp gene on plasmid pAND101. The resulting plasmid can be used to construct a B. subtilis strain containing at its amyE locus the construct P-[insert]-gfp, which can be used to study terminator or promoter activity of the cloned region. Second, a DNA fragment containing an antiterminator system conAn can subsequently be placed upstream of the first cloned fragment containing the putative terminator. The resulting derivative of pAND101 can then be used to construct a B. subtilis strain containing at its amyE locus the construct P-[insert-II (AT system)]-[insert-I (Ter)]-gfp, which can be used to study the functionality of the cloned conAn region, or to study whether the terminator is resistant or not to the antitermination system. Plasmid pAT18 shuttle vector replicates as a high-copy number plasmid in E. coli and as a low-copy number plasmid in Gram+ bacteria. The plasmid contains the replication functions of plasmid pAMβ1, allowing it to replicate in a broad host-range of Gram+ bacteria (33). It is associated with an erythromycin resistance gene for selection in Gram+ bacteria. This plasmid was used to study the conAn antitermination systems of different Gram+ plasmids in their own genus. Derivatives of pAT18 were constructed as follows. First, the cassette containing Pspank-[possible insert]-gfp-lacI was isolated from the pAND101, or derivative of interest, by digestion with PstI and BamHI. This fragment was ligated to pAT18 digested with the same enzymes. The ligation mixture was then used to transform competent E. coli DH5α cells. Positive clones were detected by colony PCR using primers oM13RevExt and oSeqpKSGFP_Dn and checked by digestion with PstI and BamHI.

Conjugation assays

Conjugation was carried out in liquid medium as described previously (34). The effect of ectopic expression of a given gene placed under the control of the inducible P and/or P promoter on conjugation was studied by adding the inducer (1 mM IPTG or 1% xylose) to prewarmed LB medium used to dilute overnight cultures of the donor cells.

RNA isolation and RNA sequencing

Total RNA was isolated from exponentially growing cells (OD600 = 0.8–1) by using RNA protect solution and RNeasy Mini Kit from Qiagen, following the manufacturer's protocol. RNA sequencing and bioinformatical analysis of RNAseq data was done as described previously (35).

Flow cytometry

Fluorescence quantification using Flow cytometry was performed as described before (30). All different bacteria were grown in liquid media with shaking until the culture reached an OD600 = 0.8–1, with the exception of E. faecalis strains that were recollected at OD600 = 0.5. Fluorescence levels were expressed as the mean value of the Geomean values of 100 000 cells obtained in three independent experiments.

Identification of putative termination sequences

Conjugative plasmid sequences were screened for the presence of putative Rho-independent transcriptional terminators by (i) the ‘ARNold’ Web server (rna.igmors.u-psud.fr/toolbox) that uses two algorithms: Erpin and RNAmotif (36,37) and by (ii) the TransTermHP Web server (transterm.cbcg.umd.edu) which uses an algorithm that is distinct from the Erpin and RNAmotif algorithms (38).

Identification of genes encoding proteins showing significant similarity to ConAn1pLS20 homologs

Each of the ConAn1 sequences was used as query in multiple rounds of Psi-blastp searches against the NCBI nr database (version 2.5.0+, November 2019) (39,40). This search resulted in the detection of 662 sequences sharing significant similarity with the nine ConAn1 sequences, with e-value less than 1e−7. The program ‘USEARCH’ (version 8.0.1517_i86linux32) was then used to identify and remove redundant sequences showing 100% identity (41), resulting in 531 unique hits. MMSEQ2 program was used to create clusters of proteins with more than 95% of sequence identity and coverage over 80%, resulting in 141 clusters. One representative sequence was selected from each cluster. The evolutionary history was inferred using the Neighbor-Joining method (42). The optimal tree with the sum of branch length = 48.74942208 is shown. The evolutionary distances were computed using the JTT matrix-based method (43) and are in the units of the number of amino acid substitutions per site. The rate variation among sites was modelled with a gamma distribution (shape parameter = 1.97). All ambiguous positions were removed for each sequence pair (pairwise deletion option). The final dataset contained a total of 343 positions. Evolutionary analyses were conducted in MEGA X (44). The tree image was generated using Dendroscope program (45). Colouring of different branches were added manually.

RESULTS

Deletion of pLS20cat gene 28 greatly reduces conjugation efficiency

The conjugation genes of pLS20cat are located in a single large operon spanning genes 28 to 74 (35, for review see, 46). Intriguingly, the putative 172-residue protein encoded by the first gene of the conjugation operon, gene 28, does not show significant similarity to any protein of known function. To gain insight into its function we constructed a derivative of pLS20cat, pLS20Δ28, containing a marker-less in-frame deletion of gene 28 and used strain AND81 harbouring pLS20Δ28 to determine its conjugation efficiency. In agreement with previous results, the conjugation efficiency obtained for wild type pLS20 (stain PKS11) was in the range of 2.0E-3 transconjugants per donor cells (34,35). However, deletion of gene 28 had a dramatic effect on conjugation, causing the efficiency to drop more than 10 000-fold (Figure 1B). This demonstrates that gene 28 plays a key role in the conjugation process. Gene 28 complementation assays are described below.
Figure 1.

pLS20cat gene 28 is required for efficient conjugation. (A) Genetic map of pLS20cat. Upper panel shows a linear map of pLS20cat. Rightward and leftward oriented genes, and the chloramphenicol resistance gene (Cm) are indicated with green, red and blue rectangles, respectively. The conjugation operon spanning genes 28–74 is indicated. A blow up of gene 28 and its downstream region, as well as the regions deleted in the various derivatives of pLS20cat studied here are shown in the lower panel. (B) Relative conjugation efficiencies of donor strains harbouring pLS20cat (wt) or the derivative pLS20Δ28 (Δ28), pLS20Δds-I (Δds-I), pLS20Δds-II (Δds-II) or pLS20ΔP (ΔP). Strains AND84 and AND178P contain an ectopic copy of gene 28 without (AND84, P-28-min) or with the 315 bp region downstream of gene 28 (AND178P, P-28-ext) placed under the control of the xylose-inducible P promoter. Conjugation efficiencies were calculated as the number of transconjugants per donor, and their efficiencies are expressed relative to the wild type plasmid pLS20cat. At least three independent experiments were performed for each donor. Error bars represent standard deviation.

pLS20cat gene 28 is required for efficient conjugation. (A) Genetic map of pLS20cat. Upper panel shows a linear map of pLS20cat. Rightward and leftward oriented genes, and the chloramphenicol resistance gene (Cm) are indicated with green, red and blue rectangles, respectively. The conjugation operon spanning genes 28–74 is indicated. A blow up of gene 28 and its downstream region, as well as the regions deleted in the various derivatives of pLS20cat studied here are shown in the lower panel. (B) Relative conjugation efficiencies of donor strains harbouring pLS20cat (wt) or the derivative pLS20Δ28 (Δ28), pLS20Δds-I (Δds-I), pLS20Δds-II (Δds-II) or pLS20ΔP (ΔP). Strains AND84 and AND178P contain an ectopic copy of gene 28 without (AND84, P-28-min) or with the 315 bp region downstream of gene 28 (AND178P, P-28-ext) placed under the control of the xylose-inducible P promoter. Conjugation efficiencies were calculated as the number of transconjugants per donor, and their efficiencies are expressed relative to the wild type plasmid pLS20cat. At least three independent experiments were performed for each donor. Error bars represent standard deviation.

Protein p28 acts at the level of transcription but not as a transcriptional activator

The lack of similarity between protein p28 with proteins of known function complicated designing an experimental approach to unravel its function in the conjugation process. We reasoned, however, that protein p28 should either play a role at the level of transcription or at a later stage, and that RNAseq could be used to discriminate between these two possibilities. We therefore compared the expression levels of the conjugation genes by RNAseq analysis of the wild type plasmid pLS20cat and its derivative pLS20Δ28. The results presented in Supplementary Figure S1 show that the absence of gene 28 severely affected the expression of most of the downstream conjugation genes but, strikingly, not that of genes 29 or 30, demonstrating that the absence of gene 28 does not affect the activity of the main conjugation promoter P located upstream of gene 28 (47). The strong decrease in expression downstream of gene 30 for pLS20Δ28 might be explained by the presence of a protein p28-activated promoter located upstream of gene 31. To test this, we constructed strain CG14 with two chromosomal cassettes: one containing the upstream region of gene 31 fused to a gfp reporter gene, and another containing a copy of gene 28 controlled by a xylose-inducible promoter. This strain did not show significant fluorescence regardless of whether grown in the presence or absence of xylose (supplementary Table S1), indicating that the upstream region of gene 31 does not contain a protein p28-inducible promoter.

pLS20cat gene 30 is followed by a functional and efficient transcriptional terminator, Ter

The results described above raised the possibility that gene 28 encodes a transcriptional antiterminator that acts on a transcriptional terminator downstream of gene 30. In line with this possibility, in silico analysis suggested the presence of a putative intrinsic transcriptional terminator near the end of gene 30, which we tentatively named Ter (Figure 2A). To test whether Ter was a genuine intrinsic terminator we constructed a series of strains in which DNA fragments with potential transcriptional terminators were inserted between an IPTG-inducible P promoter and a gfp reporter gene (Figure 2B). A functional transcriptional terminator would result in decreased fluorescence levels. Only low fluorescence levels were obtained when the cultures were grown in the absence of IPTG, showing that the P promoter was well repressed under these conditions (Figure 2C). As expected, relatively high fluorescence levels were obtained for the strain with no terminator (P-gfp, strain AND101) when grown in the presence of 1 mM IPTG. Similar high fluorescence levels were obtained for control strain AND202 with a >800 bp fragment with no known terminator (Figure 2C), demonstrating that increasing the distance between the promoter and the gfp gene by more than 800 bp did not significantly affect the expression of gfp. Importantly, only background levels of fluorescence were observed for strain AND127 with the putative Ter terminator, establishing that Ter comprises a functional transcriptional terminator.
Figure 2.

pLS20cat gene 30 is followed by a functional transcriptional terminator. (A) Features of terminator Ter located near the end of pLS20cat gene 30. pLS20cat was screened for putative intrinsic terminators by (i) the ARNold Web server (rna.igmors.u-psud.fr/toolbox) that uses two algorithms: Erpin and RNAmotif (36,37), and by (ii) the TransTermHP Web server (transterm.cbcg.umd.edu) that uses a distinct algorithm (38). The pLS20cat-region containing an inverted repeated sequence that may form a stem loop structure when transcribed into RNA is shown; nucleotides predicted to form part of the stem are shown in blue and indicated with black arrows; and those predicted to form the loop are shown in red. Ter overlaps with the C-terminal region of gene 30 (grey arrow). The right panel presents the predicted stem−loop structure including the twelve downstream nucleotides. The same colour code is used as in the left panel. (B) Schematic view of the relevant genetic features of the different strains used. The P and P promoters are indicated with light and dark blue boxes, respectively. The gfp reporter (gfp) and the chloramphenicol resistance gene (cat) are indicated with green and purple arrows, respectively. The transcriptional terminator Ter is indicated with a red lollipop symbol. (C) Fluorescence level determined by FACS analyses of cells grown in the absence or presence of 1 mM IPTG relative to the fluorescence levels obtained for strain AND101 grown in the presence of IPTG. Samples were withdrawn from late exponentially grown cultures (OD600 between 0.8 and 1). Color code bars: grey and red, strains having a construct lacking or containing Ter, respectively; light and dark, strains grown in the absence or presence of IPTG, respectively. Fluorescence levels are expressed as the mean value of the Geomean values of 100 000 cells obtained in three independent experiments. Error bars represent standard deviation.

pLS20cat gene 30 is followed by a functional transcriptional terminator. (A) Features of terminator Ter located near the end of pLS20cat gene 30. pLS20cat was screened for putative intrinsic terminators by (i) the ARNold Web server (rna.igmors.u-psud.fr/toolbox) that uses two algorithms: Erpin and RNAmotif (36,37), and by (ii) the TransTermHP Web server (transterm.cbcg.umd.edu) that uses a distinct algorithm (38). The pLS20cat-region containing an inverted repeated sequence that may form a stem loop structure when transcribed into RNA is shown; nucleotides predicted to form part of the stem are shown in blue and indicated with black arrows; and those predicted to form the loop are shown in red. Ter overlaps with the C-terminal region of gene 30 (grey arrow). The right panel presents the predicted stem−loop structure including the twelve downstream nucleotides. The same colour code is used as in the left panel. (B) Schematic view of the relevant genetic features of the different strains used. The P and P promoters are indicated with light and dark blue boxes, respectively. The gfp reporter (gfp) and the chloramphenicol resistance gene (cat) are indicated with green and purple arrows, respectively. The transcriptional terminator Ter is indicated with a red lollipop symbol. (C) Fluorescence level determined by FACS analyses of cells grown in the absence or presence of 1 mM IPTG relative to the fluorescence levels obtained for strain AND101 grown in the presence of IPTG. Samples were withdrawn from late exponentially grown cultures (OD600 between 0.8 and 1). Color code bars: grey and red, strains having a construct lacking or containing Ter, respectively; light and dark, strains grown in the absence or presence of IPTG, respectively. Fluorescence levels are expressed as the mean value of the Geomean values of 100 000 cells obtained in three independent experiments. Error bars represent standard deviation. The main conjugation promoter P, located upstream of gene 28, is considerably stronger than the fully induced P promoter (30). To test if Ter could terminate transcription from the P promoter, we constructed strains in which gfp (strain AND148) or the [Ter- gfp] construct (strain AND357) was downstream from the P promoter (Figure 2B). The much higher fluorescence levels obtained for the strain AND148 in which gfp was fused to P (Figure 2C), confirmed that P is stronger than the P promoter. Importantly, as observed for P, the presence of Ter downstream of the P promoter in strain AND357 resulted in a ∼50-fold decrease in fluorescence level (Figure 2C). These results agree with previous observations that terminators can terminate transcription with different intrinsic efficiencies but that the efficiency is not or only moderately influenced by the strength of the upstream promoter (48–52). Importantly, these results showed that Ter is an efficient transcriptional terminator.

pLS20 contains a cis-acting bipartite antitermination system composed of gene 28 and its downstream region

pLS20cat gene 29 is preceded by a constitutive weak promoter, P, that drives expression of genes 29 and 30, independent of the P promoter (30). Besides promoter P, the rather large (383 bp) intergenic gene 28–29 region contains some conspicuous features (see below). To test for possible AT activity the experiments described below were based on constructs containing gene 28 with or without 315 bp of the downstream DNA, which we refer to as extended (ext) or minimal (min) constructs, respectively. First we tested whether ectopic expression of gene 28-min or 28-ext could restore efficient conjugation of plasmid pLS20Δ28. Strains harbouring pLS20Δ28, and containing an ectopic copy of 28-min or 28-ext under the control of the xylose inducible P promoter at the chromosomal lacA locus, were used as donor strains in conjugation experiments. Ectopic expression of 28-min (strain AND84) or 28-ext (strain AND178P) did not restore efficient conjugation (Figure 1B), nor did it elicit AT at the Ter terminator located elsewhere on the chromosome (amyE), as measured by fluorescence from a P-[Ter]-gfp construct (Supplementary Table S2). To analyse whether gene 28-min/ext could antiterminate Ter in a cis configuration we engineered strains in which copies of gene 28-min or 28-ext were located upstream of Ter in the same transcriptional unit (i.e. P-[28-min/ext]-[Ter]-gfp), together with control strains lacking Ter (see Figure 3A). As expected, when grown in the presence of IPTG, high fluorescence levels were observed for the control strains AND101, AND115 and AND118 lacking Ter (Figure 3B). Importantly, similarly high fluorescence levels were also obtained for the strain in which gene 28-ext (strain AND210), but not 28-min (strain AND181), preceded Ter (Figure 3B). These results indicate that: (i) gene 28, together with (part of) the downstream region, function as a transcriptional antiterminator that efficiently eliminates transcriptional termination at Ter and (ii) the AT system is only functional when located in cis, probably to limit its activity to the conjugation operon. To support the conclusion that the gene 28 downstream region is essential for AT, we constructed plasmid pLS20Δds-I, a derivative of pLS20cat in which the 356 bp intergenic region after the stop codon of gene 28 up to the RBS of gene 29 (downstream region I) was deleted (Figure 1A). The absence of this region had similar negative effects on the efficiency of conjugation as did deletion of gene 28 (Figure 1B, strain AND98). Based on these results we refer to gene 28 as conAn1 (conjugation Antitermination element 1) and the downstream region as conAn2 (conjugation Antitermination element 2).
Figure 3.

pLS20cat gene 28 and its downstream region inhibit the termination activity of Ter30 when they precede the terminator on the same transcriptional unit. (A) Relevant genetic features of the different strains used. Light and dark brown arrows indicate pLS20cat gene 28 without (28-min) and with its 315 bp downstream region (28-ext), respectively. Oher symbols and color codes are the same as in Figure 2B. (B) FACS-determined fluorescence levels of cells of different strains normalized against the fluorescence level obtained for AND101 cells. Grey and red bars, strains having a construct lacking or containing Ter30, respectively. Relevant genotypes of the strains are given in panel A. Conditions and symbols are identical as described in legend of Figure 2C. Error bars represent standard deviation.

pLS20cat gene 28 and its downstream region inhibit the termination activity of Ter30 when they precede the terminator on the same transcriptional unit. (A) Relevant genetic features of the different strains used. Light and dark brown arrows indicate pLS20cat gene 28 without (28-min) and with its 315 bp downstream region (28-ext), respectively. Oher symbols and color codes are the same as in Figure 2B. (B) FACS-determined fluorescence levels of cells of different strains normalized against the fluorescence level obtained for AND101 cells. Grey and red bars, strains having a construct lacking or containing Ter30, respectively. Relevant genotypes of the strains are given in panel A. Conditions and symbols are identical as described in legend of Figure 2C. Error bars represent standard deviation.

The conAn2 region does not include promoter P

The antiterminator protein Q of bacteriophage λ binds between the −35 and −10 boxes of the late operon promoter PR′, which allows Q to associate with the RNA polymerase (RNAP) after it binds to PR, generating an RNAP-Q complex that can antiterminate the downstream tR′ terminator (53). To test whether ConAn1 used promoter P to associate with the RNA polymerase, we constructed two derivatives of pLS20cat lacking either the 80 bp region containing promoter P (30), or containing P but lacking the remaining intergenic gene 28–29 region (Figure 1A). The results presented in Figure 1B show that the absence of the P29 promoter (pLS20ΔP29, strain AND99) had almost no effect on the conjugation efficiency compared with pLS20cat, whereas deletion of the intergenic region (pLS20Δds-II, strain AND100) resulted in much lower conjugation efficiencies, similar to those for pLS20Δ28 and pLS20Δds-I (Figure 1B). These results demonstrate that all or part of the 283 bp DNA region located between conAn1 and the P29 promoter is required for conjugation, but P29 is not.

conAn can antiterminate other transcriptional terminators located inside or outside the conjugation operon

In silico analyses predicted that the conjugation operon of pLS20 contains 22 additional potential intrinsic terminators besides Ter, that might terminate transcripts originating from the conjugation promoter P, and thus be regulated by the conAn system (Supplementary Figure S2A and Table S3). Five of these, all predicted to constitute an intrinsic terminator by at least two of the three algorithms used, were chosen at random for further analysis. These putative transcriptional terminators, located near the end or downstream of genes 33, 35, 47, 53 and 62, are referred to as Ter, Ter, Ter, Ter and Ter, respectively. As a control, we also analysed a transcriptional terminator taken from downstream of the chloramphenicol resistance gene from an unrelated genetic element, plasmid pC194 (54), which we named Ter. The predicted features of these putative terminators are given in Supplementary Figure S3. The same strategy as used above for Ter was employed to determine whether any of these regions contain a functional terminator, and if so whether the conAn system was able to overcome transcriptional termination at that site. Supplementary Figure S2C shows that the presence of each of the six predicted terminators preceding the gfp gene resulted in a clear decrease in the level of fluorescence compared to the control strain AND101 lacking a terminator. This shows that each region indeed contains a functional transcriptional terminator, most likely corresponding to the sequences given in Supplementary Figure S3. Different reductions in fluorescence were obtained, indicating that transcription was terminated with different efficiencies. To test whether the conAn system was able to antiterminate these terminators, we engineered a set of strains in which the conAn system was placed in front of each terminator, resulting in the configuration (P-[conAn1+2]-[Terx]-gfp). When grown in the presence of IPTG, significant increases in the level of fluorescence were obtained for each of these strains compared to the corresponding strain lacking the conAn1+2 system (Supplementary Figure S2C). Such a [conAn1+2]-dependent increase in fluorescence was also observed for the strain containing Ter from plasmid pC194. Therefore, the conAn system of pLS20 acted as an antiterminator on all of the transcriptional terminators analysed, regardless of whether they form part of the conjugation operon.

ConAn2 exerts antitermination activity and ConAn1 is required for processive antitermination

We anticipated that both conAn1 and conAn2 would be required for the observed anti-termination effects. To our surprise though, we found that strain AND131 in which Ter was preceded by only conAn2 (i.e. P-[conAn2]-[Ter]-gfp) displayed similarly high fluorescence levels compared to the isogenic strain AND210 containing both conAn1 and conAn2 (Figure 4), demonstrating that the conAn2 region is sufficient for antitermination at the Ter terminator. Results presented in Supplementary Figure S2C show that conAn2 can antiterminate at least five other terminators.
Figure 4.

conAn2 exerts antitermination activity and conAn1 is required for processive antitermination. Cells of late exponential cultures grown in the absence or presence of 1 mM IPTG were subjected to FACS analyses to determine their fluorescence level. Cells corresponded to isogenic strains containing at their amyE locus a cassette in which no or different elements (see also text) are located in between the IPTG-inducible P promoter and gfp as schematically shown. Conditions and symbols are identical as described in legend of Figure 2C. The regions of conAn1 present in strains JV40 and JV41 correspond to codons 1–103 and 82–172, respectively. Error bars represent standard deviation.

conAn2 exerts antitermination activity and conAn1 is required for processive antitermination. Cells of late exponential cultures grown in the absence or presence of 1 mM IPTG were subjected to FACS analyses to determine their fluorescence level. Cells corresponded to isogenic strains containing at their amyE locus a cassette in which no or different elements (see also text) are located in between the IPTG-inducible P promoter and gfp as schematically shown. Conditions and symbols are identical as described in legend of Figure 2C. The regions of conAn1 present in strains JV40 and JV41 correspond to codons 1–103 and 82–172, respectively. Error bars represent standard deviation. These results suggest that conAn2 is mechanistically responsible for AT, which contradicts the results in the earlier section that AT on pLS20cat required both conAn1 and conAn2 (Figure 1B). One potentially important difference between the native setting on pLS20cat and the AT screening system is that in the former case Ter (the first terminator of the conjugation operon) is located 1,475 bp downstream of conAn2, whereas this distance is only 285 bp in the AT screening system. To examine if the distance between the conAn2 and the terminator affected the efficiency of conAn1-independent AT we constructed derivatives of the AT screening strain with insertions of 824 bp or 1475 bp between conAn2 and Ter (i.e. P-[conAn2]-[spacer]-[Ter]-gfp) (see Figure 4). Indeed, increasing the distance between conAn2 and the terminator by 824 bp (strain AND223) or 1475 bp (strain AND262) caused fluorescence levels to drop by 73 and 84%, respectively (Figure 4). Importantly, high level fluorescence (i.e. efficient antitermination) was regained when conAn1 was added to these constructs (i.e. conAn1+2, Figure 4 strains AND222 and AND261). These results show that conAn1 is required for conAn2 to act at long distances (i.e. P-AT).

A short GC-rich region and an inverted repeated structure are essential for conAn2 function

The sequence of the gene 28–29 intergenic region is shown in Figure 5A. The region absent in pLS20Δds-II, which had an extremely low conjugation efficiency, has four striking features. First, a 20 bp region located near the 5′ end is duplicated in inverse orientation near the 3′ end of the region. In other words, a 238 bp long central region is flanked by a pair of 20 bp inverted repeat sequences. Second, a 16 bp GC-rich region (75% versus 42% for the entire conAn2 region) is located 50 bp from the 5′ end of the fragment. Third and fourth, inverted repeated structures that could form stem-loop structures in RNA with calculated free energies of −10.1 and −6.4 kCal/mol are located around positions 75 (‘SL1’)and 225 (‘SL2’) from the 5′ end, respectively. To test if these features were important for antitermination we used our standard antitermination assay (effect on expression of gfp downstream of Ter30) to compare mutant and wild type versions of ConAn2 in a ConAn1+2 context. A schematic presentation of the positions that were subjected to mutagenesis is given in Figure 5B. Details of the introduced mutations in each fragment are given in supplementary Figure S4.
Figure 5.

Features within conAn2 crucial for antitermination. (A) Sequence of the 383 bp intergenic region between genes 28–29, together with the upstream-located TAA stop codon of gene 28 (written in red and highlighted with three red asterisks) and the downstream-located ATG start codon of gene 29 (the last 3 bp written in green and with a green letter ‘M’ reflecting the starting methionine residue). Numbering starts at the 5′ positioned 20 bp inverted repeat, which together with the 3′ positioned repeated sequence are indicated in blue and over lined with blue arrows. The sequence of the GC-rich region is shown in white on a red background. Regions highlighted in orange and purple correspond to regions predicted to form stem-loop structures in RNA with calculated free energies of −10.1 and −6.5 kcal/mol, respectively. The −35 and −10 boxes of promoter P are indicated with blue boxes, and the likely Ribosome Binding Site (RBS) of gene 29 is indicated with a grey box. (B) Schematic representation of the conAn2-region and derivatives containing mutations in one or more features together with their effect on antitermination activity. Positions of the conspicuous features within the conAn2 region are given in the top line using the same colour code as in ‘A’. Lines below the top line represent wild type (wt) and derivatives containing mutations in one or more the features. For simplicity the fragments containing different mutations are numbered as indicated. conAn2b,c,d contain mutations in the 20 bp inverted repeats located at the 5′, 3′ end or at both ends, respectively. The mutations in the 5′ and 3′-located 20 bp in conAn2b and conAn2c are complementary such that these sequences constitute an inverted repeat of 20 bp in conAn2d. The mutations in conAn2e lower the GC content in this region from 75 to 38%. The mutations in conAn2f were designed to disrupt the formation of a stem-loop when this region is transcribed into RNA. The mutations in conAn2g also correspond to the stem-loop region but alter only positions of the predicted loop region. conAn2h contains the combined mutations present in conAn2e-g. Finally, similar to conAn2f and conAn2g, the mutations in conAn2i and conAn2j were designed to disrupt the formation of a stem-loop when this region is transcribed into RNA, and to mutate positions in the loop, respectively. Left column gives the name of the strains containing the cassette with the configuration (P-[conAn1]-[conAn2x]-[Ter]-gfp), in which conAn2x represents the wild type or a derivative containing the mutations indicated. The second column refers to the fragments used for cloning. Third column indicates feature(s) containing mutations, which are indicated with red asterisks. Fourth column presents the level of antitermination activity of each derivative relative to strain AND210 containing the wild type conAn2-region. Conditions and symbols are identical as described in legend of Figure 2C. Error bars represent standard deviation.

Features within conAn2 crucial for antitermination. (A) Sequence of the 383 bp intergenic region between genes 28–29, together with the upstream-located TAA stop codon of gene 28 (written in red and highlighted with three red asterisks) and the downstream-located ATG start codon of gene 29 (the last 3 bp written in green and with a green letter ‘M’ reflecting the starting methionine residue). Numbering starts at the 5′ positioned 20 bp inverted repeat, which together with the 3′ positioned repeated sequence are indicated in blue and over lined with blue arrows. The sequence of the GC-rich region is shown in white on a red background. Regions highlighted in orange and purple correspond to regions predicted to form stem-loop structures in RNA with calculated free energies of −10.1 and −6.5 kcal/mol, respectively. The −35 and −10 boxes of promoter P are indicated with blue boxes, and the likely Ribosome Binding Site (RBS) of gene 29 is indicated with a grey box. (B) Schematic representation of the conAn2-region and derivatives containing mutations in one or more features together with their effect on antitermination activity. Positions of the conspicuous features within the conAn2 region are given in the top line using the same colour code as in ‘A’. Lines below the top line represent wild type (wt) and derivatives containing mutations in one or more the features. For simplicity the fragments containing different mutations are numbered as indicated. conAn2b,c,d contain mutations in the 20 bp inverted repeats located at the 5′, 3′ end or at both ends, respectively. The mutations in the 5′ and 3′-located 20 bp in conAn2b and conAn2c are complementary such that these sequences constitute an inverted repeat of 20 bp in conAn2d. The mutations in conAn2e lower the GC content in this region from 75 to 38%. The mutations in conAn2f were designed to disrupt the formation of a stem-loop when this region is transcribed into RNA. The mutations in conAn2g also correspond to the stem-loop region but alter only positions of the predicted loop region. conAn2h contains the combined mutations present in conAn2e-g. Finally, similar to conAn2f and conAn2g, the mutations in conAn2i and conAn2j were designed to disrupt the formation of a stem-loop when this region is transcribed into RNA, and to mutate positions in the loop, respectively. Left column gives the name of the strains containing the cassette with the configuration (P-[conAn1]-[conAn2x]-[Ter]-gfp), in which conAn2x represents the wild type or a derivative containing the mutations indicated. The second column refers to the fragments used for cloning. Third column indicates feature(s) containing mutations, which are indicated with red asterisks. Fourth column presents the level of antitermination activity of each derivative relative to strain AND210 containing the wild type conAn2-region. Conditions and symbols are identical as described in legend of Figure 2C. Error bars represent standard deviation. The strains were grown in the presence of IPTG and subjected to FACS analyses. As shown in Figure 5B, alterations in the 20 bp inverted sequences located at 5′ end (strain AND154), or at the 3′ end (strain AND156) or at both ends (strain AND157) (constructs conAn2b-d) did not significantly affect the level of antitermination. However, mutations in the 16 bp GC rich region (conAn2e, strain AND159) and/or the region SL1 (conAn2f-g, strains AND168, AND162) greatly reduced the level of fluorescence, suggesting that both of these features are required for efficient antitermination. Based on these results, it was not surprising that efficient antitermination was also abolished with conAn2h (strain AND170), which contained the combined mutations of conAn2e-g. Finally, antitermination was also affected in conAn2i-j, which contained mutations in region SL2 (strains AND165 and AND172), although the effects of these mutations were less dramatic than those in the GC rich region, or SL1. In summary, the GC-rich and SL1 regions are crucial for efficient antitermination, whereas region SL2 has a less important role.

The antitermination processivity function of ConAn1 is confined to the N-terminal half of the protein

To see whether conAn-like systems were present on other genomes or plasmids, we performed in silico analyses and found that numerous conjugative plasmids of Gram+ bacteria encode a protein that shares similarity with ConAn1 of pLS20. To distinguish between putative homologs, we have added the name of the plasmid (subscript) to the conAn element name; i.e. the conAn1 and conAn2 elements of pLS20 are referred to as conAn1 and conAn2, respectively. Remarkably, many of the identified putative ConAn1 homologs were smaller in size and corresponded to approximately the N-terminal half of ConAn1pLS20. To test whether the N- or C-terminal halves of ConAn1pLS20 were functional in vivo we constructed strains JV40 and JV41 that contained the AT screening cassette with the configuration (P-[X]-[conAn2]-[824 bp spacer]-[Ter]-gfp), in which ‘X’ corresponds to conAn1Nter or conAn1Cter, respectively. As shown in Figure 4, high and low fluorescence levels were obtained for cells of strains JV40 and JV41, respectively, showing that the function required for processive AT is contained in the N-terminal half of ConAn1pLS20.

conAn antitermination systems are present on other conjugative plasmids of Gram-positive bacteria, including pathogens

Selection of conjugative plasmids from Gram-positive bacteria to analyze if they contain a bipartite conAn-type antitermination system

We selected eight (putative) conjugative plasmids of different Gram+ bacteria, including several pathogens, to test if they possess a bipartite conAn-(like) system. Although little is known about conjugative plasmids from Gram+ bacteria in general, there are a few exceptions. The Bacillus thuringiensis plasmid pAW63 has been studied for >20 years, and the enterococcal plasmids pAD1 and pCF10 for over 40 years (55–57). This was a main reason to select these three plasmids. Another reason to include pAD1 and pCF10 was that they contain virulence determinants (55,56). Moreover, pAD1 and pCF10 are harboured by E. faecalis strains which allowed comparison of conAn-type systems present on different plasmids harboured by the same bacterial species. This was also a major reason to select pFR55, which, like pAW63, is harboured by B. thuringiensis. Plasmid p576 was chosen for a similar reason. p576 is harboured by a Bacillus pumilus strain, which is closely related to B. subtilis, and we have shown before that p576 is closely related to pLS20 (58,59). Plasmid pN2 was chosen because it is present in Lactobacillus paracasei and hence, like pLS20, may play a role in horizontal gene transfer in the guts of humans and animals. Finally, plasmids p57330 and pLM5578 were selected because they are harboured by the pathogens Clostridium sporogenes and Listeria monocytogenes, respectively. In silico analyses suggested that each of the deduced conjugation operons of these plasmids contains multiple intrinsic transcriptional terminators (not shown), suggesting that they also require a P-AT system to overcome the termination sites for proper expression of the conjugation genes. Properties of these plasmids are given in Supplementary Table S4.

In silico identification of poorly conserved putative bipartite conAn systems at the beginning of the conjugation operon on each of the eight plasmids

In silico analyses identified a putative conAn system near the beginning of the (putative) conjugation operon on each of these plasmids, that would encode either a large (p576, pLM5578) or a small (pAW63, pFR55, pAD1, pCF10, pN2 and p57330) ConAn1pLS20 ortholog. These putative ConAn1 orthologs share only limited similarity with ConAn1pLS20 (ranging from 13 to 33% identity, Supplementary Table S4); and even ConAn1 orthologs encoded by plasmids present in different strains of the same genus share low identity levels between 16 and 23%. A similar pattern was found for the putative conAn2 regions: their predicted sizes range from about 260 to 600 bp, and the levels of sequence identity with conAn2 was between 30 and 40%, except for the putative conAn2 region of Bacillus pumilus plasmid p576 which shared an identity of 70% with conAn2 (Supplementary Table S5).

All eight plasmids contain a bipartite conAn antitermination system that is functional in their native host

To study whether these regions constitute functional conAn systems we tested possible AT activity in their native genus, except for the region of C. sporogenes plasmid p57330, which was tested in both Enterococcus faecalis and B. subtilis. For this purpose, cassettes with the configuration (P-[conAnX]-[1.5 kb spacer]-[Ter]-gfp) were constructed in which conAn corresponds to one of the putative conAn systems. When B. subtilis strain 168 was used as a host, the cassettes containing the putative conAn systems were placed at the chromosomal amyE locus. For tests using other hosts, the cassettes containing the putative conAn systems were cloned onto the shuttle vector pAT18, which contains the replication functions of the low-copy broad host-range plasmid pAMβ1 (33), and then introduced into the corresponding host bacterium. Control experiments showed that terminator Ter was functional in E. faecalis JH2–2, L. innocua CLIP 11262 and L. casei BL23, and that P was induced by IPTG in these bacteria. As shown in Table 1 and Supplementary Figures S5A and B, with the exception of the putative conAn system of B. pumilus plasmid p576 (strain AND264), all the other regions tested displayed AT activity. These results demonstrate that: (i) in general, functional conAn AT systems can have very different sequences; (ii) some conAn systems from other bacteria can function in B. subtilis, even with low sequence conservation.
Table 1.

Functionality of conAn systems of conjugative plasmids of different Gram-positive bacteria

Functionality bipartite conAn system at 1.5 kb distance from Ter30Functionality conAn2 at short distance from Ter30
Host plasmid conAn system (plasmid) B. subtilis 168 L. inocua CLIP 11262 E. faecalis JH2–2 L. casei BL23 B. subtilis 168 L. inocua CLIP 11262 E. faecalis JH2–2 L. casei BL23
B. subtilis pLS20+++xxx+++xxx
B. pumilus p576-xxx++xxx
B. thuringiensis pAW63++xxx++xxx
pFR55+++xxx++xxx
L. monocytogenes pLM5578+++xx++++xx
E. faecalispAD1+x+++x+x-x
pCF10-x+++x+x++x
L. paracasei pN2++x++++++++x++
C. sporogenes P57330+++x+++x++x++x

The level of antitermination activity was studied by antitermination of Ter. Antitermination activity is expressed as the level of fluorescence observed for the different strains containing a (putative) conAn system with respect to the control strain lacking a conAn system, and considering 100% the level of fluorescence obtained by the conAn system of pLS20. +++, ≥70% fluorescence recovery; ++, (25–70%) fluorescence recovery; +, (10–30%) fluorescence recovery; -, ≤10% fluorescence recovery. x, not tested.

Functionality of conAn systems of conjugative plasmids of different Gram-positive bacteria The level of antitermination activity was studied by antitermination of Ter. Antitermination activity is expressed as the level of fluorescence observed for the different strains containing a (putative) conAn system with respect to the control strain lacking a conAn system, and considering 100% the level of fluorescence obtained by the conAn system of pLS20. +++, ≥70% fluorescence recovery; ++, (25–70%) fluorescence recovery; +, (10–30%) fluorescence recovery; -, ≤10% fluorescence recovery. x, not tested.

The conAn2 component of all eight conAn systems exerts antitermination

We also constructed strains to test if, like conAn2, the putative conAn2 regions from the various plasmids conferred AT activity when located immediately upstream of Ter. Analysis of these strains showed that all these conAn2 regions displayed moderate or (very) efficient AT activities on Ter (Table 1 and Supplementary Figure S6A and B), with the exception of conAn2 which only showed a low level of AT activity. Therefore, except perhaps for conAn2, the conAn2 regions located downstream of their respective conAn1 genes are also mechanistically responsible for AT in the other plasmids tested. Interestingly, although the conAn2 region alone resulted in good levels of AT when located just upstream of Ter (Supplementary Figure S6A, strain AND422), no AT activity was observed for the complete bipartite conAn system when tested at a large distance from Ter (Supplementary Figure S5A, strain AND264), suggesting that ConAn1p576 is not functional in B. subtilis.

conAn systems display variable antitermination activities in heterologous hosts

Having shown that the conAn system of the C. sporogenes plasmid p57330 was functional in the heterologous hosts B. subtilis 168 (Supplementary Figure S5A) and E. faecalis JH2–2 (Supplementary Figure S5B), we next tested whether one or more of the conAn2 regions alone and/or the [conAn1+2] systems of the listerial, enterococcal, lactobacillus and clostridial plasmids were functional in B. subtilis. As shown in Table 1 and Supplementary Figures S5A and S6A, the conAn2 elements and the bipartite [conAn1+2] systems of the listerial, lactobacillus and clostridial plasmids all displayed AT activities in B. subtilis with different efficiencies that ranged from moderate to good. Significant AT activities were also observed for the conAn2 regions of the enterococcal plasmids pAD1 and pCF10, although the AT levels were clearly lower than those observed for the other plasmids (Supplementary Figure S6A). Finally, while only a low level of AT was observed for the [conAn1+2] system of pAD1 (strain AND268), no AT activity was observed for the [conAn1+2] system of pCF10 (Supplementary Figure S5A, strain AND274), suggesting that, like ConAn1p576, ConAn1pCF10 is not functional in B. subtilis. In summary, the results showed: (i) that different conAn systems that have little shared similarity are functional in their native hosts; (ii) that some but not all conAn systems can be functional in a heterologous host; and (iii) proper functioning of a P-AT system depends on functionality of both the ConAn1 and conAn2 components in a given host.

The conAn system of pLS20 is the founding member of a large family of bipartite antitermination systems present on conjugative plasmids of Gram-positive bacteria

Next, we searched databases to identify genes encoding proteins sharing similarity with these ConAn1 proteins. Each of the ConAn1 sequences was used as a query in multiple rounds of Psi-blastp searches of the NCBI nr database. These nine searches resulted in a total of 662 hits having an E-value of ≤ 1e−7. The distribution of these identified hits per genus and bacterial species is shown in Supplementary Table S6. After pruning identical sequences, the list was reduced to 531 proteins. Except one hit, which has possibly been annotated incorrectly (see Supplementary Table S6), all other hits corresponded to proteins encoded by Gram+ bacteria belonging to the phylum Firmicutes (see Supplementary Table S6). 45 and 80 proteins had been annotated as PrgR or TraE, respectively, due to their similarities to the PrgR or TraE protein encoded by the enterococcus plasmids pCF10 and pAD1, respectively. All others had been annotated as ‘hypothetical protein’. The phylogenetic tree presented in Figure 6 shows that, except for the ConAn1 proteins encoded by Bacillus plasmids pLS20 and p576, all other ConAn1 protein tested are located in a separate branch. This indicates that the different ConAn1 proteins are evolutionary distantly related, which corroborates the low level of sequence similarity between them. It also suggests that the ConAn1 proteins identified here very likely do not represent all ConAn1 proteins. ConAn1pLS20 belongs to the group of ‘large’ ConAn1 proteins. All large ConAn1 proteins that contain an additional C-terminal domain form part of the two branches shown on a grey background in Figure 6, which shows that they form a minority. It should be mentioned that the eight putative ConAn1 members encoded by plasmids from Exiguobacterium and Planococcus species (plasmid pFR55 branch) also contained an additional domain. In these cases, the processivity domain (Pfam domain pXO2–34, PF17362) appeared to be fused to an N-terminal domain predicted to contain a helix-turn-helix DNA-binding motif (Pfam domain HTH_3, PF01381).
Figure 6.

Phylogenetic tree of ConAn1 proteins. The phylogenetic tree was constructed using the program ‘MEGA-X’ and phylogeny was built using the Neighbour-joining algorithm (see Materials and methods). The tree was build using a set of 141 representative proteins having less than 95% sequence identity (obtained by MMSEQ2). The 141 ConAn1 members were clustered into nine clearly distinct branches. ConAn1 members having an additional C-terminal domain of unknown function are restricted to two branches (shown on a grey background). The nine ConAn1 members for which antitermination activity has been demonstrated (see text) are indicated. The majority of ConAn1 proteins clustered within each branch belong to the same bacterial genus, which are indicated with different colours.

Phylogenetic tree of ConAn1 proteins. The phylogenetic tree was constructed using the program ‘MEGA-X’ and phylogeny was built using the Neighbour-joining algorithm (see Materials and methods). The tree was build using a set of 141 representative proteins having less than 95% sequence identity (obtained by MMSEQ2). The 141 ConAn1 members were clustered into nine clearly distinct branches. ConAn1 members having an additional C-terminal domain of unknown function are restricted to two branches (shown on a grey background). The nine ConAn1 members for which antitermination activity has been demonstrated (see text) are indicated. The majority of ConAn1 proteins clustered within each branch belong to the same bacterial genus, which are indicated with different colours.

DISCUSSION

Here, we demonstrate the presence of a novel bipartite AT system (conAn) in several plasmid conjugation operons, and which is probably widespread on conjugative plasmids of Gram+ bacteria. The system is located near the beginning of the conjugation operon; the conAn2 RNA component is essential and mechanistically responsible for antitermination, and the ConAn1 protein is required for processivity. ConAn is a genuine P-AT system because its activity is not limited to a single terminator, instead it acts on multiple terminators over a long range. The relative few processive antitermination systems described so far are constituted by either a protein or an RNA component (for review see, 20). Antiterminator proteins can act in trans and are recruited to transcription elongation complexes (TECs) in different ways, while the systems using RNA-based antitermination factors function only in cis (for review see, 20). The RNA component (conAn2) and the protein component (ConAn1) of the conAn system only function efficiently in cis, as ectopic expression of conAn1 or conAn1+conAn2 did not restore efficient conjugation of pLS20Δ28 (Figure 1B). Moreover, in the gfp-based setup in which conAn2 is located 1.5 kb upstream of Ter, ectopic expression of conAn1 also did not result in efficient conAn2-mediated antitermination (our unpublished results). The cis-dependent activity of the conAn system would contribute to limiting antitermination strictly to the conjugation operon. Presently, only two other RNA-based antiterminators are known: put elements of the lambdoid phage HK022 and the EAR element present within or upstream of an operon encoding biofilm or capsular polysaccharide proteins in Bacilli (16,18,19,60). Both are very different from the conAn2 regions described here. First and most importantly, whereas conAn2 requires ConAn1 to act at large distance, the put and EAR elements appear to act alone to achieve P-AT, although in the latter case the requirement of a cellular cofactor was not excluded (19). In addition, the conAn2 region is much larger than the put and EAR elements. Our in silico analysis suggests that conAn-type P-AT systems are widespread on conjugative plasmids of Gram+ bacteria, with over 500 conAn–like systems identified. Interestingly, most of the identified (putative) ConAn1 homologs are small monodomain proteins, corresponding to the N-terminal part of the ConAn1pLS20 protein that we showed is responsible for the processivity. In a minority of the ConAn1 proteins, including ConAn1pLS20, this processivity domain appears to be fused to another domain of unknown function. In other putative ConAn1 members, such as those encoded by Exiguobacterium and Planococcus species, the processivity domain appears to be fused to an N-terminal domain predicted to contain a helix-turn-helix DNA-binding motif. Future studies will be required to determine the function of these additional domains. At this moment, the exact mechanism by which conAn2 exerts antitermination is unknown, but it may do so according to at least one of the following three models that are in line with the obtained results. In the first model, conAn2 would exert antitermination in an indirect way, by binding to the core components of the RNAP complex or/and one or more transcription elongation factors. These interactions would alter the conformation and/or orientation of the RNA exit channel, or affect the configuration of a transcription elongation factor like NusA, such that stem-loop formation is affected when the terminator region is transcribed, or affect pausing and/or backtracking. In the second and third models, conAn2 would enable antitermination in a direct manner. In the second model, conAn2 would also bind directly to one or more protein components in the transcription elongation complex. This binding would place the conAn2 structure in a specific configuration near the RNA exit channel such that the stem-loop with calculated free energy of -10 kCal/mol and/or the GC-rich region would be able to transiently hybridize with de novo synthesized 5′ stem sequences of a terminator, thereby interfering with proper formation of a functional terminator. The third model is similar to the second one, but in this case, the conAn2 region would have no intrinsic affinity for component(s) of the transcription elongation complex, and fully depend on ConAn1 for proper configuration. The gut microbiome has high concentrations of bacteria and harbours a diverse reservoir of antibiotic resistance (AR) genes (61), so it would be expected to be a niche that is particularly apt for the transfer of AR genes between commensals and pathogenic bacteria. However, the rates of transfer of intrinsic antibiotic resistance genes between species or genera in the gut are much lower than expected (62). Our finding that several of the conAn2 elements and ConAn1 processivity factors tested were less or hardly functional in heterologous hosts may provide an explanation for the low frequency of transfer in the gut. Our results indicate that both the ConAn1 and conAn2 components must make functional interactions with the transcription machinery of the host to achieve efficient P-AT. The lower efficiency or non-functionality of conAn systems in heterologous hosts may not be surprising taking into account the low level of sequence conservation between them. We have previously shown that the pLS20 conjugation operon is under the control of the strong P promoter, whose activity is strictly regulated at different levels including DNA looping. Our new results suggest that the principal function of the P promoter is not simply to control directly the expression of all the conjugation genes, but more to regulate the expression of the surface exclusion gene ses, gene 30 of unknown function, and the conAn system. The conAn system in combination with multiple terminators most likely serves several purposes. First, it allows the coordinated expression of many genes located within a long operon. Second, in combination with additional promoters it allows differential expression of one or more genes within the conjugation operon. An example of this is the surface exclusion gene ses. When conjugation is not activated, the weak constitutive promoter P results in low levels of SespLS20, sufficient to inhibit the transfer of pLS20 between donor cells 10-fold (30). Because transcripts starting at P will terminate at Ter located downstream of gene 30, gene 29 and 30 are expressed at low levels in all donor cells. However, in cells having activated the conjugation genes, ses are also expressed from the much stronger P promoter and the elevated levels of SespLS20 inhibit >1,000-fold the transfer of pLS20 between two conjugation primed donor cells (30). In addition, these transcripts read through Ter, switching on the conjugation process. The conAn system also differentially expresses a toxin and antitoxin gene in conjugation primed and non-primed donor cells (our unpublished results, A.M., J.V., L.W., J.E. and W.M.). These examples highlight a novel function for a P-AT system: allowing differential expression of genes or cassettes embedded within the operon. However, the pLS20 conjugation operon contains many more terminators than Ter. We consider it likely that these terminators contribute to the strict regulation of expression of the conjugation genes; i.e. ensuring that transcripts encoding the conjugation proteins are only generated when environmental signals are received, processed and lead to the activation of the P promoter. ‘Unintended’ transcription from cryptic promoters, named spurious transcription, is known to occur on a rather large scale (63–65). Without terminator sites inside the large (37 kb) conjugation operon, spurious transcription with the same directionality as the conjugation genes would generate transcripts until the end of the conjugation operon and hence undermine the strict regulation of the P promoter. The presence of multiple terminators inside the conjugation operon therefore will limit the likely negative effects of spurious transcription. The bypass mechanism to allow ‘intended’ transcripts initiated at P promoter to traverse the operon is achieved by the conAn antitermination system. This view is supported by our RNAseq data of a pLS20cat-harbouring strain in which rco, -the repressor of the P promoter-, was ectopically overexpressed from a chromosomal locus. Although little transcription was observed for the plasmid region corresponding to the conjugation operon, low levels of transcripts were observed along the conjugation operon with some spikes, which disappeared at positions coinciding with the (putative) terminators. In summary, we propose that, the conAn system contributes to proper expression of the conjugation genes in three important ways: by promoting processivity across the whole 37 kb operon; by enabling differential regulation of genes within the operon; and by limiting the effects of spurious transcription.

DATA AVAILABILITY

RNAseq data have been deposited to the NCBI SRA repository (Project PRJNA717725). Flow Cytometry data is available from the corresponding author upon request. Click here for additional data file.
  62 in total

1.  Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles.

Authors:  D Gautheret; A Lambert
Journal:  J Mol Biol       Date:  2001-11-09       Impact factor: 5.469

Review 2.  Enterococcal sex pheromones: signaling, social behavior, and evolution.

Authors:  Gary M Dunny
Journal:  Annu Rev Genet       Date:  2013-09-11       Impact factor: 16.830

Review 3.  Rho Protein: Roles and Mechanisms.

Authors:  Pallabi Mitra; Gairika Ghosh; Md Hafeezunnisa; Ranjan Sen
Journal:  Annu Rev Microbiol       Date:  2017-07-21       Impact factor: 15.500

Review 4.  Mechanisms of Bacterial Transcription Termination.

Authors:  Jeffrey W Roberts
Journal:  J Mol Biol       Date:  2019-04-09       Impact factor: 5.469

5.  Efficient transformation of Lactobacillus sake by electroporation.

Authors:  Françoise Berthier; Monique Zagorec; Marie Champomier-Vergès; S D Ehrlich; Françoise Morel-Deville
Journal:  Microbiology (Reading)       Date:  1996-05       Impact factor: 2.777

6.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

Review 7.  Prospects for riboswitch discovery and analysis.

Authors:  Ronald R Breaker
Journal:  Mol Cell       Date:  2011-09-16       Impact factor: 17.970

8.  Shuttle vectors containing a multiple cloning site and a lacZ alpha gene for conjugal transfer of DNA from Escherichia coli to gram-positive bacteria.

Authors:  P Trieu-Cuot; C Carlier; C Poyart-Salmeron; P Courvalin
Journal:  Gene       Date:  1991-06-15       Impact factor: 3.688

9.  Inhibition of a transcriptional pause by RNA anchoring to RNA polymerase.

Authors:  Natalia Komissarova; Tatiana Velikodvorskaya; Ranjan Sen; Rodney A King; Sarbani Banik-Maiti; Robert A Weisberg
Journal:  Mol Cell       Date:  2008-09-05       Impact factor: 17.970

10.  Pervasive transcription: detecting functional RNAs in bacteria.

Authors:  Meghan Lybecker; Ivana Bilusic; Rahul Raghavan
Journal:  Transcription       Date:  2014-10-30
View more
  2 in total

1.  Establishment Genes Present on pLS20 Family of Conjugative Plasmids Are Regulated in Two Different Ways.

Authors:  Jorge Val-Calvo; Andrés Miguel-Arribas; Fernando Freire; David Abia; Ling Juan Wu; Wilfried J J Meijer
Journal:  Microorganisms       Date:  2021-11-29

2.  Conjugation Operons in Gram-Positive Bacteria with and without Antitermination Systems.

Authors:  Andrés Miguel-Arribas; Ling Juan Wu; Claudia Michaelis; Ken-Ichi Yoshida; Elisabeth Grohmann; Wilfried J J Meijer
Journal:  Microorganisms       Date:  2022-03-08
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.