Literature DB >> 29683387

Primary transcriptome analysis reveals importance of IS elements for the shaping of the transcriptional landscape of Bordetella pertussis.

Fabian Amman1, Alexandre D'Halluin2, Rudy Antoine2, Ludovic Huot2, Ilona Bibova3, Kristina Keidel3, Stéphanie Slupek2, Peggy Bouquet2, Loïc Coutte2, Ségolène Caboche2, Camille Locht2, Branislav Vecerek3, David Hot2.   

Abstract

Bordetella pertussis is the causative agent of whooping cough, a respiratory disease still considered as a major public health threat and for which recent re-emergence has been observed. Constant reshuffling of Bordetella pertussis genome organization was observed during evolution. These rearrangements are essentially mediated by Insertion Sequences (IS), a mobile genetic elements present in more than 230 copies in the genome, which are supposed to be one of the driving forces enabling the pathogen to escape from vaccine-induced immunity. Here we use high-throughput sequencing approaches (RNA-seq and differential RNA-seq), to decipher Bordetella pertussis transcriptome characteristics and to evaluate the impact of IS elements on transcriptome architecture. Transcriptional organization was determined by identification of transcription start sites and revealed also a large variety of non-coding RNAs including sRNAs, leaderless mRNAs or long 3' and 5'UTR including seven riboswitches. Unusual topological organizations, such as overlapping 5'- or 3'-extremities between oppositely orientated mRNA were also unveiled. The pivotal role of IS elements in the transcriptome architecture and their effect on the transcription of neighboring genes was examined. This effect is mediated by the introduction of IS harbored promoters or by emergence of hybrid promoters. This study revealed that in addition to their impact on genome rearrangements, most of the IS also impact on the expression of their flanking genes. Furthermore, the transcripts produced by IS are strain-specific due to the strain to strain variation in IS copy number and genomic context.

Entities:  

Keywords:  bordetella pertussis; insertion sequence; transcriptome

Mesh:

Substances:

Year:  2018        PMID: 29683387      PMCID: PMC6161684          DOI: 10.1080/15476286.2018.1462655

Source DB:  PubMed          Journal:  RNA Biol        ISSN: 1547-6286            Impact factor:   4.652


Introduction

Bordetella pertussis, the main causative agent of whooping cough, is a strictly human pathogen, responsible world-wide for an estimated 24.1 million pertussis cases, associated with 160,700 pertussis-linked death in 2014 [1]. The disease affects mainly children, but adolescents and adults are also susceptible [2]. Vaccination programs, first introduced in the 1950's, have resulted in a drastic decrease of pertussis incidence [3]. However, despite a high global vaccination coverage, in many countries a re-emergence of the disease has been observed over the past 10 years, which qualifies whooping cough today as the most prevalent vaccine-preventable childhood disease [4-6]. Pertussis resurgence has been hypothesized to result from a number of possible reasons, including waning of acellular pertussis vaccine efficiency and asymptomatic infections and transmission [7-11]. In addition, recently, many B. pertussis strains lacking pertactin, one of the major vaccine components of acellular vaccines, have been isolated in countries where acellular vaccines are used [12-15]. The lack of pertactin production in these strains is due to various mechanisms, including frame-shift mutations, deletions, insertions and inversions, suggesting vaccine-induced pressure. Massive gene loss, pseudogene formation and a reduction of the genome size of B. pertussis are among the main genomic features of this organism that became apparent when the genomes of several Bordetella species were compared [16]. Observed changes resulted mostly from major genetic rearrangements, including inversion and disruption of genomic islands [17-19]. These rearrangements are essentially mediated by the Insertion Sequence IS481, a mobile genetic element present in more than 230 copies in the B. pertussis genome [16]. Compared to other Bordetella species, this represents an exceptionally high potential for genomic rearrangements and could be one of the drivers of the pathogen ability to escape from vaccine-induced immunity [20]. Insertion Sequences (IS) encode a transposase and are flanked by inverted repeats involved in excision and insertion at a specific consensus target site [21]. These mobile genetic elements are tightly linked to the evolution and adaptation of pathogens to their hosts, to antibiotic resistance and to the response to an environmental stress [22-27]. The exceptional expansion of IS481 in the B. pertussis genome leads to a high structural genome diversity between strains, resulting from homologous recombination and site-directed insertion, despite an otherwise rather conserved genetic content [28]. IS have also the ability to activate expression of the neighboring genes, either by an extended transcription from an internal promoter “escaping the IS” (named Pout or Pin) or by the generation of a hybrid promoter (Phyb) [29,30]. In the latter case, a strong promoter can be formed by insertion of an IS carrying −35 box that is in proper distance from a −10 box located in adjacent chromosomal sequence. It has been shown that the katA gene in the B. pertussis strain BP504 and the bteA gene in the vaccine-derived strain BP155 are transcribed by the outward promoter (Pout) of the neighboring IS481 [31,32]. Because of the numerous copies of IS481 in the B. pertussis genome, their impact on gene transcription is likely to occur at many sites. Thus, IS481 could generate strain-specific transcripts (messenger and regulatory RNA), which might contribute to B. pertussis strain adaptation. To assess the global impact of IS481 on the transcriptome of B. pertussis, we used high through-put sequencing (HTS) combined with different bioinformatics analysis methods to establish the primary transcriptome of the Tohama I reference strain. This study revealed that most IS481 elements impact the transcriptional level of their flanking genes and can result in an uncharacterized regulation. Furthermore, as the distribution of IS elements is discriminative in each strain during the evolution, the transcripts driven by IS481 are strain-specific.

Results

Bordetella pertussis transcriptome analysis

To characterize the primary transcriptome of Bordetella pertussis we used HTS of strand-specific cDNA obtained from transcripts of the Tohama I derivative BPSM, grown in standard conditions at the exponential growth phase [33]. We combined two independent sequencing approaches to maximize information about the transcriptome architecture, a classical RNA-sequencing method (RNA-seq) on a library of strand-orientated cDNA prepared from total RNA depleted for rRNA and differential RNA-seq method (dRNA-seq) detecting the transcriptional start sites (TSS) by using terminator 5′-phosphatedependent Exonuclease (TEX)-treated and -untreated RNA samples. Libraries from four independent biological replicates were created. A total of 36.7 million reads could be mapped to the four samples (19.9 million for TEX-treated and 16.8 million for the untreated libraries). The data obtained by the two approaches were combined to quantify the transcripts abundance and to determine the primary transcriptome architecture (BioProject ID: PRJNA430365). All raw and processed data are compiled in an Assembly Hub (Fig. 1) and can be publicly accessed and incorporated into any UCSC Genome Browser instance by the following address: http://bit.ly/2Euo5HZ.
Figure 1.

Genome-wide RNA-seq and dRNA-seq data representation and deduced transcriptome organization, focus on a selected region. (A) Visualization of RNA-seq normalized dataset (blue: positive strand depth sequencing graph and grey: negative strand depth sequencing graph). Visualization of dRNA-seq normalized dataset (brown: positive strand depth sequencing graph and red: negative strand depth sequencing graph. The color intensity levels depict the different replicates). (B) Deduced transcriptome annotation in regard to published annotation file (NC_002929). Genes annotation are in green and dark green arrows. Pseudogenes are in firebrick red arrows. Repeated regions are in cyan arrows. Predicted transcripts annotation is indicated with orange and purple arrows. TSS positions are in red and dark red. (C) Detailed view on a few annotated genes.

Genome-wide RNA-seq and dRNA-seq data representation and deduced transcriptome organization, focus on a selected region. (A) Visualization of RNA-seq normalized dataset (blue: positive strand depth sequencing graph and grey: negative strand depth sequencing graph). Visualization of dRNA-seq normalized dataset (brown: positive strand depth sequencing graph and red: negative strand depth sequencing graph. The color intensity levels depict the different replicates). (B) Deduced transcriptome annotation in regard to published annotation file (NC_002929). Genes annotation are in green and dark green arrows. Pseudogenes are in firebrick red arrows. Repeated regions are in cyan arrows. Predicted transcripts annotation is indicated with orange and purple arrows. TSS positions are in red and dark red. (C) Detailed view on a few annotated genes.

TSS localization and transcript extent of annotated coding genes

We first used the above described data to determine the transcriptome organization for the annotated genes to determine the TSS and abundance of mRNA transcripts and their relative arrangement. The analysis of the dRNA-seq data using the software TSSAR revealed 2,722 potential TSSs (Fig. S1) [34]. The TSSs were classified according to their genomic context using the annotation file NC_002929.2. A TSS within ≤ 250 nt upstream of annotated genes was classified either as primary TSS for the furthest upstream, and as secondary TSS for the others. TSS localized in the forward orientation within open reading frame (ORF) of annotated genes was classified as internal TSS. The position of 614 primary TSSs regarded as the potential +1 position of transcription could be identified (Table S1). Among these 614 genes, 99 also display at least one secondary TSS. We validated the TSS of several transcripts by 5′RACE. The TSS of several coding transcripts (ptxA, bipA, fim2) and of candidate transcripts (see below) were confirmed or determined using this approach (Fig. S1). The TSSs were then used to determine putative promoter sequence motifs in their upstream region using MEME [35]. Beside a convincing TATA box motif (sequence TANAAT) in the −10 regions, no significant motif was shared by more than 50 TSS (Fig. S2). Among the 614 primary TSS, 35 were localized at the position of the translational start codon of the annotated gene (Fig. S2). More than a third of them are of unknown function or are annotated to code for hypothetical proteins. However, within the functionally annotated genes a large fraction (∼26%) consists of genes encoding regulators, including ompR (BP3222) and basR (BP3534). A hypergeometric test for these 35 genes confirmed that regulators are significantly enriched in the set of leaderless transcripts (p-value 0.0014). On the other side, 120 genes were preceded by a long 5′UTR (> 100 nt). Long 5′ UTRs often contain regulatory structures such as riboswitches or thermosensors. Among the 120 long 5′UTRs, 7 riboswitches were predicted in silico (Fig. S3). Using RT-PCR spanning the region between the predicted riboswitch and adjacent gene, 4 of them were confirmed to be co-transcribed with the adjacent gene (Fig. S3). The three remaining predicted riboswitches (type yybPykoY riboswitch, Cobalamin riboswitch and FMN riboswitch in front of BP3410, BP3595 and ribB, respectively) could not be confirmed by RT-PCR to be part of the same transcript. However, all three correspond to riboswitsches, which are expected to be switched off (leading to aborted transcripts) under the growth conditions used here (i.e. no manganese ion for yybP-ykoY riboswitch, presence of adenosylcobalamin for cobalamin riboswitch and presence of FMN for FMN-riboswitch) [36]. The abortive transcription was confirmed by the absence of RT-PCR products corresponding to the adjacent gene, whereas RT-PCR products were detected using the 5′UTR-specific (Fig. S3). 5′RACE also confirmed the TSSs of the SAH, SAM-alpha and Glycine riboswitches (Fig. S1). To characterize the transcriptional organization of the annotated genes we used the data obtained from RNA-seq. The transcript organization was deduced for 2,262 out of the 3,871 annotated ORF (including IS transposase genes) allowing us to determine the mono- and poly-cistronic genes (NC_002929.2) (see Fig. 1). We could determine 1,315 transcript structures, of which 838 were mono- and 477 were polycistronic transcripts (table S2). A histogram of number of cistrons/operon structure is shown in supplementary data Fig. S4. Furthermore, 5′ overlapping, divergent head-to-head and 3′ overlapping, convergent tail-to-tail transcript organizations were identified. Occasionally the 3′UTR of a transcript extends well beyond the annotated stop codon, sometimes yielding a 3′UTR of up to several hundreds of nt reaching into the antisense region of the neighboring gene. A striking example of this structure is the transcriptional organization of bvgR and bvgAS genes, a major virulence gene regulator system in Bordetella pertussis. The transcript of bvgR overlaps with the bvgS ORF by more than 410 nt and the bvgS transcript overlaps with the bvgR ORF by more than 170 nt (Fig. 2). Thus, including intergenic regions (IGR) these two transcripts share an overlapping region composed of 630 nt. This overlap was confirmed by a two-step RT-PCR described in supplementary data part 6. Within the entire genome, overlapping transcript were detected 37 times for 5′ divergent tail-to-tail overlaps and 89 times for 3′ convergent head-to-head overlaps (table S2).
Figure 2.

bvgR and bvgAS transcription organization. Detailed view of genome browser for the bvgAS and bvgR sequence region. Prediction of transcripts organization is indicated.

bvgR and bvgAS transcription organization. Detailed view of genome browser for the bvgAS and bvgR sequence region. Prediction of transcripts organization is indicated.

TSS classification and identification of sRNA

In addition to determining the transcript organization of the coding genes, we also used the dRNAseq data to define the TSS of internal (I) and antisense (A) transcripts, corresponding to TSS within an annotated ORF on the sense and the anti-sense strand, respectively. Furthermore, we identified orphan (O) TSS, localized in IGR at a ≥ 250 nt distance from the closest gene in the same orientation. The “A” and “O” TSSs can be considered as the transcription starts of potentially new transcripts not detected so far and will be referred in the rest of the text as candidate transcripts. Among the total 362 candidate transcripts deduced from A and O TSS, 118 were localized within IGR (O) and 244 were antisense to annotated genes (Fig. 3). Moreover, the RNA-seq data revealed 268 novel candidate transcripts not overlapping with any of the annotated genes. 218 of them are in the antisense orientation to annotated genes, either within the ORF or overlapping with the 5′ or 3′ ends, and the 50 remaining candidate transcripts are within IGR. The distribution of antisense and orphan candidate transcripts identified by these two approaches is detailed in Fig. 3.
Figure 3.

Candidate transcripts classification. Venn diagrams of candidate transcripts numbers deduced from dRNA-seq and RNA-seq. Number of antisens and orphan candidate transcripts are counted as the function of the detection methods.

Candidate transcripts classification. Venn diagrams of candidate transcripts numbers deduced from dRNA-seq and RNA-seq. Number of antisens and orphan candidate transcripts are counted as the function of the detection methods. Some of these transcripts are localized in the IGR in the vicinity and in the same orientation as the adjacent annotated gene, either upstream, close to the translational start codon, or downstream, next to a stop codon. However, they may potentially correspond to long 5′ or 3′ UTR, respectively. To investigate this possibility further we analyzed three candidate transcripts (numbers 229, 233 and 235). As for the riboswitches, the independent transcription of each candidate with respect to the adjacent gene was tested by a two-step RT-PCR spanning a region between the candidate transcript and the neighboring ORF (Fig. S3). Candidate transcript 233 was detected as an independent transcript by RT-PCR. However, it is upstream of the threonine-tRNA ligase gene and could therefore result from an abortive transcript of the thrS transcript [37,38]. The two remaining candidate transcripts (229 and 235) were confirmed as independent transcripts. The predicted TSSs of 3 other candidate transcripts were validated by 5′RACE: transcript 110 (antisense sRNA to cyaA), 273 (antisense to fhaB) and 521 (Fig. S1 and table S3). All these novel predicted transcripts can be seen as a potential reservoir of regulatory RNAs. Therefore, we plan to analyze the gene expression profiles of identified non-coding RNAs under different growth conditions.

A large proportion of antisense RNA results from transcriptional activity of IS

The B. pertussis IS481 element harbors a promoter, called Pin, which drives the transcription of the transposase gene and a second, outward facing promoter, Pout, localized downstream of Pin in the opposite direction [39]. The Pout activity yields a small transcript which blocks the translation of the transposase mRNA by hybridizing to the ribosomal binding site and destabilizing the mRNA [40,41]. The transcript issued from Pout can also serve as a transcriptional activator for flanking genes in the same orientation. We screened the expression profiles of all IS elements in the BP Tohama I genome (supplementary table S4). For that purpose a sequencing read mapping procedure was performed without filtering of multi-mapping reads in order to detect transcription signal in the repeated sequence regions of the IS. We therefore obtained a profile of expression within all IS which corresponds to the mean IS expression. The promoters' expression levels were assessed by specific reads overlapping IS extremities and genomic flanking regions. The Pin and Pout transcription activities were clearly detected (see example of the IS BP0704 on Fig. 4A). We observed 70 IS (66 IS481, 2 IS1002 and 2 IS1663) for which the Pout transcript extend into the flanking gene which is in the same orientation, resulting in a potential activation of its transcription as described for katA and bteA. Such a configuration is illustrated by Fig. 4B. Interestingly, for 150 other IS (139 IS481, 2 IS1002 and 9 IS1663) the transcript from Pout extends into the flanking gene which is in opposite orientation resulting in the generation of a transcript antisense to these genes (Fig. 4C). Finally, for 23 IS (18 IS481 and 5 IS1663) the Pout transcription extends into an IGR generating potential non-coding RNAs. The transcription starts for the Pout of 10 randomly chosen IS were checked by 5′RACE (Fig. S1) and were all confirmed at the predicted positions as shown in Fig. 4.
Figure 4.

Promoters and transcription in insertion sequences. Gene ORF are indicated as plain blue arrows. IS and inverted repeat left and right (IRL and IRR) are indicated as plain brown arrows. The sequencing depth graphs are shown underneath in green for positive (: same orientation as published sequence) orientated reads and in red for negatively orientated reads. Promoter positions and orientation are indicated as black arrows. (A) General organization of transcription in IS region. (B) Representative example of flanking gene transcription activation due to Pout activity. (C) Representative example of antisense transcription in regard to flanking gene due to Pout activity. (D) Pin polar transcription activity in the gene downstream of the transposase.

Promoters and transcription in insertion sequences. Gene ORF are indicated as plain blue arrows. IS and inverted repeat left and right (IRL and IRR) are indicated as plain brown arrows. The sequencing depth graphs are shown underneath in green for positive (: same orientation as published sequence) orientated reads and in red for negatively orientated reads. Promoter positions and orientation are indicated as black arrows. (A) General organization of transcription in IS region. (B) Representative example of flanking gene transcription activation due to Pout activity. (C) Representative example of antisense transcription in regard to flanking gene due to Pout activity. (D) Pin polar transcription activity in the gene downstream of the transposase. In addition, in 101 cases (97 IS481, 1 IS1002 and 3 IS1663) the transcription of the transposase mRNA driven by Pin extends beyond the IS element, into the downstream gene, either in the same orientation (46 times) or in the opposite orientation (38 times) of the downstream gene, or into an IGR (17 times). Some transcripts (6 IS481) appear to result from a hybrid promoter, where the −10 box is provided by the gene lying close to insertion site and the −35 box is located in the inverted repeat of the IS. The transcription activity of one hybrid promoter (from candidate transcript 256) was confirmed by 5′RACE (Fig. S1). Globally, almost all ISs show an impact on transcriptional activity in the neighboring regions as only 6 IS do not display any additional transcriptional activity under the studied condition. Some IS (94) use simultaneously a Pin and Pout causing intense transcription activity, and two of them show in addition also a Phybrid activity. Table S4 summarizes the detected transcripts resulting from Pout, Pin and Phybrid of all IS in the Tohama I genome. Most of them were annotated by our analysis pipeline as new transcripts. This pool of transcripts may have specific regulatory function which is yet to be characterized. Alternatively, they could also account for functionless byproducts of random IS displacement and therefore, these specific candidates are labeled as such in the ‘Comments' column of the table S3.

Impact of IS number and localization on the B. pertussis transcriptome landscape

In order to assess the potential impact on the transcription generated from the Pout on the B. pertussis transcriptome in general, we examined the numbers and genomic contexts of all the Pout of 21 fully sequenced and annotated B. pertussis genome sequences (Fig. S5). Unique Pout structures identified only in one given strain are listed in the table of Fig. S5. As an example, when we compared Tohama I strain, representing the reference strain of B. pertussis to the strain D420, representing the currently circulating strains, we observed that 9 out of the 246 Pout identified in Tohama I are localized in a unique genomic context (one of them was verified by RT-PCR, Fig. S5) [42]. On the other side, 15 out of 252 D420 Pout promoters are in a unique context (one of them was verified by RT-PCR, Fig. S5). More globally, almost all strains present at least one difference compared to all the other strains. Some strains, such as CS, B203, 137 and, at the extreme, 18323, have a high number of unique Pout contexts. These differences result from differences in IS insertion and rearrangement events. Each difference in the Pout context between strains can thus result in a different levels of gene expression. The levels of transcription driven by Pout were measured by calculating RPKM in the 200-nt region downstream of the left inverted repeat of each IS481 element. A wide range of transcriptional levels was observed. Some Pout had no detectable activity, while some Pout induced high transcriptional rates. A deep RNA-seq experiment was performed on D420 grown in the same condition as BPSM and the resulting Pout RPKM were compared. A high correlation (R2 = 0.8) between log2(RPKM) of the Pout of these two strains was observed indicating the same level of Pout transcriptional activity between the two strains (Fig. S5).

Discussion

In this study we combined two approaches, RNA-seq and differential RNA-seq, to elucidate the detailed organization of the transcriptional landscape of B. pertussis Tohama I, including the operon structures, 5′ UTR, 3′ UTR and novel transcripts. A total of 2,722 potential TSS were identified, 614 of which defined the +1 position of transcription, and 8 were confirmed by 5′RACE. A total of 1,315 transcript structures could be determined, of which 838 were mono- and 477 polycistronic operons. The primary transcriptome of B. pertussis shows a complex regulatory network, defined by transcriptomic structures seen in other pathogens already [43,44]. Among the 614 TSS, 120 yielded long 5′UTR (> 100 nt). Seven out of these 120 long 5′UTR are predicted to be riboswitches, which are involved in functions as diverse as enzyme cofactor sensing (SAM, SAH, TPP, FMN and cobalamin), amino acid sensing (glycine) and metal ion sensing (yybP-ykoY). The presence of riboswitch elements, of which four could be confirmed by RT-PCR to be part of the same operon as the downstream ORF, provides yet another level of possible regulation mechanism of B. pertussis to adequately and rapidly adapt to changing conditions during infection for example in order to coordinate control of crucial processes along the respiratory track of the host. In addition to these transcript structures containing a long 5′-UTR, 35 leaderless transcripts (0.90% of annotated genes) were also detected. Leaderless transcripts defy the well-accepted paradigm by which translation of bacterial mRNA is initiated by binding of the ribosome via the complementary region of 16S rRNA on a Shine-Dalgarno sequence in the 5′ UTR [45]. The number of leaderless transcripts found in B. pertussis is in the range of those found in other bacteria for which global transcriptome landscape was determined, with 18 (0.46%), 23 (0.48%), and 83 (1.41%) leaderless mRNAs identified in Helicobacter pylori, Salmonella typhimurium, and Klebsiella pneumonia, respectively, but is far lower than the over 500 (>12.4%) leaderless transcripts found in Mycobacterium tuberculosis [43,46-48]. Intriguingly, many of the 35 B. pertussis leaderless transcripts code for putative regulatory proteins, which suggests a highly specific regulatory role of these genes in B. pertussis. The transcriptomic landscape of Bordetella pertussis show multiple antisense transcript shaping potential regulatory effect between close related genes. A total of 37 5′-overlapping divergent head-to-head and, among these unusual transcript structures, the example of a TetR-family transcriptional regulator (BP1202) stands out in particular, since the entire gene overlaps with the 5′ UTR of the convergent transcript coding for the two uncharacterized proteins BP1203 and BP1204 [49]. This transcriptional organization corresponds to the definition of an excludon as proposed by Wurtzel et al. and might therefore indicate that BP1203 and BP1204 encode products which functions might be opposite to the function of the transcriptional regulator TetR [50]. 89 3′-overlapping convergent tail-to-tail transcripts were also found, sometimes extending well beyond the stop codon up to several hundred nucleotides into the antisense region of the neighboring gene. The overlapping of bvgAS and bvgR transcripts arises a potential new level of complexity in the regulation of the master two-component system of Bordetella pertussis [51,52]. Indeed, this configuration implies a potential cross-regulation of bvgAS and bvgR transcripts. The 3′UTR overlapping of two flanking genes has already been seen in different bacteria, and has been suggested to induce the decay of one of the mRNA depending on sigma factor expression [53-56]. One of the major findings of this study was that a large number of transcripts arise from the invading transcription driven by IS elements. Transcription issued from internal promoters, Pin and Pout, or newly formed Phyb promoters can extend into the neighboring genomic regions and affect the gene expression as they can initiate the transcription of the downstream gene, as shown in Bordetella pertussis and other prokaryotes, including pathogens [26,29,32]. However, our primary transcriptome analysis reveals new types of IS-driven RNAs either transcribed within intergenic regions or antisense to their flanking gene(s). Interestingly, the new candidate RNAs transcribed antisense to neighboring gene(s) are highly prevalent over those affecting sense genes. Antisense RNA originating from IS elements could have a regulatory function for those genes, probably by inducing degradation of the mRNA as already shown with antisense small RNA transcribed at the 3′ of a gene [57]. Genome rearrangements mediated by IS481 are considered as one of the driving forces of the B. pertussis adaptation to human respiratory tract and vaccine-induced immunity. The different localization and number of those elements in Bordetella pertussis strains show strain-specific transcripts with a potential regulatory function validated by RT-PCR. Thus, this evidence could explain some gene expression alterations associated with IS481 even without insertion in the gene sequence. This could be the case of some antigens used in acellular vaccine such as fim3 and fhaB gene, which are altered in Bpe280 strain after 12 in vitro passages on agar plate and located near an IS481 element [58]. More importantly, some of the resulting gene reshufflings appeared to be conserved through positive selection indicating that some of the IS481-induced changes in gene expression are beneficial to these isolates. We also found that transcription levels driven by IS481 Pout promoters can be variable depending on the nature of the locus the IS element is inserted into, but that conserved loci drive similar Pout transcriptional levels in different strains. Thus, the exceptionally high number of IS481 elements contributes to B. pertussis evolution not only by gene rearrangements but also by alteration of the transcriptional landscape. As many different IS show some outgoing transcription from those promoters, this global transcriptional impact can extend to other prokaryotic pathogen containing high numerous IS, as seen in Shigella flexneri strain 2457T and Yersinia pestis strain CO92 [59,60]. The regulatory functions of the antisense RNAs originating from IS elements will be further investigated.

Material and methods

Bacterial strain and oligonucleotides

Bordetella pertussis BPSM, a streptomycin-resistant derivative of Tohama I was grown as previously described at 37°C in modified Stainer-Scholte medium, containing 100 μg/ml streptomycin (Sigma Chemicals) [33,52]. Cells were harvested at exponential phase (OD600 nm = 2.1) by adding 2 ml of 5:95 (v:v) phenol/ethanol to 8 ml of culture medium. After centrifugation for 8 min at 2800 × g the pellets were stored at −80°C until further use. Primers (IDT DNA Technologies) were designed using FastPCR and sequence specificity were checked using Blast [61,62].

RNA-seq library preparation and Illumina sequencing

Bacterial pellets were resuspended in 400 µl of 1 mg/ml Lysozyme (Sigma Aldrich). Total RNA was extracted with the TRI Reagent Kit (Ambion) according to the manufacturer's protocols. Ten µg of total RNA were first DNase treated using DNase I (Sigma Aldrich) for 10 min at room temperature and the ribosomal RNA was then depleted using Ribo-Zero rRNA removal kit (Epicentre) according to manufacturer instructions. Quantity and quality assessment were checked using the Nanodrop 2000 and the Bioanalyzer 2100 (Agilent Technologies), respectively. Sequencing libraries were prepared using TruSeq stranded total RNA library prep kit (Illumina) and the sequencing run was performed on a HiSeq® 2500 sequencing system (Illumina).

Differential RNA-seq libraries preparation and Ion Torrent PGM sequencing

Five µg of DNase treated total RNA were treated with 1U of Terminator™ 5′phosphate-dependant exonuclease (TEX) (Epicentre) for 60 min at 30°C. Reaction was stopped by adding 1 µl of EDTA 100 mM pH 8 and purified by an organic extraction. An equal amount of treated (TEX plus) and untreated (TEX minus) RNA with the Terminator™ 5′phosphate-dependant exonuclease were incubated with 1U of Tobacco Acid Pyrophosphatase (Epicentre) for 60 min at 37°C. cDNA libraries were constructed with 500 ng of TEX plus or TEX minus RNA using the Ion Total RNA-seq Kit v2 (Life Technologies), following the manufacturer's recommendations and purified twice with 1.8 volume of Agencourt AMPure (Beckman Coulter). Emulsion PCR, enrichment and sequencing were made using the Ion PGM™ template OT2 400 Kit and the Ion PGM™ sequencing 400 Kit with 12 pM of each library. Enriched beads were sequenced on an Ion Torrent PGM machine using a 318 v2 chip.

Bioinformatics analysis of dRNA-seq data

The four independent replicates of TEX-treated and untreated library pairs were quality controlled using fastqc, reads were quality trimmed using fastq_quality_trimmer from the fastx-toolkit, with a phred cutoff of 22, reads with length below 16 bases after trimming were discarded. The libraries were mapped on the BP reference genome (NC_002929.2) using the short read aligner Segemehl with default settings [63]. For all libraries only the uniquely mapped reads were further considered. All four TEX treated-untreated library pairs were analyzed with TSSAR demanding a minimal peak size of 2. For each genomic position the relative enrichment of signal intensity in the TEX treated library in the single analysis were merged applying Fisher's combined probability test and corrected for multiple testing with the Benjamini & Hochberg method [34,64,65]. Positions with an enrichment in the TEX treated library in comparison to the untreated library based on statistical significance level of 0.1 were denominated as transcription start sites (TSS). Finally, consecutive TSS positions separated by less than 10 nt were joined and represented by the most prominent peak.

5′race using PGM Ion Torrent sequencing technology

Starting sites were determined by 5′RACE for NGS sequencing as described by Beauregard et al [66]. PCR was performed with a first primer specific for the adaptor sequence and a second primer specific for the target cDNA (see Table in supplementary data part 1). A and P1 sequences for PGM sequencing were ligated using the Ion Plus Fragment Library Kit, following manufacturer's recommendations (Life Technology). PCR products were purified with 1.8 volume of Agencourt AMPure (Beckman Coulter) and sequenced on a 314 chip using Hi-Q kits for PGM Ion Torrent machine (Life Technology). Reads trimming, filtering and counting to determine transcript start was as described in supplementary data part 1.

Operonic structure determination and novel transcripts annotation

For the annotation of transcription units, mapped reads from all four TEX minus libraries from the dRNA-seq experiments were merged. The position wise read coverage were calculated using bedtools genomecov [67]. Since strand-specificity imposes a problem in current NGS protocols, we developed a simple yet effective approach to detect and correct for the fraction of anti-sense shadow and predicted transcripts architectures by calculation taking care of TSS, transcription terminators, annotated genes and observed transcription (detailed in supplemental material part 6) [68].
  64 in total

Review 1.  Bordetella pertussis, molecular pathogenesis under multiple aspects.

Authors:  C Locht; R Antoine; F Jacob-Dubuisson
Journal:  Curr Opin Microbiol       Date:  2001-02       Impact factor: 7.934

2.  Comparative analysis of the genome sequences of Bordetella pertussis, Bordetella parapertussis and Bordetella bronchiseptica.

Authors:  Julian Parkhill; Mohammed Sebaihia; Andrew Preston; Lee D Murphy; Nicholas Thomson; David E Harris; Matthew T G Holden; Carol M Churcher; Stephen D Bentley; Karen L Mungall; Ana M Cerdeño-Tárraga; Louise Temple; Keith James; Barbara Harris; Michael A Quail; Mark Achtman; Rebecca Atkin; Steven Baker; David Basham; Nathalie Bason; Inna Cherevach; Tracey Chillingworth; Matthew Collins; Anne Cronin; Paul Davis; Jonathan Doggett; Theresa Feltwell; Arlette Goble; Nancy Hamlin; Heidi Hauser; Simon Holroyd; Kay Jagels; Sampsa Leather; Sharon Moule; Halina Norberczak; Susan O'Neil; Doug Ormond; Claire Price; Ester Rabbinowitsch; Simon Rutter; Mandy Sanders; David Saunders; Katherine Seeger; Sarah Sharp; Mark Simmonds; Jason Skelton; Robert Squares; Steven Squares; Kim Stevens; Louise Unwin; Sally Whitehead; Bart G Barrell; Duncan J Maskell
Journal:  Nat Genet       Date:  2003-08-10       Impact factor: 38.330

Review 3.  Modes and modulations of antibiotic resistance gene expression.

Authors:  Florence Depardieu; Isabelle Podglajen; Roland Leclercq; Ekkehard Collatz; Patrice Courvalin
Journal:  Clin Microbiol Rev       Date:  2007-01       Impact factor: 26.132

4.  Pertactin-negative variants of Bordetella pertussis in the United States.

Authors:  Anne Marie Queenan; Pamela K Cassiday; Alan Evangelista
Journal:  N Engl J Med       Date:  2013-02-07       Impact factor: 91.245

5.  Molecular characterization of catalase from Bordetella pertussis: identification of the katA promoter in an upstream insertion sequence.

Authors:  D DeShazer; G E Wood; R L Friedman
Journal:  Mol Microbiol       Date:  1994-10       Impact factor: 3.501

6.  Translational control of IS10 transposition.

Authors:  R W Simons; N Kleckner
Journal:  Cell       Date:  1983-09       Impact factor: 41.582

Review 7.  Pertussis: a reemerging infection.

Authors:  Jonathan M Kline; William D Lewis; Eleanor A Smith; Lloyd R Tracy; Sarah K Moerschel
Journal:  Am Fam Physician       Date:  2013-10-15       Impact factor: 3.292

8.  tRNA as a positive regulator of transcription antitermination in B. subtilis.

Authors:  F J Grundy; T M Henkin
Journal:  Cell       Date:  1993-08-13       Impact factor: 41.582

9.  Comparative genomics of the classical Bordetella subspecies: the evolution and exchange of virulence-associated diversity amongst closely related pathogens.

Authors:  Jihye Park; Ying Zhang; Anne M Buboltz; Xuqing Zhang; Stephan C Schuster; Umesh Ahuja; Minghsun Liu; Jeff F Miller; Mohammed Sebaihia; Stephen D Bentley; Julian Parkhill; Eric T Harvill
Journal:  BMC Genomics       Date:  2012-10-10       Impact factor: 3.969

10.  Asymptomatic transmission and the resurgence of Bordetella pertussis.

Authors:  Benjamin M Althouse; Samuel V Scarpino
Journal:  BMC Med       Date:  2015-06-24       Impact factor: 8.775

View more
  17 in total

1.  Radical genome remodelling accompanied the emergence of a novel host-restricted bacterial pathogen.

Authors:  Gonzalo Yebra; Andreas F Haag; Maan M Neamah; Bryan A Wee; Emily J Richardson; Pilar Horcajo; Sander Granneman; María Ángeles Tormo-Más; Ricardo de la Fuente; J Ross Fitzgerald; José R Penadés
Journal:  PLoS Pathog       Date:  2021-05-20       Impact factor: 7.464

2.  Signal transduction-dependent small regulatory RNA is involved in glutamate metabolism of the human pathogen Bordetella pertussis.

Authors:  Kristina Keidel; Fabian Amman; Ilona Bibova; Jakub Drzmisek; Vladimir Benes; David Hot; Branislav Vecerek
Journal:  RNA       Date:  2018-08-10       Impact factor: 4.942

3.  A high resolution A-to-I editing map in the mouse identifies editing events controlled by pre-mRNA splicing.

Authors:  Konstantin Licht; Utkarsh Kapoor; Fabian Amman; Ernesto Picardi; David Martin; Prajakta Bajad; Michael F Jantsch
Journal:  Genome Res       Date:  2019-08-19       Impact factor: 9.043

4.  iRAPs curb antisense transcription in E. coli.

Authors:  Andrés Magán; Fabian Amman; Fatinah El-Isa; Natascha Hartl; Ilya Shamovsky; Evgeny Nudler; Renée Schroeder; Nadezda Sedlyarova
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

5.  Comparative genomics of whole-cell pertussis vaccine strains from India.

Authors:  Shweta Alai; Vikas C Ghattargi; Manish Gautam; Krunal Patel; Shrikant P Pawar; Dhiraj P Dhotre; Umesh Shaligram; Sunil Gairola
Journal:  BMC Genomics       Date:  2020-05-07       Impact factor: 3.969

6.  Combined RNAseq and ChIPseq Analyses of the BvgA Virulence Regulator of Bordetella pertussis.

Authors:  Loïc Coutte; Rudy Antoine; Stephanie Slupek; Luis Solans; Julien Derop; Amelie Bonnefond; David Hot; Camille Locht
Journal:  mSystems       Date:  2020-05-19       Impact factor: 6.496

7.  Comparative Omics Analysis of Historic and Recent Isolates of Bordetella pertussis and Effects of Genome Rearrangements on Evolution.

Authors:  Ana Dienstbier; Fabian Amman; Denisa Petráčková; Daniel Štipl; Jan Čapek; Jana Zavadilová; Kateřina Fabiánová; Jakub Držmíšek; Dilip Kumar; Mark Wildung; Derek Pouchnik; Branislav Večerek
Journal:  Emerg Infect Dis       Date:  2021-01       Impact factor: 6.883

8.  A Mutation Upstream of the rplN-rpsD Ribosomal Operon Downregulates Bordetella pertussis Virulence Factor Production without Compromising Bacterial Survival within Human Macrophages.

Authors:  Jakub Novák; David Jurnečka; Irena Linhartová; Jana Holubová; Ondřej Staněk; Daniel Štipl; Ana Dienstbier; Branislav Večerek; Nayara Azevedo; Jan Provazník; Vladimír Beneš; Peter Šebo
Journal:  mSystems       Date:  2020-12-08       Impact factor: 6.496

9.  Identification of RNA 3´ ends and termination sites in Haloferax volcanii.

Authors:  Sarah J Berkemer; Lisa-Katharina Maier; Fabian Amman; Stephan H Bernhart; Julia Wörtz; Pascal Märkle; Friedhelm Pfeiffer; Peter F Stadler; Anita Marchfelder
Journal:  RNA Biol       Date:  2020-02-10       Impact factor: 4.652

10.  A Bordetella pertussis MgtC homolog plays a role in the intracellular survival.

Authors:  Juan Hilario Cafiero; Yanina Andrea Lamberti; Kristin Surmann; Branislav Vecerek; Maria Eugenia Rodriguez
Journal:  PLoS One       Date:  2018-08-30       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.