| Literature DB >> 21283790 |
Azeem Siddique1, Nicolas Buisine, Ronald Chalmers.
Abstract
Neisseria meningitidis is the primary causative agent of bacterial meningitis. The genome is rich in repetitive DNA and almost 2% is occupied by a diminutive transposon called the Correia element. Here we report a bioinformatic analysis defining eight subtypes of the element with four distinct types of ends. Transcriptional analysis, using PCR and a lacZ reporter system, revealed that two ends in particular encode strong promoters. The activity of the strongest promoter is dictated by a recurrent polymorphism (Y128) at the right end of the element. We highlight examples of elements that appear to drive transcription of adjacent genes and others that may express small non-coding RNAs. Pair-wise comparisons between three meningococcal genomes revealed that no more than two-thirds of Correia elements maintain their subtype at any particular locus. This is due to recombinational class switching between elements in a single strain. Upon switching subtype, a new allele is available to spread through the population by natural transformation. This process may represent a hitherto unrecognized mechanism for phase variation in the meningococcus. We conclude that the strain-to-strain variability of the Correia elements, and the large number of strong promoters encoded by them, allows for potentially widespread effects within the population as a whole. By defining the strength of the promoters encoded by the eight subtypes of Correia ends, we provide a resource that allows the transcriptional effects of a particular subtype at a given locus to be predicted.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21283790 PMCID: PMC3024310 DOI: 10.1371/journal.pgen.1001277
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1The eight classes of the consensus CE.
(A) Nucleotide sequence alignment of the 8 CE consensus subtypes. Dashes within the alignment indicate gaps. Asterisks mark the positions of the R and Y nucleotides where the consensus sequence is polymorphic. “R” represents either A or G, with adenosine more frequently present at this position. “Y” represents T, or more frequently C. Nucleotides are colored according to their identity except for the two polymorphisms and the flanking TA dinucleotide repeats, which are black. The CE −10 and −35 transcriptional start sequences and the equivalent sequences for the consensus E. coli σ70 promoter are indicated below the alignment. The total number of elements of each subclass within the N. meningitidis Z2491, MC58 and FAM18 genomes is indicated beside the alignment. These numbers represent approximately half of the total number of elements present in the 3 genomes. (B) Alignment of the α and β TIRs from the left and right ends of the CE. Asterisks mark the 2 nucleotides within the inverted repeat that differ between the left and right ends. The CE −10 and −35 transcriptional start sequences are underlined. For comparison purposes, the −10 and −35 transcriptional start sequences of the consensus E. coli σ70 promoter are provided below the alignment. Also given are the mutated −10 and −35 sequences constructed to replace the wild-type sequences in the Correia end-lacZ reporter plasmids in Table 1. (C) A graphical illustration of sequence variation, relative to the consensus sequence, within the set of 121 α-α CEs.
β-galactosidase reporter assays for consensus Correia ends.
| Plasmid | Description | Miller units | Standard error | Fold difference |
| - | 0.54 | 0.11 | - | |
| pRC746 | vector control | 7.4 | 0.28 | 1.0 |
| pRC759 | α-left | 540 | 5.0 | 73 |
| pRC748 | α-right | 75 | 7.2 | 10 |
| pRC750 | α-rightY128T | 670 | 22 | 90 |
| pRC761 | α-rightY128T; −10 box altered | 19 | 0.98 | 2.6 |
| pRC779 | α-rightY128T; −35 box altered | 46 | 1.9 | 6.3 |
| pRC760 | β-left | 97 | 8.8 | 13 |
| pRC749 | β-right | 86 | 6.1 | 12 |
| pRC751 | β-rightY128T | 140 | 7.4 | 19 |
| pRC762 | β-rightY128T; −10 box altered | 51 | 4.3 | 6.8 |
| pRC781 | β-rightY128T; −35 box altered | 89 | 7.8 | 12 |
The consensus CE ends shown in Figure 1B (along with 4 nucleotides immediately upstream of the Correia repeat and the downstream flanking dinucleotide “AT”) were cloned into a low-copy lacZ reporter plasmid (pRC746). Details are provided in Table S1. The E. coli host strain is MC4100.
β-galactosidase activity is measured in Miller units. The following formula was used to calculate Miller units: [(spectrophotometer reading at 420 nm wavelength)×1000]/[(spectrophotometer reading at 600 nm wavelength)×(cell volume (mls))×(time (mins))].
Based on results from 3 independent experiments.
Relative to vector control (pRC746).
Unless designated otherwise, the base at position 128 is cytosine.
Altered −10 and −35 box sequences are shown in Figure 1B.
β-galactosidase reporter assays for chromosomally integrated Correia elements in log phase cultures.
| Correia element | wild-type strain | Fold difference | ||||
| ← → | ←reverse | →forward | ←reverse | →forward | ←reverse | →forward |
| αL - αR | 120±5.5 | 20±0.8 | 140±4.2 | 20±0.6 | 0.8 | 1.0 |
| αL - αR′ | 200±7.1 | 21±1.3 | 200±6.4 | 19±0.3 | 1.0 | 1.1 |
| αLR52G - αRY128T | 97±1.0 | 470±24 | 110±5.4 | 590±7.3 | 0.9 | 0.8 |
| βL - βR | 39±5.3 | 24±0.2 | 38±0.5 | 29±0.3 | 1.1 | 0.8 |
| βL - βR′ | 44±0.8 | 31±0.6 | 45±0.7 | 31±0.8 | 1.0 | 1.0 |
| βLR52G- βRY128T | 26±0.3 | 55±4.4 | 31±0.3 | 80±1.1 | 0.8 | 0.7 |
| αL - βR | 170±7.9 | 67±1.5 | 120±5.4 | 73±2.4 | 1.4 | 0.9 |
| αL - βR′ | 130±0.9 | 62±1.0 | 140±12 | 52±1.6 | 0.9 | 1.2 |
| βL - αR | 32±1.2 | 24±0.7 | 36±0.4 | 24±0.3 | 0.9 | 1.0 |
| βL - αR′ | 75±0.8 | 31±1.9 | 72±0.8 | 27±0.8 | 1.0 | 1.1 |
| vector | 0.6±0.05 | 0.6±0.01 | 1.0 | |||
Unless designated otherwise, the bases at positions 52 and 128 of the CE are adenine and cytosine, respectively.
E. coli strain NR289.
IHF+/IHF−.
The units of measurement, Miller units, are calculated using the formula provided in Table 1. For each construct, the mean Miller unit measurement and the standard error from at least three independent experiments are provided.
β-galactosidase reporter assays for chromosomally integrated Correia elements in stationary phase cultures.
| Correia element | wild-type strain | Fold difference | ||||
| ← → | ←reverse | →forward | ←reverse | →forward | ←reverse | →forward |
| αL - αR | 170±3.2 | 24±1.6 | 170±5.5 | 26±1.7 | 1.0 | 0.9 |
| αL - αR′ | 280±2.0 | 27±1.3 | 280±4.1 | 25±1.9 | 1.0 | 1.1 |
| αLR51G - αRY128T | 130±2.1 | 660±26 | 140±4.2 | 670±41 | 0.9 | 1.0 |
| βL - βR | 41±2.3 | 27±1.7 | 47±3.0 | 32±2.6 | 0.9 | 0.8 |
| βL - βR′ | 57±2.5 | 32±2.7 | 57±1.8 | 34±3.1 | 1.0 | 0.9 |
| βLR52G- βRY128T | 36±2.1 | 80±9.6 | 41±3.0 | 100±11 | 0.9 | 0.8 |
| vector | 0.6±0.02 | 0.5±0.02 | 1.2 | |||
Unless designated otherwise, the bases at positions 52 and 128 of the CE are adenine and cytosine, respectively.
E. coli strain NR289.
IHF+/IHF−.
The units of measurement, Miller units, are calculated using the formula provided in Table 1. For each construct, the mean Miller unit measurement and the standard error from at least three independent experiments are provided.
Figure 2Transcript mapping by primer extension.
(A) Primer extension analysis was performed with 20 µg of RNA and 5 pmoles of radiolabeled primer, as described in the Materials and Methods. RNA was prepared from E. coli MC4100 strains harboring lacZ reporter plasmid pRS415 [52] and derivatives containing the CEs indicated at the top of the gel. Reaction products were electrophoresed on a denaturing 8% polyacrylamide gel. The primary primer extension products and free primer are indicated beside the gel. (B) Nucleotide sequence of the Correia α-rightY128T repeat and downstream sequences highlighting the transcriptional start points observed in (A).
Figure 3Transcriptional analysis of three loci containing CEs.
(A) Schematic diagram of the region between the NMA0073 and NMA0074 ORFs showing the location of primers I, II and III (sequences given in Table S2) used in the accompanying PCR analysis. The solid black arrows denote genes. The hatched box represents a strong predicted transcriptional terminator, with the arrow(s) indicating its polarity: in this case, the terminator is predicted to function in both directions. The CE inverted repeats are indicated by grey arrowheads and the direction of transcription from the α-rightY128T repeat is indicated by a bent arrow. Primer I was used for reverse transcription (RT). The subsequent PCR step was performed with the indicated pairs of primers and analyzed on an ethidium bromide-stained 3% Metaphor agarose TAE gel (right panels). Results from the RT-PCR are shown in the middle panel and those from the genomic DNA PCR are shown on the far right. The latter provides molecular weight standards for the RT-PCR products and a control for the efficiency of the various primer pairs. (B) The region between NMA0057 and NMA0059 is shown. Annotations are as in (A). The inverted repeats of an ATR element are depicted as black arrowheads. (C) The region between NMA0530 and NMA0531 is shown. The short NMA0530A ORF is not likely to code for a protein and has been omitted from the schematic. Annotations are as in (A).
Figure 4Distribution of the Correia α-rightY128T repeats in the N. meningitidis Z2491, MC58, and FAM18 genomes.
Horizontal black lines represent the N. meningitidis genomes. Colored triangles represent the α-rightY128T promoters, of which there are almost 40 in each genome. The letters above or below each triangle refer to the dinucleotides flanking the TATA sequence at the end of each repeat. Correia ends represented above and below the line drive transcription to the right and left, respectively. The grey lines between genomes connect Correia α-rightY128T repeats inserted at homologous positions in their respective genomes. The synteny of loci was established first by using a 20 bp sequence tag, and then by examining the ORF landscape surrounding each element (see Figure S1). The diagram is drawn to scale.
Correia α-rightY128T repeat distribution in the genome of N. meningitidis serogroup A strain Z2491.
| Nucleotide sequence of the Correia element end (29 bp) | Adjacent ORF or other feature (downstream of Correia promoter) | Distance of nearest ORF to Correia element (bp) | Direction of transcription | Description of putative gene product | |
| 1 | ATR repeat followed by NMA0059 | 176 | convergent | modification methylase (probable pseudogene) | |
| 2 | NMA0074 ( | 42 | tandem | tRNA uridine 5-carboxymethyl aminomethyl modification enzyme | |
| 3 | 69 | tandem | threonine tRNA & putative integral membrane protein | ||
| 4 | NMA0277 | 155 | convergent | putative lipoprotein | |
| 5 | RS-dRS3 repeat array | ||||
| 6 | NMA0381 ( | 42 | convergent | putative transcriptional activator | |
| 7 | single RS-dRS3 repeat followed by NMA0402 ( | 218 | tandem | tRNA pseudouridine synthase | |
| 8 | NMA0412 | 79 | convergent | putative glucokinase | |
| 9 | NMA0434 | 30 | tandem | hypothetical inner membrane protein | |
| 10 | RS-dRS3 repeat array & another CE | ||||
| 11 | NMA0495 ( | 107 | tandem | 50S ribosomal protein L31 | |
| 12 | NMA0530A & NMA0531 | 259 | convergent | possible pseudogene & putative transposase | |
| 13 | NMA0590 | 282 | tandem | hypothetical protein | |
| 14 | ATR repeats | ||||
| 15 | NMA0788 | 127 | convergent | putative secreted toxin (probable pseudogene) | |
| 16 | RS-dRS3 repeat array | ||||
| 17 | NMA0823 | 28 | convergent | putative transposase (probable pseudogene) | |
| 18 | RS-dRS3 repeat array | ||||
| 19 | RS-dRS3 repeat array | ||||
| 20 | NMA0940 | 118 | tandem | putative membrane protein | |
| 21 | inserted within another CE; nearest ORF is NMA0975 | 274 | tandem | putative integral membrane protein | |
| 22 | NMA0975 | 175 | tandem | putative integral membrane protein | |
| 23 | NMA1079 | 55 | tandem | hypothetical protein | |
| 24 | NMA1111 | CE end lies within ORF | convergent | hypothetical protein | |
| 25 | RS-dRS3 repeat array | ||||
| 26 | NMA1331 | 105 | tandem | ABC transporter ATP-binding protein (probable pseudogene) | |
| 27 | NMA1411 | 95 | tandem | hypothetical protein | |
| 28 | NMA1414 | 131 | tandem | putative periplasmic protein | |
| 29 | NMA1588 ( | 41 | convergent | tRNA pseudouridine synthase B | |
| 30 | NMA1626 | 124 | convergent | putative RTX-family exoprotein | |
| 31 | NMA1696 ( | 304 | tandem | succinate semialdehyde dehydrogenase | |
| 32 | NMA1808 ( | 55 | tandem | O-succinylhomoserine sulfhydrolase | |
| 33 | NMA1935 & NMA1934 ( | 24 | convergent | hypothetical protein & glycine dehydrogenase | |
| 34 | NMA1954 | 42 | tandem | putative integral membrane protein | |
| 35 | NMA1960 & NMA1959 ( | CE end lies within ORF | convergent | hypothetical protein & α 1,2 N-acetylglucosamine transferase | |
| 36 | NMA2029 | 79 | convergent | hypothetical protein | |
| 37 | single RS-dRS3 repeat followed by NMA2051 | 283 | tandem | conserved hypothetical protein | |
| 38 | NMA2108 ( | 138 | convergent | oxygen-independent co-proporphyrinogen III oxidase | |
| 39 | NMA2205 | 108 | tandem | hypothetical protein |
Sequences are listed in order of ascending genome coordinate (coordinates not shown).
“NMA” refers to open reading frames (ORFs) in N. meningitidis Z2491; the numbers indicate the physical order of the ORFs within the genome.
Direction of transcription from the Correia α-rightY128T end relative to that of the nearest ORF.
Locus where a rho-independent transcriptional terminator is present no more than 300 bp from a Correia α-rightY128T.
The thr tRNA gene and NMA0530A, NMA1935 and NMA1960 ORFs are very small (75 bp, 84 bp, 147 bp, and 171 bp, respectively), so the name and description of the next ORF is given as well.