| Literature DB >> 35311568 |
Cornelia Große1, Jan Grau2, Ivo Große2, Dietrich H Nies1.
Abstract
The genome of the metal-resistant, hydrogen-oxidizing bacterium Cupriavidus metallidurans contains a large number of horizontally acquired plasmids and genomic islands that were integrated into its chromosome or chromid. For the C. metallidurans CH34 wild-type strain growing under nonchallenging conditions, 5,763 transcriptional starting sequences (TSSs) were determined. Using a custom-built motif discovery software based on hidden Markov models, patterns upstream of the TSSs were identified. The pattern TTGACA, -35.6 ± 1.6 bp upstream of the TSSs, in combination with a TATAAT sequence 15.8 ± 1.4 bp upstream occurred frequently, especially upstream of the TSSs for 48 housekeeping genes, and these were assigned to promoters used by RNA polymerase containing the main housekeeping sigma factor RpoD. From patterns upstream of the housekeeping genes, a score for RpoD-dependent promoters in C. metallidurans was derived and applied to all 5,763 TSSs. Among these, 2,572 TSSs could be associated with RpoD with high probability, 373 with low probability, and 2,818 with no probability. In a detailed analysis of horizontally acquired genes involved in metal resistance and not involved in this process, the TSSs responsible for the expression of these genes under nonchallenging conditions were assigned to RpoD- or non-RpoD-dependent promoters. RpoD-dependent promoters occurred frequently in horizontally acquired metal resistance and other determinants, which should allow their initial expression in a new host. However, other sigma factors and sense/antisense effects also contribute-maybe to mold in subsequent adaptation steps the assimilated gene into the regulatory network of the cell. IMPORTANCE In their natural environment, bacteria are constantly acquiring genes by horizontal gene transfer. To be of any benefit, these genes should be expressed. We show here that the main housekeeping sigma factor RpoD plays an important role in the expression of horizontally acquired genes in the metal-resistant hydrogen-oxidizing bacterium C. metallidurans. By conservation of the RpoD recognition consensus sequence, a newly arriving gene has a high probability to be expressed in the new host cell. In addition to integrons and genes travelling together with that for their sigma factor, conservation of the RpoD consensus sequence may be an important contributor to the overall evolutionary success of horizontal gene transfer in bacteria. Using C. metallidurans as an example, this publication sheds some light on the fate and function of horizontally acquired genes in bacteria.Entities:
Keywords: Cupriavidus metallidurans; RpoD; metal resistance
Mesh:
Substances:
Year: 2022 PMID: 35311568 PMCID: PMC9045368 DOI: 10.1128/spectrum.00121-22
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
Distribution of the scores of the identified transcriptional start sites
| TSS score | No. of TSSs | % of TSSs identified |
|---|---|---|
| >10,000 | 19 | 0.33 |
| 3,000–9,999 | 76 | 1.3 |
| 1,000–2,999 | 292 | 5.1 |
| 300–999 | 481 | 8.3 |
| 100–299 | 1,125 | 20 |
| 30–99.9 | 1,900 | 33 |
| <30 | 1,870 | 32 |
A total of 5,765 transcriptional start sites (TSSs) were identified in RNA isolated from C. metallidurans CH34 cells cultivated under nonchallenging conditions in Tris-buffered mineral salts medium in three biological repeats. Two TSSs could not be assigned to the C. metallidurans genome. The scores and positions of the remaining 5,763 TSSs are listed in Table S1. A cutoff for the TSS score of 10 was used. The score was defined as the ratio of RRS_TSS to RRS_control, with RRS defined as the number of reads n at a position i and orientation o divided by the total number of reads N, multiplied by 106.
FIG 1Overview of the motifs discovered in the −10 and −35 regions. For each motif, a sequence logo and a histogram of the positions of motif matches are shown. In the −10 region, 5 motifs were discovered (from the top to the bottom, components 0,1 to 0,5), of which one (component 0,1 at the top) covered the majority of TSSs and showed strict positioning. Only promoters of TSSs containing this motif were considered when searching for a second, upstream motif. Of the 5 motifs discovered, only two components (C = 1,1 and C = 2,1) were clearly positioned and showed similarities to known −35 consensus sequences.
DNA sequences upstream of transcriptional start sites corresponding to −35 model number 1 or number 2 combined with −10 model number 1 upstream of housekeeping genes
| TSS rank | Start at −35 | Sequence for −35 model | Start at −10 | Sequence for −10 model | Locus tag | Gene | RpoD score |
|---|---|---|---|---|---|---|---|
| 2,376 | −34 |
| −13 |
| Rmet_R0003 |
| 13.6 |
| 2 | −35 |
| −15 |
| Rmet_6415 |
| 12.6 |
| 53 | −34 |
| −13 |
| Rmet_0722 |
| 10.6 |
|
| −36 |
| −14 |
| Rmet_0749 |
| 10.9 |
| 1,211 | −37 |
| −15 |
| Rmet_0920 |
| 11.4 |
| 886 | −36 |
| −14 |
| Rmet_R0015 |
| 12.4 |
| 714 | −37 |
| −16 |
| Rmet_1161 |
| 10.8 |
| 2,673 | −36 |
| −13 |
| Rmet_1162 |
| 11.4 |
|
| −35 |
| −14 |
| Rmet_1166 |
| 10.6 |
|
| −39 |
| −15 |
| Rmet_1975 |
| 6.9 |
| 49 | −35 |
| −13 |
| Rmet_1979 |
| 12.2 |
| 1,413 | −36 |
| −15 |
| Rmet_2134 |
| 12.3 |
| 14 | −35 |
| −15 |
| Rmet_2135 |
| 9.1 |
| 31 | −35 |
| −13 |
| Rmet_2137 |
| 12.2 |
| 7 | −35 |
| −13 |
| Rmet_2870 |
| 14.2 |
| 38 | −36 |
| −14 |
| Rmet_2904 |
| 14.4 |
| 33 | −35 |
| −14 |
| Rmet_3106 |
| 12.6 |
| 1,622 | −39 |
| −15 |
| Rmet_3291 |
| 4.9 |
| 60 | −34 |
| −14 |
| Rmet_3307 |
| 11.6 |
| 526 | −34 |
| −13 |
| Rmet_3317 |
| 11.6 |
| 219 | −34 |
| −13 |
| Rmet_3327 |
| 11.6 |
| 2,084 | −37 |
| −14 |
| Rmet_3336 |
| 9.9 |
| 11 | −35 |
| −14 |
| Rmet_R0059 |
| 14.6 |
| 2,648 | −36 |
| −13 |
| Rmet_R0060 |
| 8.9 |
| 2,789 | −36 |
| −13 |
| Rmet_R0063 |
| 12.4 |
| 2,182 | −35 |
| −14 |
| Rmet_R0053 |
| 9.6 |
| 1,312 | −35 |
| −11 |
| Rmet_R0063 |
| 9.2 |
| 235 | −34 |
| −13 |
| Rmet_R0064 |
| 12.6 |
| 19 | −35 |
| −13 |
| Rmet_3501 |
| 14.2 |
| 1,108 | −39 |
| −13 |
| Rmet_3501 |
| 1.9 |
| 15 | −35 |
| −14 |
| Rmet_0260 |
| 12.6 |
|
| −33 |
| −13 |
| Rmet_0927 |
| 5.1 |
| 1,708 | −33 |
| −13 |
| Rmet_2039 |
| 6.1 |
| 389 | −36 |
| −14 |
| Rmet_2188 |
| 11.4 |
| 2,159 | −34 |
| −14 |
| Rmet_2188 |
| 7.6 |
| 1,336 | −34 |
| −12 |
| Rmet_2621 |
| 12.2 |
| 2,410 | −34 |
| −13 |
| Rmet_2623 |
| 10.6 |
|
| −36 |
| −14 |
| Rmet_3227 |
| 11.4 |
|
| −37 |
| −15 |
| Rmet_3230 |
| 9.4 |
| 1,092 | −36 |
| −13 |
| Rmet_3230 |
| 10.9 |
|
| −39 |
| −14 |
| Rmet_2031 |
| 2.4 |
| 391 | −35 |
| −13 |
| Rmet_2192 |
| 13.2 |
| 861 | −36 |
| −15 |
| Rmet_2489 |
| 13.8 |
|
| −38 |
| −13 |
| Rmet_2486 |
| 4.4 |
| 2,042 | −33 |
| −13 |
| Rmet_2895 |
| 7.1 |
| 2,941 | −38 |
| −14 |
| Rmet_3729 |
| 7.4 |
| 1,295 | −36 |
| −15 |
| Rmet_4268 |
| 10.8 |
| 723 | −35 |
| −14 |
| Rmet_5296 |
| 12.6 |
The 3,000 transcriptional start points identified in C. metallidurans CH34 with the highest TSS scores (Table S1), as indicated by the position in the score ranking, were searched with a pattern-finding software: 2,832 were assigned to −35 and −10 models, and 2,132 TSSs with DNA sequences corresponding to −35 model number 1 (rank in standard black letters) or 2 (rank in boldface letters) in combination with −10 model number 1 were further characterized (Table S1). Among these, 48 putative promoter sequences upstream of housekeeping genes were used to develop a score for RpoD consensus motifs in C. metallidurans. The results are indicated in the last row on the right hand. Strong RpoD-dependent promoters according to the RpoD score are on a white field, medium strong one are shaded in light gray, and weak are shaded in medium gray.
Distribution of the scores of the identified transcriptional start sites
A total of 5,763 of the 5,765 TSSs were analyzed for RpoD promoter consensus motifs. “Position” indicates if the position of the −35 motif was between −31 and −39 and the −10 motif between −10 and −18 or not. The result of the RpoD scoring is shown in the next row, followed by the total number of TSSs in this category and the distribution of these TSSs to the components 1,1 or 2,1 or the remaining components, 3,1, 4,1, or 5,1 (Fig. S3). “Category” shows the category of the TSS as indicated in Fig. S1 with the corresponding cell color in case of categories s, m, w, no, and nu; for sba and ba, the cell colors of the (s)-sba and (s)-ba sites in Table S1 are indicated. The percentage of the total number of TTSs is that of 5,763 TSSs, and that of the components is the portion of TSSs of this category. “Wrong, s.d.” means both positions are wrong, but positions −35 and −10 are either both too far upstream or downstream of the TSS, and “all wrong” indicates both are too close together. n.a., analysis of the RpoD score not shown.
FIG 2Map of the cobalt-zinc-cadmium resistance determinant czc on plasmid pMOL30. The map shows the first part of the czc determinants in the indicated regions with NPKM values on one DNA strand (red) or the other direction of transcription (blue). Above are the Rmet locus and gene names, the mean NPKM, and response values (24). TSSs (flags) are indicated with the corresponding TSS score as white with a score of <50, shades of red for strong (>1,000) or orange for medium (50 to 1000) transcription initiation from RpoD promoters (medium or strong RpoD score), and shades of blue if not associated or only weakly associated with the RpoD model. Transcripts (48) and operons (25) are also indicated in the transcript map. TSSs with scores of <50 (white arrows) are shown without TSS scores.
RpoD- and non-RpoD-dependent promoters and possible antisense transcripts involved in expression of metal resistance determinants in C. metallidurans
| RpoD-dependent promoters | Non-RpoD-dependent promoters | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| TSS | TSS score | S | Putative transcript | Position | TSS | TSS score | S | Putative transcript | Position |
|
| |||||||||
| | 26 ± 10 | + | 1,395 bp upstr. Rmet_5984 |
| 17 ± 5 | + | 844 bp upstr. Rmet_5984 | ||
| | 55 ± 23 | − |
| 482 bp upstr. Rmet_5985 |
| 17 ± 4 | + | 834 bp upstr. Rmet_5984 | |
| | 22 ± 5 | + |
| 284 bp upstr. Rmet_5983 |
| 232 ± 22 | + | 643 bp upstr. Rmet_5984 | |
| | 29 ± 5 | − | 1,103 bp upstr. Rmet_5985 |
| 110 ± 27 | + |
| 97 bp upstr. Rmet_5983 | |
| | 1,114 ± 145 | + |
| 53 bp upstr. Rmet_5983 |
| 131 ± 22 | + |
| 31 bp upstr. Rmet_5983 |
|
| 50 ± 12 | − | 1,351 bp upstr. Rmet_5985 |
| 32 ± 12 | + |
| 308 bp upstr. Rmet_5980 | |
|
| 19 ± 2 | − | 4,607 bp upstr. Rmet_5985 |
| 17 ± 5 | − | 5,460 bp upstr. Rmet_5985 | ||
|
| 17 ± 5 | − |
| 47 bp upstr. Rmet_5975 |
| 36 ± 4 | + |
| 1,003 bp upstr. Rmet_5979 |
| | 15 ± 2 | + |
| 528 bp upstr. Rmet_5971 |
| 26 ± 5 | + |
| 37 bp upstr. Rmet_5976 |
|
| 23 ± 4 | − |
| 92 bp upstr. Rmet_5975 | |||||
|
| 89 ± 26 | − | 199 bp upstr. Rmet_5974 | ||||||
|
| 160 ± 35 | − |
| 12 bp upstr. Rmet_5972 | |||||
|
| 24 ± 6 | + |
| 41 bp upstr. Rmet_5970 | |||||
|
| |||||||||
|
| 204 ± 31 | + |
| 23 bp upstr. Rmet_6205 |
| 80 ± 14 | + |
| 966 bp upstr. Rmet_6205 |
|
| 47 ± 8 | + |
| 989 bp upstr. Rmet_6210 |
| 87 ± 17 | + |
| 24 bp upstr. Rmet_6208 |
|
| 32 ± 9 | + |
| 2,582 bp upstr. Rmet_6211 |
| 164 ± 16 | + |
| 763 bp upstr. Rmet_6211 |
|
| |||||||||
| | 743 ± 13 | − |
| 12 bp upstr. Rmet_4594 | |||||
| | 155 ± 52 | + | 474 bp upstr. Rmet_4596 | ||||||
|
| 317 ± 65 | − | 25 bp upstr. Rmet_4599 | ||||||
|
| 27 ± 5 | + | 55 bp upstr. Rmet_4465 |
| 41 ± 9 | + | 118 bp upstr. Rmet_4465 | ||
| | 12,866 ± 1,637 | − | 231 bp upstr. Rmet_4472 |
| 159 ± 30 | + | 8 bp upstr. Rmet_4467 | ||
| | 2,807 ± 1,154 | − |
| 28 bp upstr. Rmet_5320 |
| 22 ± 3 | − | 247 bp upstr. Rmet_5322 | |
| | 530 ± 145 | + | 54 bp upstr. Rmet_5321 |
| 66 ± 35 | − | 72 bp upstr. Rmet_5324 | ||
| | 30 ± 6 | − | 29 bp upstr. Rmet_5322 |
| 45 ± 9 | + | 27 bp upstr. Rmet_5326 | ||
| | 278 ± 19 | + |
| 3 bp upstr. Rmet_5323 |
| 19 ± 2 | − | 1,986 bp upstr. Rmet_5328 | |
| | 16 ± 3 | + |
| 1,215 bp upstr. Rmet_5325 |
| 122 ± 52 | + |
| 18 bp upstr. Rmet_5331 |
|
| 142 ± 16 | − | 449 bp upstr. Rmet_5322 |
| 28 ± 11 | − | 243 bp upstr. Rmet_5330 | ||
|
| 189 ± 76 | − | 56 bp upstr. Rmet_5330 | ||||||
| | 36 ± 4 | + |
| 44 bp upstr. Rmet_5331 | |||||
|
|
| 29 ± 8 | + |
| 850 bp upstr. Rmet_6144 | ||||
|
| |||||||||
|
| 495 ± 50 | + | 181 bp upstr. Rmet_3836 | ||||||
|
| 60 ± 32 | + | 71 bp upstr. Rmet_3836 | ||||||
|
|
| 375 ± 63 | + |
| 111 bp upstr. Rmet_4119 | ||||
|
| 74 ± 6 | + |
| 90 bp upstr. Rmet_4119 | |||||
|
| |||||||||
| | 25 ± 6 | + | 147 bp upstr. Rmet_5672 |
| 35 ± 6 | + | 75 bp upstr. Rmet_5672 | ||
|
| 58 ± 11 | + | 46 bp upstr. Rmet_5672 | ||||||
|
| 94 ± 22 | − | 94 bp upstr. Rmet_5678 | ||||||
|
| 168 ± 42 | − | 12 bp upstr. Rmet_5680 | ||||||
|
| 1,025 ± 113 | − | 0 bp upstr. Rmet_5683 | ||||||
|
| |||||||||
|
| 81 ± 16 | + | 74 bp upstr. Rmet_3013 |
| 70 ± 15 | + | 123 bp upstr. Rmet_3013 | ||
|
| 16 ± 4 | + | anti. | 507 bp upstr. Rmet_3014 |
| 50 ± 9 | − | 113 bp upstr. Rmet_3017 | |
|
| 55 ± 5 | − | 304 bp upstr. Rmet_3016 | ||||||
|
| 22 ± 3 | − | 1,277 bp upstr. Rmet_3016 | ||||||
|
| |||||||||
| | 2,235 ± 160 | + |
| 20 bp upstr. Rmet_5030 |
| 1,233 ± 204 | − | anti. | 24 bp upstr. Rmet_5036 |
| | 284 ± 48 | − | anti. | 30 bp upstr. Rmet_5037 | |||||
|
| |||||||||
| | 133 ± 45 | − |
| 16 bp upstr. Rmet_6133 |
| 51 ± 14 | − |
| 31 bp upstr. Rmet_6133 |
|
| |||||||||
| | 1,256 ± 163 | − |
| 43 bp upstr. Rmet_2302 |
| 29 ± 2 | − | anti. | 3,500 bp upstr. Rmet_2302 |
| | 58 ± 21 | + |
| 2,949 bp upstr. Rmet_2304 | |||||
|
| |||||||||
| | 21 ± 2 | + |
| 2,793 bp upstr. Rmet_5946 |
| 36 ± 10 | − |
| 150 bp upstr. Rmet_5949 |
|
| 21 ± 3 | − | 499 bp upstr. Rmet_5948 |
| 33 ± 10 | − | 2,198 bp upstr. Rmet_5947 | ||
|
| 27 ± 6 | − | 28 bp upstr. Rmet_5947 |
| 47 ± 10 | − | 2,508 bp upstr. Rmet_5947 | ||
|
| 215 ± 13 | + |
| 21 bp upstr. Rmet_5946 |
| 25 ± 2 | − | 2,907 bp upstr. Rmet_5947 | |
|
| 273 ± 29 | + |
| 39 bp upstr. Rmet_5945 | |||||
|
| 39 ± 6 | + |
| 35 bp upstr. Rmet_5944 | |||||
|
| |||||||||
|
| 25 ± 6 | + | 1,441 bp upstr. Rmet_6382 |
| 45 ± 3 | − | 14 bp upstr. Rmet_6120 | ||
|
| 21 ± 6 | + |
| 101 bp upstr. Rmet_6119 |
| 26 ± 8 | − | 128 bp upstr. Rmet_6112 | |
|
| 148 ± 32 | − | 2,875 bp upstr. Rmet_6120 |
| 28 ± 9 | + |
| 56 bp upstr. Rmet_6111 | |
|
| 190 ± 22 | − | 3,960 bp upstr. Rmet_6120 | TSS_191954-4 | 51 ± 12 | − | 755 bp upstr. Rmet_6112 | ||
|
| 21 ± 3 | − | 381 bp upstr. Rmet_6112 |
| 19 ± 2 | + |
| 547 bp upstr. Rmet_6109 | |
|
| 35 ± 12 | + |
| 278 bp upstr. Rmet_6109 |
| 20 ± 4 | + |
| 127 bp upstr. Rmet_6106 |
|
| 69 ± 12 | + |
| 22 bp upstr. Rmet_6106 | |||||
| | 185 ± 14 | − | 2,354 bp upstr. Rmet_6107 | ||||||
|
| |||||||||
| | 25 ± 6 | + |
| 147 bp upstr. Rmet_5672 |
| 32 ± 8 | + | anti. | 118 bp upstr. Rmet_5667 |
|
| 35 ± 6 | + |
| 75 bp upstr. Rmet_5672 | |||||
|
| 58 ± 11 | + |
| 46 bp upstr. Rmet_5672 | |||||
|
|
| 82 ± 13 | − |
| 39 bp upstr. Rmet_3524 | ||||
|
| 1,312 ± 146 | − | 39 bp upstr. Rmet_3526 | ||||||
|
| 26 ± 7 | − | 94 bp upstr. Rmet_3526 | ||||||
|
| |||||||||
| | 149 ± 61 | − |
| 0 bp upstr. Rmet_6203 |
| 15 ± 2 | + | 6,297 bp upstr. Rmet_6204 | |
|
| 300 ± 44 | + | 3,732 bp upstr. Rmet_6204 | ||||||
|
| 16 ± 1 | + | 2,605 bp upstr. Rmet_6204 | ||||||
|
| 61 ± 16 | + | 1,553 bp upstr. Rmet_6204 | ||||||
|
| |||||||||
| | 60 ± 3 | − |
| 0 bp upstr. Rmet_3867 |
| 36 ± 11 | − |
| 931 bp upstr. Rmet_3864 |
|
| |||||||||
| | 164 ± 57 | − |
| 0 bp upstr. Rmet_0333 | |||||
Transcriptional start sites (TSSs) are listed in the vicinity of metal resistance genes with their TSS score and DNA strand (S). TSSs in boldface are on the “+” strand, and those in italic are on the “−” strand. The putative transcript is given with fields shaded gray representing antisense transcripts. The position of the TSS with respect to a gene (Rmet sequence tag) is also provided. All TSSs were sorted into the group of RpoD- and non-RpoD-dependent promoters as outlined in the text. upstr., upstream; anti., antisense.