| Literature DB >> 21264064 |
Yili Chen1, Dinesh C Indurthi, Shawn W Jones, Eleftherios T Papoutsakis.
Abstract
The genus Clostridium includes major human pathogens and species important to cellulose degradation, the carbon cycle, and biotechnology. Small RNAs (sRNAs) are emerging as crucial regulatory molecules in all organisms, but they have not been investigated in clostridia. Research on sRNAs in clostridia is hindered by the absence of a systematic method to identify sRNA candidates, thus delegating clostridial sRNA research to a hit-and-miss process. Thus, we wanted to develop a method to identify potential sRNAs in the Clostridium genus to open up the field of sRNA research in clostridia. Using comparative genomics analyses combined with predictions of rho-independent terminators and promoters, we predicted sRNAs in 21 clostridial genomes: Clostridium acetobutylicum, C. beijerinckii, C. botulinum (eight strains), C. cellulolyticum, C. difficile, C. kluyveri (two strains), C. novyi, C. perfringens (three strains), C. phytofermentans, C. tetani, and C. thermocellum. Although more than one-third of predicted sRNAs have Shine-Dalgarno (SD) sequences, only one-sixth have a start codon downstream of SD sequences; thus, most of the predicted sRNAs are noncoding RNAs. Quantitative reverse transcription-PCR (Q-RT-PCR) and Northern analysis were employed to test the presence of a randomly chosen set of sRNAs in C. acetobutylicum and several C. botulinum strains, leading to the confirmation of a large fraction of the tested sRNAs. We identified a conserved, novel sRNA which, together with the downstream gene coding for an ATP-binding cassette (ABC) transporter gene, responds to the antibiotic clindamycin. The number of predicted sRNAs correlated with the physiological function of the species (high for pathogens, low for cellulolytic, and intermediate for solventogenic), but not with 16S rRNA-based phylogeny.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21264064 PMCID: PMC3025663 DOI: 10.1128/mBio.00340-10
Source DB: PubMed Journal: mBio Impact factor: 7.867
Summary of predicted clostridial sRNAs
| Organism | No. of predicted sRNAs (excludes tRNAs) | Genome size (bp) | No. of genes | Avg IR size (bp) | No. of genes that overlap with Rfam (v9.1) (excludes tRNAs) |
|---|---|---|---|---|---|
| Pathogenic | |||||
| | 219 | 3,903,260 | 3,590 | 195 | 31 |
| | 233 | 3,863,450 | 3,551 | 198 | 31 |
| | 231 | 3,760,560 | 3,404 | 210 | 31 |
| | 249 | 4,259,691 | 3,984 | 200 | N/A |
| | 246 | 4,107,013 | 3,852 | 209 | N/A |
| | 245 | 3,847,969 | 3,520 | 201 | N/A |
| | 251 | 3,659,644 | 3,256 | 194 | N/A |
| | 257 | 4,012,918 | 3,659 | 208 | 31 |
| | 264 | 4,298,133 | 3,753 | 194 | 31 |
| | 119 | 2,547,720 | 2,315 | 146 | 30 |
| | 193 | 3,085,740 | 2,723 | 184 | 26 |
| | 181 | 3,256,683 | 2,876 | 183 | 30 |
| | 131 | 2,921,996 | 2,578 | 204 | 12 |
| | 137 | 2,873,333 | 2,432 | 171 | 32 |
| Solventogenic and | |||||
| | 113 | 4,132,880 | 3,848 | 153 | 32 |
| | 336 | 6,000,632 | 5,020 | 240 | 27 |
| | 126 | 4,023,800 | 3,913 | 161 | 41 |
| | 136 | 3,955,303 | 3,523 | 186 | N/A |
| Cellulolytic | |||||
| | 45 | 4,068,724 | 3,390 | 159 | N/A |
| | 42 | 4,847,594 | 3,902 | 228 | N/A |
| | 15 | 3,843,301 | 3,189 | 197 | 12 |
N/A, not available.
FIG 1 Q-RT-PCR validation of a select set of predicted sRNAs in Clostridium acetobutylicum and three Clostridium botulinum strains. Cycle threshold (C) values are shown for genes in four groups: positive-control genes, predicted sRNAs, low-expression genes, and negative-control genes. C values are averages of 5 or 6 replicates (error bars represent standard deviations). The numbers shown above the symbols (1 to 47) represent specific genes as listed below; the gene name is given first, and the gene number is shown in parentheses after the gene name. (A) Q-RT-PCR test of sRNA expression in C. acetobutylicum using pool CAC1 as described in Text S1 in the supplemental material. Positive-control genes (genes 1 to 6) include the 16S rRNA and genes known to be expressed well on the basis of microarray data (34): 16S (1), 6S (2), CAC0681 (3), CAC2957 (4), CAC1322 (5), and CAC2139 (6). Low- or no-expression genes (under normal culture conditions) (genes 37 to 42) based on microarray data (34) include the following: CAC3313 (37), CAP0060 (38), CAC0428 (39), CAC1094 (40), CAC2614 (41), and CAC2179 (42). The negative controls (genes 43 to 47) include the following intergenic region (IR) sequences (Ig stands for intergenic): IgCAC_1350 (43), IgCAC_232 (44), IgCAC_2999 (45), IgCAC_2630 (46), and IgCAC_2996 (47). The predicted sRNAs (genes 7 to 36) include the following: sCA-P60 (7), sCAC1449 (8), sCAC3821 (9), sCAC137 (10), sCA-P189 (11), sCAC3283 (12), sCAC1645 (13), sCAC646 (14), sCAC1132 (15), sCAC1582 (16), sCAC1760 (17), sCAC2795 (18), sCAC3723 (19), sCAC903 (20), sCAC3340 (21), sCAC3825 (22), sCAC975.1 (23), sCAC1315 (24), sCA-P18 (25), sCAC610 (26), sCAC2819.1 (27), sCAC1313 (28), sCA-P105 (29), sCA-P18.1 (30), sCAC3039 (31), sCAC500 (32), sCAC1594 (33), sCAC2709 (34), sCAC3850 (35), and sCAC349 (36). Notice that the values of the genes expressed well (genes 4 to 6) and the low-expression genes (genes 37 to 42) overlap. Thus, three zones of approximate expression are indicated: expressed (C ≤ 21), lowly expressed if at all (21 < C < 24), and probably not expressed (C ≥ 24). The C value of 21 was chosen as just below the C value of the lowest low-expression control genes; the C value of 24 was chosen based on the subsequent finding (Fig. 3) that sCAC610 represented by gene 26 is highly expressed. (B) Q-RT-PCR test of sRNA expression in C. botulinum A3 Loch Maree,C. botulinum A ATCC 3502, and C. botulinum A Hall using pool CLK, pool CBO, and pool CLC, respectively, as described in Text S2 in the supplemental material . The positive controls include 23S rRNA (genes 48 to 50) and 16S rRNA (genes 51 to 53) in the three C. botulinum strains. The negative controls (genes 75 to 78) include the following intergenic region sequences: IgCLC_2904 (75), IgCLC_2422 (76), IgCLC_2026 (77), and IgCLC_2348 (78). The predicted sRNAs (genes 54 to 74) are sCLK_200 (54), sCLK_3269 (55), sCLK_3642 (56), sCBO3039 (57), sCLC_2889 (58), sCLK_3105 (59), sCLK_3427 (60), sCBO2696 (61), sCLK_3557 (62), sCBO3480 (63), sCBO1976 (64), sCLK_2040 (65), sCLK_2759 (66), sCLC_3353 (67), sCLC_1905 (68), sCLC_2101 (69), sCBO2173 (70), sCLC_3476 (71), sCBO3602 (72), sCLK_1206 (73), and sCLK_3693 (74).
FIG 2 Northern analysis of predicted sRNAs. Validation of expression for a select set of sRNAs in C. acetobutylicum and C. botulinum by Northern analysis using single-stranded oligonucleotide probes. RNA from pool CAC2 was used for the C. acetobutylicum sRNAs, and RNA from pool CLK was used for the C. botulinum sRNAs. These RNA pools are described in Text S2 in the supplemental material. Ethidium bromide gels of 16S RNA are shown as qualitative and approximate loading controls. (A) Four predicted sRNAs from C. acetobutylicum (sCAC1449, sCAC1760, sCAC3821, and sCAC3340). (B) Two predicted sRNAs from C. botulinum A3 Loch Maree (sCLK_200 and sCLK_3105). (C) The predicted sCAC137 was validated with Northern analysis using a single-stranded oligonucleotide probe. The upstream annotated tRNA, CACt008, was also probed to confirm the tRNA size using a single-stranded oligonucleotide probe.
FIG 3 Expression of sCAC610 and CAC0528 in response to clindamycin treatment of C. acetobutylicum cultures. A single-stranded oligonucleotide probe was used in Northern analysis of sCAC610, and a double-stranded oligonucleotide probe was used for CAC0528. (A) Relative expression, by Q-RT-PCR analysis, of sCAC610 and CAC0528 upon clindamycin treatment, compared to untreated cells. The cells were treated with vehicle (no-clindamycin control) or with 50, 75, and 100 μg/ml clindamycin for 30 and 60 min. (B) Differential expression of sCAC610 and CAC0528 was confirmed using Northern analysis. The cells were treated with 50 µg/ml clindamycin for 30 and 60 min or not treated with clindamycin (0 µg/ml). Ethidium bromide gels of 16S RNA are shown as qualitative loading controls. (C) Predicted rho-independent terminators downstream of CAC0528. (D) DNA sequence of the entire intergenic region upstream of CAC0528. The sCAC610 transcript determined by the 5′ and 3′ RACE reactions is shown in bold type.
FIG 4 Conservation of predicted sRNAs in different clostridial species. (A) The number of predicted sRNAs in representative strains that are conserved in other clostridial species which includes all substrains. The number in parentheses next to the strain name is the total number of predicted sRNAs in each strain. (B to E) sRNA conservation between different clostridial species. Because an sRNA sequence may have multiple conserved sequences in another organism, the number of sRNAs conserved between two organisms could vary. The common number of sRNA sequences between the three species in each Venn diagram is the maximum number of conserved sRNAs in the three species, and the number in parentheses is the minimum number of conserved sRNAs in the three species.
FIG 5 Number of predicted sRNAs versus genome size. The number of predicted sRNAs for each clostridial species is plotted against the size of its genome. The number of predicted sRNAs in pathogenic clostridial strains varies linearly with genome size (R2 = 0.8901). For species with similar genome sizes (shown within the dashed-line box), the number of predicted sRNAs varies systematically with the type (grouped based on physiological niche) of clostridial species: very low in cellulolytic species, high for pathogenic species, and intermediate for solventogenic and C. kluyveri (this is statistically significant; the t test comparing any two of the three clostridial types gives P values of <0.005).