| Literature DB >> 19923228 |
Marco Albrecht1, Cynthia M Sharma, Richard Reinhardt, Jörg Vogel, Thomas Rudel.
Abstract
Chlamydia trachomatis is an obligate intracellular pathogenic bacterium that has been refractory to genetic manipulations. Although the genomes of several strains have been sequenced, very little information is available on the gene structure of these bacteria. We used deep sequencing to define the transcriptome of purified elementary bodies (EB) and reticulate bodies (RB) of C. trachomatis L2b, respectively. Using an RNA-seq approach, we have mapped 363 transcriptional start sites (TSS) of annotated genes. Semi-quantitative analysis of mapped cDNA reads revealed differences in the RNA levels of 84 genes isolated from EB and RB, respectively. We have identified and in part confirmed 42 genome- and 1 plasmid-derived novel non-coding RNAs. The genome encoded non-coding RNA, ctrR0332 was one of the most abundantly and differentially expressed RNA in EB and RB, implying an important role in the developmental cycle of C. trachomatis. The detailed map of TSS in a thus far unprecedented resolution as a complement to the genome sequence will help to understand the organization, control and function of genes of this important pathogen.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19923228 PMCID: PMC2817459 DOI: 10.1093/nar/gkp1032
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Characterization of the cDNA libraries. (A) Length distribution of reads after 5′ end-linker and polyA-tail clipping of four sequenced C. trachomatis cDNA libraries generated from either total RNA or total RNA enriched for primary transcripts of reticulate bodies (RB) or elementary bodies (EB), respectively. Shown are relative numbers of groups of sequence length in relation to the total number of reads per library. (B) Sequence read distribution of the four cDNA libraries grouped into different classes of RNAs. Transcripts antisense to ribosomal RNAs, transfer RNAs and messenger RNAs are not shown since they represent <1% of total sequences. Transcripts located in intergenic regions (IGR) can either be sRNA candidates, part of 5′- or 3′-UTRs of mRNAs or unannotated coding genes. The majority of the fraction of reads that could not be mapped to the chlamydial genome corresponds to contaminating host cell RNA, mainly mitochondrial RNAs.
Transcripts enriched in EB and RB
| Gene | Description | Gene no. | Homolog | Microarray ( |
|---|---|---|---|---|
| Elementary bodies | ||||
| Pseudogen | CTLon_0548 | CT300 | – | |
| Hypothetical protein | CTLon_0186 | CT814.1 | Late | |
| Hypothetical protein | CTLon_0332 | CT081 | Late | |
| Type III secretion chaperone (low calcium response protein H) | CTLon_0833 | CT576 | Late | |
| Putative type III secretion system protein | CTLon_0836 | CT579 | Late | |
| Hypothetical protein | CTLon_0250 | CT875 | Late | |
| Hypothetical protein | CTLon_0333 | CT082 | Late | |
| Cysteine-rich outer membrane protein | CTLon_0699 | CT444 | Late | |
| Hypothetical protein | CTLon_0880 | CT622 | Late | |
| Hypothetical protein | CTLon_0834 | CT577 | Late | |
| Hypothetical protein | CTLon_0242 | CT867 | – | |
| Hypothetical protein | CTLon_0185 | CT814 | Late | |
| Cysteine-rich membrane protein | CTLon_0697 | CT442 | Late | |
| Putative integral membrane protein | CTLon_0398 | CT147 | Very late | |
| Putative lipoprotein | CTLon_0700 | CT444.1 | – | |
| Hypothetical protein | CTLon_0428 | CT181 | Late | |
| Late transcription unit B protein | CTLon_0331 | CT080 | Late | |
| Putative type III secretion system membrane protein | CTLon_0835 | CT578 | Late | |
| Hypothetical protein | CTLon_0609 | CT357 | – | |
| Hypothetical protein | CTLon_0255 | CT005 | Late | |
| Putative protein ligase | CTLon_0285 | CT035 | Very late | |
| Hypothetical protein | CTLon_0608 | CT356 | Late | |
| 60 kDa cysteine-rich outer membrane protein | CTLon_0698 | CT443 | Late | |
| 1-Acyl-sn-glycerol-3-phosphate acyltransferase | CTLon_0144 | CT775 | Late | |
| Putative integral membrane protein | CTLon_0617 | CT365 | Late | |
| Late transcription unit A protein | CTLon_0629 | CT377 | Midlate I | |
| Hypothetical protein | CTLon_0461 | CT214 | Late | |
| Putative oxidoreductase | CTLon_0627 | CT375 | Late | |
| Hypothetical protein | CTLon_0240 | CT865 | – | |
| ABC transporter, ATP-binding protein | CTLon_0427 | CT180 | – | |
| Hypothetical protein | CTLon_0334 | CT083 | Late | |
| Malate dehydrogenase | CTLon_0628 | CT376 | Midlate I | |
| Pseudogen | CTLon_0610 | CT358 | – | |
| Hypothetical protein | CTLon_0536 | CT288 | Late | |
| Long chain fatty acid-[acyl-carrier-protein] ligase | CTLon_0145 | CT776 | Late | |
| Hypothetical protein | CTLon_0477 | CT229 | – | |
| Hypothetical protein | CTLon_0028 | CT659 | Late | |
| Ribose-5-phosphate isomerase A | CTLon_0460 | CT213 | Late | |
| Reticulate bodies | ||||
| Polymorphic outer membrane protein | CTLon_0667 | CT414 | Midlate I | |
| Ribonucleotide-diphosphate reductase subunit alpha | CTLon_0199 | CT827 | Midlate I | |
| Hypothetical protein | CTLon_0267 | CT017 | Late | |
| Hypothetical protein | CTLon_0859 | CT602 | – | |
| Cytochrome | CTLon_0263 | CT013 | Midlate I | |
| Putative type III secretion system chaperone | CTLon_0294 | CT043 | Midlate I | |
| Putative cation efflux protein | CTLon_0678 | CT423 | – | |
| Type III secretion structural protein (outer membrane ring) | CTLon_0043 | CT674 | Midlate I | |
| Type III secretion system protein, membrane component | CTLon_0816 | CT559 | Midlate I | |
| Hypothetical protein | CTLon_0251 | CT001 | Late | |
| Transaldolase B | CTLon_0561 | CT313 | Midlate I | |
| Cell division protein | CTLon_0213 | CT841 | Late | |
| Major outer membrane protein | CTLon_0050 | CT681 | Midlate II | |
| Putative serine/threonine-protein kinase (TTSS effector protein) | CTLon_0042 | CT673 | – | |
| Holliday junction DNA helicase B | CTLon_0291 | CT040 | – | |
| Putative lipoprotein | CTLon_0501 | CT253 | – | |
| Integration host factor alpha-subunit | CTLon_0515 | CT267 | Midlate I | |
| Putative helicase | CTLon_0077 | CT708 | Midlate I | |
| Polymorphic outer membrane protein | CTLon_0245 | CT870 | Midlate I | |
| Hypothetical protein | CTLon_0624 | CT372 | Midlate I | |
| Hypothetical protein | CTLon_0152 | CT783 | Midlate II | |
| Tyrosine-specific transport protein | CTLon_0190 | CT818 | – | |
| ATP-dependent Clp protease | CTLon_0534 | CT286 | Midlate II | |
| Hypothetical protein | CTLon_0537 | CT289 | Midlate I | |
| Low calcium response protein E (TTSS effector protein) | CTLon_0340 | CT089 | Midlate I | |
| Polymorphic outer membrane protein | CTLon_0246 | CT871 | Midlate I | |
| Putative sugar phosphate permease | CTLon_0801 | CT544 | Midlate I | |
| Polymorphic outer membrane protein | CTLon_0247 | CT872 | Midlate I | |
| 50S ribosomal protein L13 | CTLon_0376 | CT125 | – | |
| Hypothetical protein | CTLon_0876 | CT618 | Midlate I | |
| Polymorphic outer membrane protein | CTLon_0244 | CT869 | Late | |
| Hypothetical protein | CTLon_0572 | CT324 | – | |
| ABC transport protein, ATPase component | CTLon_0059 | CT690 | – | |
| Polymorphic outer membrane protein | CTLon_0666 | CT413 | Midlate I | |
| Hypothetical protein | CTLon_0521 | CT273 | Midlate I | |
| Hypothetical protein | CTLon_0702 | CT446 | – | |
| Ribosomal large subunit pseudogenuridine synthase D | CTLon_0027 | CT658 | – | |
| Hypothetical protein | CTLon_0607 | CT355 | – | |
| Putative DNA methyltransferase | CTLon_0733 | CT477 | – | |
| Histone-like protein 2 | CTLon_0297 | CT046 | Late | |
| Hypothetical protein | CTLon_0879 | CT621 | Midlate I | |
| Nucleoside diphosphate kinase | CTLon_0757 | CT500 | Midlate I | |
| Polymorphic outer membrane protein | CTLon_0249 | CT874 | Midlate I | |
| CDP-diacylglycerol–glycerol-3-phosphate 3-phosphatidyltransferase | CTLon_0752 | CT496 | Midlate I | |
| Na(+)-translocating NADH-quinone reductase subunit A | CTLon_0002 | CT634 | Midlate I | |
| Hypothetical protein | CTLon_0635 | CT382.1 | – | |
A semi-quantitaive analysis of differentially expressed genes returned 84 protein coding genes, 38 overrepresented in EB and 46 more abundant in RB. Sequence read numbers were counted as for each ORF including the 5′-UTR if present and normalized by the total number of mRNA transcripts for each library. Genes with a total number of at least 20 sequence reads and a 2-fold regulation were considered as differentially expressed. A comparison to a microarray based gene expression study shows a high correlation of genes abundant in the late infection cycle phase to EB and genes abundant in the midlate phases to RB.
Figure 2.Validation of putative small RNAs by northern analysis. (A) Expression of eight out of twelve new candidate sRNAs could be confirmed by northern blotting whereas the length of the probed RNA corresponded to the calculated length from the sequencing data. The genomic location of the sRNAs is shown in (B). ctrR1 is not a primary transcript but presumably processed from a larger transcript containing tRNA:Thr. ctrR2, 3, 5–7 are located intergenic and primary transcripts. Two sRNAs are transcribed from ctrR4 and the longer transcript overlaps the gene 403 on the opposite strand. ctrR8 is located antisense to gene 807. ctrR0332 has previously been identified as ORF CTLon_0332, but represents two RNA species processed from a larger transcript lacking an ORF. Details on the genomic location and the sequence of ctrR0332 are given in Supplementary Figure S2. Two housekeeping RNAs (tmRNA and SRP RNA) and ihtA is the only so far reported chlamydial sRNA are shown in the right lanes. Genomic locations, sequence read numbers and hybridization probes are given in Supplementary Table S2.
Putative targets of chlamydial non-coding RNAs
| sRNA | Start | End | Length | RB read number | EB read number | Putative target mRNAs |
|---|---|---|---|---|---|---|
| ihtA | 53 029 | 53 137 | 109 | 77 | 17 | hctA, 317, 097, mgtE |
| ctrR1 | 53 220 | 53 363 | 144 | 136 | 227 | 819, 370, 505 |
| ctrR2 | 1 26 638 | 1 26 536 | 103 | 1 | 2 | 492, |
| ctrR3 | 3 42 947 | 3 42 838 | 110 | 981 | 1761 | 248, |
| ctrR4 | 4 98 126 | 4 97 963 | 164 | 19 | 3 | xerC, pknD, 626 |
| ctrR5 | 5 86 269 | 5 86 341 | 73 | 8 | 3 | dacC, gnd |
| ctrR6 | 7 25 411 | 7 25 268 | 144 | 75 | 19 | 824, |
| ctrR7 | 8 76 694 | 8 76 384 | 311 | 71 | 15 | copB, 847, glnQ, 531, ssc2, 641, 623, 652, lpdA, ihfA, 597, tig, nusG, aroA |
| ctrR8 | 9 38 403 | 9 38 509 | 107 | 59 | 1 | |
| ctrR0332 | 4 16 302 | 4 16 641 | 340 | 1039 | 16 960 | 333, atpK, 292, 586, gnd, 822, glgC |
| pL2-sRNA1 | 213 | 292 | 90 | 8332 | 1075 |
Based on binding probability of the newly identified small non-coding RNAs putative binding partners were identified using the TargetRNA algorithm. Hits were filtered for binding of the sRNA to the translation start codon or the ribosome binding site. None of the sRNAs is conserved in other bacteria than Chlamydiae. However some seed sequences are conserved among at least two other bacterial phyla. Targets putatively bound by these conserved sequence elements are underlined. Putative target mRNAs are listed in order of their binding score starting with the highest score.
Figure 3.Transcriptome of the cryptic plasmid. (A) Calculated sequence graphs for C. trachomatis cryptic plasmid. Graphs show the number of sequence reads for every nucleotide of the plasmid up to the cut-off value of 10. Total RNA library reads are shown in grey and are overlaid by red graphs for the TSS enriched libraries. Annotated ORFs are shown as black bars for the plus strand (upper) and minus strand (lower), respectively. Note that total numbers for single transcripts are much higher than 10 but omitted for better visualization. TSS are indicated by horizontal arrows marked with the start positions. Hybridization probe binding sites used for northern detection are marked by vertical arrowheads and the probe name. (B) Northern analysis of C. trachomatis plasmid transcripts reveals a highly abundant small RNA pL2-sRNA1 of ∼100 nt in length antisense to pL2-07a. The region located antisense to the ORF pL2-02 encodes several transcripts in the range of ∼80–450 nt. Probe binding sites are marked by an arrow in (A). The names of the transcripts correspond to the first nucleotide of the probe binding site.