| Literature DB >> 25143534 |
Christian W Remmele1, Yibo Xian2, Marco Albrecht2, Michaela Faulstich2, Martin Fraunholz2, Elisabeth Heinrichs2, Marcus T Dittrich1, Tobias Müller1, Richard Reinhardt3, Thomas Rudel4.
Abstract
The WHO has recently classified Neisseria gonorrhoeae as a super-bacterium due to the rapid spread of antibiotic resistant derivatives and an overall dramatic increase in infection incidences. Genome sequencing has identified potential genes, however, little is known about the transcriptional organization and the presence of non-coding RNAs in gonococci. We performed RNA sequencing to define the transcriptome and the transcriptional start sites of all gonococcal genes and operons. Numerous new transcripts including 253 potentially non-coding RNAs transcribed from intergenic regions or antisense to coding genes were identified. Strikingly, strong antisense transcription was detected for the phase-variable opa genes coding for a family of adhesins and invasins in pathogenic Neisseria, that may have regulatory functions. Based on the defined transcriptional start sites, promoter motifs were identified. We further generated and sequenced a high density Tn5 transposon library to predict a core of 827 gonococcal essential genes, 133 of which have no known function. Our combined RNA-Seq and Tn-Seq approach establishes a detailed map of gonococcal genes and defines the first core set of essential gonococcal genes.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25143534 PMCID: PMC4176332 DOI: 10.1093/nar/gku762
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
cDNA sequencing results, adapter removal and mapping overview
| No. of reads | Illumina A -TEX | Illumina A +TEX | Illumina B -TEX | Illumina B +TEX | 454 C -TEX | 454 C +TEX | 454 D -TEX | 454 D +TEX |
|---|---|---|---|---|---|---|---|---|
| Sequenced | 5 410 513 | 3 223 994 | 2 153 437 | 5 589 533 | 299 475 | 328 427 | 322 146 | 299 755 |
| Processed | 5 373 804 | 3 214 281 | 2 129 429 | 5 582 207 | 292 628 | 322 252 | 313 940 | 288 842 |
| Mapped | 5 318 864 | 3 209 065 | 2 084 169 | 5 576 139 | 280 501 | 315 114 | 89 925 | 206 705 |
| Uniquely Mapped | 2 484 119 | 2 887 125 | 925 437 | 5 489 994 | 169 728 | 200 189 | 45 887 | 98 145 |
Figure 1.Transcriptome annotation and expression. (A) Frequency for transcripts containing different numbers of genes. (B) 5′-UTR length distribution of 918 annotated protein coding genes on the ring chromosome for which a distinct primary TSS could be assigned. 5′-UTRs longer than 1000 nt were collapsed into one colored bin. (C) 3′-UTR length distribution of 709 annotated protein coding genes for which a distinct TES could be assigned. 3′-UTRs longer than 600 nucleotides were collapsed into one colored bin. (D) Expression values in RPKM for genes (green), antisense transcription (red) and intergenic regions (blue).
Figure 2.Novel transcripts and sRNA. (A) Northern blot validation of putative small RNAs. Expression of three new transcripts and two previously published sRNAs could be confirmed in MS11 strain. Wt MS11 as well as mutant strains were analyzed by northern blotting. Calculated RNA length were: 115 nt (short transcript) and 191 nt (long transcript) for NgncR_198, 88 nt for NgncR_162, 75 nt for NgncR_011, 239 nt for NgncR_036 and 109 nt for NgncR_094. Multiple bands visible for NgncR_198, NgncR_036 and NgncR_094 may have originated from degradation of the full-size products. (B) Visualization of sequence reads using the integrated genome browser for putative small RNA NgncR_011 and NgncR_198 obtained from non-treated and TEX-treated (+ TEX) libraries.
Figure 3.Phage gene cluster. Visualization of sequence reads of the RNA-Seq and TIS of the Tn-Seq using the integrated genome browser for the phage gene cluster NGFG_01323 - NGFG_02102 and a magnification of the transcriptionally active region including the phage repressor protein NGFG_01312.
Figure 4.Promoter analysis and motif search. (A) Genome-wide motif analysis of promoter regions identified a TATA box-like motif in all promoters. (B) Multiple sequence alignment and sequence logo of the DUS and the TATA-box motifs including the in-between sequence region of 34 promoters where the DUS motif is anchored around the −35 position. (C) Positional distribution of the DUS and the TATA-box like motif in these sequences. (D) A conserved TTGACA motif was identified by exact matching around the position −35 in 30 promoter sequences. (E) Network analysis of highly similar promoter sequences reveals a heterogeneous largest component of 30 promoters including densely clustering opacity proteins. Nodes in the network represent promoters and edges depict significant similarity between the promoter sequences; stronger edges represent higher similarity scores.
Tn sequencing, processing and mapping results
| No. of reads | A | B | A + B |
|---|---|---|---|
| Sequenced | 54 346 653 | 30 541 365 | 84 888 018 |
| Processed | 38 534 885 | 17 804 632 | 56 339 517 |
| Mapped | 36 258 405 | 16 360 608 | 52 619 013 |
| Unique TIS | 84 335 | 86 327 | 112 822 |
Figure 5.Tn-Seq analysis. (A) Scatterplot of TIS counts versus gene length. Red dots show genes with significantly lower TIS counts while green dots represent genes which are not significant. Triangles symbolize dots outside the borders of the plot. (B) Distribution of TIS counts per gene. (C) Distribution of identified TIS (black vertical lines) on a segment of the N. gonorrhoeae MS11 genome. For each position, the number of TIS in range of 200 nt in both directions was plotted. CDS annotations (light green) and transcript annotations (dark green) on plus and minus strands were visualized. The non-essential genes NGFG_01803, NGFG_ 04088 and NGFG_01809 have dense TIS, while essential modification methylases HphIA (NGFG_04087) and HphIB (NGFG_01806) display very few insertions and thus constitute essential genes. The potential promoter region of TRANS_1808 (modification methylases HphIA and HphIB) upstream of NGFG_04087 shows no TIS and, therefore, seems to be essential as well. (D) Relative COG class distribution of all CDS (dark gray) and of essential CDS (light gray). Significantly enriched or depleted COG classes are marked with stars.