| Literature DB >> 29370758 |
Manuel Wittchen1, Tobias Busche1,2, Andrew H Gaspar3, Ju Huck Lee4,5, Hung Ton-That4, Jörn Kalinowski6, Andreas Tauch1.
Abstract
BACKGROUND: The human pathogen Corynebacterium diphtheriae is the causative agent of diphtheria. In the 1990s a large diphtheria outbreak in Eastern Europe was caused by the strain C. diphtheriae NCTC 13129. Although the genome was sequenced more than a decade ago, not much is known about its transcriptome. Our aim was to use transcriptome sequencing (RNA-Seq) to close this knowledge gap and gain insights into the transcriptional landscape of a C. diphtheriae tox+ strain.Entities:
Keywords: Corynebacterium diphtheriae; Diphtheria toxin; DtxR; Promoter; RNA-Seq; Transcription start site; Transcriptome sequencing
Mesh:
Substances:
Year: 2018 PMID: 29370758 PMCID: PMC5784534 DOI: 10.1186/s12864-018-4481-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Classification of detected transcription start sites (TSS). The identification, curation and classification of TSS is shown. From the automatically detected 3987 TSS 2310 false positive TSS and 21 TSS belonging to rRNA and tRNA genes were removed, resulting in 1656 putative TSS assigned to different transcript types
Fig. 2Promoter motifs for the sigma factor σA of C. diphtheriae. The size of the nucleotide represents the relative abundance of the particular nucleotide at this position. The −10 motif was found upstream of 1190 TSS and the −35 motif upstream of 1031 TSS. The data was visualized with Weblogo [54]
Fig. 3Histogram of 5’-UTR lengths and Ribosomal binding site motif in C. diphtheriae. a For the 5’-UTR analysis 1232 TSS from known genes and novel transcripts were used. The distribution of 5’-UTR lengths ranges from 0 nt (leaderless) to a maximum of 463 nt. Transcripts with a 5’-UTR of up to 5 nucleotides make up one third of all protein-coding genes. The bars represent UTR length increments of five nucleotides (1–5 nt, 6–10 nt, etc.), except for the first bar which represents UTR length of zero nucleotides. b Ribosomal binding site motif and mean distance to start codon. All 5’-UTRs longer than 5 nt were used for motif search. The y-axis shows the information content in bits for every nucleotide position. The diagram was created with Weblogo [54]
Predicted cis-regulatory elements in the 5’-UTRs. The predictions were obtained by using Infernal 1.1.2 with the Rfam 12.1 database and compared with the primary 5′-end data set. The list is sorted by Bit Score in descending order. Abbreviation: n.a., not applicable
| Infernal prediction with Rfam | RNA-Seq detection | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Name | ID | Start | End | Bit Score | Strand | Status | Start | End | Gene |
| Cobalamin riboswitch | RF00174 | 1,066,117 | 1,066,317 | 90.8 | + | observed | 1,066,117 | 1,066,317 | DIP1084 |
| Cobalamin riboswitch | RF00174 | 862,214 | 862,414 | 81.8 | + | not observeda | n.a. | n.a. | n.a. |
|
| RF01766 | 285,506 | 285,923 | 64.8 | + | observed | 285,506 | 285,923 | DIP0320 / |
|
| RF01746 | 1,640,095 | 1,639,987 | 70.2 | – | observedb | 1,640,095 | 1,639,987 | DIP1606 / |
| TPP riboswitch | RF00059 | 924,778 | 924,890 | 60.8 | + | observedc | 924,778 | 924,890 | DIP0953 |
| TPP riboswitch | RF00059 | 922,838 | 922,969 | 60.0 | + | observedc | 922,838 | 922,969 | DIP0951 |
|
| RF01747 | 509,334 | 509,277 | 48.2 | – | observed | 509,334 | 509,277 | DIP0539 |
| TPP riboswitch | RF00059 | 27,456 | 27,566 | 57.6 | + | not observeda | n.a. | n.a. | n.a. |
aNo TSS detected and insufficient read coverage in that area
bMotif of the mraW region is located upstream of the TSS of DIP1606 / mraW
cLast base of TPP riboswitch is located in the respective CDS
Fig. 4Illustration of an intergenic TSS assigned to a novel transcript. The intergenic TSS at position 1,866,670 bp of the C. diphtheriae NCTC 13129 genome has a corresponding coverage in the downstream direction, giving hint for a novel, not yet annotated transcript. An ORF given the locus tag DIP1817A was found, which is predicted to encode a helix-turn-helix (HTH) family transcriptional regulator
Fig. 5Number of monocistronic genes, primary operons and sub-operons of C. diphtheriae NCTC 13129. The number of genes included in primary and sub-operons is color-coded
Largest primary operons in C. diphtheriae NCTC 13129. The predictions from the eggNOG database (v4.5) were used to classify genes by their functions
| Genes | Number of genes | Strand | TSS | Gene names and classification by eggNOG database |
|---|---|---|---|---|
| DIP0472 - DIP0482 | 11 | + | detected | |
| DIP0203 - DIP0209 | 8 | + | not detected | DIP0203, DIP0204, DIP0205, DIP0206, DIP0207, DIP0208, DIP0208A, DIP0209. |
| DIP0407 - DIP0414 | 8 | + | detected | |
| DIP0438 - DIP0445 | 8 | + | detected | DIP0438, DIP0439, DIP0440, DIP0441, DIP0442, DIP0443, DIP0444, DIP0445.Inorganic ion transport and metabolism; Function unknown |
| DIP0719 - DIP0726 | 8 | + | detected | DIP0719, DIP0720, DIP0721, DIP0722, DIP0723, DIP0724, DIP0725, |
| DIP0969 - DIP0976 | 8 | + | detected | DIP0969, DIP0970, DIP0971, DIP0972, |
| DIP1248 - DIP1241 | 8 | – | detected | DIP1248, DIP1247, DIP1246, DIP1245, DIP1244, DIP1243, |
| DIP1603 - DIP1596 | 8 | – | not detected | |
| DIP1720 - DIP1713 | 8 | – | not detected | |
| DIP1779 - DIP1772 | 8 | – | not detected | |
| DIP1857 – DIP1850 | 8 | – | detected |
Fig. 6M/A plot of differentially transcribed genes in the ΔdtxR mutant. Genes with an adjusted p-value below 0.05 are shown in red. The blue lines indicate the log2(fold change) threshold of +1.0 and −1.0, respectively. Genes with a known or predicted DtxR binding site are labeled
Differential expression and σA promoter region of genes with known or predicted DtxR binding site. Based on the differential expression values the genes are classified into three groups: repressed, activated and not differentially expressed. In case no TSS could be assigned to the gene only the translation start codon is shown. In case a TSS could be assigned, but no −10 motif could be identified the start codon is preceded with a dotted line. Counts, normalized read counts; LFC, log2(fold change); Start, start codon. The asterisks mark genes with an experimentally shown DtxR regulation
aNo TSS detected, hence no promoter motif predictable
bGene is part of a primary operon and lacks own TSS
cMultiple TSS detected and only TSS closest to DtxR binding site shown. In case of the tox gene the −10 motif of the TSS (TSS 2) closest to the start codon is shown
dGene is the first one in a sub-operon and therefore has own TSS
Fig. 7Transcription profile of the phage island and the tox gene. a The transcription profile of the phage island (framed by two red lines) of the wild type and the isogenic ΔdtxR mutant is shown. The genes which could be assigned with a TSS are labeled with a black arrow. Primary operon structures are indicated by dashed arrows. The tox gene is shown in dark yellow. TSS assigned to intergenic and antisense transcripts were omitted for clarity. b Detailed view on the monocistronic tox gene. It possesses two TSS (TSS 1 and TSS 2) with 5’-UTR lengths of 43 nt and 38 nt. The DtxR binding site (red box) is located close to the start codon of the CDS and overlaps both TSS and their corresponding −10 promoter motifs. Two additional TSS for putative antisense RNAs were found (light green arrows) and two additional TSS (intergenic TSS 1 and intergenic TSS 2) on the forward and reverse strand, respectively, were identified upstream of the tox gene, indicating two putative novel transcripts. The image is a modified screenshot from ReadXplorer [48] showing non-normalized coverage data of one exemplary replicate per condition