| Literature DB >> 25806405 |
José Vicente Gomes-Filho1, Livia Soares Zaramela, Valéria Cristina da Silva Italiani, Nitin S Baliga, Ricardo Z N Vêncio, Tie Koide.
Abstract
The existence of sense overlapping transcripts that share regulatory and coding information in the same genomic sequence shows an additional level of prokaryotic gene expression complexity. Here we report the discovery of ncRNAs associated with IS1341-type transposase (tnpB) genes, at the 3'-end of such elements, with examples in archaea and bacteria. Focusing on the model haloarchaeon Halobacterium salinarum NRC-1, we show the existence of sense overlapping transcripts (sotRNAs) for all its IS1341-type transposases. Publicly available transcriptome compendium show condition-dependent differential regulation between sotRNAs and their cognate genes. These sotRNAs allowed us to find a UUCA tetraloop motif that is present in other archaea (ncRNA family HgcC) and in a H. salinarum intergenic ncRNA derived from a palindrome associated transposable elements (PATE). Overexpression of one sotRNA and the PATE-derived RNA harboring the tetraloop motif improved H. salinarum growth, indicating that these ncRNAs are functional.Entities:
Keywords: HgcC RNA family; IS1341; IS1341-type; IS200/605; RNA-seq; non-coding RNA; overlaping transcripts; sense overlaping transcripts; tetraloop; transposons
Mesh:
Substances:
Year: 2015 PMID: 25806405 PMCID: PMC4615843 DOI: 10.1080/15476286.2015.1019998
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
IS1341-type transposases (tnpB genes) analyzed in this work and their respective sense overlapping transcripts (sotRNAs)
| Gene | IS | chromosome | strand | sotRNA transcript start | sotRNA transcript end | sotRNA size [nt] | R1 strain equivalent genes | |||
|---|---|---|---|---|---|---|---|---|---|---|
| VNG0013C | IS | chr | 12734 | 11478 | reverse | 11641 | 11428 | 214 | OE1019R | |
| VNG0026C | IS | chr | 21789 | 20533 | reverse | 20701 | 20498 | 204 | OE1040R | |
| VNG0042G | IS | chr | 35931 | 34651 | reverse | 34775 | 34570 | 206 | OE1070R | |
| VNG0044H | IS605 (VNG0043H – VNG0044H), convergent | chr | 36715 | 37971 | forward | 37813 | 38042 | 230 | OE1074F | |
| VNG0286C | IS | chr | 229443 | 230597 | forward | 230444 | 230629 | 186 | OE1440F | |
| VNG2652H | IS | chr | 1989188 | 1987884 | reverse | 1988005 | 1987819 | 187 | OE4727R | |
| VNG6181H | IS | pNRC200 | 155603 | 154368 | reverse | 154508 | 154259 | 250 | OE5062R | |
| VNG6221H | IS | pNRC200 | 183054 | 181795 | reverse | 182008 | 181724 | 285 | OE5102R | |
| VNG6361G | IS | pNRC200 | 287504 | 288844 | forward | 288696 | 288915 | 220 | OE6034F | |
| VNG6406H | IS | pNRC200 | 316931 | 315786 | reverse | 315833 | 315631 | 203 | OE6089R |
Figure 1.Identification of sotRNAs in IS1341-type transposases using tiling array data. (A) (I) Tiling array signal in reference condition for VNG0042G and VNG_sot0042; (II) expression profiles over the growth curve; (III) expression profiles during TfbD overexpression. Heatmaps are color-coded according to log10 expression ratios between each of the 13 time points relative to reference condition. (B) Tiling array signal in reference condition and expression profiles of the remaining 9 IS1341-type transposases (arrows in yellow for genes in forward strand and in orange for genes in reverse strand) and their sotRNAs (light blue arrows) over the growth curve and TfbD overexpression, as described in (A).
Figure 2.Analysis of sRNA-seq and expression profiles for VNG0042G and VNG_sot0042. (A) Strand-specific sRNA-seq data was used to map the 5' end of sotRNAs in IS1341-type transposases of H. salinarum NRC-1. sRNA-seq data is visualized as log2 of total reads aligned in each genomic position for VNG0042G and VNG_sot0042. Enrichment of 5' ends of mapped reads are visualized as peaks right below small RNA-seq coverage. Light blue arrow: sotRNA. Dark orange arrows: genes annotated on reverse strand. (B) Expression profiles of VNG0042G (orange) and VNG_sot0042 (blue) in different environmental and genetic backgrounds. Each pair of orange/blue columns represent one published experimental condition. Data for the remaining genes and for each individual experiment is available in Supplementary Figure 1. Data for intergenic ncRNAs can be found in and Supplementary Figure 7.
Figure 4.(A) Strand-specific sRNA-seq data was used to map 5' ends of intergenic RNAs in H. salinarum NRC-1. sRNA-seq data is visualized as log2 of total reads aligned in each genomic position for VNG_R0052. Enrichment of 5' ends of mapped reads are visualized as peaks right below small RNA-seq coverage profile. (B) Expression profiles of VNG_R0052 in different environmental and genetic backgrounds.
Figure 3.Tetraloop motif found in most sotRNAs in H. salinarum NRC-1. Colors represent base pairing groups and color saturation corresponds to conservation, parenthesis correspond to pairing and dots to unpaired regions. Gray blocks below the sequences represent a histogram of the most conserved nucleotide. Raw data is available at Supplementary File 2.
Intergenic RNAs harboring the RE-like tetraloop motif
| ncRNA | VNG_R0052 | VNG_R6334 |
|---|---|---|
| Start | 14397 | 268030 |
| End | 14299 | 268304 |
| Length | 99 | 275 |
| Chr | chr | pNRC200 |
| Strand | reverse | forward |
| RE-like Start | 14359 | 268256 |
| RE-like End | 14340 | 268277 |
Figure 5.Growth curve of 4 different strains of H. salinarum NRC-1 under reference conditions. Strains overexpressing RNAs (VNG_R0052 and VNG_sot0042) that harbors the RE-like tetraloop present higher growth when compared to control strains (NRC-1, wild type and PMTF-0, which is the wild type strain with an empty overexpression vector).