| Literature DB >> 22783241 |
David L Bernick1, Patrick P Dennis, Lauren M Lui, Todd M Lowe.
Abstract
A great diversity of small, non-coding RNA (ncRNA) molecules with roles in gene regulation and RNA processing have been intensely studied in eukaryotic and bacterial model organisms, yet our knowledge of possible parallel roles for small RNAs (sRNA) in archaea is limited. We employed RNA-seq to identify novel sRNA across multiple species of the hyperthermophilic genus Pyrobaculum, known for unusual RNA gene characteristics. By comparing transcriptional data collected in parallel among four species, we were able to identify conserved RNA genes fitting into known and novel families. Among our findings, we highlight three novel cis-antisense sRNAs encoded opposite to key regulatory (ferric uptake regulator), metabolic (triose-phosphate isomerase), and core transcriptional apparatus genes (transcription factor B). We also found a large increase in the number of conserved C/D box sRNA genes over what had been previously recognized; many of these genes are encoded antisense to protein coding genes. The conserved opposition to orthologous genes across the Pyrobaculum genus suggests similarities to other cis-antisense regulatory systems. Furthermore, the genus-specific nature of these sRNAs indicates they are relatively recent, stable adaptations.Entities:
Keywords: C/D box small RNA; antisense small RNA; archaea; comparative genomics; gene regulation; transcriptome sequencing
Year: 2012 PMID: 22783241 PMCID: PMC3388794 DOI: 10.3389/fmicb.2012.00231
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Small RNA transcript abundance in four species of . Sense oriented reads (+) and antisense-oriented reads (−) shown in barplots for each species. Samples of each species were taken at both exponential (Exp) and stationary phases (Stat). RNA classifications were made based on mapping to genes coding for C/D box sRNA (C/D sRNA), H/ACA-like sRNA, CRISPR arrays (crRNA), fragments of coding regions (coding), ribosomal RNA (rRNA), and transfer RNA (tRNA).
Orthologous genes with 5′ sequencing reads. Orthologous groups are shown in each row where the locus tag number (e.g., 1645 for gene PAE1645) is followed by counts of (antisense, sense) reads. Groups are ranked by the total number of reads found within groupings formed by the number of species in a group with antisense sequencing reads. Read counts are accumulated by considering the largest region covered by at least one read in an overlapping region along a given strand, and assigning the read count to that region. Footnoted gene IDs have associated snoRNA-like sRNA (C/D box or H/ACA-like) – a, antisense oriented; s, sense oriented.
| Product | ||||
|---|---|---|---|---|
| Transcription initiation factor IIB | 1645 (225,409) | 1976 (12,16) | 0584 (1,1) | 1667 (8,39) |
| DNA-cytosine methyltransferase | 1659 (4,0) | 1839 (1,0) | 0576 | 1675 (2,0) |
| Rhomboid family protein | 1099 (3,0) | 0267 (1,0) | 0686 | 1249 (2,0) |
| Ferric uptake regulator, Fur family | 2309 (40,11) | 1526 | 1653 (1,0) | 1023 |
| 30S ribosomal protein S12P | 0670 | 2326 (2,0) | 2096 | 0698 (7,1) |
| Cobalamin adenosyltransferase | 1715 | 0782 | 0623a (86,4) | 1701 (0,3) |
| Thiol:disulfide interchange protein | 3152 | 1672 | 1794 | 0523 (32,0) |
| 30S ribosomal protein S11P | 3179 (15,2) | 1654 (0,2) | 1813 | 0540 |
| NAD-dependent deacetylase | 3500 | 1959a (12,3) | 1963 | 0793 |
| NADH dehydrogenase subunit A | 3520 (9,0) | 1954 (0,1) | 1983 | 0847 |
| 30S ribosomal protein S3P | 1779 | 0769 | 0553 | 1729 (9,0) |
| Translation initiation factor IF–1A | 1072 (7,0) | 0278 | 0681 | 1256 |
| Putative transcriptional regulator, GntR family | 2315 (0,10) | 1532 (4,2) | 1659 (0,2) | 1028 |
| Valyl-tRNA synthetase | 2297 (4,0) | 1497 (0,1) | 1649 | 1019 (0,1) |
| Putative signal-transduction protein with CBS domains | 2961 (4,0) | 1332 (0,1) | 1143 | 0364 |
| Major facilitator superfamily MFS_1 | 1550s (3,5) | 0660s (0,2) | 0530s (0,2) | |
| Elongation factor 1, beta/beta’/delta chain | 0695 (3,1) | 2345 | 2114 | 0684 |
| Egghead-like protein | 0042 (3,2) | 1076 | 2043 (0,1) | 0056 |
| V-type ATP synthase subunit B | 1146 | 0237 (3,0) | 0698 | 1264 |
| Conserved protein (possible ATP binding) | 0793 (3,11) | 0044 | 2138 | 1084 |
| Putative transcriptional regulator, ModE family | 0813 (2,0) | 0057 | 0023 | 1100 |
| 50S ribosomal protein L18e | 0672 | 2328 (2,0) | 2098 | 0696 (0,1) |
| ABC transporter related | 1393 (2,0) | 0445 | 1879 | 1525 |
| Peptidase M50 | 1702 | 2238 (2,1) | 0618 | 1696 |
| Cation diffusion facilitator family transporter | 0568 (2,0) | 2239 | 1215 | 0125 (0,2) |
| paREP10 | 1480 | 0613 (2,0) | 0811 (0,1) | 1575 |
| Exosome complex RNA-binding protein Rrp42 | 2206 (2,1) | 1938 | 0932 | 0835 |
| Inner-membrane translocator | 3412 (2,0) | 1174 | 1046 | 0977 |
| NADH-ubiquinone oxidoreductase subunit | 2274 | 2047 | 0329 (1,0) | |
| CopG domain protein DNA-binding domain protein | 2357 (1,0) | 1561 | 1689 | 0622 |
| Inner-membrane translocator | 3348 (1,0) | 1760 | 0444 | 0590 |
| Amino acid-binding ACT domain protein | 2296 | 1510 | 1648 | 1018 (1,1) |
| Hydrogen sulfite reductase | 2596 (1,0) | 1213 | 1457 | |
| DNA-directed RNA polymerase subunit P | 2258 (0,1) | 1825 | 1624 (1,0) | 0899 |
| DNA polymerase, beta domain protein | 1893 | 0821 (1,0) | 1502 | |
| Phosphate ABC transporter, inner membrane subunit PstC | 1396 | 0443 | 1881 | 1527 (1,0) |
| Nicotinamide-nucleotide adenylyltransferase | 1438 | 0405 (1,0) | 0794 | 1561 |
| Peptidase S8 and S53, subtilisin, kexin, sedolisin | 1983 | 2056 | 1805 (1,0) | |
| Glu/Leu/Phe/Val dehydrogenase,C terminal | 3438 | 1871 | 1031 | 0980 (1,0) |
| D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding | 3320 (1,1) | 1736 | 1741 | 0566 |
| Electron transfer flavoprotein, alpha subunit | 0721 | 2372 (1,0) | 2132 | 0645 (0,1) |
| Sua5/YciO/YrdC/YwlC family protein | 2978 (1,0) | 1345 | 1129 | 0378 (0,1) |
| AAA ATPase | 3527 | 1626 (1,0) | 1978 | 0145 |
| MazG nucleotide pyrophosphohydrolase | 1159 (1,0) | 0222 | 0722 | |
| 30S ribosomal protein S8P | 2098 | 2009 (0,2) | 0176 | 1865 (1,0) |
| Transcriptional regulator, XRE family | 0783 | 0037 (1,0) | 2145 | 1076 |
| Acyl-CoA dehydrogenase domain protein | 2070 | 2103 | 0199 | 1853 (1,0) |
| 2-dehydropantoate 2-reductase | 3409 (1,0) | 2003 | 0383 | 1363 |
| FHA domain containing protein | 0816 | 0060 (1,0) | 0026 | 1103 |
| PaREP1 domain containing protein | 3235 | 0464 (0,3) | 1514 (1,0) | |
| 30S ribosomal protein S19e | 3043 (1,1) | 1790 | 0988 | 0440 (0,39) |
| CutA1 divalent ion tolerance protein | 2325 | 1539 | 1667 | 1044 (1,0) |
| Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase | 2075 (1,1) | 2019 | 0203 | 1857 |
| Inner-membrane translocator | 2083 (1,0) | 1504 | 0826 | 0317 (0,2) |
| 30S ribosomal protein S7P | 0733 (1,1) | 0001 | 0006 | 0655 (0,3) |
| Ribosomal protein L11 | 3104 (1,0) | 1602 | 1832 | 0464 |
| Metallophosphoesterase | 3211 | 1639 (1,0) | 0239 | 1924 |
| Acetolactate synthase, large subunit, biosynthetic type | 3300 | 1724 (1,1) | 1753 | 0554 (0,2) |
| NAD+ synthetase | 1219 (0,1) | 0310 (1,0) | 0793 | 1302 |
| 30S ribosomal protein S3Ae | 3472 (1,0) | 1852 | 1182 | 0771 (0,1) |
| Band7 protein | 0750 | 0015 | 2166 (1,0) | 1055 |
| TGS domain protein | 1649 | 1844 | 0581 | 1670 (1,0) |
| MoaD family protein | 0727 (0,1) | 2368 | 2136 (1,0) | 0649 |
| Putative circadian clock protein, KaiC | 0729 (1,1) | 2366 (0,2) | 0010 | 0651 |
| Tryptophanyl-tRNA synthetase | 3091 (1,0) | 1612 | 1822 | 0454 |
| Aldehyde ferredoxin oxidoreductase | 0622 (1,4) | 2285 | 2057 | 0738 |
| Inner-membrane translocator | 3350 | 1761 | 0445 (0,1) | 0591 (1,0) |
| Tyrosyl-tRNA synthetase | 0630 | 2290 (1,0) | 2062 | 0733 |
| NADH-quinone oxidoreductase, B subunit | 2928 | 1001 | 1957 | 0336 (1,0) |
| Prephenate dehydratase | 0893s (0,51) | 0111 | 0075 | 1150 (1,0) |
Hypothetical genes with 3′ sequencing reads. Orthologous groups, read counts, and footnotes displayed are as described in Table A1.
| Product | ||||
|---|---|---|---|---|
| Protein of unknown function DUF6, transmembrane | 1545s (16,32) | 0667s (2,20) | 0535s (1,53) | 1614s (2,7) |
| Hypothetical protein | 1519a (550,1) | 0634a (197,0) | 0875a (202,0) | 1596 |
| Hypothetical protein | 0577a (6,0) | 2243a (18,7) | 1195a (2,0) | 0121 |
| Hypothetical protein | 1836 (0,2) | 0710 (1,0) | 0502 (5,0) | 1752 (2,0) |
| Hypothetical protein | 3249 (4,0) | 1805 | 1855 | 0485 (2148,0) |
| Hypothetical protein | 1234 (349,0) | 0295 (1,0) | 1511 | 0244 |
| Protein of unknown function DUF1614 | 2020 | 2082 (15,0) | 1014 | 1832 (58,0) |
| Hypothetical protein | 1687a (48,4) | 2232a (17,3) | 0565 | 1690 |
| Hypothetical protein | 3138a (15,23) | 1680a (27,2) | 1786 | 0515 |
| Hypothetical protein | 0889 | 0108 | 0073 (4,0) | 1147a (28,3) |
| Protein of unknown function DUF192 | 2955a (15,1) | 1329 | 1147a (10,4) | 0358 |
| Hypothetical protein | 3550 | 1622 | 1974 (1,0) | 0930s (2,44) |
| Hypothetical protein | 3005a (177,0) | 1260 | 1094 | 0416 |
| Hypothetical protein | 3630 | 1030 | 2015 | 0003 (74,0) |
| Hypothetical protein | 3245a (63,1) | 1808 | 0487 | |
| Hypothetical protein | 2069 | 2102 | 0198a (44,0) | 1852 |
| Hypothetical protein | 3468 | 1856 | 1186 | 0994 (31,0) |
| Hypothetical protein | 0730 | 2365 (18,0) | 0009 | 0652 |
| Hypothetical protein | 3497 | 1956 | 1958 (12,6) | 0790 |
| Hypothetical protein | 0936 | 0136 (8,0) | 0105 | 1172 |
| Hypothetical protein | 3135 (0,1) | 1683 | 1783s (8,6) | 0512s (0,116) |
| Hypothetical protein | 3156 | 1669 | 1798 | 0526 (6,0) |
| Hypothetical protein | 0748 | 0013s (6,6) | 1058s (0,1982) | |
| Protein of unknown function DUF62 | 3627 | 2209s (0,5) | 2013 | 0002 (6,0) |
| Hypothetical protein | 1177 | 0209 | 0746a (5,16) | 1317 |
| Hypothetical protein | 3189 | 1645 | 1819 | 0545 (4,0) |
| Hypothetical protein | 3295 (4,0) | 1719 | 1758 | 0550 |
| Hypothetical protein | 2549 | 0845 (3,0) | 1351 | 0257 |
| Hypothetical protein | 1173 (3,0) | 0212 | 0743 | 1320 |
| Protein of unknown function UPF0027 | 0998 | 0172 (3,0) | 0141 | 1219 |
| Hypothetical protein | 2504 (3,0) | 1412 | 1927 | |
| Hypothetical protein | 2326 | 1541 | 1669 | 1042 (3,0) |
| Hypothetical protein | 1549s (3,5) | 0661s (0,2) | 0531s (0,5) | 1618s (0,4) |
| Hypothetical protein | 0611s (2,15) | 0813s (0,24) | 1573 (0,23) | |
| Protein of unknown function DUF64 | 0371 | 1533 | 1367 (2,0) | |
| Hypothetical protein | 2285 (2,0) | 1583 | 1640 | 1010 |
| Hypothetical protein | 1307 | 0473 | 1351s (2,14) | |
| Hypothetical protein | 1816 | 0724 (2,0) | 0422 (0,1) | |
| Hypothetical protein | 3251 (2,0) | 1803 | 1853 | 0483 |
| Hypothetical protein | 1497 | 0619 | 0805 | 1582 (2,0) |
| Hypothetical protein | 1895 | 0678 (2,0) | 0646 | |
| Protein of unknown function DUF224, cysteine-rich region domain protein | 1762 | 0754 | 0545 | 1721 (2,0) |
| Hypothetical protein | 3568 | 2225 | 1996 (2,0) | 0916 |
| Hypothetical protein | 1998 | 2066 | 1030 | 1817 (2,0) |
| Protein of unknown function DUF115 | 2328 | 1542 | 1670 | 1041 (2,0) |
| Hypothetical protein | 2337 | 1548 | 1676 | 0635 (2,0) |
| Protein of unknown function DUF100 | 0944 (1,0) | 0141 | 0111 | 1181 |
| Protein of unknown function DUF340, membrane | 1479 | 0612 | 0812 | 1574 (1,0) |
| Hypothetical protein | 3304 (1,0) | 1727 | 1750 | 0557 |
| Hypothetical protein | 0882 (1,0) | 0102 | 0067 | 1141 |
| Hypothetical protein | 1130 | 0243 (1,0) | 0697 | 1241 |
| Hypothetical protein | 1449 | 0601 | 0800 | 1567 (1,0) |
| Hypothetical protein | 2190 | 1946 | 0927 | 0827 (1,0) |
| Hypothetical protein | 0927 (1,0) | 0131 | 0083 | 1161 |
| Hypothetical protein | 0708 | 2353 | 2122 | 0676 (1,0) |
| Hypothetical protein | 2606 | 1232 (1,0) | 1431 | |
| Hypothetical protein | 2187 | 0791 (1,0) | 1080 | 0809 (0,1) |
| Hypothetical protein | 0239 | 1512 (1,0) | 0690 | |
| Protein of unknown function DUF1028 | 3380 | 1006 (1,0) | 0160 | |
| Hypothetical protein | 2154 (1,0) | 0817 | 1069 | 0942 |
| Hypothetical protein | 3161 | 1666 | 1803 | 0529 (1,0) |
| Hypothetical protein | 2058 | 2311 | 0401 | 1850 (1,0) |
| Hypothetical protein | 0840 | 0083 (1,0) | 0046 | 1122 |
Orthologous genes with 3′ sequencing reads. Orthologous groups, read counts, and footnotes displayed are as described in Table A1.
| Product | ||||
|---|---|---|---|---|
| Electron transfer flavoprotein, alpha subunit | 0721a (381,54) | 2372a (258,14) | 2132a (2145,21) | 0645a (22,6) |
| DNA-directed RNA polymerase, M/15 kDa subunit | 3480a (153,0) | 1847 (1,0) | 1177a (94,0) | 0776a (31,0) |
| NAD-dependent deacetylase | 3500a (83,2) | 1959a (6,0) | 1963 (1,0) | 0793a (26,1) |
| SMC domain protein | 2280a (2,7) | 1811a (9,0) | 1637a (7,1) | 0884a (10,0) |
| Triosephosphate isomerase | 1501 (2,1) | 0622 (1,0) | 0817 (13,0) | 1585 (1,0) |
| Metallophosphoesterase | 2243a (287,0) | 1913 | 0956a (26,0) | 0802a (63,0) |
| Resolvase, N-terminal domain | 3513a (250,0) | 1963a (66,0) | 1967 | 0797a (27,0) |
| Succinate dehydrogenase subunit D | 0719a (94,10) | 2361a (129,0) | 2130 | 0668a (46,0) |
| HhH-GPD family protein | 0880a (23,2) | 0101a (10,3) | 0066 | 1140a (235,0) |
| Elongation factor EF-2 | 0332a (183,0) | 2139 | 0213a (15,0) | 1957a (3,0) |
| Twin-arginine translocation protein, TatA/E family subunit | 1546a (32,16) | 0666 | 0534a (53,1) | 1615a (7,2) |
| Aldo/keto reductase | 2929a (66,14) | 1002a (1233,1) | 0966 | |
| Putative agmatinase | 2260 | 1823 | 1626a (13,2) | 0897a (611,2) |
| MazG nucleotide pyrophosphohydrolase | 1159 (133,1) | 0222 (352,2) | 0722 | |
| Ferric uptake regulator, Fur family | 2309a (128,0) | 1526a (141,1) | 1653 | 1023 |
| Seryl-tRNA synthetase | 3158a (50,0) | 1667 | 1802 | 0528a (39,36) |
| Uridylate kinase | 3159 | 1665 | 1804 (20,0) | 0530 (39,0) |
| Purine and other phosphorylases, family 1 | 1476 | 0610 | 0814a (24,0) | 1572 (23,0) |
| Isoleucyl-trna synthetase | 1617 | 1993a (5,0) | 0601a (2,1) | 1650 |
| Transcriptional regulator, Fis family | 3027s (4,10) | 1779a (3,0) | 0999s (0,5) | 0429s (0,47) |
| GCN5-related N-acetyltransferase | 3246 | 1807 | 1556a (4,0) | 0488 (2,0) |
| Putative circadian clock protein, kaic | 0729 | 2366a (5,0) | 0010 | 0651 (1,0) |
| Conserved protein (RNA polymerase related?) | 1975 | 2051 (1,0) | 1587 | 1800 (1,0) |
| Lysine exporter protein (LYSE/YGGA) | 2077 | 2018 | 0708a (5260,0) | 1858 |
| Translation initiation factor IF-2 subunit gamma | 0064 | 1171 | 0242a (162,0) | 0078 |
| Alpha- | 1818 | 0723 | 0506a (116,0) | 1747 |
| Oxidoreductase, molybdopterin binding | 0389 | 0833 | 1263 | 1366a (84,0) |
| Ribosomal protein L25/L23 | 1972 | 2048 | 1585a (66,8) | 1798 |
| Proliferating-cell nuclear antigen-like protein | 0720 | 2362 | 2131a (36,0) | 0667 |
| 3-dehydroquinate synthase | 1685 | 1827 | 0566a (25,0) | 1689 |
| DNA polymerase, beta domain protein region | 1153 | 1067 | 0856 (23,0) | |
| Haloacid dehalogenase domain protein hydrolase | 1785 | 0739 | 0554a (20,0) | 1734 |
| Mn2+-dependent serine/threonine protein kinase | 2192 | 1948 | 0924 | 0825 (12,0) |
| Radical SAM domain protein | 2153 (0,1) | 0818 | 1068 | 0189a (9,0) |
| DNA polymerase I | 2180 | 0798 | 1087 | 0816 (6,0) |
| Ribonuclease HII | 1216 | 0312 | 0780 | 1305 (5,0) |
| Bifunctional GMP synthase/glutamine amidotransferase protein | 3369 | 1772 | 1723 | 0600 (4,0) |
| Band 7 protein | 0750 | 0015 | 2166 | 1055 (4,0) |
| Alpha- | 0645 (4,0) | 2302 | 2074 | 0721 |
| Ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent | 3155 (4,0) | 1670 | 1797 | 0525 |
| Thermosome | 3273 (0,3) | 1704 (0,1) | 1771 | 0501 (3,5) |
| Metallophosphoesterase | 1087 (3,0) | 0270 | 1512 | 1254 |
| Peptidase M24 | 2025 | 2086 | 1010 | 1836 (3,0) |
| Pyruvate/ketoisovalerate oxidoreductase, gamma subunit | 3279 | 1708 | 1767 | 0497 (3,0) |
| DNA polymerase, beta domain protein region | 0045 | 1137 (3,0) | 0385 | |
| Creatininase | 2983 | 1350 | 1124 | 0383 (2,0) |
| Acetyl-CoA acetyltransferase | 1220 | 0309 | 0781 | 1301 (2,2) |
| Indole-3-glycerol-phosphate synthase | 0570 (2,0) | 2240 | 1213 | 0124 (0,2) |
| Regulatory protein, ArsR | 0731 | 2364 | 0008 (2,0) | 0653 |
| PaREP1 domain containing protein | 0002 | 1095 | 1373a (2,0) | |
| Putative signal-transduction protein with CBS domains | 3588 | 1394 | 0254 (2,0) | |
| Exosome complex exonuclease Rrp41 | 2207 (2,1) | 1937 | 0933 | 0836 |
| ABC transporter related | 3413 (2,0) | 1175 | 1045 | 0976 |
| Uroporphyrinogen III synthase HEM4 | 0589 | 2250 | 1712 | 0116 (2,0) |
| Potassium transport membrane protein, conjectural | 2422 (2,0) | 1446 | 0314 | 0883 |
| Undecaprenyl diphosphate synthase | 2942 (2,0) | 1319 | 1157 | 0348 |
| Nucleotidyl transferase | 0837 | 0080 | 0043 | 1119 (2,0) |
| Carbon starvation protein CstA | 1423 (2,0) | 0894 | 0860 | |
| Carboxypeptidase Taq | 0885 (1,2) | 0104 | 0069 (0,1) | 1143 |
| Leucyl-tRNA synthetase | 1107 | 0260 | 0691 | 1246 (1,0) |
| HEPN domain protein | 1894 | 0820 (1,0) | 1501 | |
| DNA-directed RNA polymerase subunit E, RpoE2 | 3563 (1,0) | 2230 | 1991 | 0921 |
| 5-carboxymethyl-2-hydroxymuconate Δ-isomerase | 2688 | 0535 | 1503 (1,0) | |
| ATPase | 1789 | 0736 | 1446 (1,0) | |
| Oligosaccharyl transferase, STT3 | 3030 (1,0) | 1781 | 0997 | 0431 |
| paREP7 | 0906 | 0492 | 0185 (1,0) | |
| Haloacid dehalogenase domain protein hydrolase | 2017s (0,12) | 2080s (0,15) | 1016s (1,4) | 1830 |
| Egghead-like protein | 0042 | 1076 | 2043 | 0056 (1,0) |
| Putative transcriptional regulator, CopG family | 1443 (0,2) | 0399 | 0796 | 1563 (1,1) |
| Asparaginyl-tRNA synthetase | 2973 (1,0) | 1342 | 1133 | 0375 |
| Succinate dehydrogenase iron-sulfur subunit | 0717 | 2359 | 2128 | 0670 (1,0) |
| Peptidase T2, asparaginase 2 | 3083 (1,0) | 1892 | 0970 | 0908 |
| Radical SAM domain protein | 0596 | 2255 | 1716 | 0113 (1,0) |
| 30S ribosomal protein S25e | 2188 (0,1) | 0790 (0,1) | 1079 | 0808 (1,0) |
| Ribosomal-protein-alanine acetyltransferase | 2246 (1,0) | 0958 | 1001 | |
| PilT protein domain protein | 3561 (1,0) | 1614 | 1989 | 0923 |
| Peptidase S8 and S53, subtilisin, kexin, sedolisin | 0712 | 2355 | 2124 (1,0) | 0674 |
| Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase | 2075s (0,49) | 2019 | 0203 | 1857s (1,8) |
| Beta-lactamase domain protein | 2160 | 0810 | 1074 | 0803 (1,0) |
| Xanthine dehydrogenase accessory factor | 2669 | 0253 | 1324 (1,0) | |
| ABC transporter related | 3269 | 1702 (1,0) | 1774 | 0503 |
| tRNA CCA-pyrophosphorylase | 3325 | 1740 | 1737 | 0570 (1,0) |
| Starch synthase | 3429 | 1878 (1,0) | 1038 | 0968 |
| Dual specificity protein phosphatase | 1536 | 0675 (1,0) | 0541 | 1603 |
| Putative endoribonuclease L-PSP | 3003s (1,62) | 1258s (0,210) | 1096s (0,81) | 0414s (0,65) |
| Sulfite reductase, dissimilatory-type beta subunit | 2597 | 1212 (1,0) | 1456 | |
| Methyltransferase small | 0261 | 2199 | 0236 | 0747 (1,0) |
| Putative transcriptional regulator, AsnC family | 1507 (1,0) | 0627 (0,1) | 0822 | 1590 |
| Methyltransferase type 11 | 1165 | 0216 (1,0) | 1364 | 1338 |
| Serine/threonine protein kinase | 0815 | 0059 (1,0) | 0025 | 1102 |
| Transcriptional regulator, PadR-like family | 0013 | 1087 (1,0) | 0038 | |
| Inner-membrane translocator | 3350 (1,0) | 1761 | 0445 | 0591 |
| Geranylgeranyl reductase | 2989 | 1355 | 1119 | 0388 (1,0) |
| Extracellular solute-binding protein, family 5 | 2391 | 1494 | 0422 | 0602 (1,0) |
| 2-methylcitrate synthase/citrate synthase II | 1689 (1,0) | 2234 | 0563 | 1692 |
| 30S ribosomal protein S6e | 1505 (0,1) | 0626 (1,0) | 0821 | 1589 |
C/D box sRNA genes in each .
| Species | C/D box sRNAs |
|---|---|
| 88 | |
| 83 | |
| 88 | |
| 84 | |
| 83 | |
| 74 |
All loci are manually curated.
Figure 2Conservation of C/D box sRNA genes among six . C/D box sRNA genes were organized by homolog family based on the location targeted by the encoded guide regions. These homolog families were then compared among the six studied species to verify conservation. While each individual species encodes more than 74 C/D box sRNA genes, 70 of those are conserved among each of the six studied Pyrobaculum spp. (group 6).
Figure 3Northern analysis of the 3′ UTR of the electron transport flavoprotein (. P. aerophilum total RNA, exponential phase (left lane) and stationary phase (right lane). The probe was designed to anneal beyond the stop codon of the terminal gene in the etf operon, in the region of the convergently oriented PAEsR53 C/D box sRNA. Multiple bands at 3400, 2250, and 1040 nt are consistent with the etf operon and suboperon transcripts. The band near 50 nt, consistent with the RNA sequencing data, shows an apparent antisense transcript to PAEsR53 (sense relative to the 3′ UTR of the etf operon).
Figure 4. The antisense sRNA genes (indicated as asR1, asR2, and asR3) are defined over the region of antisense-oriented sequencing reads (shown in red) with the associated gene (sense orientation) shown in blue. (A) Sequencing reads map to the 5′ terminus of transcription initiation factor B in both sense and antisense orientations (Pars_1976 is shown from Pyrobaculum arsenaticum). Sense oriented transcripts (blue) are present in two populations, consistent with the two conserved promoters, Pu and Pd. (B) The antisense sRNA (asR2) gene is defined over the region of antisense-oriented reads that map to the 5′ terminus of the ferric uptake regulator gene (fur; PAE2309 from Pyrobaculum aerophilum is shown). (C) Antisense-oriented reads map to the 3′ end of the triose-phosphate isomerase (tpi) locus (Pcal_0817 from Pyrobaculum calidifontis). Sequence conservation [lower graphic (C)] extends beyond the tpi stop codon (UCSC Archaeal Browser; Chan et al., 2012).
Terminal cis-antisense encoded sRNA in .
| sRNA | Len | Sequence | |
|---|---|---|---|
| Pae | 59 | ||
| Par | 79 | ||
| Pca | 33 | ||
| Pis | 63 | ||
| Pae | 60 | ||
| Pca | 54 | ||
| Pae | 65 | ||
| Par | 58 | ||
| Pca | 63 | ||
| Pis | 59 | ||
Position of start codon (on coding strand) shown underlined for asR1 and asR2 (.
Figure 5Northern blot analysis of tfb1 from . Sense transcripts of tfb1 occur at both full length (~1000 nt) and at 50 nt. Antisense transcripts occur at ~50 nt. Lanes 1–15 (upper panels, left to right); total RNA across five respiratory growth conditions in three time series. Lanes 1–3 stationary phase, Lanes 4–6 growth with O2, 7–9 growth with NO3, 10–12 growth with As(V), 13–15 Fe(III). Each set of three lanes extracted from a time series for all five respiratory conditions at T = (2.5, 4.5, 7.5 h) with indicated terminal electron acceptor. Sense and antisense sRNA transcript abundance, inferred from band density, is positively correlated across growth conditions (C), while no significant correlation is found between full length tfb1 mRNA and either sRNA population (D,E). Full length tfb1 transcripts [1000 nt] remain nearly constant under all conditions tested (A). Band density established using imagej (http://rsb.info.nih.gov/ij/).
Figure 6The . In all sequenced Pyrobaculum species, a highly conserved primary sequence forms a predicted secondary structure element (upper panel) at the 3′ end of the triose-phosphate isomerase gene (tpi). The depicted secondary structure contains the stop codon (blue box) in P. aerophilum (PAE1501), P. islandicum (Pisl_1585), and P. calidifontis (Pcal_0817). In P. oguniense (Pogu_1730), P. arsenaticum (Pars_0622), Pyrobaculum sp.1860 (P186_2792), and P. neutrophilum (Tneu_0616), the stop codon is immediately upstream of the tpi-element. asR3 (red line) is encoded on the opposite strand, and has potential to compete/interfere with the tpi-element secondary structure. The genomic alignment of the 3′ portion of tpi and 3′ UTR is shown with the consensus secondary structure (nested parentheses). Base paired columns with one or more substitutions that maintain secondary structure are highlighted (green).
Hypothetical genes with 5′ sequencing reads. Orthologous groups, read counts, and footnotes displayed are as described in Table A1.
| Product | ||||
|---|---|---|---|---|
| Hypothetical protein | 3282 (3,3) | 1710 | 1765 | 0495 (11,6) |
| Hypothetical protein | 0301 | 2159a (673,0) | 0225 | 2002 |
| Hypothetical protein | 3499 | 1958 | 1962 | 0792a (26,1) |
| Hypothetical protein | 0432 (6,0) | 0474 | 0140 | |
| Hypothetical protein | 1798 | 0175 (4,0) | 0509 | 1742 |
| Protein of unknown function DUF107 | 0749 | 0014 | 2167 | 1056 (4,0) |
| Hypothetical protein | 1503 (3,0) | 0624 | 0819 | 1587 |
| Hypothetical protein | 2934 | 1279 (3,0) | 1165 | 0340 (0,1) |
| Hypothetical protein | 1517 | 0632 (3,0) | 0877 | |
| Hypothetical protein | 3546 | 1625 (3,0) | 1977 | 0933 |
| Hypothetical protein | 0433 (3,6) | 0479 | 0139 | |
| Hypothetical protein | 3051s (0,284) | 1797 | 0981s (0,152) | 0447s (3,129) |
| Hypothetical protein | 0838 (2,2) | 0081 | 0044 | 1120 |
| Hypothetical protein | 1710 (0,3) | 0785 (2,3) | 0620 | 1698 |
| Hypothetical protein | 0728 | 2367 (2,0) | 2137 | 0650 |
| Hypothetical protein | 1147 | 0229 | 0706 | 1284 (2,0) |
| Hypothetical protein | 1522 | 0636 | 0874 | 1594 (2,0) |
| Hypothetical protein | 2941 (2,0) | 1318 | 1158 | 0347 |
| Hypothetical protein | 2338 | 1549 | 1677 | 0634 (2,0) |
| Hypothetical protein | 2822 | 0279 | 1187 | 1388 (2,1) |
| Hypothetical protein | 1943 | 2025 (2,0) | 1297 | |
| Hypothetical protein | 2416 (2,0) | 1479 | 0319 | 0879 (0,1) |
| Hypothetical protein | 0746 (2,0) | 0012 | 2168 | 1006 |
| Hypothetical protein | 1069 (0,13) | 0281 | 0680 | 1257 (2,0) |
| Hypothetical protein | 3081 (2,0) | 1891 | 0969 | 0907 (0,1) |
| Hypothetical protein | 0800 | 0050 | 0016 (0,1) | 1092 (2,4) |
| Protein of unknown function DUF77 | 1158 | 0223 (2,0) | 0711 | 1327 |
| Hypothetical protein | 3135 (0,2) | 1683 | 1783 | 0512 (2,0) |
| Hypothetical protein | 1683 | 1828 (1,0) | 0567 (0,1) | 1688 |
| Hypothetical protein | 0789 | 0040 (1,0) | 2142 | 1079 |
| Protein of unknown function DUF72 | 2078 (1,0) | 2017 | 0205 | 1859 |
| Hypothetical protein | 1641 (0,3) | 1979 (1,1) | 0587 | 1664 (0,1) |
| Protein of unknown function DUF54 | 2213 | 1931 | 0938 | 0842 (1,0) |
| Protein of unknown function DUF437 | 0638 (1,0) | 2296 | 2068 | 0727 |
| Hypothetical protein | 3550 | 1622 | 1974 | 0930 (1,0) |
| Hypothetical protein | 3467 | 1857 | 1188 | 0993 (1,0) |
| Hypothetical protein | 1318 | 0471 | 1365 (1,1) | |
| Hypothetical protein | 3556 (1,0) | 1619 | 1969 | 0927 |
| Hypothetical protein | 2598 | 1211 (1,0) | 1455 | |
| Hypothetical protein | 1643 (1,0) | 1977 | 0585 | 1666 |
| Hypothetical protein | 2824 | 0884 (1,0) | 0867 | 1387 |
| Hypothetical protein | 2322 | 1537 | 1664 | 1037 (1,0) |
| Hypothetical protein | 2177 | 0799 | 1088 | 0817 (1,0) |
| Hypothetical protein | 1449 (0,1) | 0601 (0,1) | 0800 (1,0) | 1567 (0,1) |
| Protein of unknown function DUF52 | 0818 | 0062 (1,0) | 0028 | 1105 |
| Hypothetical protein | 1173 | 0212 | 0743 | 1320 (1,0) |
| Hypothetical protein | 3004s (1,62) | 1259 | 1095 | 0415s (0,65) |
| Protein of unknown function DUF72 | 3079 (0,1) | 1889 (1,3) | 0967 | 0905 |
| Hypothetical protein | 2268 (1,0) | 1820 | 1629 | 0894 |
| Hypothetical protein | 3324 | 1739 | 1738 (0,1) | 0569 (1,0) |
| Protein of unknown function UPF0027 | 0998 | 0172 | 0141 (1,0) | 1219 |
| Hypothetical protein | 2210 | 1934 (1,0) | 0941 | 0839 |
| Hypothetical protein | 1797 | 0174 (1,1) | 0508 | 1741 |
| Hypothetical protein | 0718 | 2360 | 2129 | 0669 (1,0) |
| Hypothetical protein | 1448 | 0600 | 0799 | 1566 (1,0) |
| Hypothetical protein | 1613 | 2156 | 0603 | 1648 (1,10) |
| Hypothetical protein | 1018 | 0184 | 0656 (0,1) | 1203 (1,0) |
| Hypothetical protein | 2429 | 1441 | 0309 | 1892 (1,0) |
| Hypothetical protein | 3148 | 1674 | 1792 | 0521 (1,0) |
| Hypothetical protein | 1676 | 1833 | 1682 (1,0) | |
| Hypothetical protein | 2403 (1,0) | 1470 | 0327 | 0871 (0,3) |