| Literature DB >> 19969537 |
Jonathan Livny1, Matthew K Waldor.
Abstract
Regulatory 5' untranslated regions (r5'UTRs) of mRNAs such as riboswitches modulate the expression of genes involved in varied biological processes in both bacteria and eukaryotes. New high-throughput sequencing technologies could provide powerful tools for discovery of novel r5'UTRs, but the size and complexity of the datasets generated by these technologies makes it difficult to differentiate r5'UTRs from the multitude of other types of RNAs detected. Here, we developed and implemented a bioinformatic approach to identify putative r5'UTRs from within large datasets of RNAs recently identified by pyrosequencing of the Vibrio cholerae small transcriptome. This screen yielded only approximately 1% of all non-overlapping RNAs along with 75% of previously annotated r5'UTRs and 69 candidate V. cholerae r5'UTRs. These candidates include several putative functional homologues of diverse r5'UTRs characterized in other species as well as numerous candidates upstream of genes involved in pathways not known to be regulated by r5'UTRs, such as fatty acid oxidation and peptidoglycan catabolism. Two of these novel r5'UTRs were experimentally validated using a GFP reporter-based approach. Our findings suggest that the number and diversity of pathways regulated by r5'UTRs has been underestimated and that deep sequencing-based transcriptomics will be extremely valuable in the search for novel r5'UTRs.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19969537 PMCID: PMC2836559 DOI: 10.1093/nar/gkp1121
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 3.Expression of GFP fused to indicated 5′UTRs. Escherichia coli strains carrying the indicated fusions were grown in defined media lacking both glycine and l-leucine (blue diamonds) or supplemented with either glycine (red squares) or l-leucine (green triangles). Results from a representative experiment are shown.
Figure 4.Effect of increased amino acid concentration on the expression of GFP fused to known or candidate r5′UTRs. Results from representative experiments in E. coli and V. cholerae are shown.
Figure 1.Results of in silico mining of V. cholerae cDNA datasets for r5′UTRs. The values shown correspond to the percentage of all transcripts in the 454 datasets remaining after addition of each filter.
Candidate r5′UTRs sharing conserved genomic context with known families of r5′UTRs or with putative RibEx RLEsa
| No./ namea | ORF No. | ORF product | 3′CGC (Rfam)b | 3′CGC and/or |
|---|---|---|---|---|
| 6* | VC0894 | Thiamine biosynthesis protein ThiI | SAM-IV(1) | RLE0079(7) |
| 7 | VC1091 | Oligopeptide ABC transporter, periplasmic oligopeptide-binding protein | T-box(4), SAM(8) | RLE0210(6) |
| 10* | VC2108 | Erythronate-4-phosphate dehydrogenase | T-box(3), SAM(1) | |
| 11* | VC2334 | Hypothetical protein | ykoK(3) | RLE0310(6) |
| 14 | VC2522 | Hypothetical protein | yybP-ykoY(9) | RLE0334(5) |
| 15 | VC2645 | Acetylornithine deacetylase | TPP(4) | |
| 17 | VCA0142 | C4-dicarboxylate transport transcriptional regulatory protein | MOCO RNA motif(1), GEMM RNA motif(2) | RLE0123(2) |
| 19 | VCA0278 | Serine hydroxymethyltransferase | Glycine(8) | RLE0085(7) |
| – | ||||
| – | ||||
| – | ||||
| – | ||||
| – | ||||
| – | ||||
| – | ||||
| – | ||||
| 29 | VCA0819 | Co-chaperonin GroES | – | RLE0003(75) |
| 30 | VCA1075 | Hypothetical protein | – | |
| 31 | VCA0518 | Bifunctional fructose-specific PTS protein | – | RLE0062(8) |
| 32 | VC2431 | DNA topoisomerase IV subunit B | – | |
| 33 | VC2738 | Phosphoenolpyruvate carboxykinase | – | RLE0239(5) |
| 34* | VC1923 | Trigger factor | – | RLE0241(7) |
| 35* | VC1046 | 3-ketoacyl-CoA thiolase | – | RLE0244(2) |
| 36 | VC1258 | DNA gyrase, subunit A | – | RLE0300(4) |
| 37 | VC0633 | Outer membrane protein OmpU | – | RLE0331(5) |
| 38 | VC0972 | Porin, putative | – | RLE0331(5) |
| 39 | VC1130 | DNA-binding protein H-NS | – | RLE0337(5) |
aAsterisk denote candidates containing or directly upstream of a putative transcription terminator.
bThe number in parentheses denotes the number of r5′UTR in each family or of RLEs found to share 3′CGC with the candidate r5′UTR.
cRLEs predicted to be encoded by the candidate are underlined, RLEs with 3′CGC are not.
Figure 2.Features of putative V. cholerae (A) LeuL and (B) PheL leader peptides. The two numbers in bold denote the relative positions of the 5′- and 3′-ends of each transcript based on the 454 data. The third number indicates the relative position of the downstream ORF as annotated by NCBI. Cognate clusters of codons for each leader peptide are shown in red. The ‘+’ and ‘#’ symbols denote stop codons.
Candidate for novel r5′UTR lacking conserved genomic context with Rfam r5′UTRs or RibEx RLEs
| Name | ORF | ORF product | Gene Ontology (GO) role category |
|---|---|---|---|
| 40 | VC2656 | Fumarate reductase flavoprotein subunit | Anaerobic respiration |
| 41 | VCA0013 | Maltodextrin phosphorylase | Carbohydrate metabolism |
| 42 | VC2188 | Flagellin | Ciliary or flagellar motility |
| 43 | VC2678 | Primosome assembly protein PriA | DNA replication, synthesis of RNA primer |
| 44 | VC1442 | Cytochrome | Electron transport |
| 45 | VC2295 | Na( + )-translocating NADH-quinone reductase subunit A | Electron transport |
| 46* | VC2701 | Thiol:disulfide interchange protein precursor | Electron transport, protein thiol-disulfide exchange, cytochrome complex assembly |
| 47 | VCA1067 | Aldehyde dehydrogenase | Fermentation |
| 48 | VC0374 | Glucose-6-phosphate isomerase | Glycolysis |
| 49 | VCA0843 | Glyceraldehyde-3-phosphate dehydrogenase | Glycolysis |
| 50 | VC0986 | Adenylate kinase | Nucleobase, nucleoside and nucleotide interconversion |
| 51 | VCA0623 | Transaldolase B | Pentose-phosphate shunt |
| 52 | VCA0870 | Peptidoglycan biosynthetic | |
| 53 | VC2421 | N-acetyl-anhydromuranmyl- | Peptidoglycan metabolism |
| 54 | VC0322 | Preprotein translocase subunit SecE | Protein secretion |
| 55 | VC2748 | Nitrogen regulation protein NR(II) | Regulation of nitrogen utilization |
| 56* | VC1796 | Middle operon regulator-related protein | Regulation of transcription, DNA-dependent |
| 57 | VC1901 | Sodium/proton antiporter | Sodium ion transport, hydrogen transport |
| 58 | VC0347 | RNA-binding protein Hfq | Targeting of mRNA for destruction, involved in RNA interference |
| 59 | VC0038 | Hypothetical protein | – |
| 60 | VC0381 | Hypothetical protein | – |
| 61 | VC1576 | Hypothetical protein | – |
| 62 | VC1613 | Hypothetical protein | – |
| 63 | VC1891 | Hypothetical protein | – |
| 64 | VC2002 | Hypothetical protein | – |
| 65 | VC2264 | Hypothetical protein | – |
| 66 | VC2647 | Hypothetical protein | – |
| 67 | VCA0327 | Hypothetical protein | – |
| 68 | VCA0363 | Hypothetical protein | – |
| 69 | VCA0743 | Hypothetical protein | – |
Asterisk denotes candidates containing or directly upstream of a putative transcription terminator.