| Literature DB >> 17158156 |
Jonathan L Jacobs1, Ashton T Belew, Rasa Rakauskaite, Jonathan D Dinman.
Abstract
In viruses, programmed -1 ribosomal frameshifting (-1 PRF) signals direct the translation of alternative proteins from a single mRNA. Given that many basic regulatory mechanisms were first discovered in viral systems, the current study endeavored to: (i) identify -1 PRF signals in genomic databases, (ii) apply the protocol to the yeast genome and (iii) test selected candidates at the bench. Computational analyses revealed the presence of 10 340 consensus -1 PRF signals in the yeast genome. Of the 6353 yeast ORFs, 1275 contain at least one strong and statistically significant -1 PRF signal. Eight out of nine selected sequences promoted efficient levels of PRF in vivo. These findings provide a robust platform for high throughput computational and laboratory studies and demonstrate that functional -1 PRF signals are widespread in the genome of Saccharomyces cerevisiae. The data generated by this study have been deposited into a publicly available database called the PRFdb. The presence of stable mRNA pseudoknot structures in these -1 PRF signals, and the observation that the predicted outcomes of nearly all of these genomic frameshift signals would direct ribosomes to premature termination codons, suggest two possible mRNA destabilization pathways through which -1 PRF signals could post-transcriptionally regulate mRNA abundance.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17158156 PMCID: PMC1802563 DOI: 10.1093/nar/gkl1033
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Typical −1 PRF signals consist of a heptameric slippery site that fits the motif N NNW WWH (spaces indicate zero frame codons), a short spacer region of 5–12 nt, and an mRNA pseudoknot with two stem and three loop regions (S1, S2 and L1–L3, respectively). See Materials and Methods for the pseudoknot motif criteria used in this study.
The yeast genome has a significant number of putative programmed −1 ribosomal frameshift signals compared to randomized genomes created using any one of seven different randomization strategies
| RNAMotif | SD | ||
|---|---|---|---|
| 6016 | — | — | |
| noBias | 3044 | 64.07 | <0.01 |
| nShuffle | 4567 | 70.84 | <0.01 |
| nBias | 4660 | 65.89 | <0.01 |
| cShuffle | 6551 | 85.13 | 0.02 |
| sBias | 6580 | 82.13 | 0.02 |
| cBias | 6639 | 86.52 | 0.02 |
| dnBias | 6774 | 88.16 | 0.01 |
RNAMotif, the number of motif hits using our descriptor of functional −1 PRF signals (22); stdev, standard deviation of for each randomization strategy. Seven methods for randomization were used. These were: NoBias, randomized ORFs with unbiased nucleotide bias; ntShuffle, nucleotides from each natural ORF are shuffled by triplicate mononucleotide permutations; ntBias, randomized ORFs using the CDS single-nucleotide frequency; cdnShuffle, codons from each natural ORF are shuffled by triplicate monocodon permutation; SilentBias, a silent bias where the codons are randomized in place so as to maintain protein coding sequence; cdnBias, randomized ORFs using the observed CDS codon usage bias; diNuc, randomized ORFs generated using the observed CDS dinucleotide frequency.
Figure 2Scatterplot of MFE values (predicted using pknots, (24) versus z scores for 10 340 candidate −1 PRF signals demonstrates the weak correlation between these two feature statistics (see Supplementary Table 2B). The red diamonds and associated labels indicate the location and parental gene of nine sequences empirically tested for frameshifting. The hypothetical distributions were created using summary statistics from Supplementary Table 2B.
Figure 3Nine examples of candidate −1 PRF signals chosen to generally represent the diversity of features present in the PRFdb. Gene names are shown with RNA sequence and corresponding CDS nucleotide start and stop locations. The predicted structure is shown for each empirically tested candidate signal. See Supplementary Table 3 for statistical data.
Figure 4Measurement of −1 PRF efficiency for nine candidate signals. (A) High-efficiency frameshifting including the frameshift signal from the endogenous yeast L-A virus. (B) Medium- and low-efficiency frameshifting including the sequence from the FKS1 gene that did not promote −1 PRF above background levels. The parental genes of each candidate signal are indicated with the percentage of −1 PRF efficiency as was measured using a dual-luciferase reporter assay system (36,37).
Figure 5(A) The CDS of S.cerevisiae is not prone to lengthy out-of-frame translation. The relative positions of candidate −1 PRF signals from the start codon of each ORF compared to the expected overall change in peptide length if a frameshifting event were to occur. (B) Fraction of ORFs containing high probability −1 PRF signals represented as mRNAs stabilized in strains deficient in NMD (upf1Δ, upf2Δ or upf3Δ) (60), No-go decay (xrn1Δ or dcp1Δ) (60), or having half-lives less than the yeast transcriptome average (t1/2) (70).