| Literature DB >> 15629717 |
Michaël Bekaert1, Jean-Pierre Rousset.
Abstract
By using a sensitive search program based on hidden Markov models (HMM), we identified 74 viruses carrying frameshift sites among 1500 fully sequenced virus genomes. These viruses are clustered in specific families or genera. Sequence analysis of the frameshift sites identified here, along with previously characterized sites, identified a strong bias toward the two nucleotides 5' of the shifty heptamer signal. Functional analysis in the yeast Saccharomyces cerevisiae demonstrated that high frameshifting efficiency is correlated with the presence of a Psi39 modification in the tRNA present in the E site of the ribosome at the time of frameshifting. These results demonstrate that an extended signal is involved in eukaryotic frameshifting and suggest additional interactions between tRNAs and the ribosome during decoding.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15629717 PMCID: PMC7129640 DOI: 10.1016/j.molcel.2004.12.009
Source DB: PubMed Journal: Mol Cell ISSN: 1097-2765 Impact factor: 17.970
Figure 1Unrooted Neighbor-Joining Phylogeny Based on the Amino Acid Sequence of the Pol Protein
Labels at branch tips represent taxa as provided by the Universal Virus Database, ICTVdB (http://phene.cpmc.columbia.edu; Büchen-Osmond, 2003). Numbers at branch nodes indicate the percentage of bootstrap support for that node based on 1000 replications. Color codes are according to the clade, with the following scheme: ssRNA-RT viruses (red), positive-sense ssRNA viruses (blue), and dsRNA viruses (green). Abbreviations: PRRSV, Porcine reproductive and respiratory syndrome virus.
Nucleic Acid Alignment of Heptameric Sequence
| Virus Acronym | FS-1 | Slippery Region | Genbank | ||
|---|---|---|---|---|---|
| BChV | 15.8% ± 2% | cacaucugcC | gGacuGaGcG | NC_002766 | |
| BLV gag/pro | 8.1% ± 1% | cccucaaaUC | UaauAGaGGG | NC_001414 | |
| BWYV | 12.0% ± 1% | ccaagagcUC | gGGagaGcGG | NC_004756 | |
| BYDV | 12.2% ± 1% | uugacucugu | UaGagGGGcu | NC_002160 | |
| CABYV | 17.5% ± 1% | aauacgagUC | gGGcAGGcGG | NC_003688 | |
| EIAV | 7.0% ± 1% | gaaguguucC | gGGagcaaGG | NC_001450 | |
| FIV | 9.0% ± 1% | gaaagaauUC | UGGaAGGcGG | NC_001482 | |
| HIV1 | 6.0% ± 1% | gacaggcuaa | gGGaAGaucu | NC_001802 | |
| IBV | 19.3% ± 1% | auaagaauUa | gGGuAcGGGG | NC_001451 | |
| L-A | 10.0% ± 1% | guacucagca | gGaguGGuaG | NC_003745 | |
| L-BC | 13.0% ± 2% | cugagaagUu | cGuguaGcaG | NC_001641 | |
| LDV | 13.1% ± 1% | aggcaucggC | UGcuAGccac | NC_002534 | |
| MMTV gag/pro | 20.2% ± 2% | cugaaaauUC | UuGuAaaGGG | NC_001503 | |
| PEMV1 | 31.0% ± 2% | ccagacgcUC | gGauuauucc | NC_003629 | |
| PLRV | 19.0% ± 1% | caaacaagcC | gGGcAaGcGG | NC_001747 | |
| PLRV-W | 17.8% ± 2% | caaacaagcC | gGGcgaGcGG | ||
| PRRSV | 15.7% ± 1% | aggagcagUg | UGcuAGccGc | NC_001961 | |
| SARS | 10.3% ± 1% | caucaacgUu | gGGuuuGcGG | NC_004718 | |
| ScYLV | 0.7% ± 0% | cuccagacca | gaGccaaGuG | NC_000874 | |
| SRV1 gag/pro | 13.0% ± 2% | caccccauca | gGacuGaGGG | NC_001551 | |
| xxxxxxxxUC | UGGxAGGGGG | ||||
Nucleotides in agreement with the functional pseudoconsensus inferred from the HMM profile are in uppercase. Acronyms are as follows: BChV, Beet chlorosis virus; BLV, Bovine leukemia virus (gag/pro junction); BWYV, Beet western yellows virus; BYDV, Barley yellow dwarf virus; CABYV, Cucurbit aphid-borne yellows virus; EIAV, Equine infectious anemia virus; FIV, Feline immunodeficiency virus; HIV1, Human immunodeficiency virus 1; IBV, Avian infectious bronchitis virus; L-A, Saccharomyces cerevisiae virus L-A; L-BC, Saccharomyces cerevisiae virus L-BC; LDV, Lactate dehydrogenase-elevating virus; MMTV, Mouse mammary tumor virus (gag/pro junction); PEMV1, Pea enation mosaic virus 1; PLRV, Potato leafroll virus; PLRV-W, Potato leafroll virus, Germany strain (Wageningen); PRRSV, Porcine reproductive and respiratory syndrome virus; SARS, SARS coronavirus; ScYLV, Sugarcane yellow leaf virus; and SRV1, Simian type D virus 1 (gag/pro junction).
Detailed Distributions of the HMM Profile Results from GenBank Compared to the Initial Data Set (RECODE database)
| Status | GenBank (1500 virus) | RECODE (35 viruses) |
|---|---|---|
| New viruses | 10 | – |
| New annotations | 32 | 15 |
| Frameshift localized | 12 | 8 |
| Already annotated | 20 | 12 |
| 74 | 35 | |
Figure 2Estimation of Dinucleotide Usage Bias
χ2 values were calculated at each position upstream of the slippery heptamer. Black line represents the threshold score for a p value of 1% with 15 degrees of freedom. Higher values are more indicative of a significant bias. The first dinucleotide is strongly significant (p value = 6.4 10−11).
Effect of the Upstream Dinucleotide on −1 Frameshifting Efficiency
| Plasmids | Modified Sequence | tRNA (Anticodon loop) | Frameshift |
|---|---|---|---|
| pAC.5.AA | aau AAu | ACU | 18.0% ± 2% |
| pAC.5.AC | aau ACu | Am5CU | 22.0% ± 1% |
| pAC.5.UA | aau UAu | ACU | 19.3% ± 1% |
| pAC.5.UC | aau UCu | AΨU | 22.1% ± 2% |
| pAC.5.UG | aau UGu | AΨU | 19.0% ± 2% |
| pAC.5.UU | aau UUu | ACmU | 21.0% ± 1% |
| pAC.5.AG | aau AGu | CCU | 9.3% ± 1% |
| pAC.5.AU | aau AUu | GCU | 7.0% ± 1% |
| pAC.5.CA | aau CAu | GΨU | 9.0% ± 1% |
| pAC.5.CC | aau CCu | CUU | 9.0% ± 1% |
| pAC.5.CG | aau CGu | GCU | 7.5% ± 1% |
| pAC.5.CU | aau CUu | GUC | 10.0% ± 2% |
| pAC.5.GA | aau GAu | CΨU | 9.4% ± 1% |
| pAC.5.GC | aau GCu | CUU | 8.0% ± 1% |
| pAC.5.GG | aau GGu | GΨU | 8.5% ± 1% |
| pAC.5.GU | aau GUu | CΨU | 6.7% ± 1% |
The FY strain was transformed with one of the plasmids harboring the test sequence as indicated (from 5′ to 3′). Frameshifting efficiencies were measured at 30°C, and the data are expressed as percentages. Codons including the dinucleotides are underlined and anticodon of tRNA anticodon loops are in bold (Lecointe, 2002); heptamers are in bold and dinucleotides in uppercase.
Effect of PUS3 Gene Deletion on −1 Frameshifting Efficiency
| Plasmids | Wt | ||||
|---|---|---|---|---|---|
| pAC.5.CG | 5.3% ± 1% | 5.9% ± 1% (1.1) | 5.8% ± 0% (1.1) | 5.5% ± 1% (1.0) | 6.0% ± 1% (1.1) |
| pAC.5.GA | 7.6% ± 1% | 8.0% ± 1% (1.1) | 8.1% ± 1% (1.1) | 7.2% ± 0% (0.9) | 7.3% ± 1% (1.0) |
| pAC.5.UA | 21.7% ± 1% | 12.8% ± 1% (0.6) | 11.5% ± 1% (0.5) | 19.3% ± 1% (0.9) | 11.8% ± 1% (0.5) |
| pAC.5.UC | 19.5% ± 1% | 10.3% ± 1% (0.5) | 10.2% ± 1% (0.5) | 18.7% ± 2% (1.0) | 12.7% ± 1% (0.7) |
Wild-type (wt) and pus3Δ mutants of 74-D694 strains were transformed with the test plasmids. The 74-D694 pus3Δ strain was also transformed with empty pRS315 or the same plasmid containing the PUS3 gene or the mutant pus3[D151A] gene, as indicated. −1 frameshifting efficiencies were measured at 30°C, and the data are expressed as percentages. Numbers in parentheses correspond to ratios of recoding efficiency in the wt strain over the recoding efficiency in the pus3Δ derivative strain. No significant difference can be expected by a Mann-Whitney statistical test, except between pAC.5.UA/UC in wt or pus3Δ + PUS3 compared to other transformed strains (p value < 0.005).