| Literature DB >> 19292925 |
Andrew E Firth1, John F Atkins.
Abstract
The plant-infecting members of the genus Fijivirus (family Reoviridae) have linear dsRNA genomes divided into 10 segments, two of which contain two substantial and non-overlapping ORFs, while the remaining eight are apparently monocistronic. However, one of these - namely segment 5 - contains a second long ORF (approximately 200+ codons) that overlaps the 3' end of the major ORF (approximately 920-940 codons) in the +1 reading frame. In this report, we use bioinformatic techniques to analyze the pattern of base variations across an alignment of fijivirus segment 5 sequences, and show that this 3' ORF has a strong coding signature. Possible translation mechanisms for this unusually positioned ORF are discussed.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19292925 PMCID: PMC2666654 DOI: 10.1186/1743-422X-6-32
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Figure 1Coding potential statistics for an alignment of four plant fijivirus segment 5 sequences. (1)-(3) The positions of stop codons in each of the four sequences in each of the three forward reading frames (frame defined by alignment to the reference sequence [GenBank: NC_003736]). The FDV sequence has been put into the same frame as the other sequences via the arbitrary insertion of an extra 'U' into the run of five 'U's at NC_007160 nucleotides 2714–2718, as discussed in the text. Note the conserved absence of stop codons in the +0 frame within ORF5-1 and in the +1 frame within ORF5-2. (4)-(6) MLOGD sliding-window plots (window size 75 codons; step size 25 codons). Each window is represented by a small circle (showing the likelihood ratio score for that window), and grey bars showing the width (ends) of the window. See [9] for further details of the MLOGD software. In (4) the null model, in each window, is that the sequence is non-coding, while the alternative model is that the sequence is coding in the +0 (i.e. ORF5-1) frame. Positive scores favour the alternative model and, as expected, there is a strong coding signature throughout ORF5-1. In (5)-(6) the null model, in each window, is that only ORF5-1 is coding, while the alternative model is that both ORF5-1 and the window frame are coding. Scores are generally negative with some scatter into low positive scores, except for the ORF5-2 region which has consecutive high-positively scoring windows (5). Note that the generally lower MLOGD signal within the overlap region itself (4)-(5), and also at the 5' end of ORF5-1 (4), is due to there being fewer substitutions with which to discrimate the null model from the alternative model in these regions of above-average nucleotide conservation. (7) Map of the reference sequence [GenBank: NC_003736]. (8) Conservation at synonymous sites within ORF5-1 (see text and [18] for details). Note that the relatively large window size (75 codons) – used here for improved statistical power – explains the broad smoothing of the conservation peak at the edges of the region where ORF5-2 and ORF5-1 overlap.
Figure 2Alignment extract showing the region around the 5' end of fijivirus ORF5-2. The high nucleotide conservation in row 1 (mostly '*'s, only 2 'X's) can potentially be a result of amino acid constraints on the protein encoded by ORF5-1. In contrast, the high nucleotide conservation from the middle of row 3 to the end of the alignment extract is indicative of overlapping features (many 'X's).