| Literature DB >> 18492230 |
Andrew E Firth1, John F Atkins.
Abstract
Members of the genus Cypovirus (family Reoviridae) are common pathogens of insects. These viruses have linear dsRNA genomes divided into 10-11 segments, which have generally been assumed to be monocistronic. Here, bioinformatic evidence is presented for a short overlapping coding sequence (CDS) in the cypovirus genome segment encoding the major core capsid protein VP1, overlapping the 5'-terminal region of the VP1 ORF in the +1 reading frame. In Cypovirus type 1 (CPV-1), a 62-codon AUG-initiated open reading frame (hereafter ORFX) is present in all four available segment 1 sequences. The pattern of base variations across the sequence alignment indicates that ORFX is subject to functional constraints at the amino acid level (even when the constraints due to coding in the overlapping VP1 reading frame are taken into account; MLOGD software). In fact the translated ORFX shows greater amino acid conservation than the overlapping region of VP1. The genomic location of ORFX is consistent with translation via leaky scanning. A 62-64 codon AUG-initiated ORF is present in a corresponding location and reading frame in other available cypovirus sequences (2 CPV-14, 1 CPV-15) and an 87-codon ORFX homologue may also be present in Aedes pseudoscutellaris reovirus. The ORFX amino acid sequences are hydrophilic and basic, with between 12 and 16 Arg/Lys residues in each though, at 7.5-10.2 kDa, the putative ORFX product is too small to appear on typical published protein gels.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18492230 PMCID: PMC2409309 DOI: 10.1186/1743-422X-5-62
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Figure 1MLOGD statistics for the alignment of four CPV-1 segment 1 sequences. The four sequences were aligned with code2aln [15]; the alignment is gapless within the VP1 ORF. (1)–(3) The positions of stop codons in each of the four sequences in each of the three forward reading frames (frame defined by alignment to the reference sequence [GenBank: AF389462]). Note the conserved absence of stop codons in the +0 frame within the VP1 ORF and in the +1 frame in the ORFX region. (4)–(7) MLOGD sliding-window plots. Window size = 20 codons. Step size = 10 codons. Each window is represented by a small circle (showing the likelihood ratio score for that window), and grey bars showing the width (ends) of the window. See [6] for further details of the MLOGD software. In (4)–(5) the null model, in each window, is that the sequence is non-coding, while the alternative model is that the sequence is coding in the window frame. Positive scores favour the alternative model. There is a strong coding signature in the +0 frame (4) throughout the VP1 ORF, except where it overlaps ORFX. In this region there is a strong coding signature in the +1 frame (5) indicating that ORFX is subject to stronger functional constraints than the overlapping section of VP1. In (6)–(7) the null model, in each window, is that only the VP1 frame is coding, while the alternative model is that both the VP1 frame and the window frame are coding. Only the +1 (6) and +2 (7) frames are shown because the +0 frame is the VP1 frame which is included in the null model. Scores are generally negative with occasional random scatter into low positive scores, except for the ORFX region which has consecutive high-positively scoring windows (6). (8) Map of the reference sequence [GenBank: AF389462].