| Literature DB >> 29516246 |
Max L Nibert1,2, Austin R Manny3, Humberto J Debat4, Andrew E Firth5, Laura Bertini6, Carla Caruso6.
Abstract
Because so few viruses in the family Barnaviridae have been reported, we searched for more of them in public sequence databases. Here, we report the complete coding sequence of Colobanthus quitensis associated barnavirus 1, mined from a transcriptome of the Antarctic pearlwort Colobanthus quitensis. The 4.2-kb plus-strand sequence of this virus encompasses four main open reading frames (ORFs), as expected for barnaviruses, including ORFs for a protease-containing polyprotein, an RNA-dependent RNA polymerase whose translation appears to rely on - 1 ribosomal frameshifting, and a capsid protein that is likely to be translated from a subgenomic RNA. The possible derivation of this virus from a fungus associated with C. quitensis is discussed.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29516246 PMCID: PMC5999160 DOI: 10.1007/s00705-018-3794-x
Source DB: PubMed Journal: Arch Virol ISSN: 0304-8608 Impact factor: 2.574
Fig. 1Genome organizations. Diagrams are drawn to scale and aligned to the proposed frameshifting (fs) site in each. For RsBV1, diagrams for both reported and newly revised sequences are included. Main ORFs are shown as gray rectangles, along with first and last nt positions (not including stop codons). The reading frame that includes ORF2 is defined as frame 0 in each genome. ORF3 is defined by bracketing stop codons; the other ORFs are defined as spanning from the first in-frame AUG codon to the first downstream stop codon. In MBV, three small ORFs with potential to encode proteins of 51–65 aa each [19] are shown as rectangles with dashed outlines; a few similarly sized small ORFs, with potential to encode proteins 52–100 aa each, are also found in CqABV1 and RsBV1 but are not located in the same genomic positions and are considered unlikely to be expressed. TM regions in encoded proteins, as predicted by SOSUI, are shown as diamonds. Regions of similarity in encoded proteins to viral proteases (pro), RdRps, or CPs, as determined by HHpred, are shown as thick lines within the respective ORFs. Known cleavage products of the ORF2-encoded polyprotein of representative sobemovirus SBMV are labeled. The ORF spanning nt 421–684 in SBMV encodes the essential sobemovirus protein Px [11], which may be analogous to barnavirus P1 based on the coding organizations of these viruses. The other 5′-proximal ORF in SBMV, spanning nt 93–533, has been reported to encode a movement protein (P1 [3]), which is not expected to be present in the fungal barnaviruses
Fig. 2Signals for −1 programmed ribosomal frameshifting. The stop codon defining the maximal possible 5′ extent of ORF3 in each virus is shaded. The proposed −1 slippery sequence in each is underlined. Stems in the RNA structures predicted directly downstream of the slippery sequence in each are overlined; interactions between the half stems are indicated by brackets. The predicted two-stem structure in CqABV1 is a compact pseudoknot. The nt position of the 3′-most residue shown for each virus is indicated at right. A folding diagram of the proposed CqABV1 pseudoknot is shown at far right (generated using PseudoViewer 3.0 as implemented at http://pseudoviewer.inha.ac.kr/)
Fig. 3Unrooted radial phylogram. Deduced protein sequences for the P3 (RdRp) region of P2+3 of barnaviruses (black) and sobemoviruses (gray) were aligned using MAFFT 7.3 (G-INS-i) and then subjected to maximum-likelihood phylogenetic analyses using ModelFinder, IQ-TREE, and UFBoot [10, 13, 14] as implemented with the “Find best and apply” option at https://www.hiv.lanl.gov/content/sequence/IQTREE/iqtree.html. The following were found to apply: best-fit model according to BIC, LG+F+I+G4; model of rate heterogeneity, Invar+Gamma with 4 categories; proportion of invariable sites, 0.0743; and gamma shape alpha, 0.8970. Branch support values (from 1000 bootstrap replicates) are shown in %; branches with < 50% support have been collapsed to the preceding node. The scale bar indicates the average number of substitutions per alignment position. See Table S4 for a summary of abbreviations and GenBank numbers