| Literature DB >> 29628822 |
Dongbin Park1, Chul Jun Goh1, Hyein Kim1, Yoonsoo Hahn1.
Abstract
The genome sequences of two novel monopartite RNA viruses were identified in a common eelgrass (Zostera marina) transcriptome dataset. Sequence comparison and phylogenetic analyses revealed that these two novel viruses belong to the genus Amalgavirus in the family Amalgaviridae. They were named Zostera marina amalgavirus 1 (ZmAV1) and Zostera marina amalgavirus 2 (ZmAV2). Genomes of both ZmAV1 and ZmAV2 contain two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. The fusion protein (ORF1+2) of ORF1 and ORF2, which mediates RNA replication, was produced using the +1 programmed ribosomal frameshifting (PRF) mechanism. The +1 PRF motif sequence, UUU_CGN, which is highly conserved among known amalgaviruses, was also found in ZmAV1 and ZmAV2. Multiple sequence alignment of the ORF1+2 fusion proteins from 24 amalgaviruses revealed that +1 PRF occurred only at three different positions within the 13-amino acid-long segment, which was surrounded by highly conserved regions on both sides. This suggested that the +1 PRF may be constrained by the structure of fusion proteins. Genome sequences of ZmAV1 and ZmAV2, which are the first viruses to be identified in common eelgrass, will serve as useful resources for studying evolution and diversity of amalgaviruses.Entities:
Keywords: +1 programmed ribosomal frameshifting; Amalgavirus; common eelgrass
Year: 2018 PMID: 29628822 PMCID: PMC5880360 DOI: 10.5423/PPJ.NT.11.2017.0243
Source DB: PubMed Journal: Plant Pathol J ISSN: 1598-2254 Impact factor: 1.795
Summary of amalgaviruses identified in this study
| Acronym | Full name | Accession | Length (nt) | ORF | Position | Length (aa) |
|---|---|---|---|---|---|---|
| ZmAV1 | Zostera marina amalgavirus 1 | KY783316 | 3383 | Fusion protein (RNA-dependent RNA polymerase) | 149–940, 942–3299 | 1049 |
| Putative replication factory matrix-like protein | 149–1279 | 382 | ||||
| ZmAV2 | Zostera marina amalgavirus 2 | KY783317 | 3316 | Fusion protein (RNA-dependent RNA polymerase) | 111–941, 943–3300 | 1062 |
| Putative replication factory matrix-like protein | 111–1298 | 395 |
Fig. 1(A) The predicted +1 programmed ribosomal frameshifting (PRF) motifs of ZmAV1 and ZmAV2. Both UUU and UUC codons can interact with the anticodon 3′-AAG-5′ of the phenylalanyl-tRNA (tRNAPhe). The tRNAPhe positioned on UUU is thought to slip forward by one nucleotide, causing a +1 frameshift for continued translation. The codon-anticodon base pairs are marked using dots. (B) Sequence comparison of the +1 PRF motif of amalgaviruses. The ZmAV1 and ZmAV2 +1 PRF motif sequences are marked in boldface letters in the last two rows. Sequence logo representation at the bottom clearly shows the conserved UUU_CGN motif, with uracil (U) being slightly preferred over the other bases before the motif and at the position N of the motif.
Identities among the RdRp motif-containing ORF2 protein sequences of ZmAV1, ZmAV2, and related viruses
| Acronym | Full name | Accession number | Identity with ZmAV1 | Identity with ZmAV2 |
|---|---|---|---|---|
| BLV | Blueberry latent virus | NC_014593.1 | 369/744 (50%) | 368/720 (51%) |
| FpAV2 | Festuca pratensis amalgavirus 2 | GBXZ01002308.1 | 357/722 (49%) | 371/733 (51%) |
| FpAV3 | Festuca pratensis amalgavirus 3 | GBXZ01009138.1 | 381/743 (51%) | 375/729 (51%) |
| LpAV1 | Lolium perenne amalgavirus 1 | GAYX01076418.1 | 367/717 (51%) | 373/744 (50%) |
| AcAV1 | Allium cepa amalgavirus 1 | GAAO01011981.1 | 340/719 (47%) | 344/733 (47%) |
| AcAV2 | Allium cepa amalgavirus 2 | GAAN01008476.1 | 362/769 (47%) | 370/776 (48%) |
| PeAV1 | Phalaenopsis equestris amalgavirus 1 | GDHJ01028335.1 | 341/749 (46%) | 343/727 (47%) |
| CaAV1 | Capsicum annuum amalgavirus 1 | JW101175.1 | 334/766 (44%) | 334/759 (44%) |
| STV | Southern tomato virus | NC_011591.1 | 324/705 (46%) | 326/737 (44%) |
| AoAV1 | Anthoxanthum odoratum amalgavirus 1 | GBIE01024896.1 | 327/731 (45%) | 332/715 (46%) |
| FpAV1 | Festuca pratensis amalgavirus 1 | GBXZ01049574.1 | 315/716 (44%) | 343/720 (48%) |
| RHV-A | Rhododendron virus A | NC_014481.1 | 345/717 (48%) | 356/727 (49%) |
| MsAV1 | Medicago sativa amalgavirus 1 | GAFF01077243.1 | 318/723 (44%) | 335/725 (46%) |
| VCV-M | Vicia cryptic virus M | EU371896.1 | 329/715 (46%) | 330/725 (46%) |
| CdAV1 | Cleome droserifolia amalgavirus 1 | GDRJ01026949.1 | 338/727 (46%) | 347/729 (48%) |
| CoAV1 | Camellia oleifera amalgavirus 1 | GEFY01004381.1 | 358/759 (47%) | 372/777 (48%) |
| GaAV1 | Gevuina avellana amalgavirus 1 | GEAC01063629.1 | 350/712 (49%) | 372/769 (48%) |
| EbAV1 | Erigeron breviscapus amalgavirus 1 | GDQF01098448.1 | 358/732 (49%) | 352/729 (48%) |
| EbAV2 | Erigeron breviscapus amalgavirus 2 | GDQF01120453.1 | 338/721 (47%) | 360/741 (49%) |
| ScAV1 | Secale cereale amalgavirus 1 | GCJW01039808.1 | 331/716 (46%) | 339/728 (47%) |
| SpAV1 | Spinach amalgavirus 1 | KY695011.1 | 356/722 (49%) | 359/740 (49%) |
| PpAV1 | Pinus patula amalgavirus 1 | GECO01025317.1 | 329/719 (46%) | 335/746 (45%) |
| ZbV-Z | Zygosaccharomyces bailii virus Z | KU200450.1 | 127/512 (25%) | 118/516 (23%) |
Accession numbers of viral genome sequences.
Amino acid sequence identities have been described in the following format: identical residues/aligned length (% identity).
Fig. 2Phylogenetic tree of ZmAV1, ZmAV2, and related plant amalgaviruses. Multiple sequence alignment of the RdRp-motif containing ORF2 protein sequences was performed for inference of the phylogenetic tree. The fungus-infecting virus ZbV-Z was used as an outgroup. The bootstrap values calculated from 100 replicates are shown at the nodes. The +1 PRF site position labels used in Fig. 3A are marked in parentheses.
Fig. 3(A) The +1 PRF sites and surrounding regions. Multiple sequence alignment of regions encompassing the +1 PRF sites was performed using the 24 full-length amalgavirus ORF1+2 fusion protein sequences (Supplementary Fig. 1). The +1 PRF occurred at three positions which were labelled as #1, #2, and #3 and marked using red, green, and blue arrows, respectively. Number of viruses are shown in the parenthesis. The first aa residues of ORF2 are shown in colored boxes. (B) Sequence logo representations generated from all viruses (top), and those generated from viruses with the +1 PRF sites at position #1 (middle) and position #3 (bottom). Notably, the +1 PRF sites are embedded in a segment with diverse aa residues and surrounded by conserved regions.