| Literature DB >> 36139088 |
Abstract
Many viruses from the realm Riboviria infecting eukaryotic hosts encode protein domains with sequence similarity to S-adenosylmethionine-dependent methyltransferases. These protein domains are thought to be involved in methylation of the 5'-terminal cap structures in virus mRNAs. Some methyltransferase-like domains of Riboviria are homologous to the widespread cellular FtsJ/RrmJ-like methyltransferases involved in modification of cellular RNAs; other methyltransferases, found in a subset of positive-strand RNA viruses, have been assigned to a separate "Sindbis-like" family; and coronavirus-specific Nsp13/14-like methyltransferases appeared to be different from both those classes. The representative structures of proteins from all three groups belong to a specific variety of the Rossmann fold with a seven-stranded β-sheet, but it was unclear whether this structural similarity extends to the level of conserved sequence signatures. Here I survey methyltransferases in Riboviria and derive a joint sequence alignment model that covers all groups of virus methyltransferases and subsumes the previously defined conserved sequence motifs. Analysis of the spatial structures indicates that two highly conserved residues, a lysine and an aspartate, frequently contact a water molecule, which is located in the enzyme active center next to the methyl group of S-adenosylmethionine cofactor and could play a key role in the catalytic mechanism of the enzyme. Phylogenetic evidence indicates a likely origin of all methyltransferases of Riboviria from cellular RrmJ-like enzymes and their rapid divergence with infrequent horizontal transfer between distantly related viruses.Entities:
Keywords: FtsJ; Riboviria; Rossmann fold; RrmJ; S-adenosylmethionine; SN2 nucleophilic substitution; coronavirus; methyltransferase; tobacco mosaic virus
Mesh:
Substances:
Year: 2022 PMID: 36139088 PMCID: PMC9496149 DOI: 10.3390/biom12091247
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Multiple sequence alignment of SAM-dependent methyltransferases encoded by representative genomes of Riboviria. Sequence identifiers in GenBank and PDB, virus species names and the family names, as of 2021, are shown on the left (truncated in the lower pane). The distance from the N-termini of the proteins, in amino acids, is shown before each sequence, and the lengths of less-conserved inserts are also shown. In the consensus secondary structure lines above the alignment, s stands for a β-strand and h stands for an α-helix; the positions are marked if the residue is found in a strand or a helix in more than a half of the known spatial structures, and the numbering of the strands within the Rossmann fold is provided. Conserved hydrophobic residues (I, L, M, V, F, Y, W) are indicated by yellow highlight, conserved small-side-chain or turn/kink-prone residues (A, G, S, P) are indicated by bold type and red color, conserved positively charged residues (K, R) are in bold type and blue color, and conserved acidic or amine side chains (D, E, N, Q) are highlighted in cyan. In Motif I, blue highlight is used to mark additional residues with potential negative charge (C, H, S, T and Y). Prior motif nomenclature (see text) is shown under the alignment. The order in which the sequences are sorted is explained in the Section 3.3.
Figure 2Bound water molecules in the active centers of diverse RrmJ-like methyltransferases. The cyan sphere represents water, and the green dotted sphere represents the methyl group of the bound S-adenosylmenthionine rendered as licorice sticks. The side chains of the residues that ligate the water molecule are shown and marked; the numbering is from the beginning of a pdb entry, not from the beginning of the virus (poly)protein as in Figure 1. (Left), NS5 methyltransferase of Dengue virus 3 (pdb 3P97); center, methyltransferase RmlE of E. coli (pdb 1EIZ); (Right), coronavirus methyltransferase Nsp16 (pdb 6WKS).
Figure 3The distribution of methyltransferases among the families within the realm Riboviria tree. The tree is based on the most recent update of the megataxonomy of Riboviria and is constructed from the alignment of more than 6000 virus RNA-directed RNA polymerases (71); for the purpose of this visualization, the clades have been collapsed at the family level or occasionally at the order level. Some colors used for coding the clades (the inner circle) and the text labels (the outer circle) are slightly similar, but the coding schemes are different, and are as follows. The clades, counterclockwise from 10 o’clock: golden, Branch I; dark gray, Branch V; light gray, Branch IV; magenta, Branch II; blue, Branch III. The text labels: maroon, virus families that do not encode methyltransferases; dark gray, virus families encoding RrmJ-like methyltransferases; blue, virus families encoding Sindbis-like methyltransferases; black on blue, virus families encoding two different methyltransferases; black on red, unconfirmed methyltransferase domains in Bornaviridae and Matonaviridae. The concentric grid and the axis pointing at 10 o’clock indicate the tree scale, measured in the average number of substitutions per site.