| Literature DB >> 31507560 |
Hengxia Jia1,2, Peng Gong1.
Abstract
The RNA-dependent RNA polymerases (RdRPs) encoded by the RNA viruses are a unique class of nucleic acid polymerases. Each viral RdRP contains a 500-600 residue catalytic module with palm, fingers, and thumb domains forming an encircled human right hand architecture. Seven polymerase catalytic motifs are located in the RdRP palm and fingers domains, comprising the most conserved parts of the RdRP and are responsible for the RNA-only specificity in catalysis. Functional regions are often found fused to the RdRP catalytic module, resulting in a high level of diversity in RdRP global structure and regulatory mechanism. In this review, we surveyed all 46 RdRP-sequence available virus families of the positive-strand RNA viruses listed in the 2018b collection of the International Committee on Virus Taxonomy (ICTV) and chose a total of 49 RdRPs as representatives. By locating hallmark residues in RdRP catalytic motifs and by referencing structural and functional information in the literature, we were able to estimate the N- and C-terminal boundaries of the catalytic module in these RdRPs, which in turn serve as reference points to predict additional functional regions beyond the catalytic module. Interestingly, a large number of virus families may have additional regions fused to the RdRP N-terminus, while only a few of them have such regions on the C-terminal side of the RdRP. The current knowledge on these additional regions, either in three-dimensional (3D) structure or in function, is quite limited. In the five RdRP-structure available virus families in the positive-strand RNA viruses, only the Flaviviridae family has the 3D structural information resolved for such regions. Hence, future efforts to solve full-length RdRP structures containing these regions and to dissect the functional contribution of them are necessary to improve the overall understanding of the RdRP proteins as an evolutionarily integrated group, and our analyses here may serve as a guideline for selecting representative RdRP systems in these studies.Entities:
Keywords: RNA-dependent RNA polymerase; catalytic motif; genome replication; positive-strand RNA virus; structure
Year: 2019 PMID: 31507560 PMCID: PMC6713929 DOI: 10.3389/fmicb.2019.01945
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Virus taxonomy assignments, virus name abbreviations, GenBank accession numbers, and related Protein Data Bank (PDB) entries of the representative positive-strand RNA virus RdRPs chosen for analyses in this study.
| EAV ( | NC_002532 | |||||
| SARS-CoV ( | NC_004718 | |||||
| AAbV | GBBW01007738 | |||||
| CharNV | KX883628 | |||||
| TurrNV | KX883629 | |||||
| NDiV | MH520106 | |||||
| PSCNV | MH933735 | |||||
| GAV | NC_010306 | |||||
| WBV | NC_008516 | |||||
| IAPV ( | NC_009025 | |||||
| IFV ( | AB000906 | |||||
| PV-1 ( | NC_002058 | 1RA6 | ||||
| CPMV ( | X00206 | |||||
| SINV-2 | EF428566 | |||||
| HaRNAV | NC_005281 | |||||
| TYMV ( | NC_004063 | |||||
| PVX | NC_011620 | |||||
| GVA | AF007415 | |||||
| SsDFV1 | NC_038977 | |||||
| BotV-F | NC_002604 | |||||
| /f | NβV ( | NC_001990 | ||||
| / | HAstV-8 ( | AF260508 | ||||
| / | MBV ( | NC_001633 | ||||
| / | BNYVV ( | NC_003514 | ||||
| / | OuMV ( | EU770623 | ||||
| / | BMV ( | NC_002027 | ||||
| / | NV ( | AJ583672 | 1SH0 | |||
| / | PrV ( | NC_014126 | ||||
| / | LIYV ( | U15440 | ||||
| / | YFV ( | X03700 | 4K6M | |||
| / | HCV ( | AF011751 | 1C2P | |||
| / | HGV ( | NC_001710 | ||||
| / | BVDV ( | NC_001461 | 5YF5 | |||
| / | CTV ( | NC_015521 | ||||
| / | BNRBV ( | JN651149 | ||||
| / | Qβ ( | AWN02713 | 3MMP | |||
| / | BYDV-PAV ( | NC_004750 | ||||
| / | ScNV-20S ( | NC_004051 | ||||
| / | NoV ( | AF174533 | ||||
| / | TaV ( | AF282930 | 4XHI | |||
| / | PVY ( | U09509 | ||||
| / | SBMV ( | NC_004060 | ||||
| / | SINV-3 ( | FJ528584 | ||||
| / | SINV ( | J02363 | ||||
| / | TBSV ( | NC_001554 | ||||
| / | GORV ( | NC_024501 | ||||
| / | HcRNAV01 | NC_007518 | ||||
| / | RUB | RUBCG | ||||
| / | CHV3 | AF188515 |
FIGURE 1The primary structure comparison of RNA-dependent RNA polymerases (RdRPs) from representative positive-strand RNA viruses. Orders, families, genera, and species assignments based on the ICTV 2018b collection are listed in Table 1. The virus species names are listed in alphabetical order giving the priority to virus order, then to virus family, and then to virus genus. The conserved motif C aspartic acid (magenta, corresponding to PV RdRP D328) is used as the origin in the scale bar. Conserved residues in motifs A, B, and F are also labeled: motif F lysine (corresponding to PV RdRP K159) in yellow; motif F arginine (corresponding to PV RdRP R174) in purple; motif A aspartic acid (corresponding to PV RdRP D233) in green; motif B glycine (corresponding to PV RdRP G289) in red. The orange rectangle indicates that 3D structures are available in that virus family (or virus genus in case of the Flaviviridae). The boundaries of the RdRP catalytic module defined by reported 3D structures are indicated by the white bars for the Flaviviridae and Permutotetraviridae RdRPs. Numbers on the right side of individual RdRP indicate the amino acids numbers for the full-length RdRPs. The question mark (?) indicates undefined boundaries of the RdRP proteins. The yellow and green arrows (150 and 230 residues in length, respectively) are used to estimate the N-terminal boundary of the RdRPs. The two long vertical bars (130 and 250 residues to the origin) indicate the C-terminal boundary of the RdRP catalytic module of the primer-dependent PV 3Dpol and the de novo HCV NS5B and are used to help predict additional functional regions. Wherever available in literature, the name of additional functional regions are labeled. Circles placed at the RdRP termini indicate predicted additional regions. The numbers following the “N” and “C” simply refer to the number of families possibly having additional regions at the RdRP N- and C-termini, respectively.
FIGURE 2Sequence alignment of motifs A-C of RdRPs from representative positive-strand RNA viruses. The order of motif sequences from top to bottom are consistent with that in Figure 1 and Table 1. Highly conserved residues, including three absolutely conserved residues (labeled by asterisks), are shown in red. Numbers within the brackets indicate the number of residues not shown.
FIGURE 3Global views of seven representative RdRPs 3D structures. RdRP structures are shown in cartoon representations. If available, order/family/genus/species assignments are shown on top of each structure. PV, poliovirus, PDB entry 1RA6 (chain A); NV, norovirus, PDB entry 1SH0 (chain A); Qβ, bacteriophage Qβ, PDB entry 3MMP (chain G); TaV, Thosea asigna virus, PDB entry 4XHI (chain A); HCV, hepatitis C virus, PDB entry 1C2P (chain A); JEV, Japanese encephalitis virus, PDB entry 4K6M (chain A); CSFV, classical swine fever virus, PDB entry 5YF5 (chain A). Coloring scheme: RdRP palm in gray, thumb in blue, fingers in pink, and signature sequence XGDD in magenta. The α-carbon atom of the three absolutely conserved amino acid residues (labeled by asterisks in Figure 2) are shown as green spheres. The N-terminal additional regions, if present, are shown in cyan.