| Literature DB >> 32289120 |
François Ferron1,2, Humberto J Debat3, Ashleigh Shannon1, Etienne Decroly1, Bruno Canard1,2.
Abstract
The order Nidovirales is a diverse group of (+)RNA viruses, classified together based on their common genome organisation and conserved replicative enzymes, despite drastic differences in size and complexity. One such difference pertains to the mechanisms and enzymes responsible for generation of the proposed viral 5' RNA cap. Within the Coronaviridae family, two separate methytransferases (MTase), nsp14 and nsp16, perform the RNA-cap N7-guanine and 2'-OH methylation respectively for generation of the proposed m7GpppNm type I cap structure. For the majority of other families within the Nidovirales order, the presence, structure and key enzymes involved in 5' capping are far less clear. These viruses either lack completely an RNA MTase signature sequence, or lack an N7-guanine methyltransferase signature sequence, obscuring our understanding about how RNA-caps are N7-methylated for these families. Here, we report the discovery of a putative Rossmann fold RNA methyltransferase in 10 Tobaniviridae members in Orf1a, an unusual genome locus for this gene. Multiple sequence alignments and structural analyses lead us to propose this novel gene as a typical RNA-cap N7-guanine MTase with substrate specificity and active-site organization similar to the canonical eukaryotic RNA-cap N7-guanine MTase.Entities:
Year: 2019 PMID: 32289120 PMCID: PMC7107822 DOI: 10.1093/nargab/lqz022
Source DB: PubMed Journal: NAR Genom Bioinform ISSN: 2631-9268
Figure 1.Presence, type and loci of signature-sequences of Nidovirales RNA MTases: RF-MTase of unknown specificity (light red), non-RF « nsp14-like » MT (yellow) and RF « nsp16-like » MTase (green). Both light red and yellow MTases map to the nsp14 C-ter locus, ie. immediately downstrean the ExoN domain, while « nsp16-like » MTase map to the nsp16 locus, at the end of Orf1b. The tree was made based on MAFFT v7.427 multiple sequence alignment with BLOSUM62 scoring matrix and G-INS-i iterative refinement method. The alignments were used as input for maximum likelihood trees generated with the FasTtree v2.1.5 software (best-fit model = JTT-Jones-Taylor-Thorton with single rate of evolution for each site = CAT). Local support values were computed using the Shimodaira-Hasegawa test (SH) with 1000 replicates. Numbers at the nodes represent FasTtree support values and scale var substitutions per site. The tree included two novel Ronivirus-like and Mesonivirus-like genome sequences: Western Flower Thrips Mesonivirus, and Palaemon Nidovirus, respectively (WFTV, PAN, unpublished, see the ‘Materials and Methods’ section and Supplementary Table S1). When present in the NCBI viral genomes database in the 92 Nidovirales complete genomes repository (https://www.ncbi.nlm.nih.gov/genomes/GenomesGroup.cgi?taxid=76804), no accession number is indicated. When an accession number is given in parenthesis, it is referring to the GenBank accession number (https://www.ncbi.nlm.nih.gov/genbank/). The genome dataset used in this study is given in Supplementary Table S1. From top to bottom of the figure: Arteriviridae: HHPAV: Hainan hebius popei arterivirus (MG600021); NGSAV: Nanhai ghost shark arterivirus (MG600024); SHFV: simian hemorrhagic fever virus; EAV: Equine arteritis virus; LDV, Lactate elevating virus; PRRSV-1 and 2, porcine reproductive and respiratory syndrome virus. Tobaniviridae: BToV: Breda virus; GToV: Goat torovirus; EToV: Berne virus (CAA36747); PToV: Porcine torovirus; CSV: Chinook salmon bafinivirus; FHMNV: Fathead Minnow nidovirus 1; WBV: white bream virus; BoNV: Bovine nidovirus TCH5; INToV: Xinzhou nido-like virus 6; SECToV; Xinzhou toro-like virus 1; GRBS-ToV: Guangdong red banded snake torovirus (MG600030); ShNV: shingleback nidovirus 1 (KX184715); BPNV: Ball python nidovirus 1; MVNV: Morelia viridis nidovirus; BRV: Bellinger River virus (MF685025); Coronaviridae: IBV: Infectious Bronchitis Virus; BWCoV: Beluga Whale coronavirus SW1; HCoV_HKU1: Human coronavirus HK1; MHV: Mouse hepatitis virus; BatCoV_HKU9: Rousettus bat Coronavirus HKU9; SARS-CoV: Severe acute respiratory syndrome coronavirus; MERS_CoV: middle-east respiratory syndrome coronavirus; Ty-BatCoV_HKU4: Tylonycteris bat coronavirus HKU4; Pi-BatCoV_HKU5: pipistrellus bat coronavirus HKU5; FIPV: Feline infectious peritonitis virus; Rh-BatCoV-HKU2: Rhinolophus bat coronavirus HKU2; HCoV_229E: Human coronavirus 229E; HCoV_OC43: Human coronavirus OC43 (YP_003766); HCoV_NL63: Human coronavirus NL63; PEDV: Porcine epidemic diarrhea virus; Sc-BatCoV_512: Scotophilus bat coronavirus 512; Mi-BatCoV-1A: bat coronavirus 1A; Mi-BatCoV_HKU8: Miniopterus bat coronavirus HKU8. Medioniviridae: BlNV: Botrylloides leachii nidovirus; TurrNV: Turrinivirus 1; WFTV: Western Flower Thrips virus. Mesoniviridae: MenoV: Meno virus; NseV: Nse virus; HanaV: Hana virus; KSaV: Karang sari virus; DKNV: Dak Nong virus; CASV: Casuarina virus; NDiV: Nam Dinh virus; CavV: Cavally virus; DMV: Dianke mesonivirus. Roniviridae: PAN: Palaemon nidovirus; GAV: Gill-associated virus; YHV: Yellow head virus (EU487200). Euroniviridae: PagRV: Paguronivirus 1; CharNV: Charybnivirus 1; DecNV: Decronivirus 1. Abyssoviridae: AAbV: Aplysia abyssovirus 1. Mononiviridae: PSCNV: Planidovirus 1
Figure 3.(A) Structural alignment of Tobaniviridae MTases together with various prokaryotic, eukaryotic and viral members; Pectobacterium Atrosepticum ECA1738 (PDB ID: 2P7I), Vaccinia virus D1:D12 heterodimer (PDB ID: 4CKB), Encephalitozooan cuniculi EcmI (PDB ID: 1Z3C), human RNA N7-guanine MTase (PDB ID: 3BGV). (B) Vaccinia Virus N7-guanine MTase structure (PDB: 4CKB) with highlighted motifs 1 to 5 and their corresponding amino-acid frequencies as determined by WebLogo analysis.
Distribution and putative substrate specificities of MTases in the four families of the Nidovirales Order
| Virus | Genome size | MTase | N7-MTase | 2′-O MTase |
|---|---|---|---|---|
| (nt) | Orf1a | nsp14-like | nsp16 | |
|
| ∼125–157 000 | − | − | (+/−)* |
|
| ||||
|
| ||||
| Porcine torovirus | 28 301 | − | − | − |
| Bovine torovirus | 28 479 | − | − | |
| Equine torovirus | 27 992 | − | − | − |
|
| ||||
| Bovine nidovirus 1 | 20 261 | − | − | + |
|
| ||||
| White bream virus | 26 660 | + | − | + |
| Fathead minnow | ||||
| nidovirus 1 | 27 318 | + | + | + |
|
| ||||
| Chinook | ||||
|
| 27 004 | + | − | + |
|
| ||||
| Infratovirus 1 | 30 353 | + | − | + |
|
| ||||
| Sectovirus 1 | 25 960 | + | − | + |
|
| ||||
| Shingleback | ||||
| nidovirus 1 | 23 832 | + | − | + |
|
| ||||
| Ball python | ||||
| nidovirus 1 | 33 452 | + | − | + |
|
| ||||
| Bellinger river virus | 30 742 | + | − | + |
| Guangdong | ||||
| red-banded snake | ||||
| torovirus | 30 859 | + | − | + |
| Python nidovirus | 32 399 | + | − | + |
| Goat torovirus | 28 487 | + | − | + |
|
| ∼27–32 000 | − | + | + |
|
| ||||
|
| ∼20–25 000 | − | −/+ | + |
|
| ∼26 000 | − | − | + |
|
| 35 906 | − | +** | + |
|
| 41 178 | − | +** | + |
*: Arteriviruses do not usually carry any MTase, except the two newly identified Hainan Hebius Popei arterivirus and Nanhai gost shark arterivirus (see text).**: although in the nsp14 locus immediately downstream the ExoN gene, the MTase has a Rossmann fold. Virus species names are indicated and genome size corresponds to exemplar virus sequence used in this study.
Only the Tobaniviridae is expanded and genera indicated in italic plus unassigned viruses (see Figure 1).
Figure 2.Distribution of the Orf1a MTase along the (A) Tobaniviridae phylogenetic tree (see Figure 1). (B) approximate genome organisation and gene content along the family described in A). The genomes are not drawn to scale. Torovirinae: Bovine, Goat, Equine and Procine toroviruses; Bovine nidovirus 1 is the Bostovirus (Remotovirinae), whose genome is represented smaller than its fellow members to account for its ∼20 kb genome size; Piscavirinae and Serpentovirinae are the viruses boxed in green. See ref (1).