| Literature DB >> 35781557 |
Heather L Wells1, Elizabeth Loh2,3, Alessandra Nava4, Mónica Romero Solorio5, Mei Ho Lee2,6, Jimmy Lee2,6, Jum R A Sukor7, Isamara Navarrete-Macias8, Eliza Liang9, Cadhla Firth2, Jonathan H Epstein2, Melinda K Rostal2, Carlos Zambrana-Torrelio10, Kris Murray11,12, Peter Daszak2, Tracey Goldstein13, Jonna A K Mazet14, Benhur Lee15, Tom Hughes2,6, Edison Durigon16, Simon J Anthony17.
Abstract
As part of a broad One Health surveillance effort to detect novel viruses in wildlife and people, we report several paramyxovirus sequences sampled primarily from bats during 2013 and 2014 in Brazil and Malaysia, including seven from which we recovered full-length genomes. Of these, six represent the first full-length paramyxovirid genomes sequenced from the Americas, including two that are the first full-length bat morbillivirus genome sequences published to date. Our findings add to the vast number of viral sequences in public repositories, which have been increasing considerably in recent years due to the rising accessibility of metagenomics. Taxonomic classification of these sequences in the absence of phenotypic data has been a significant challenge, particularly in the subfamily Orthoparamyxovirinae, where the rate of discovery of novel sequences has been substantial. Using pairwise amino acid sequence classification (PAASC), we propose that five of these sequences belong to members of the genus Jeilongvirus and two belong to members of the genus Morbillivirus. We also highlight inconsistencies in the classification of Tupaia virus and Mòjiāng virus using the same demarcation criteria and suggest reclassification of these viruses into new genera. Importantly, this study underscores the critical importance of sequence length in PAASC analysis as well as the importance of biological characteristics such as genome organization in the taxonomic classification of viral sequences.Entities:
Mesh:
Year: 2022 PMID: 35781557 PMCID: PMC9402765 DOI: 10.1007/s00705-022-05500-z
Source DB: PubMed Journal: Arch Virol ISSN: 0304-8608 Impact factor: 2.685
Fig. 1Histograms of paramyxovirid sequences submitted to GenBank by submission year (left) and sequence length (right). Trends are shown for all Paramyxoviridae (top) and Orthoparamyxovirinae only (bottom). The number of sequences submitted is increasing each year, but a significant majority of sequences submitted are of short PCR fragments (~500 bp), and very few are of full genomes (~15-16 kb).
Full-length viral genome sequences described in this study and their suggested taxonomic classification, host species, geographic origin (plus latitude/longitude, where available), collection date, and associated GenBank accession numbers
| Virus | Classification | Host species | Geographic origin | Collection date | GenBank accession no. |
|---|---|---|---|---|---|
| PDF-3137 | Brazil (Amazon) -2.39, -60.05 | 12/03/2013 | MW557651 MW554523 (host COI) MW557650 (host CytB) | ||
| PBZ-1381 | Brazil (Atlantic Forest) -22.25, -52.18 | 06/10/2013 | MZ312422 MZ312423 (host CytB) | ||
| PDF-0699 | Malaysia 5.53, 118.08 | 12/13/2012 | MZ312421 | ||
| PDF-3308 | Brazil (Amazon) -2.41, -59.41 | 01/03/2014 | MZ312420 | ||
| PBZ-1672 | Brazil (Atlantic Forest) -22.57, -52.30 | 03/05/2013 | MZ312428 MZ312429 (host CytB) | ||
| PBZ-3205 | Brazil (Atlantic Forest) -22.25, -52.18 | 10/01/2013 | MZ312424 MZ312425 (host CytB) | ||
| PBZ-2282 | Brazil (Atlantic Forest) -22.49, -52.39 | 09/12/2013 | MZ312426 MZ312427 (host CytB) |
Fig. 2Nucleotide-sequence-based phylogeny with maximum-likelihood bootstrap values of the orthoparamyxovirins for which full- or nearly full-length RdRp gene sequences are available. The genus Respirovirus is represented by a single sequence, that of Sendai virus, as an outgroup. Branches are colored according to their current genus classifications, which are also indicated at the right. Colored highlighting indicates the taxonomic suggestions discussed here, including reclassifying Tupaia virus and Mòjiāng virus each into its own new genus and establishing the subclades A-D within the genus Jeilongvirus.
Fig. 3Genome organization of representative members of each genus included in this study. Sequences are organized according to their RdRp phylogeny, which is shown on the left. The new genome sequences described here are labeled with red font, with the exception of the bat morbilliviruses (PDF-3137 and PBZ-1381), which share the same genome arrangement with other morbilliviruses. Where more than one genome arrangement is present in a single genus, a representative of each type is shown. ORFs are represented by colored polygons, and the black lines represent the entire length of the genome, both of which are drawn to scale. ORFs are colored as follows: nucleocapsid (N gene), blue; phosphoprotein (P gene), yellow; matrix (M gene), green; fusion (F gene), peach; ATUs, magenta; receptor binding protein (RBP gene), purple; RNA-dependent RNA polymerase (RdRp gene), orange. Asterisks on RBP ORFs indicate premature stop codons.
Fig. 4Nucleotide-sequence-based phylogenies for all genes, excluding ATUs. Clades are color-coded with the same scheme as in Fig. 2: Morbillivirus, blue; Henipavirus, orange; Jeilongvirus “subclade A”, green; Jeilongvirus “subclades B” and “C”, yellow; Jeilongvirus “subclade D”, red; Narmovirus, purple. Where sequences are incomplete, no sequence is shown in the tree for that gene. For clades that are not monophyletic by genus in one or more of the trees (i.e., for paraphyletic sequences previously proposed as “shaanviruses” and for Tupaia virus), branches are shown as dotted lines.
Fig. 5PAASC histograms of pairwise amino acid comparisons based on the RdRp gene. Scores on the x-axis were calculated by scoring alignments with the BLOSUM62 matrix and dividing by the total possible score (100% identity). Values on the y-axis indicate frequency. Gold boxes correspond to PIDs between sequences that do not conform to the identified cutoff values based on their established classifications. The cutoff values used are shown as dotted lines, with red for full-length RdRp sequences and blue for short fragments (Tong-PanPMV and Tong-RMH).
Genera of the paramyxovirus subfamily Orthoparamyxovirinae, with the exception of Salemvirus and Ferlavirus, which have only one classified member, and Respirovirus and Aquaparamyxovirus, which are outside the scope of this study. For each genus, the host taxa, host cellular receptor (if known), and the number of ATUs are shown. In addition, if any classified member of that genus represents an exception to those traits, it is listed along with the differing trait(s) in the last column
| Genus | Host | Receptor | ATUs | Exceptions |
|---|---|---|---|---|
| Mammals | SLAMF1/NECTIN4 | None | ||
| Bats | ENFB2/3 | None | Mòjiāng virus: host is a rodent, does not use ENFB2/3, falls below PAASC cutoff | |
| Rodents | Unknown | Two | ||
| Bats | Unknown | Two | ||
| Bats | Unknown | One | Belerina virus: host is a hedgehog | |
| Bats | Unknown | One | ||
| Rodents | Unknown | None | Tupaia virus: does not share monophyly, falls below PAASC cutoff |