| Literature DB >> 27692039 |
Donald B Smith1, Paul Becher2, Jens Bukh3,4, Ernest A Gould5, Gregor Meyers6, Thomas Monath7,8, A Scott Muerhoff9, Alexander Pletnev10, Rebecca Rico-Hesse11,12, Jack T Stapleton13,14,15, Peter Simmonds1,16.
Abstract
Proposals are described for the assignment of recently reported viruses, infecting rodents, bats and other mammalian species, to new species within the Hepacivirus and Pegivirus genera (family Flaviviridae). Assignments into 14 Hepacivirus species (Hepacivirus A-N) and 11 Pegivirus species (Pegivirus A-K) are based on phylogenetic relationships and sequence distances between conserved regions extracted from complete coding sequences for members of each proposed taxon. We propose that the species Hepatitis C virus is renamed Hepacivirus C in order to acknowledge its unique historical position and so as to minimize confusion. Despite the newly documented genetic diversity of hepaciviruses and pegiviruses, members of these genera remain phylogenetically distinct, and differ in hepatotropism and the possession of a basic core protein; pegiviruses in general lack these features. However, other characteristics that were originally used to support their division into separate genera are no longer definitive; there is overlap between the two genera in the type of internal ribosomal entry site and the presence of miR-122 sites in the 5' UTR, the predicted number of N-linked glycosylation sites in the envelope E1 and E2 proteins, the presence of poly U tracts in the 3' UTR and the propensity of viruses to establish a persistent infection. While all classified hepaciviruses and pegiviruses have mammalian hosts, the recent description of a hepaci-/pegi-like virus from a shark and the likely existence of further homologues in other non-mammalian species indicate that further species or genera remain to be defined in the future.Entities:
Mesh:
Year: 2016 PMID: 27692039 PMCID: PMC5770844 DOI: 10.1099/jgv.0.000612
Source DB: PubMed Journal: J Gen Virol ISSN: 0022-1317 Impact factor: 3.891
Fig. 1.Amino acid divergence across the Hepacivirus polyprotein. Mean amino acid p-distances were calculated for 26 aligned Hepacivirus polyprotein sequences that differed by >0.1 of amino acid positions using a sliding window of 50 amino acids incremented by 10 residues and plotted against the amino acid position of the start of the fragment. The X-axis scale is uneven because of gaps in the reference sequence (M62321). Two regions with mean p-distances consistently <0.6 are indicated by bars. A schematic representation of the Hepacivirus polyprotein is shown to scale below.
Fig. 2.Analysis of Hepacivirus conserved regions. Maximum-likelihood trees were produced using mega6 for (a) positions 1123–1566 and (b) 2536–2959 of the virus polyprotein using the Le and Gascuel model and a gamma distribution of variation with invariant sites. Branches observed in >70 % of bootstrap replicates are indicated. Proposed Hepacivirus species assignments are indicated by single letters to the right of each branch. (c) Frequency histograms of amino acid p-distance between Hepacivirus sequences in the region 1123–1566 and (d) the region 2536–2959. The range of distances between different genotypes of Hepacivirus C (HCV) is indicated by an open arrow, between Hepacivirus G and Hepacivirus H by a shaded arrow and between Hepacivirus A and Hepacivirus C by a black arrow. The distance that demarcates different species is indicated by a dotted line.
Proposed Hepacivirus species
| Species | Previous identifier(s) | Type isolate | Accession number | Host/region | Source | 5′ UTR | 5′ UTR miR-122 sites | K/R aa before E1 | E1/E2 | 3′ UTR | Genome | Tissue | Chronic | Disease |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Canine hepacivirus | NZP1 | KP325401 | Horse (Dog?) | Serum, | IV IRES | 1 | 34 | 4/9 | 86 | 12.8–15.2 | Liver | Yes | ? | |
| GBV-B | T-1053 | U22304 | New World primate? | Plasma | IV IRES | 2 | 20 | 2/5 | 27 | 8.7 | Liver | Rarely | Hepatitis | |
| HCV | HCV-1 | M62321 | Human | Serum, | IV IRES | 2 | 31 | 4/11 | 10–108‡ | 7.3–9.9 | Liver | Yes | Hepatitis, cirrhosis, | |
| Guereza hepacivirus | BWC08 | KC551800 | Old World primate | Plasma | IV IRES? | 1 | 26 | 4/4 | ? | 11.6–11.9 | ? | ? | ? | |
| Rodent hepacivirus | RHV-339 | KC815310 | New World rodent | Plasma | IV IRES | 1 | 24 | 2/4 | 3 | 4.8–5.1 | ? | ? | ? | |
| Rodent hepacivirus | NLR07-oct70 | KC411784 | Old World rodent | Serum | Pegi-like | 1 | 23† | 2/3 | ? | 6.8 | Liver | ? | ? | |
| Norway rat hepacivirus 1 | NrHV-1/NYC-C12 | KJ950938 | Global rodent | Serum, multiple organs | ? | ? | 22† | 3/3 | ? | 5.6 | Liver | ? | ? | |
| Norway rat hepacivirus 2 | NrHV-2/NYC-E43 | KJ950939 | Global rodent | Serum, multiple organs | ? | ? | 27† | 3/4 | ? | 4.5 | Liver | ? | ? | |
| Rodent hepacivirus | SAR-3 | KC411806 | Old World rodent | Serum | IV IRES | 0 | 29 | 1/2 | 5 | 1.5–1.9 | ? | ? | ? | |
| Rodent hepacivirus | RMU10-3382 | KC411777 | Old World rodent | Serum, | Pegi-like | 2 | 29 | 2/7 | ? | 9.8–13.6 | Liver | ? | ? | |
| Bat hepacivirus | PDB-829 | KC796074 | Old World bat | Serum | ? | ? | 34 | 1/5 | ? | 9.5 | ? | ? | ? | |
| Bat hepacivirus | PDB-112 | KC796077 | Old World bat | Serum | ? | ? | 25 | 2/4 | ? | 7.7 | ? | ? | ? | |
| Bat hepacivirus | PDB-491.1 | KC796078 | Old World bat | Serum | ? | ? | 32 | 1/4 | ? | 9.6–10.8 | ? | ? | ? | |
| Bovine hepacivirus | 463 | KP641127 | Cow | Serum | IV IRES | 1* | 25 | 2/6 | 4 | 9.5–10.8 | Liver | Yes | ? |
?, Unknown or uncertain.
*0 in KP641127 but 1 in NC_026797.
†Position of AUG initiation codon uncertain.
‡M62321 has an incomplete 3′ UTR; other Hepacivirus C sequences have a run of 10–108 U residues.
Fig. 3.Amino acid divergence across Pegivirus polyproteins. Mean amino acid p-distances were calculated for 26 aligned Pegivirus polyprotein sequences that differed by >0.11 of amino acid positions using a sliding window of 50 amino acids incremented by 10 residues and plotted against the amino acid position of the start of the fragment. Increments on the X-axis scale are uneven because of unnumbered gaps in the reference sequence (U22303). Two regions with distances consistently <0.6 are indicated by bars. A schematic representation of the Pegivirus polyprotein is shown to scale below.
Fig. 4.Analysis of Pegivirus conserved regions. Maximum-likelihood trees were produced using mega6 for (a) amino acid positions 888–1635 using the Le and Gascuel model with frequencies and a gamma distribution of variation with invariant sites, and for (b) amino acid positions 2398–2916 using the Le and Gascuel model with a gamma distribution of invariant sites. Branches observed in >70 % of bootstrap replicates are indicated. Frequency histograms of amino acid p-distance between Pegivirus sequences in the region 888–1635 (c) and the region 2398–2916 (d) Amino acid p-distances between Pegivirus C sequences derived from different primate species are indicated by an open arrow, while those between primate- and bat-derived Pegivirus A sequences are indicated by a shaded arrow. The distance that demarcates different species is indicated by a broken line.
Proposed Pegivirus species
| Species | Previous identifier | Type isolate | Accession number | Host/region | Source | 5′ UTR | 5′ UTR | K/R aa before E1 | E1/E2 N-X-S/T sites | 3′ UTR | Genome | Tissue | Chronic | Disease |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GBV-A | T-1053 | New World primate, | Plasma | Pegi-like | 0 | 0 | 1/4 | 2 | 10.4–14.1 | ? | Yes | ? | ||
| GBV-D | GBV-D | Old World bat | Serum | ? | ? | 5* | 1/9 | ? | 9.5 | ? | ? | ? | ||
| GBV-C | PNF2161 | Human, | Plasma | Pegi-like | 0 | 4 | 1/3 | 3 | 9.4–13.1 | Lymphocytes | Yes | No | ||
| Theiler’s disease-associated virus | Horse_A1 | Horse | Serum | ? | 0 | 1 | 2/7 | 4 | 13.1 | ? | Yes | Serum | ||
| Equine pegivirus | C0035 | Horse | Serum | Pegi-like | 0 | 8* | 1/4 | 4 | 11.3 | ? | Yes | ? | ||
| Bat pegivirus | PDB-1698 | New World bat | Serum | ? | 1 | 7* | 2/9 | ? | 9.3 | ? | ? | ? | ||
| Bat pegivirus | PDB-620 | Old World bat | Serum | ? | ? | 7* | 0/6 | ? | 8.6–12.0 | ? | ? | ? | ||
| Human hepegivirus Human pegivirus 2 | AK-790 | Human | Serum | IV IRES | 0 | 5 | 2/9 | 3 | 8.5 | ? | Yes | ? | ||
| Bat pegivirus | PDB-1715 | New World bat | Serum | ? | ? | 0 | 1/2 | ? | 9.4 | ? | ? | ? | ||
| Rodent pegivirus | CC61 | New World rodent | Serum, multiple organs | ? | 0 | 21* | 2/9 | 4 | 9.3 | ? | ? | ? | ||
| Porcine pegivirus | PPgV_903 | Pig | Serum | Pegi-like | 0 | 0 | 4/4 | 2 | 9.1 | ? | Yes | ? |
?, Unknown or uncertain.
*Position of AUG initiation codon uncertain.
Fig. 5.Predicted number of N-glycosylation sites. Scatter plots show the number of Asn-X-Ser/Thr glycosylation sites predicted for E1 (X-axis) or E2 (Y-axis) using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/) for individual species of Pegivirus (■), Hepacivirus (○) and for Wenling shark virus (∆). Points have been jittered on the X-axis to improve legibility.
Fig. 6.Comparative analysis of Wenling shark virus. (a) Mean amino acid p-distance across the virus polyprotein between single representatives of each Hepacivirus and Pegivirus species with Wenling shark virus was plotted against polyprotein position (numbered relative to M62321). The positions of two regions where mean p-distances were <0.6 are indicated by solid bars. A schema of the boundaries between virus proteins is given below. (b) Phylogenetic analysis of representatives of each Hepacivirus (filled circles) and Pegivirus (open circles) species was analysed together with Wenling shark virus (filled triangle) using amino acid sequences between positions (b) 1216–1534 and (c) 2560–2745. Trees were constructed from maximum-likelihood distances using a Le and Gascuel model with a gamma distribution of variation with a class of invariant sites. Branches supported by >70 % of bootstrap replicates are indicated.