| Literature DB >> 35215964 |
Xue-Lian Zhang1, Xin-Yan Yao1, Yu-Qian Zhang1, Zhi-Hang Lv1, Hong Liu1, Jing Sun1, Jian-Wei Shao1.
Abstract
Hepaciviruses represent a group of viruses that pose a significant threat to the health of humans and animals. During the last decade, new members of the genus Hepacivirus have been identified in various host species worldwide, indicating the widespread distribution of genetically diversified hepaciviruses among animals. By applying unbiased high-throughput sequencing, a novel hepacivirus, provisionally designated Hepacivirus Q, was discovered in duck liver samples collected in Guangdong province of China. Genetic analysis revealed that the complete polyprotein of Hepacivirus Q shares 23.9-46.6% amino acid identity with other representatives of the genus Hepacivirus. Considering the species demarcation criteria for hepaciviruses, Hepacivirus Q should be regarded as a novel hepacivirus species of the genus Hepacivirus within the family Flaviviridae. Phylogenetic analyses also indicate the large genetic distance between Hepacivirus Q and other known hepaciviruses. Molecular detection of this novel hepacivirus showed an overall prevalence of 15.9% in duck populations in partial areas of Guangdong province. These results expand knowledge about the genetic diversity and evolution of hepaciviruses and indicate that genetically divergent hepaciviruses are circulating in duck populations in China.Entities:
Keywords: China; domestic duck; genetic diversity; novel hepacivirus
Mesh:
Substances:
Year: 2022 PMID: 35215964 PMCID: PMC8879383 DOI: 10.3390/v14020371
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Geographic maps showing the location of sampling sites where the duck liver samples in this study were collected. This map was plotted with a combination of Surfer software version 4 (Golden Software, Golden, CO, USA) and Adobe illustrator version CC2017 (Adobe, San Jose, CA, USA).
Figure 2Mapped read count plot of the Hepacivirus Q genome. The histograms show the coverage depth per base of the Hepacivirus Q genome, and the mean sequencing depth was 8.5×.
Amino acid sequence similarity of mature Hepacivirus Q proteins compared to those of other hepaciviruses. Sequences were aligned using the ClustalW method and the calculated sequence similarity implemented in MegAlign.
| Value for the Indicated Virus in Relation to | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Protein | A | B | C | D | E | F | G | H | I | J | K | L | M | N | RHV | JgV | HCL1 | GD61 | BeHV |
| Polyprotein | 23.9 | 24.6 | 23.7 | 24.6 | 24.8 | 24.0 | 24.0 | 24.0 | 24.1 | 24.7 | 24.6 | 25.5 | 25.4 | 24.1 | 25.0 | 39.1 | 35.5 | 35.6 | 46.6 |
| Core | 26.5 | 24.1 | 23.9 | 25.2 | 24.7 | 28.1 | 26.5 | 26.5 | 23.9 | 28.6 | 25.8 | 24.8 | 27.5 | 23.1 | 24.1 | 9.4 | 28.8 | 28.2 | 47.6 |
| E1 | 24.6 | 20.8 | 19.4 | 19.7 | 22.5 | 19.3 | 18.5 | 17.6 | 18.7 | 26.8 | 22.9 | 16.0 | 23.9 | 16.1 | 18.2 | 45.2 | 38.6 | 38.6 | 55.6 |
| E2 | 11.6 | 13.2 | 15.0 | 15.0 | 15.9 | 16.5 | 17.0 | 15.2 | 12.8 | 9.3 | 12.1 | 15.0 | 12.5 | 13.7 | 14.8 | 29.7 | 30.9 | 31.7 | 44.6 |
| p7 | 21.0 | 17.5 | 27.4 | 15.6 | 11.1 | 10.9 | 15.1 | 20.0 | 17.3 | 18.6 | 33.9 | 15.3 | 27.4 | 15.0 | 19.6 | 31.1 | 31.5 | 31.5 | 47.3 |
| NS2 | 15.6 | 15.7 | 17.0 | 16.1 | 15.8 | 15.8 | 16.8 | 17.9 | 17.8 | 17.8 | 17.5 | 18.8 | 17.5 | 15.3 | 14.3 | 32.2 | 29.9 | 29.9 | 41.9 |
| NS3 | 37.2 | 38.9 | 37.5 | 39.2 | 35.9 | 37.6 | 36.7 | 35.6 | 35.6 | 38.8 | 38.1 | 41.2 | 37.6 | 37.3 | 40.2 | 46.6 | 48.1 | 48.3 | 53.0 |
| NS4A | 17.6 | 25.5 | 21.6 | 26.9 | 23.5 | 25.5 | 19.6 | 17.3 | 21.2 | 15.4 | 21.2 | 27.5 | 25.0 | 23.5 | 29.4 | 42.3 | 28.8 | 28.8 | 46.2 |
| NS4B | 21.6 | 21.8 | 22.0 | 21.8 | 20.2 | 16.8 | 17.5 | 22.1 | 22.4 | 24.1 | 23.5 | 19.4 | 24.8 | 23.2 | 19.3 | 35.0 | 32.4 | 32.4 | 41.6 |
| NS5A | 12.0 | 11.9 | 10.4 | 11.7 | 15.7 | 11.6 | 10.8 | 12.3 | 12.3 | 13.6 | 11.4 | 9.2 | 12.1 | 15.5 | 9.8 | 19.7 | 12.1 | 11.7 | 27.2 |
| NS5B | 31.6 | 31.6 | 31.1 | 34.0 | 31.4 | 30.9 | 32.1 | 32.1 | 33.0 | 34.4 | 34.2 | 34.6 | 35.1 | 32.0 | 32.2 | 56.3 | 43.1 | 43.3 | 54.8 |
A to N indicate Hepacivirus A to Hepacivirus N, and RHV, JgV, HCL1, GD61, and BeHV indicate RHV-GS2015, Jogalong virus, DuHV-HCL1, DuHV-GD61, and Bald eagle hepacivirus, respectively. GenBank accession numbers for the sequences are as follows: Hepacivirus A, NC038425; Hepacivirus B, NC001655; Hepacivirus C, NC038882; Hepacivirus D, NC031950; Hepacivirus E, KC815310; Hepacivirus F, NC038427; Hepacivirus G, NC025672; Hepacivirus H, NC025673; Hepacivirus I, NC038428; Hepacivirus J, NC038429; Hepacivirus K, NC038430; Hepacivirus L, NC031916; Hepacivirus M, NC038431; Hepacivirus N, NC038432; RHV-GS2015, NC040815; Jogalong virus, MN133813; Bald eagle hepacivirus, MN062427; DuHV-HCL1, MK737640; DuHV-GD61, MT135177.
Comparison of predicted cleavage sites of hepacivirus polyprotein.
| Cleavage Site at: | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Virus | C/E1 | E1/E2 | E2/p7 | P7/NS2 | NS2/NS3 | NS3/NS4A | NS4A/NS4B | NS4B/NS5A | NS5A/NS5B |
|
| GEA/SVV | VSC/DNY | AEA/YLS | AWA/FDN | RLL/SPI | TQT/NAW | EEC/FDH | QNC/DFT | ESC/SLS |
|
| CSG/ARV | IEA/TSG | MAA/GLP | ASA/FDT | AIT/APF | VNT/SGT | EEC/ASF | DDC/GLI | FSC/SMS |
|
| ASA/YQV | VDA/ETH | AEA/ALE | AYA/LDT | RLL/API | VVT/STW | EEC/SQH | TPC/SGS | VCC/SMS |
|
| GAS/CVV | VTS/TSL | AAA/AAM | AVG/FDD | MLN/PFS | NDC/SLV | EEC/SFG | AQC/DGG | AKC/ASW |
|
| ATA/VSN | AAA/AAP | AYA/FTP | AYA/ISL | KYT/IPF | FFA/SGY | EEC/YNW | DLC/TPP | HSC/SMS |
|
| GGA/VTN | VKA/LAL | AFA/FTP | SAY/SLN | ERT/APF | YFA/STT | EEC/YQW | EDC/SCR | HEC/SSW |
|
| ASA/GIF | VAA/PVS | VGA/LEV | EAY/EGG | RFT/APF | YFA/ETV | EEC/STQ | DVC/TSP | TDC/SWS |
|
| AEA/NLL | SAV/AVP | SEA/VPT | RAE/QFD | QLT/KPF | YYC/GLV | EEC/ANE | EIC/DGS | SSC/SKS |
|
| VEP/KPL | SVA/APV | YAQ/PPL | VEA/FSS | QLS/SPV | ELA/STW | EEC/ALD | EPC/TDS | ETC/TYS |
|
| AVS/HWC | AEG/LPF | ANA/LVL | AQG/GCL | RLT/APF | EEM/TDG | EEC/GFD | AEC/AGG | TSC/NYS |
|
| GEA/SYA | AQA/NPI | ADA/ALT | AVG/GPY | RHC/SPI | DDT/STG | EEC/LSY | SEC/AFF | DEC/SAS |
|
| AES/VPA | AAA/MPV | AWG/WPA | AQA/ASL | ERN/APM | YSA/GGL | EEC/MQT | AEC/DGM | SKM/SRS |
|
| VDA/SFA | SQA/AEH | AEG/AME | VGG/GPV | RHC/SPI | TPT/SAW | EEC/ADY | RNC/SCS | SPC/SAS |
|
| VSG/YRQ | VEA/TTT | ATA/ALL | VTA/LDF | APC/SPI | LDV/WGA | EEC/WGF | VPC/GFN | KEC/SYS |
| JgV | AVA/FSD | AQA/GTH | IEG/AVN | VAG/FWF | KLA/API | SAG/LTV | EEC/AST | TNC/TSP | VCC/GES |
| DuHV-HCL1 | ASA/DHI | GMA/DRS | AEG/MLS | VLG/ASV | QYT/API | NCS/AAY | EEC/SAE | YEC/NSE | ESC/SFS |
| DuHV-GD61 | ASA/DHI | GMA/DRS | AEG/MLS | VLG/ASV | QYT/API | NCS/AAY | EEC/SAE | YEC/NSE | ESC/SFS |
| BeHV | ADS/SHD | VEG/GLQ | VTA/AVN | VSG/TEV | NWS/APL | VSC/SLY | EEC/AGN | YDC/ANS | HSC/SAS |
| Hepacivirus Q | EPA/THL | VEG/GLV | VGA/AIN | VAG/SEI | RWS/APF | VNC/SML | EEC/SSS | QLC/SSN | CSC/SMS |
Cleavage is indicated by a slash (/). The GenBank accession numbers for the corresponding sequences are same as shown in Table 1.
Figure 3Genomic characterization and phylogenetic analysis based on the complete polyprotein of Hepacivirus Q and other reference hepaciviruses. (A) Genome organization of Hepacivirus Q identified in this study. Red arrows indicate N-linked glycosylation sites. (B) Phylogenetic analysis based on the amino acid sequence of the complete polyprotein of Hepacivirus Q. The trees were constructed based on the maximum likelihood method implemented in PhyML v3.0 and mid-point rooted for clarity; the scale bar represents the number of nucleotide substitutions per site. Bootstrap values were calculated with 100 replicates of the alignment, and only bootstrap values > 70% are shown at relevant nodes.
Figure 4Phylogenetic analysis based on amino acid sequence of NS3 and NS5B proteins of hepaciviruses. The trees were constructed based on the maximum likelihood method implemented in PhyML v3.0 and mid-point rooted for clarity; the scale bar represents the number of nucleotide substitutions per site. Bootstrap values were calculated with 100 replicates of the alignment, and only bootstrap values >70% are shown at relevant nodes.