| Literature DB >> 32283837 |
Ngoc Minh Hien Phan1,2, Helen Faddy1,2,3, Robert Flower1,2, Kirsten Spann1,4, Eileen Roulis1,2.
Abstract
The extent of whole genome diversity amongst hepatitis B virus (HBV) genotypes is not well described. This study aimed to update the current distribution of HBV types and to investigate mutation rates and nucleotide diversity between genotypes in Southeast Asia, Australia and New Zealand. We retrieved 930 human HBV complete genomes from these regions from the NCBI nucleotide database for genotyping, detection of potential recombination, serotype prediction, mutation identification and comparative genome analyses. Overall, HBV genotypes B (44.1%) and C (46.2%) together with predicted serotypes adr (36%), adw2 (29%) and ayw1 (19.9%) were the most commonly circulating HBV types in the studied region. The three HBV variants identified most frequently were p.V5L, c.1896G>A and double mutation c.1762A>T/c.1764G>A, while genotypes B and C had the widest range of mutation types. The study also highlighted the distinct nucleotide diversity of HBV genotypes for whole genome and along the genome length. Therefore, this study provided a robust update to HBV currently circulating in Southeast Asia, Australia and New Zealand as well as an insight into the association of HBV genetic hypervariability and prevalence of well reported mutations.Entities:
Keywords: HbsAg subtypes; genetic diversity; genotypes; human hepatitis B virus; mutations; recombination; serotypes
Mesh:
Substances:
Year: 2020 PMID: 32283837 PMCID: PMC7232418 DOI: 10.3390/v12040427
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Flowchart of genotype assignment for each sequence retrieved from the NCBI nucleotide database.
Figure 2Selected mutations across RT, HBsAg, NRE, BCP, PC, C and X ORFs for analysis. Nucleotide position 1 on an HBV genome corresponds to the HBV EcoR1 restriction site.
Figure 3Number of HBV complete genomes retrieved through each step of data collection.
Identified HBV genotypes by country.
| Genotype | A | B | C | D | G | I | B/BC or C/CB | G/GC | Total | |
|---|---|---|---|---|---|---|---|---|---|---|
|
| ||||||||||
| Australia | 1 | 66 | 67 (7.2%) | |||||||
| Cambodia | 4 | 24 | 28 (3%) | |||||||
| Indonesia | 82 | 51 | 9 | 142 (15.3%) | ||||||
| Laos | 14 | 14 (1.5%) | ||||||||
| Malaysia | 2 | 170 | 116 | 4 | 4 | 296 (31.8%) | ||||
| Myanmar | 18 | 18 (1.9%) | ||||||||
| New Zealand | 6 | 36 | 42 (4.5%) | |||||||
| Philippines | 9 | 7 | 10 | 26 (2.8%) | ||||||
| Thailand | 16 | 96 | 1 | 1 | 1 | 2 | 117 (12.6%) | |||
| Vientam | 131 | 43 | 6 | 180 (19.4%) | ||||||
|
| 12 (1.3%) | 410 (44.1%) | 430 (46.2%) | 49 (5.3%) | 1 (0.1%) | 21 (2.3%) | 5 (0.5%) | 2 (0.2%) | 930 (100%) | |
Figure 4Distribution of HBV genotypes and recombinants by country in Southeast Asia, Australia and New Zealand. Pie charts indicate the percentage of HBV genotypes with the diameters. Countries with human HBV complete genomes collectable from NCBI nucleotide database were highlighted in bold and uppercase.
HBV recombination detection by five automatic methods built in RDP4.
| Detection Method | RDP | 3SEQ | Chimaera | Geneconv | Maxchi | |
|---|---|---|---|---|---|---|
| Events/Recombinants | ||||||
| Recombination events | 83 | 80 | 49 | 87 | 41 | |
| Potential recombinants (possibly present in different events) | 467 | 613 | 744 | 1199 | 1355 | |
| Potential recombinants (after removing replicate recombinants) | 380 | 566 | 510 | 857 | 548 | |
| Recombinants identified by all methods: 83 (65 sequences from our data and 18 reference sequences) | ||||||
| + with overlapping breakpoint positions: 35 (31 sequences from our data and 4 reference sequences) | ||||||
| + with at least one same major parent: 10 (8 sequences from our data and 2 reference sequences) | ||||||
Predicted HBV serotypes by genotype and country in Southeast Asia, Australia and New Zealand.
| Serotype | Unassigned ( | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Genotype/ | |||||||||
|
|
|
|
| ||||||
| Australia | 1 | ||||||||
| Malaysia | 1 | 1 | |||||||
| Philippines | 7 | 2 | |||||||
|
|
|
|
|
|
|
| |||
| Cambodia | 2 | 2 | |||||||
| Indonesia | 46 | 1 | 34 | 1 | |||||
| Malaysia | 141 | 4 | 20 | 1 | 4 | ||||
| Philippines | 1 | 6 | |||||||
| Thailand | 12 | 4 | |||||||
| Vietnam | 16 | 106 | 6 | 3 | |||||
|
|
|
|
|
|
|
| |||
| Australia | 6 | 60 | |||||||
| Cambodia | 23 | 1 | |||||||
| Indonesia | 45 | 3 | 3 | ||||||
| Malaysia | 107 | 5 | 1 | 3 | |||||
| Myanmar | 17 | 1 | |||||||
| New Zealand | 6 | ||||||||
| Philippines | 1 | 9 | |||||||
| Thailand | 86 | 7 | 3 | ||||||
| Vietnam | 40 | 3 | |||||||
|
|
|
|
| ||||||
| Indonesia | 1 | 8 | |||||||
| Malaysia | 4 | ||||||||
| New Zealand | 36 | ||||||||
|
|
| ||||||||
| Thailand | 1 | ||||||||
|
|
|
| |||||||
| Laos | 4 | 10 | |||||||
| Thailand | 1 | ||||||||
| Vietnam | 6 | ||||||||
|
|
|
|
| ||||||
| Thailand | 1 | ||||||||
| Malaysia | 1 | 1 | |||||||
|
|
|
| |||||||
| Malaysia | 1 | 1 | |||||||
|
|
| ||||||||
| Thailand | 2 | ||||||||
DNA polymorphism of HBV genotypes identified in Southeast Asia, Australia and New Zealand.
| Genotype | Number of Polymorphic (Segregating) Sites | Average Number of Nucleotide Differences (k) | Nucleotide Diversity (Pi) |
|---|---|---|---|
| Genotype A ( | 44 | 10.788 | 0.00775 |
| Genotype B ( | 454 | 11.515 | 0.00827 |
| Genotype C ( | 446 | 12.548 | 0.00901 |
| Genotype D ( | 68 | 7.770 | 0.00558 |
| Genotype I ( | 37 | 8.148 | 0.00585 |
| Total | 676 | 20.944 | 0.01505 |
Figure 5Nucleotide diversity (Pi) over HBV genome. The MAFF-aligned genomes had 3434 bp in length; genotypes were colour-coded (sites with gaps in the complete data file were excluded; window length 100 bp, step size 25 bp).
Number and percentage of selected mutations by HBV genotype 1.
| Mutation Type (Region) | A ( | B ( | C ( | D ( | G ( | I ( | B/BC, C/CB, G/GC ( | Total ( |
|---|---|---|---|---|---|---|---|---|
| Drug resistance mutations (polymerase) | 1 (8%) | 15 (3.7%) | 22 (5%) | 2 (4%) | 0 | 0 | 0 | 40 (4.3%) |
| Escape mutations (pre-S/S) | 2 (16%) | 41 (10%) | 79 (18%) | 8 (16%) | 0 | 2 (10%) | 2 (27%) | 134 (14.4%) |
| c.1613G>A (NRE) | 2 (16%) | 44 (11%) | 67 (15%) | 0 | 1 (100%) | 4 (19%) | 1 (14%) | 119 (12.8%) |
| Only c.1762A>T (BCP) | 0 | 7 (2%) | 3 (0.7%) | 0 | 0 | 0 | 0 | 10 (1.1%) |
| Only c.1764G>A (BCP) | 1 (8%) | 3 (0.7%) | 20 (5%) | 1 (2%) | 0 | 0 | 0 | 25 (2.7%) |
| Double c.1762A>T/ c.1764G>A (BCP) | 5 (42%) | 60 (15%) | 159 (37%) | 7 (14%) | 1 (100%) | 5 (24%) | 2 (27%) | 239 (25.7%) |
| c.1896G>A (PC) | 0 | 193 (47%) | 77 (18%) | 13 (27%) | 0 | 4 (19%) | 2 (27%) | 289 (31.1%) |
| p.P130T (C) | 0 | 28 (6.8%) | 60 (14%) | 0 | 1 (100%) | 0 | 0 | 89 (9.6%) |
| p.V5M (X) | 0 | 8 (2%) | 11 (2.6%) | 0 | 0 | 0 | 1 (14%) | 20 (2.2%) |
| p.V5L (X) | 10 (83%) | 401 (97%) | 395 (92%) | 48 (97%) | 1 (100%) | 21 (100%) | 6 (86%) | 882 (95%) |
1 Note: One genome can acquire one or more variations.