| Literature DB >> 33058031 |
Tipsuda Chanmanee1, Pravech Ajawatanawong2, Suda Louisirirotchanakul1, Watcharasak Chotiyaputta3, Siwaporn Chainuvati3, Patimaporn Wongprompitak4.
Abstract
Hepatitis E virus (HEV) is a causative agent of acute viral hepatitis globally. Evolutionary phylogeny classifies the HEV into eight genotypes that correlate with the viral transmission. Only four genotypes have been proven to be responsible for transmission in humans. However, there has been no report on the genomics and genotyping of HEV in Thailand during the past ten years. Here, we identified the genotype distributions of the Thai isolates of HEV and we sequenced two HEV genomes. We screened for 18 Thai isolates of HEV from Siriraj Hospital in Bangkok, from 2014-2016. The HEV genomes were sequenced from the serum and feces of a patient. The results showed that all Thai isolates of HEV were identified as genotype 3 (HEV-3). The ORF2 and genome phylogenies suggested two subgenotypes, called 3.1 and 3.2. The Thai isolates of HEV were frequently found in the subgenotype 3.1. The genome sequences of the two Thai isolates of HEV from the serum and fecal samples of the same patient showed 91% nucleotide similarity with the HEV genotype 3. Comparisons between the HEV genome and the ORF2 phylogenies illustrated that the ORF2 tree can be used to identify HEV genotypes, but it has less phylogenetic power for the HEV evolution. The two new genome sequences of HEV-3 from Thailand could contribute valuable information to the HEV genome study. (226 words).Entities:
Keywords: Complete genome sequence; HEV genotype; HEV genotype 3; Hepatitis E virus; Phylogeny
Mesh:
Substances:
Year: 2020 PMID: 33058031 PMCID: PMC7674359 DOI: 10.1007/s11033-020-05908-3
Source DB: PubMed Journal: Mol Biol Rep ISSN: 0301-4851 Impact factor: 2.316
A list of the 12 oligonucleotide primer sets used for hepatitis E virus (HEV) whole-genome-sequence amplification
| Set | Outer/inner | Primer | Oligonucleotide primer sequence | Region | Position | Product size (bp) | Ta (°C) |
|---|---|---|---|---|---|---|---|
| 1 | Outer | HE5’ | GTC GAT GCC ATG GAG GCC | 5′UTR/ORF1 | 1–18 | 543 | 54 |
| HE52 | CCG AGG GCC AAA GGT CAT G | 525–543 | 53 | ||||
| Inner | HE5’ | GTC GAT GCC ATG GAG GCC | 1–18 | 473 | 54 | ||
| HE452 | CCC ATC GAA GCA GTA TGT GCG | 452–473 | 53 | ||||
| 2 | Outer | HE01 | AAG GCT CCT GGC ATT ACT ACT | ORF1 | 48–70 | 955 | 53 |
| HE02 | AAR AGC ATR AGC CGR TCT CA | 984–1003 | 51 | ||||
| Inner | HE01 | AAG GCT CCT GGC ATT ACT ACT | 48–70 | 901 | 53 | ||
| HE03 | GTA GAG CAR GCT GAK GGR AA | 930–949 | 51 | ||||
| 3 | Outer | HE741F | TGG ATC CGC ACC ACT AAA ATA | ORF1 | 741–761 | 1173 | 52 |
| HE1895Rst | CGG TRC AAT CCA RGC CAT TA | 1895–1914 | 52 | ||||
| Inner | HE741F | TGG ATC CGC ACC ACT AAA ATA | 741–761 | 1000 | 52 | ||
| HE1722Rn | AAG RCC TTA TTC CCG AGC AC | 1722–1741 | 56 | ||||
| 4 | Outer | HE1498Fst | TGG TTG GTG ACT GTG GCC AT | ORF1 | 1498–1517 | 877 | 55 |
| HE2293Rst | TGG TGT RGG CGG CTT ACG G | 2357–2375 | 56 | ||||
| Inner | HE1560Fn | CCC GCC CAC CTT GAT GTT TC | 1560–1579 | 815 | 55 | ||
| HE2293Rn | TGG TGT RGG CGG CTT ACG G | 2357–2375 | 56 | ||||
| 5 | Outer | HE1990F | ACA ATA GGT TCA CCC AGC GCC | ORF1 | 1990–2010 | 1085 | 55 |
| HE3054Rst | GTC GCC AAC TAT TGC GGA GCT C | 3054–3075 | 55 | ||||
| Inner | HE1990F | ACA ATA GGT TCA CCC AGC GCC | 1990–2010 | 1016 | 55 | ||
| HE2985Rn | TAG ACT TTC CGG AGC CGG GAA C | 2985–3006 | 55 | ||||
| 6 | Outer | FfrR4st | GTC GAT GCC ATG GAG GCC | ORF1 | 2795–2815 | 1007 | 54 |
| FfrR4n’ | CCG AGG GCC AAA GGT CAT G | 3783–3802 | 55 | ||||
| Inner | FfrR4st | GTC GAT GCC ATG GAG GCC | 2795–2815 | 944 | 54 | ||
| RfrR4 | CCC ATC GAA GCA GTA TGT GCG | 3783–3802 | 55 | ||||
| 7 | Outer | HE3560Fst | TGC GCA TGC TAT CGT TGC ACT | ORF1 | 3560–3580 | 956 | 55 |
| HE4497R | GGC ATG CCA CAY TCC TCC AT | 4497–4516 | 55 | ||||
| Inner | HE3664Fn | ACT TCT TCC TGG CTG GTG GTG A | 3664–2685 | 852 | 55 | ||
| HE4497R | GGC ATG CCA CAY TCC TCC AT | 4497–4516 | 55 | ||||
| 8 | Outer | HE4263Fst | GCA TAT CGG CTT GGA GTA AGA C | ORF1 | 4263–4285 | 816 | 52 |
| HE5058R | AGT GGG CCT TTC CAT CAG CAA T | 5058–5079 | 55 | ||||
| Inner | HE4332Fn | ATT CTA GCT TTG CTC CCG CCC | 4332–4352 | 747 | 55 | ||
| HE5058R | AGT GGG CCT TTC CAT CAG CAA T | 5058–5079 | 55 | ||||
| 9 | Outer | HE4974F | CGC AGG TTT GTG TTG ATG TTG T | ORF1/ORF2 | 4974–4996 | 677 | 54 |
| HE5631Rst | GGG GCG GCA TAC AAG ACA AGA | 5631–5651 | 55 | ||||
| Inner | HE4974F | CGC AGG TTT GTG TTG ATG TTG T | 4974–4996 | 610 | 54 | ||
| HE5563Rn | TAT ACT GGC GGC GCA AGA TAG C | 5563–5584 | 55 | ||||
| 10 | Outer | HE5204F | TCT TCG TGC TTC TGC CTA TGC TG | ORF2 | 5204–5226 | 972 | 55 |
| HE6159Rst | CCA CGA CGC AAA CGG TGT | 6159–6176 | 54 | ||||
| Inner | HE5204F | TCT TCG TGC TTC TGC CTA TGC TG | 5204–5226 | 855 | 55 | ||
| HE6040Rn | CCG GTA TAG GGC GTG TTG GT | 6040–6059 | 55 | ||||
| 11 | Outer | ORF2F2 | CHA TYT CTA TYT CYT TYT GGC C | ORF2 | 5890–5912 | 955 | 56 |
| ORF2R2 | GTA GTC TGR TCA TAY TCA GCV GC | 6703–6725 | 55 | ||||
| Inner | ORF2F2 | CHA TYT CTA TYT CYT TYT GGC C | 5890–5912 | 901 | 56 | ||
| Cona1M | TCT TGT TCR TGY TGG TTR TCR TAR TC | 6524–6499 | 55 | ||||
| 12 | Outer | HE6209Fst | CAG CCA CAC GTT TYA TGA AGG A | ORF2/3′UTR | 6209–6230 | 1036 | 56 |
| HE3prime | TTT TTT TTT TTT TTT TTT TCC AGG GAG CG | 7220–7245 | 55 | ||||
| Inner | HE6326Fn | CGA CAG AAT TGA TTT CGT CGG C | 6326–6347 | 919 | 56 | ||
| HE3prime | TTT TTT TTT TTT TTT TTT TCC AGG GAG CG | 7220–7245 | 55 |
Samples and sequences used in this study: each sample ID referrs to individual hepatitis patients
| Sample ID | Anti-HEV | HEV RNA | Accession number | ||
|---|---|---|---|---|---|
| IgM | IgG | In serum | In feces | ||
| TH-hu-FJN | + | ND | − | + | MH427099 |
| TH-hu-FNP | + | + | + | + | MH427105 |
| TH-hu-FYS | + | + | − | + | MH427098 |
| TH-hu-FUD | + | + | − | + | MH427100 |
| TH-hu-FKP | + | + | + | + | MH427106 |
| TH-hu-FSN | + | + | + | + | MH427095 |
| TH-hu-FOG | + | + | + | + | MH427103 |
| TH-hu-FTM | + | ND | + | + | MH427096 |
| TH-hu-FLR | + | + | − | + | MH427104 |
| TH-hu-F40 | + | + | + | + | MH427091 |
| TH-hu-FPP | - | + | + | + | MH427092 |
| TH-hu-FKS | + | + | + | + | MH427090 |
| TH-hu-FSP | + | ND | + | + | MH427097 |
| TH-hu-F45-1 | + | ND | + | + | KY232312, KY232313 |
| TH-hu-FTK | + | + | − | + | MH427094 |
| TH-hu-FTS | + | + | + | + | MH427102 |
| TH-hu-FVS | + | + | + | + | MH427093 |
| TH-hu-F47 | + | + | + | + | MH427101 |
The anti-HEV, both IgM and IgG, were detected in the serum specimens. The HEV RNA was detected from serum and fecal samples. HEV in the serum and fecal specimens from patient TH-hu-F45-1 was chosen to perform genome sequencing. The accession number KY232312 and KY232313 refer to the HEV from serum and feces, respectively
ND not done
Fig. 1Bayesian inference (BI) phylogeny of 62 partial nucleotide sequences in the ORF2 (a capsid gene containing ORF) of HEV. The topology of the BI phylogeny was slightly different from the phylogeny reconstructed by the maximum likelihood method. The dashed lines show the differences in positions between those phylogenies. The statistical support is shown at the deep branches that connect to the specific nodes. The number above the branch represent the bootstrap value calculated from 1000 replications of the maximum-likelihood method, while the number underneath the same branch represents the posterior probability calculated by the Bayesian inference-phylogenetic method. The operational taxonomic units (OTUs) with a red color show the accession numbers of the HEV sequences in this study, and the isolate identification numbers are shown in the brackets. The isolates that were selected to perform the genome sequences are marked with asterisks (**). The black OTUs show the accession number of HEV sequences from the other study [18]. The HEV genotypes and subgenotypes are labeled in different colors. Scale bars indicate the distance in nucleotide substitution per site
Fig. 2Genome organization of the Thai isolates of HEV (KY232312 and KY232313) and mutation events between those two genomes: the HEV genomes from serum and feces were identical in size and organization (a). The total length of the HEV genomes was 7142 bp, organized in three ORFs—1, 2, and 3. The ORF1 encoded a nonstructural polyprotein which contained at least eight regions, including a methyltransferase (Met), a Y domain (Y), a papain-like cysteine protease (PCP), a hypervariable region (HVR), a proline-rich region (Pro), a protein with an unknown function (X), a helicase (Helicase), and an RNA-dependent RNA polymerase (RdRp). The ORF2 encoded a single capsid gene (Capsid), and the ORF3 also carried a single protein with unknown function. The numbers on the genome diagram represent the boundary of each protein in the genome. Nucleotide substitution sites on the two genomes are shown by arrows and are labeled in black numbers (b). The labels at each arrow tail represent the nucleotide substitution site, pattern of nucleotide-substitution, and amino acid. The nucleotide in the HEV genome isolated from serum (KY232312), and feces (KY232313) are separated by a slash (/). The amino acids encoded from the mutation position are shown in red. Regarding the non-synonymous substitution, the amino acids in the different genomes are separated by a slash (/)
Fig. 3The Bayesian inference (BI) phylogeny of 97 partial genome sequences of HEV, including the two isolates from this study. The phylogenetic analyses were reconstructed by ML and BI methods. The tree topologies were identical between both the ML and BI phylogenies. Statistical supports of the branches are shown on the deep branches. The number above the branch represents the bootstrap value calculated from 1000 replications of the maximum-likelihood method, while the number underneath the same branch represents the posterior probability calculated by the Bayesian inference phylogenetic method. The OTUs in red show the accession numbers of the HEV genome sequences in this study, and the isolate identification numbers are shown in brackets, whereas the OTUs in black show the accession number of HEV sequences from the other study [15, 24–73]. The HEV genotypes and subgenotypes are labeled with different-colored boxes. Scale bars indicate the distance in nucleotide substitutions per site