| Literature DB >> 32523313 |
Marçal Yll1, Maria Francesca Cortese1, Mercedes Guerrero-Murillo2, Gerard Orriols1, Josep Gregori3, Rosario Casillas1, Carolina González1, Sara Sopena1, Cristina Godoy1, Marta Vila1, David Tabernero1, Josep Quer3, Ariadna Rando1, Rosa Lopez-Martinez1, Rafael Esteban3, Mar Riveiro-Barciela3, Maria Buti3, Francisco Rodríguez-Frías1.
Abstract
BACKGROUND: Since it is currently not possible to eradicate hepatitis B virus (HBV) infection with existing treatments, research continues to uncover new therapeutic strategies. HBV core protein, encoded by the HBV core gene (HBC), intervenes in both structural and functional processes, and is a key protein in the HBV life cycle. For this reason, both the protein and the gene could be valuable targets for new therapeutic and diagnostic strategies. Moreover, alterations in the protein sequence could serve as potential markers of disease progression. AIM: To detect, by next-generation sequencing, HBC hyper-conserved regions that could potentially be prognostic factors and targets for new therapies.Entities:
Keywords: Amino acid substitution; Gene therapy; Genetic conservation; Hepatitis B core gene; Hepatitis B virus; Next-generation sequencing; Small interfering RNA
Mesh:
Substances:
Year: 2020 PMID: 32523313 PMCID: PMC7265140 DOI: 10.3748/wjg.v26.i20.2584
Source DB: PubMed Journal: World J Gastroenterol ISSN: 1007-9327 Impact factor: 5.742
Figure 1Schematic summary of the 3 amplification steps. In the first amplification step (PCR1), a large region was amplified. In the following step (PCR2), the region was divided into 2 amplicons that overlapped in a 112 nucleotide-long portion. In the third step (PCR 3) a sample identifier (MID) was added. PCR: Polymerase chain reaction; MID: Multiplex identifier.
Primer design and polymerase chain reaction protocols for each amplified region
| 1st step | PCR1 | Forward | TAGGAGGCTGTAGGCATA | 1774-2930 | 95 ºC 5 min; (95ºC 20 s, 49 ºC 20 s, 72 ºC 15 s) × 35 cycles; 72 ºC 3 min |
| Reverse | GGAAAGAATCCCAGAGG | ||||
| 2nd step | PCR2 A.1 | Forward A.1 | 1863-2317 | 95 ºC 2 min; (95 ºC 20 s, 58 ºC 20 s, 72 ºC 15s ) × 35 cycles; 72 ºC 3 min | |
| Reverse A.1 | |||||
| PCR2 A.2 | Forward1 A.2 | 2205-2483 | 95 ºC 2 min; (95 ºC 20 s, 50 ºC 20 s, 72 ºC 15 s) × 35 cycles; 72 ºC 3 min | ||
| Forward2 A.2 | |||||
| Forward3 A.2 | |||||
| Forward4 A.2 | |||||
| Reverse A.2 | |||||
| 3rd step | PCR3 | Forward (specific per sample) | 95 ºC 2 min; (95 ºC 20 s, 60 ºC 20 s, 72 ºC 15 s) × 20 cycles; 72 ºC 3 min | ||
| Reverse (specific per sample) |
Bold nucleotides indicate the M13 sequence. Forward primers in PCR2-A2 were multiplexed at the same concentration to cover all HBV genotypes. The protocols of amplification are reported. A.1: Amplicon 1; A.2: Amplicon 2; PCR: Polymerase chain reaction; MID: Multiplex identifier.
Main clinical and viral characteristics of hepatitis B virus-infected patients enrolled in the study
| Age | 38.5 [33.5-46.5] | 67 [58-69] | 56 [48-66] | 0.002 |
| Viral load (log IU/mL) | 6.8 [5.7-8.0] | 5.5 [4.7-6.7] | 5.7 [4.8-6.2] | NS |
| ALT | 56.5 [41.25-180.5] | 70 [47-212] | 46 [43-79] | NS |
| AST | 56 [34.75-124] | 120.5 [59-163.5] | 66.45 [48.675-84.225] | NS |
| Platelets (109/L) | 183 [161.5-226] | 136 [98.5-255] | 81.5 [61.25-101.75] | NS |
| Proportion | ||||
| Gender (male) | 11/16 | 15/17 | 3/5 | |
| HBeAg (positive) | 8/16 | 3/17 | 0/5 | |
| Genotype, % ( | ||||
| A | 18.8 (3) | 5.9 (1) | 20.0 (1) | |
| C | 37.5 (6) | 5.9 (1) | 20.0 (1) | |
| D | 25.0 (4) | 64.7 (11) | 40.0 (2) | |
| D/A | (0) | 5.9 (1) | 0.0 (0) | |
| D/E | 6.3 (1) | 11.8 (2) | 0.0 (0) | |
| E | 6.3 (1) | 0.0 (0) | 20.0 (1) | |
| F | 6.3 (1) | 5.9 (1) | 0.0 (0) |
D/E and D/A indicate mixtures of the 2 genotypes. The frequency of each genotype within the clinical groups is reported as percentage (%) and number of patients (n). CHB: Chronic hepatitis B infection without liver damage; HCC: Hepatocellular carcinoma; LC: Liver cirrhosis; ALT: Alanine aminotransferase (normal value < 40 IU/mL); AST: Aspartate aminotransferase (normal value < 40 IU/mL); IQR: Interquartile range; NS: No-statistical P value.
Figure 2Information content analysis at nucleotide level. A: Sliding window analysis of Hepatitis B core gene performed by aligning the quasispecies haplotypes for all 38 patients with and without considering their relative frequency. Each point on the graph represents the mean information content (in bits) of the 25-nucleotides windows, with forward displacement of 1 nucleotide step between windows. The purple line shows the analysis by haplotype (By hpl), which is the mean information content obtained from the multiple alignments of all quasispecies haplotypes. The orange line represents the analysis by haplotype frequency (By hpl freq), which is the mean information content from the multiple alignments of all the patients’ quasispecies haplotypes considering their relative frequency. The dashed lines indicate the 3 common hyper-conserved regions observed, with reporting of their positions. B: Representation of detected hyper-conserved regions as sequence logos (with reporting of nucleotide positions). The relative sizes of the letters in each stack indicate their relative frequencies at each position within the multiple alignments of nucleotide haplotypes. The total height of each stack of letters depicts the information content of each nucleotide position, measured in bits (Y-axis): from minimum (0) to maximum conservation (2). By hpl: Analysis by haplotype; By hpl freq: Analysis by haplotype frequency; nt: Nucleotide.
Figure 3Information content analysis at nucleotide level by clinical stage group. A: By-haplotype sliding window analysis of the Hepatitis B core gene according to different clinical groups (HCC in blue, CHB in red, and LC in green). The portions and positions where CHB showed lower levels of conservation than the others (P < 0.05) are shown in red. B: Representation of the information content of CHB- and LC-specific conserved nucleotide regions as sequence logos. Positions are reported at the top of each logo. CHB: Chronic hepatitis B infection without liver damage; HCC: Hepatocellular carcinoma; LC: Liver cirrhosis; nt: Nucleotide; P: P value.
Figure 4Information content analysis at amino acid level. A: Sliding window analysis of the Hepatitis B core protein sequence for all 38 patients with and without consideration of relative frequency. Each point on the graph is the result of the mean information content (in bits) of the 10-amino acid in size windows, with forward displacement between them of 1 amino acid step. The purple line represents the information content of all the quasispecies haplotypes (By hpl) whereas the orange line indicates the information content considering haplotype frequency (By hpl freq). The dashed lines show the 2 common amino acid hyper-conserved regions observed, with reporting of their positions. B: Representation of amino acid hyper-conserved regions detected as sequence logos (with reporting of amino acid positions). The relative sizes of the letters in each stack indicate their relative frequencies at each position within the multiple alignments of amino acid haplotypes. The total height of each stack depicts the information content of each amino acid position, measured in bits (Y-axis); range: 0 bits (0% conservation) to 4.32 bits (100% conservation). By hpl: Analysis by haplotype; By hpl freq: Analysis by haplotype frequency; aa: Amino acid.
Figure 5Information content analysis at amino acid level by clinical group. A: Sliding window analysis of the Hepatitis B core protein by haplotype between the different clinical groups (HCC in blue, CHB in red, and LC in green). The green horizontal line corresponds to the region where LC group is less conserved compared to the CHB and HCC groups (P < 0.05). B: Representation of CHB- and LC-specific conserved amino acid regions as sequence logos. Positions are reported at the top of each logo. CHB: Chronic hepatitis B infection without liver damage; HCC: Hepatocellular carcinoma; LC: Liver cirrhosis; aa: Amino acid; P: P value.
Relative frequencies of nucleotide insertions/deletions detected
| 1951 (1 nt: T) | 2085 (1 nt: G) | ||
| CHB (8/16) | 1 | 8.36 (8.7) | |
| 2 | 17.12 (40) | ||
| 3 | 3.19 (5) | ||
| 4 | 0.37 (5.9) | ||
| 9 | 2.02 (8.82) | ||
| 10 | 1.34 (50) | ||
| 12 | 1.04 (10) | ||
| 13 | 2.79 (22.22) | ||
| HCC (2/17) | 28 | 0.78 (4) | |
| 33 | 2.42 (4.8) | ||
| LC (1/5) | 34 | 17.42 (19.2) | |
The table shows the relative frequency of insertions/deletions, together with the percentage (%) of mutated haplotypes per patient. Only patients carrying these mutations were included in the table. CHB: Chronic hepatitis B infection without liver damage; HCC: Hepatocellular carcinoma; LC: Liver cirrhosis; T: Thymine; G: Guanine; nt: Nucleotide.
Figure 6Relative frequency of P79Q substitution in the 3 clinical groups. Each dot represents a patient. The Bonferroni-corrected P value was calculated by Kruskal-Wallis test with posthoc Dunn multiple comparison test. (aP < 0.05). CHB: Chronic hepatitis B infection without liver damage; HCC: Hepatocellular carcinoma; LC: Liver cirrhosis; P: P value; P79Q: Proline to glutamine in position 79.