| Literature DB >> 35625929 |
Selene García-García1,2,3, Andrea Caballero-Garralda1,4, David Tabernero1,2, Maria Francesca Cortese1,2, Josep Gregori2,5, Francisco Rodriguez-Algarra6, Josep Quer2,3,5, Mar Riveiro-Barciela2,7, Maria Homs8,9, Ariadna Rando-Segura1,10, Beatriz Pacin-Ruiz1,2,3, Marta Vila1, Roser Ferrer-Costa1, Tomas Pumarola10, Maria Buti2,7, Francisco Rodriguez-Frias1,2,3.
Abstract
Deletions in the 3' end region of the hepatitis B virus (HBV) X open reading frame (HBX) may affect the core promoter (Cp) and have been frequently associated with hepatocellular carcinoma (HCC). The aim of this study was to investigate the presence of variants with deletions and/or insertions (Indels) in this region in the quasispecies of 50 chronic hepatitis B (CHB) patients without HCC. We identified 103 different Indels in 47 (94%) patients, in a median of 3.4% of their reads (IQR, 1.3-8.4%), and 25% (IQR, 13.1-40.7%) of unique sequences identified in each quasispecies (haplotypes). Of those Indels, 101 (98.1%) caused 44 different altered stop codons, the most commonly observed were at positions 128, 129, 135, and 362 (putative position). Moreover, 39 (37.9%) Indels altered the TATA-like box (TA) sequences of Cp; the most commonly observed caused TA2 + TA3 fusion, creating a new putative canonical TATA box. Four (8%) patients developed negative clinical outcomes after a median follow-up of 9.4 (8.7-12) years. In conclusion, we observed variants with Indels in the HBX 3' end in the vast majority of our CHB patients, some of them encoding alternative versions of HBx with potential functional roles, and/or alterations in the regulation of transcription.Entities:
Keywords: HBX 3′ end region; deletions; hepatitis B X open reading frame; hepatitis B virus; insertions; next-generation sequencing; quasispecies
Year: 2022 PMID: 35625929 PMCID: PMC9139148 DOI: 10.3390/biomedicines10051194
Source DB: PubMed Journal: Biomedicines ISSN: 2227-9059
Characteristics of patients on obtaining the sample analyzed.
| Patients ( | |
|---|---|
| Age (median years, IQR) | 43 (35–55) |
| Gender ( | 42 (84) |
| HBV-DNA (median logIU/mL, IQR) | 5.9 (5.3–7.9) |
| HBV genotype | |
| 39 (78) | |
| 8 (16) | |
| 3 (6) | |
| HBeAg-negative ( | 33 (66) |
| ALT (median IU/L, IQR) | 80 (57–115) |
| Liver fibrosis 1 | |
| 45 (90) | |
| 5 (10) | |
| Treatment history | |
| 17 (34) | |
| 2 (4) |
1 Ishak fibrosis stage (F), assessed by liver biopsy. Abbreviations: N indicates number; IQR, interquartile range; HBV, hepatitis B virus; IU, international units; HBeAg, hepatitis B e antigen; ALT, alanine aminotransferase; NA, nucleoside/nucleotide analogs; IFN, interferon.
Figure 1Fragment of hepatitis B virus (HBV) genome analyzed in this study. It encompasses parts of the Polymerase, X (HBX), and pre-core/core open reading frames (ORF). The fragment of HBX analyzed encodes most of hepatitis B X protein (HBx) transactivating C-terminal domain, including its essential α-helical motif (Hbox) [27]. In addition, the nucleotide (nt) sequence of that fragment (nts 1596 to 1912) contains most of the core promoter sequence, including the core upstream regulatory sequence (CURS) and the basic core promoter (BCP) with the TATA-like boxes 1–4 (TA1–TA4); nt positions are shown as described in [6]. It also includes the Enhancer II and the direct repeat 1 (DR1) sequences; nt positions are shown as described in [7].
Description and frequencies of reads and haplotypes of the insertions, deletions or combinations of both, identified in more than 10% of the 50 patients studied.
| ID | Deletions | Insertions | N Patients (%) | Median % Reads/Patient (IQR) | Median % HPL/Patient (IQR) |
|---|---|---|---|---|---|
| 11 | 1646 | - | 6 (12) | 0.5 (0.4–0.6) | 6.5 (2.8–10.4) |
| 30 | 1692 | 1697TT | 6 (12) | 0.5 (0.4–0.6) | 7.5 (6.1–14.6) |
| 37 | - | 1739G | 9 (18) | 0.3 (0.3–0.5) | 8.3 (7.1–14.3) |
| 38 | - | 1746G/T | 7 (14) | 0.4 (0.3–0.4) | 10 (6.7–12.7) |
| 40 | 1749 | - | 7 (14) | 0.4 (0.3–0.5) | 6.7 (5.2–11.3) |
| 51 | 1763–1770 | - | 10 (20) | 1.7 (0.9–2.1) | 5.5 (3.0–8.2) |
| 59 | - | 1781C | 5 (10) | 0.6 (0.6–0.7) | 8.3 (4.8–8.3) |
| 74 | - | 1820C | 7 (14) | 0.4 (0.4–0.9) | 4.3 (1.9–4.8) |
| 84 | - | 1825T | 19 (38) | 1.5 (0.7–2.1) | 4.8 (2.8–6.9) |
| 85 | 1825 | - | 10 (20) | 0.4 (0.4–0.8) | 4.1 (2.8–5.1) |
| 88 | - | 1826C/T | 9 (18) | 0.6 (0.4–0.9) | 4.8 (2.6–8.3) |
| 103 | - | 1838A | 5 (10) | 1.2 (0.9–2.7) | 2.3 (2.2–5.9) |
Abbreviations: ID indicates code to identify single insertion or deletion, or combinations of them (see Table S3); N, number; IQR, interquartile range; HPL, haplotypes.
Description and frequencies of reads and haplotypes of the altered stop codons identified in more than 10% of the 50 patients studied.
| IDs | N Patients (%) | Median % Reads/Patient (IQR) | Median % HPL/Patient (IQR) | |
|---|---|---|---|---|
| 95 | 2, 3, 7, 8 | 5 (10) | 0.4 (0.3–0.5) | 5.9 (5.3–8.3) |
| 109 | 28, 29, 30, NA | 9 (18) | 0.5 (0.4–0.6) | 6.7 (4.3–16.7) |
| 125 | 18, NA | 8 (16) | 1.9 (0.6–3.8) | 3.8 (2.2–5.8) |
| 128 | 37, 38, 41 | 15 (30) | 0.4 (0.3–0.6) | 8.3 (5.6–18.3) |
| 129 | 6, 11, 23, 25, 34, 35, 36, 39, 40 | 21 (42) | 0.5 (0.4–0.6) | 7.1 (4.8–11.1) |
| 132 | 45, 46, 49, 57 | 5 (10) | 0.5 (0.5–0.8) | 2.6 (2.6–4.3) |
| 135 | 14, 15, 22, 47, 48, 51, 53, 55, 80, 81, 87, 90, 100 | 14 (28) | 2.0 (0.5–4.9) | 6.1 (3.0–11.8) |
| 138 | 59 | 5 (10) | 0.6 (0.6–0.7) | 8.3 (4.8–8.3) |
| 149 | 63, 72 | 5 (10) | 0.7 (0.4–1.0) | 4.5 (4.2–4.8) |
| 156 | 12, 13, 27, 58, 76, 82, 89 | 7 (14) | 2.6 (0.7–7.3) | 7.1 (5.5–7.7) |
| 179 | 85 | 10 (20) | 0.4 (0.4–0.8) | 4.1 (2.8–5.1) |
| 180 * | 75, 86, 98 | 5 (10) | 0.9 (0.6–1.2) | 5.1 (3.6–5.9) |
| 207 * | 101, NA | 6 (12) | 0.5 (0.4–0.6) | 3 (2.4–4.4) |
| 360 * | 70, 92 | 5 (10) | 0.6 (0.6–1.1) | 2.6 (2.3–4.3) |
| 362 * | 74, 78, 84, 88 | 24 (48) | 1.6 (0.8–2.1) | 5.8 (3.1–10.7) |
Abbreviations: HBX indicates hepatitis B X open reading frame; IDs, code to identify single insertion or deletion, or combinations of them, described in Table S3; N, number; IQR, interquartile range; HPL, haplotypes; NA, haplotype/s without insertions and/or deletions. * Putative stop codons of the HPL without stop codon in HBX, obtained by extension of those HPL with reference sequences V01460 (for genotype D HPL) and X02763 (for genotype A HPL) and continue translation.
Figure 2Comparison between the predicted local RNA folding of the region between nucleotides 1820 and 1880 of the hepatitis B virus (HBV) genome, including the polyT homopolymeric region between positions 1821–1825 (A); the region between nucleotides 1625 to 1691 of the human immunodeficiency virus-1 (HIV), including the Programmed −1 ribosomal frameshifting mRNA signal (B). The HBV sequence has been modeled from the genotype A reference sequence used to report Indels, included in Table S2. The HIV sequence has been modeled from the Genbank pattern with accession number NC_001802.1.
Alterations in the core promoter TATA-like boxes caused by insertions and deletions identified between nucleotides 1751 and 1787 in the 50 patients studied.
| TA Alteration | Cause of Alteration and Indels Involved | N Patients (%) | Median % Reads/Patient (IQR) | Median % HPL/Patient (IQR) |
|---|---|---|---|---|
| Insertion in TA1 (nt 1750–1755) | 3 (6) | 0.3–0.4 * | 3.4–5.9 * | |
| Partial or total TA1 or TA2 (nt 1758–1762) elimination | 7 to 10 nt Del between nt 1754 and 1767: | 5 (10) | 0.5 (0.4–2.0) | 3.6 (2.2–4.5) |
| Partial or total TA2 + TA3 (nt 1758–1775) elimination | Dels between nt 1756 and 1787: | 3 (6) | 0.9–71.2 * | 2.3–61.8 * |
| TA2 + TA3 Fusion | 8 nt Del between nt 1763 and 1770: | 10 (20) | 1.8 (0.9–4.3) | 5.5 (3.0–17.5) |
| Partial or total TA3 (nt 1771–1775) elimination | 8 to 10 nt Del between nt 1763 and 1776 | 7 (14) | 0.4 (0.4–2.1) | 2.6 (2.4–5.6) |
| No TA affected | 10 (20) | 0.6 (0.4–0.9) | 6.5 (4.3–8.3) |
Abbreviations: TA indicates TATA-like boxes; N, number; IQR, interquartile range; HPL, haplotypes; nt, nucleotides; Ins, insertions; Del, deletions; IDs, code to identify single Ins or Del, or combinations of them, described in Table S4. * No median was calculated due to low number of patients, instead the maximum and minimum percentage of reads and haplotypes per patient are shown.
Figure 3Comparison between the predicted local RNA folding between nucleotides 1741 and 1912 encompassing the TA region of the core promoter, of a WT variant (A) versus a variant including the deletion 1763–1770 (ID: 51) (B). Both variants have been modeled from the genotype A reference sequence used to report Indels, included in Table S2. In red, TA1 sequence (AGAUUA); in green, TA2 sequence (UUAAA); in blue, TA3 sequence (UAUUA); in purple, TA4 sequence (CAUAAAUU); in grey, DR1 sequence (UUCACCUCUGC); in orange, TA23 sequence (UUAAAUAUUA).
Description of the insertions, deletions, or combinations of both, identified by cloning/sequencing analysis in the 4 patients selected.
| Patient | Deletions | Insertions | N Clones (%) | ID | N Reads (%) |
|---|---|---|---|---|---|
| 2 (24) | 1627 + 1758 − 1777 | 1647 TCTTACATAAGAGGACTCTTGGAC | 12 (50) | 12 | 9554 (51.9) |
| - | 1820 C | 1 (4.2) | 74 | 75 (0.4) | |
| 1627 + 1726 + 1758 − 1777 | 1647 TCTTACATAAGAGGACTCTTGGAC | 2 (8) | - | - | |
| 1627 + 1758 − 1777 | 1647 TCTTACATAAGAGGACTCTTGGAC + 1822 ATTCAA + 1825 T | 1 (4.2) | - | - | |
| 1627 | 1600T + 1647 TCTTACATAAGAGGACTCTTGGAC | 1 (4.2) | - | - | |
| - | - | 7 (29.2) | - | 4692 (25.5) | |
| 17 (29) | - | 1825 T | 2 (6.9) | 84 | 3140 (14.2) |
| - | 1909 TG | 1 (3.4) | * | * | |
| - | - | 26 (89.7) | - | 18736 (84.8) | |
| 20 (18) | 1825 | - | 1 (5.6) | 85 | 3011 (15.5) |
| - | 1826 TTC | 6 (33.3) | 89 | 2171 (11.2) | |
| - | - | 11 (61) | - | 14064 (72.5) | |
| 39 (19) | - | 1605 T | 1 (5.3) | - | - |
| - | 1825 T | 7 (36.8) | 84 | 3242 (22.9) | |
| - | 1895 T | 1 (5.3) | * | * | |
| - | - | 10 (52.6) | - | 9701 (68.5) |
Abbreviations: N Clones indicates number of clones showing a specific insertion, deletion or combination of both; ID, code to designate single insertion or deletion, or combinations of them identified by next-generation sequencing (see Table S3); N Reads, number of next-generation sequencing reads showing a specific single insertion or deletion, or a combination of both. * Insertions that have not been assessed in next-generation sequencing reads since they are located out of hepatitis B X gene open reading frame.