| Literature DB >> 29785078 |
Carolina González1, David Tabernero1, Maria Francesca Cortese1, Josep Gregori2, Rosario Casillas1, Mar Riveiro-Barciela3, Cristina Godoy1, Sara Sopena1, Ariadna Rando1, Marçal Yll1, Rosa Lopez-Martinez1, Josep Quer3, Rafael Esteban3, Maria Buti3, Francisco Rodríguez-Frías1.
Abstract
AIM: To detect hyper-conserved regions in the hepatitis B virus (HBV) X gene (HBX) 5' region that could be candidates for gene therapy.Entities:
Keywords: Gene therapy; HBV conserved regions; Hepatitis B X gene; Hepatitis B X protein; Hepatitis B virus; Next-generation sequencing; Small interference RNA
Mesh:
Substances:
Year: 2018 PMID: 29785078 PMCID: PMC5960815 DOI: 10.3748/wjg.v24.i19.2095
Source DB: PubMed Journal: World J Gastroenterol ISSN: 1007-9327 Impact factor: 5.742
Figure 1Hepatitis B virus genome and transcripts. The figure shows the HBV genome (in grey), with the DR1 and DR2 (direct repeat) regions, necessary for viral DNA synthesis. The viral ORFs are highlighted with colored arrows, and nucleotide positions are reported. Wavy lines show the various HBV transcripts: the 3.5-kb transcript, corresponding to the pregenomic RNA, which is translated to the core and polymerase and later subjected to reverse-transcription in the viral capsid, or to the precore/core transcript, which is translated to the precore protein; the 2.4-kb and 2.1-kb transcripts, which are translated, respectively, to large and medium/small HBsAg); and the 0.7-kB transcript, which is translated to the HBx protein. The region analyzed in this study and its corresponding nt positions are indicated by red dashed lines. Note that the region of interest is included in all the viral transcripts. HBV: Hepatitis B virus; ORF: Open reading frame.
Main clinical and virological characteristics of the hepatitis B virus infected patients enrolled
| 1 | 27 | M | Sub-Saharan | Chronic hepatitis | Negative | 7.8 | 170 | E |
| 2 | 31 | M | Asian | Chronic hepatitis | Negative | 6.6 | 103 | C |
| 3 | 51 | M | Caucasian | Chronic hepatitis | Negative | 6.6 | 262 | D |
| 4 | 28 | F | Caucasian | Chronic hepatitis | Negative | 7.9 | 126 | D |
| 5 | 47 | F | Caucasian | Chronic hepatitis | Negative | 6.3 | 170 | D |
| 6 | 37 | F | Caucasian | Chronic hepatitis | Negative | 4.5 | 53 | D |
| 7 | 37 | M | Caucasian | Chronic hepatitis | Negative | 4.5 | 33 | F |
| 8 | 38 | M | Caucasian | Chronic hepatitis | Negative | 5.0 | 29 | D |
| 9 | 46 | M | Caucasian | Chronic hepatitis | Negative | 5.8 | 88 | F |
| 10 | 46 | F | Caucasian | Chronic hepatitis | Negative | 5.3 | 23 | H |
| 11 | 71 | F | Caucasian | Chronic hepatitis | Negative | 6.2 | 87 | F |
| 12 | 51 | M | Asian | Chronic hepatitis | Negative | 5.7 | 435 | C |
| 13 | 52 | M | Caucasian | Chronic infection | Negative | 4.4 | 18 | A |
| 14 | 40 | F | Caucasian | Chronic infection | Negative | 4.2 | 29 | D |
| 15 | 33 | M | Asian | Chronic infection | Negative | 4.3 | 25 | D |
| 16 | 63 | M | Hispanic | Cirrhosis | Negative | 4.0 | 16 | A |
| 17 | 53 | M | Caucasian | Cirrhosis/HCC | Negative | 3.7 | 36 | A |
| 18 | 35 | M | Sub-Saharan | Chronic hepatitis | Positive | 5.7 | 36 | E |
| 19 | 37 | M | Caucasian | Chronic hepatitis | Positive | 8.4 | 32 | C |
| 20 | 45 | M | Caucasian | Chronic hepatitis | Positive | 5.6 | 35 | A |
| 21 | 29 | F | Asian | Chronic hepatitis | Positive | 6.9 | 355 | B |
| 22 | 28 | M | Asian | Chronic hepatitis | Positive | > 8.0 | 341 | C |
| 23 | 28 | M | Asian | Chronic infection | Positive | 8.7 | 24 | C |
| 24 | 28 | F | Asian | Chronic infection | Positive | 8.8 | 22 | C |
| 25 | 55 | F | Caucasian | Cirrhosis | Positive | 5.4 | 73.9 | A |
| 26 | 82 | F | Caucasian | Cirrhosis/HCC | Positive | 4.8 | 24 | C |
| 27 | 64 | M | Caucasian | Cirrhosis/HCC | Positive | 6.3 | 45 | D |
Genotype determined by Sanger sequencing of the X region (same region as was analyzed by next-generation sequencing). ALT: Alanine aminotransferase; HBV: Hepatitis B virus; HBeAg: Hepatitis B e antigen; M: Male; F: Female; HBV: Hepatitis B virus.
Math 1Math(A1).
Results of genotyping of nucleotide haplotypes obtained in each patient, extracted by next-generation sequencing based on ultra-deep pyrosequencing analysis %
| 1 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
| 2 | 0 | 0 | 100 | 0 | 0 | 0 | 0 |
| 3 | 7.1 | 0 | 0 | 92.9 | 0 | 0 | 0 |
| 4 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
| 5 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
| 6 | 1.7 | 0 | 0 | 98.3 | 0 | 0 | 0 |
| 7 | 0 | 0 | 0.3 | 51.3 | 0 | 48.4 | 0 |
| 8 | 7.1 | 0 | 0 | 92.9 | 0 | 0 | 0 |
| 9 | 0 | 0 | 0.3 | 7 | 0 | 92.7 | 0 |
| 10 | 0.9 | 0 | 0 | 50.9 | 0 | 0 | 48.2 |
| 11 | 0 | 0 | 0.3 | 7 | 0 | 92.7 | 0 |
| 12 | 95.1 | 0 | 4.4 | 0.5 | 0 | 0 | 0 |
| 13 | 0 | 0 | 100 | 0 | 0 | 0 | 0 |
| 14 | 46.6 | 0 | 8.2 | 33.2 | 0 | 12 | 0 |
| 15 | 89.7 | 0 | 4.4 | 8 | 0 | 1.5 | 0 |
| 16 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
| 17 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
| 18 | 0 | 0 | 0 | 0 | 100 | 0 | 0 |
| 19 | 0 | 0 | 95.3 | 3,6 | 0.9 | 0 | 0 |
| 20 | 100 | 0 | 0 | 0 | 0 | 0 | 0 |
| 21 | 0 | 99.6 | 0.4 | 0 | 0 | 0 | 0 |
| 22 | 0 | 0 | 100 | 0 | 0 | 0 | 0 |
| 23 | 0 | 0 | 87.8 | 12.2 | 0 | 0 | 0 |
| 24 | 0 | 0 | 100 | 0 | 0 | 0 | 0 |
| 25 | 97.9 | 0 | 0 | 2.1 | 0 | 0 | 0 |
| 26 | 0 | 0 | 100 | 0 | 0 | 0 | 0 |
| 27 | 0 | 0 | 0 | 100 | 0 | 0 | 0 |
%A to %H indicates the percentage of nucleotide haplotypes from each patient, classified as HBV genotype A, B, C, D, E, F or H. HBV: Hepatitis B virus.
Figure 2Sliding window analysis of the nucleotide region of interest in the hepatitis B virus X gene (nt 1255-1611). Each point on the graph is the result of the mean information content (IC, bits) of the windows 25-nt in size, with displacement between them in 1-nt steps. The purple line represents the mean IC from the multiple alignments of all haplotypes (in abundance > 0.25%) in the quasispecies (QS) of all patients (n = 720), whereas the blue line shows the mean IC obtained from the multiple alignments of the consensus obtained for each patient (n = 27).
Figure 3Representation by sequence logos of the information content of the most conserved regions detected in the multiple alignment of all nucleotide haplotypes obtained (quasispecies), nts 1255-1286 and nts 1519-1603. The relative sizes of the letters in each stack of nt sequence logos indicate their relative frequencies at each position within the multiple alignments of nt haplotypes. The total height of each stack of letters depicts the IC of each nt position, measured in bits (Y-axis): 0 bits being the minimum and 2, the maximum conservation. Nucleotide positions with an IC between 1.6 and 1.8 (80%-90% of maximum conservation) are indicated by light blue circles and those with an IC between 1.4 and 1.6 bits (70%-80% of maximum conservation) by pink circles. IC: Information content.
Figure 4Sliding window of the coded the core and X protein amino acid sequence (aa 1-79). Each point on the graph is the result of the mean information content (IC, bits) between windows of 10-aa size with displacement between them in 1-aa steps. The purple line represents the mean IC from the multiple alignments of all haplotypes (in abundance > 0.25%) in the quasispecies (QS) of all patients (n = 330), whereas the blue line shows the mean IC obtained from the multiple alignments of the consensus obtained for each patient (n = 27).
Figure 5Representation by sequence logos of the information content of the conserved region in the the core and X protein amino acid sequence (aa 63-76). The relative sizes of the letters in each stack in the sequence logos indicate their relative frequencies at each position within the multiple alignments of aa haplotypes obtained. The total height of each stack of letters depicts the IC of each aa position, measured in bits (Y-axis): 0 bits being the minimum and 4.32 bits the maximum conservation. Amino acids belonging to the Kunitz-like domain portion are framed in green. IC: Information content.