| Literature DB >> 15731337 |
Augustine Chen1, Y F Kao, Chris M Brown.
Abstract
The human hepatitis B virus (HBV) has a compact genome encoding four major overlapping coding regions: the core, polymerase, surface and X. The polymerase initiation codon is preceded by the partially overlapping core and four or more upstream initiation codons. There is evidence that several mechanisms are used to enable the synthesis of the polymerase protein, including leaky scanning and ribosome reinitiation. We have examined the first AUG in the pregenomic RNA, it precedes that of the core. It initiates an uncharacterized short upstream open reading frame (uORF), highly conserved in all HBV subtypes, we designated the C0 ORF. This arrangement suggested that expression of the core and polymerase may be affected by this uORF. Initiation at the C0 ORF was confirmed in reporter constructs in transfected cells. The C0 ORF had an inhibitory role in downstream expression from the core initiation site in HepG2 cells and in vitro, but also stimulated reinitiation at the polymerase start when in an optimal context. Our results indicate that the C0 ORF is a determinant in balancing the synthesis of the core and polymerase proteins.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15731337 PMCID: PMC549565 DOI: 10.1093/nar/gki251
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1The conservation of the C0 ORF in all HBV genotypes and other members of the orthohepadnaviridae. (A) Schematic representation of the HBV pregenomic RNA transcript. The numbering in the scheme is such that nucleotide 1 is the pgRNA start site, indicated by the arrow (this number corresponds to nucleotide number 1816 in the ayw subtype). The pgRNA contains an epsilon structure, near the 5′ and 3′ end of the RNA. This transcript also contains a redundant sequence of 117 bases in its 3′ end, found in the 5′ end of the transcript. Vertical bars with dots represent the initiation codons within the pgRNA labeled C0, C, C1, J, C2 and P. The height of each vertical bar with black dots represents the match to the ‘ideal’ initiation context of Kozak's consensus. ORFs encoded by each initiation codon are represented by filled boxes while the poly(A) tail is denoted by An. (B) Schematic representation of the three different ORFs in the pgRNA in which the C0, C, J and P protein are made. The initiation codons (vertical bars with black dots) and termination codons (long vertical bars without dots) are marked in each of the three possible reading frames with the corresponding ORFs represented by filled boxes. (C) The comparison of DNA sequences in the region of the C0 ORF from representative members of the orthohepadnaviruses. The alignment of these sequences revealed a well-conserved ‘weak’ initiation codon (C0) and its 19 amino acid ORF which overlaps the C ORF by 8 bases.
PCR primers used to generate parent constructs
| Primers | Sequence |
|---|---|
| APH3T7F | 5′-gggcccaa |
| H3T7PCRNAF | 5′-gcg |
| H3T7PGRNAF | 5′-gcgat |
| AatIILUCR | 5′-gtttttggcgtcttc |
| XbaIF | 5′-gc |
| XbaIR | 5′-gc |
| GLR | 5′-CCAGGGCGTATCTCTTCATAGCC-3′ |
The italicized sequence represents the T7 promoter and the underlined sequences represent the restriction enzyme site.
Figure 2Translation initiation from the C0 initiation codon compared with that from C and P initiation codons. (A) Schematic representation of the DNA constructs used in this study (diagram not to scale). It contains the SV40 and T7 promoter at the 5′ end followed by DNA sequences corresponding to the HBV pregenomic RNA leader fused to the luciferase gene. The nucleotide sequence for the hepatitis B virus pregenomic RNA leader has been deposited in the DNA Data Bank of Japan under DDBJ accession no. AB037684. Immediately downstream of the luciferase gene is the 117 bases of repeated sequence to mimic the terminally redundant pgRNA which also contains the epsilon structure denoted by ε. At the 3′ end, An denotes the poly(A) tract while the SV40 poly(A) signal is denoted by an enclosed A. Vertical bars with dots represent the initiation codons within the pgRNA labeled C0, C in a light and dark font. The darker font represents the initiation codon that is in-frame to the luciferase reporter gene. ORFs encoded by each initiation codon are represented by filled boxes. Any mutation introduced into the leader sequence is denoted with an asterisk (*). Other labels are as in Figure 1. (B) Normalized expression level from the C0, C and P initiation codons after transfection into HepG2 cells. Expression from each initiation codon is expressed as a percentage of normalized CLUC expression. These results are averages of three replicates from two independent experiments. (C) Luciferase expression was confirmed via western blot using anti-luciferase antibody on HepG2 transfected lysates separated on 5–20% SDS–PAGE. (D) Fluorograph of the luciferase fusion proteins translated off capped transcript in rabbit reticulocyte. Proteins were separated on 5–20% SDS–PAGE. Lane 1, CLUC RNA; lane 2, C0LUC RNA; lane 3, PLUC RNA; M, protein or DNA marker. The integrity and quantitation of the capped RNA used for cell-free translation is shown on the RNA gel below. (E) Detection of the C0LUC fusion protein in HepG2 transfected lysates via immunoblot analysis using anti-C0 antibody. Lane 4 contains 50 ng of synthetic C0.
Plasmids used in this study
| Plasmid name | Template PCR | Mutations | Position | Effect |
|---|---|---|---|---|
| P series | Determine effect of uATGs on P expression | |||
| PLUC | ATCC40103 | NcoI to AatII | 496 | Introduction of adw HBV leader with AatII |
| C0KOPLUC | PLUC | ATG to AaG | 36 | Removal of C0 ATG codon |
| C0stopKOPLUC | PLUC | TGA to cGA, TAA to cAA | 92, 101 | Extended C0 ORF to 38 codons from 20 codons |
| C0CPLUC | PLUC | A deletion | 141 | C0 ORF removed via fusion to C ORF |
| CKOPLUC | PLUC | ATG to AaG | 85 | Removal of C ATG codon |
| C1KOPLUC | PLUC | ATG to AaG | 280 | Removal of C1 ATG codon |
| C1D3PLUC | PLUC | TTGATGA to aTcATGA | 276, 278 | Optimized C1 ATG context |
| JKOPLUC | PLUC | ATG to ggG | 347, 348 | Removal of J ATG codon |
| C2KOPLUC | PLUC | ATG to AaG | 361 | Removal of C2 ATG codon |
| PKOLUC | PLUC | ATG to AaG | 491 | Removal of P ATG codon |
| uaugkopluc | PLUC | ATGs to AaGs or ggG | 36, 85, 280, 347,348, 361 | Removal of all uATGs except P ATG codon |
| UKOLUC | PKOLUC | ATGs to AaG or ggG | 36, 85, 280, 347,348, 361, 491 | Removal of all uATGs including P ATG |
| PcPLUC | PLUC | 34 bases insertion | Upstream of pgRNA | 34 base extension at 5′ end to mimic the pcRNA leader |
| PCKOpcPLUC | pcPLUC | ATG to AaG | 32 (pcRNA numbering) | Removal of PC ATG codon |
| pg series | Initiation level from each ATG codons in respective leader sequence and uATGs | |||
| C0LUC | PLUC | C0 ORF fused to luciferase ORF | ||
| CLUC | C0LUCX | C ATG directs luciferase expression | ||
| C1LUC | PLUC | C1 ATG directs luciferase expression | ||
| JLUC | PLUC | J ATG directs luciferase expression | ||
| C2LUC | PLUC | C2 ATG directs luciferase expression | ||
| C0KO series | Derivative of pg series | Determine the effect of C0 on downstream expression | ||
| COKOCLUC | CLUC | ATG to AaG | 36 | C0 ATG codon abolished |
| COKOC1LUC | C1LUC | ATG to AaG | 36 | C0 ATG codon abolished |
| COKOJLUC | JLUC | ATG to AaG | 36 | C0 ATG codon abolished |
| COKOC2LUC | C2LUC | ATG to AaG | 36 | C0 ATG codon abolished |
| C0CJLUC | JLUC | ATG to AaG | 36 | C0 ORF abolished |
| C0stopKOJLUC | JLUC | ATG to AaG | 36 | longer C0 ORF 38 codons from 20 codons |
| C0D3 series | Derivative of pg series | Determine the effect of an optimal C0 context on downstream expression | ||
| C0D3CLUC | CLUC | GTACATGT to caACATGT | 31 and 32 | C0 ATG context optimized |
| C0D3C1LUC | C1LUC | GTACATGT to caACATGT | 31 and 32 | C0 ATG context optimized |
| C0D3JLUC | JLUC | GTACATGT to caACATGT | 31 and 32 | C0 ATG context optimized |
| C0D3C2LUC | C2LUC | GTACATGT to caACATGT | 31 and 32 | C0 ATG context optimized |
| C0D3PLUC | PLUC | GTACATGT to caACATGT | 31 and 32 | C0 ATG context optimized |
| C0D3ORF | PLUC | GTACATGT to caACATGT | 31 and 32 | C0 ATG context optimized |
| AUG series | Determine the strength of respective ATG context | |||
| COAUG | C0LUC | C0 ATG context directs luciferase expression | ||
| CAUG | COAUG | C ATG context directs luciferase expression | ||
| C1AUG | COAUG | C1 ATG context directs luciferase expression | ||
| JAUG | COAUG | J ATG context directs luciferase expression | ||
| C2AUG | COAUG | C2 ATG context directs luciferase expression | ||
| PAUG | COAUG | P ATG context directs luciferase expression | ||
| CAUGKO | CAUG | C ATG abolished | ||
Figure 4The effect of C0 initiation codon removal on expression at downstream initiation. (A) Schematic diagram of the HBV test constructs used to study the effect of C0 on downstream expression. The initiation codon with a darker font is in-frame to the luciferase reporter gene and an asterisk (*) indicates the position where a mutation is introduced. (B) Result of DNA transfection of the above constructs into HepG2 cells. The normalized luciferase activity of each construct with C0 mutation is compared with its respective parent construct without the C0 mutation. Normalized expression levels are presented as a percentage of the CLUC construct, assigned 100%. These results are averages of replicates from two independent experiments. Results from the C, C1 and C2 series represent sums of the activities of fusion proteins with similar specific activities.
PCR primers used for quantitative PCR
| Primers | Sequence |
|---|---|
| Taqman lucfwd | 5′-TTCTAAAACGGATTACCAGGGATT-3′ |
| Taqman lucrev | 5′-CCGGGAGGTAGATGAGATGTG-3′ |
| MGB lucprobe | 5′-CAGTCGATGTACACGTTC-3′ |
Strength of each initiation context within the HBV pgRNA in HepG2 cells
| Sequence context surrounding ATG codon (−6 to +6) | Predicted initiation strength | Strength of each context tested in HepG2 cells | |
|---|---|---|---|
| Human RefSeq Consensus | GCC | 1.00 | |
| C0AUG | TTG | 0.41 | 0.32 ± 0.05 |
| CAUG | TGG | 0.79 | 0.87 ± 0.08 |
| C1AUG | GAA | 0.33 | 0.44 ± 0.05 |
| JAUG | TCA | 0.16 | 0.69 ± 0.06 |
| C2AUG | ACT | 0.77 | 1.00 ± 0.02 |
| PAUG | CAC | 0.51 | 0.56 ± 0.04 |
| CAUGKO | TGG | 0.00 | 0.00 ± 0.00 |
The key nucleotides at positions −3 and +4 relative to the start codon (underlined) are represented in bold. Matches to an information content scoring matrix derived from 16 232 human RefSeq initiation codons are scored 0–1. When tested in HepG2 cells, the relative strength of each initiation context are represented in arbitary units relative to the C2 AUG context (designated 1.00). Note that sequences within the coding region could also affect protein stability or activity.
Figure 3The expression levels from each respective initiation codon within the HBV pregenomic RNA. (A) A schematic diagram of the HBV test constructs used to study the levels of initiation from C0, C, J and P initiation codons. Each initiation codon is fused in-frame to the luciferase gene. The initiation codon with a darker font is in-frame to the luciferase reporter gene and an asterisk (*) indicates the position where a mutation is introduced. (B) Normalized expression level from each initiation codon as compared with that from CLUC, assigned as 100%. These results are averages from two independent experiments.
Figure 5The effect of upstream AUG mutations on the expression at P initiation codon. (A) Schematic diagram of the HBV test constructs used to study the effect of upstream mutation on P expression. The initiation codon with a darker font is in-frame to the luciferase reporter gene and an asterisk (*) indicates the position where a mutation is introduced. (B) Result of DNA transfection of the above constructs into HepG2 cells. The normalized luciferase activity of each construct with upstream mutations is compared with the parent PLUC construct. Average luciferase counts for PLUC were 12 545 ± 141 RLU (relative light unit) in 103 HepG2 cells, which were normalized against respective RNA levels as determined by real-time PCR. These results are averages of two replicates from two independent experiments. (C) Fluorograph of the proteins translated in wheat germ extract in the presence of [35S]methionine from representative in vitro synthesized cap RNA. Proteins were separated on a 5–20% SDS–polyacrylamide gel. (D) Fluorograph of the proteins translated in rabbit reticulocyte lysate. Proteins were separated on a 5–20% SDS–polyacrylamide gel.
Figure 6The effect of an optimal C0 initiation context on the expression at downstream initiation. (A) Schematic diagram of the HBV test constructs used to study the effect of an optimal C0 initiation context. The initiation codon with a darker font is in-frame to the luciferase reporter gene while an asterisk (*) indicates the position where a mutation is introduced. (B) Result of DNA transfection of the above constructs into HepG2 cells. The normalized luciferase activity of each construct with the improved C0 initiation context is compared with their corresponding counterpart without C0 context enhancement. Normalized expression levels are presented as a percentage of the CLUC construct, assigned 100%. These results are averages of two replicates from two independent experiments.