| Literature DB >> 25674439 |
Ken-Ichiro Tatematsu1, Keiro Uchino1, Hideki Sezutsu1, Toshiki Tamura1.
Abstract
The context sequence motif surrounding the ATG initiation codon influences mRNA translation efficiency and affects protein production; however, the optimal sequence differs among species. To determine the optimal sequence for production of recombinant proteins in a transgenic silkworm, we compared 14-nucleotide context motifs around the ATG (ATG-context) in 50 silkworm genes and found the following consensus: (A/T)AN(A/T)ATCAAAatgN. We were also able to define the least-common motif: CCN(C/G)CGN(C/T/G)(G/C/T)(T/G)atgC, which served as a negative control. To examine the regulatory role of these motifs in protein expression, we constructed reporter plasmids containing different ATG-context motifs together with either the luciferase gene or an enhanced green fluorescent protein (EGFP) gene. These constructs were then used for comparison of luciferase reporter activity and EGFP production in BmN4 cells in vitro as well as in transgenic silkworms in vivo. We detected 10-fold higher luciferase activity in BmN4 cells transfected with the consensus ATG-context motif construct, compared to the negative control plasmid. ELISA measurements of EGFP translation products with the corresponding constructs in BmN4 cells showed consistently similar results. Interestingly, the translation efficiency of the novel consensus ATG-context motif did not show the highest activity in the transgenic silkworms in vivo, except for the fat body. The highest efficiency in the middle and posterior silk glands was produced by the sericin 1 context. Our results show that the ATG-context motifs differ among silkworm tissues. This result is important for the further improvement of the transgenic silkworm system for the production of recombinant proteins.Entities:
Keywords: Bioreactor; Kozak sequence; Recombinant protein; Silkworm; Transgenic; Translation
Year: 2014 PMID: 25674439 PMCID: PMC4320137 DOI: 10.1186/2193-1801-3-136
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Figure 1Compilation of the 14-base sequence context of the ATG initiation codon of genes. Sequences surrounding the ATG initiation codon were compiled from 50 genes in GenBank. For reference, the ATG initiation codon corresponds to +1 through +3. The frequency of each nucleotide at each position is presented as a percentage. Solo consensus (italicized letter) and co-consensus nucleotides (underlined letter) were determined by the 50/75 rule (Cavener 1987). The consensus and least-common nucleotides determined by a binominal test are indicated by red and blue boxes, respectively. The consensus and least-common nucleotides were determined using Cavener’s 50/75 rule and/or by a binominal test and are shown at the bottom. The accession numbers of genes used for the analysis are as follows: AY769299, BMU06073, D10953, D90454, DQ311154, DQ311189, DQ311242, DQ311250, DQ311264, DQ311306, DQ311321, DQ311322, DQ311328, DQ311332, DQ311333, DQ311340, DQ311341, DQ311350, DQ311356, DQ311358, DQ311360, DQ311363, DQ311365, DQ311378, DQ311384, DQ311386, DQ311388, DQ311397, DQ311402, DQ311407, DQ311412, DQ311418, DQ311430, DQ311436, DQ311438, DQ311439, DQ343760, DQ358079, DQ424947, M64336, NM_001044023, NM_001044041, NM_001113262, NM_001145941, S74376, S77508, U30289, U94993, X74320, and X95604.
Figure 2Construction of expression vectors. a. 14-base context motifs used in the experiment. Red and blue letters indicate the most- and least-frequent nucleotides at each position, respectively. Capital letters indicate the nucleotide of the consensus or the least-common nucleotide at each position. b. Physical maps of plasmids including a sequence around the translation initiation site of luciferase and EGFP constructs. X represents the newly designed context motif and the arrow points to translation initiation.
Effects of the ATG initiation codon sequence context of luciferase constructs on luciferase expression in BmN4 cells
| Context name |
|
|
| Fold activity (%) |
|---|---|---|---|---|
| NC-Luc | 8.3 ± 1.1 | 4.26 ± 0.7 | 196.5 ± 15.0 | 100 |
| PC-Luc | 88.3 ± 26.6 | 4.34 ± 0.5 | 2,008.7 ± 393.1 | 1,022 |
| 7N3P-Luc | 55.9 ± 20.4 | 4.23 ± 0.5 | 1,307.5 ± 393.6 | 665 |
| 7P3N-Luc | 56.8 ± 10.2 | 4.49 ± 0.8 | 1,272.5 ± 184.9 | 648 |
| AAT-Luc | 38.7 ± 27.4 | 4.44 ± 0.6 | 834.6 ± 539.9 | 425 |
| AGT-Luc | 39.4 ± 6.7 | 4.28 ± 0.6 | 941.6 ± 252.9 | 479 |
| CAA-Luc | 15.0 ± 2.7 | 4.80 ± 0.4 | 316.8 ± 84.3 | 161 |
| CGA-Luc | 11.1 ± 2.4 | 4.82 ± 0.4 | 228.2 ± 30.8 | 116 |
| Ser1-Luc | 45.0 ± 24.1 | 4.68 ± 0.6 | 928.2 ± 373.2 | 472 |
| Vert3-Luc | 25.5 ± 3.1 | 4.66 ± 0.3 | 547.9 ± 61.0 | 279 |
| Vert9-Luc | 36.9 ± 6.6 | 4.81 ± 0.3 | 775.4 ± 173.8 | 395 |
| atgA-Luc | 24.3 ± 16.9 | 4.47 ± 0.5 | 550.6 ± 413.8 | 280 |
| atgC-Luc | 34.3 ± 2.1 | 4.56 ± 0.5 | 759.1 ± 102.1 | 386 |
| atgT-Luc | 27.9 ± 12.5 | 4.58 ± 0.8 | 594.0 ± 171.2 | 302 |
Values represent the mean ± standard deviation (SD) obtained from three individual experiments.
The fold efficiency is indicated as the activity of NC equal to 100. Measurement of each luciferase activity was performed at least three times. The value of Photinus luciferase activity/Renilla luciferase activity was calculated in each measurement.
Effects of the ATG initiation codon sequence context of EGFP constructs on translational efficiency of EGFP in cultured cells
| Context name | EGFP protein (μg/well) (A) | EGFP mRNA (×10 −6 ) (pmol/ng) (B) | rp49 mRNA (×10 −7 ) (pmol/ng) (C) | EGFP/rp49 mRNA (B/C) | Translational efficiency (A/{B/C}) | Fold efficiency (%) |
|---|---|---|---|---|---|---|
| NC-EGFP | 1.60 ± 0.5 | 4.31 ± 1.2 | 6.99 ± 0.17 | 6.15 ± 1.5 | 0.28 ± 0.13 | 100 |
| PC-EGFP | 6.37 ± 1.7 | 3.97 ± 1.5 | 6.68 ± 0.57 | 5.92 ± 1.9 | 1.20 ± 0.60 | 424 |
| 7N3P-EGFP | 4.15 ± 0.6 | 3.47 ± 1.3 | 7.00 ± 0.17 | 4.92 ± 1.7 | 0.94 ± 0.41 | 341 |
| 7P3N-EGFP | 3.54 ± 0.7 | 3.50 ± 1.1 | 6.81 ± 0.23 | 5.14 ± 1.6 | 0.76 ± 0.34 | 274 |
| AAT-EGFP | 4.58 ± 1.2 | 4.32 ± 1.2 | 6.89 ± 0.47 | 6.22 ± 1.5 | 0.79 ± 0.34 | 286 |
| AGT-EGFP | 2.72 ± 0.7 | 3.43 ± 1.3 | 6.82 ± 0.25 | 5.00 ± 1.7 | 0.61 ± 0.29 | 216 |
| CAA-EGFP | 4.04 ± 1.0 | 4.26 ± 1.2 | 6.63 ± 0.14 | 6.43 ± 1.9 | 0.68 ± 0.30 | 248 |
| CGA-EGFP | 1.20 ± 0.2 | 3.21 ± 1.4 | 6.93 ± 0.50 | 4.60 ± 1.8 | 0.30 ± 0.14 | 106 |
| Ser1-EGFP | 3.17 ± 0.7 | 3.40 ± 1.0 | 6.74 ± 0.31 | 5.03 ± 1.4 | 0.68 ± 0.28 | 248 |
| Vert3-EGFP | 2.87 ± 0.7 | 3.67 ± 1.2 | 7.20 ± 0.27 | 5.09 ± 1.6 | 0.62 ± 0.26 | 224 |
| Vert9-EGFP | 2.84 ± 0.3 | 3.30 ± 1.5 | 7.02 ± 0.49 | 4.65 ± 1.8 | 0.68 ± 0.27 | 251 |
Values represent the mean ± SD obtained from more than three individual experiments. The fold efficiency is indicated as the activity of NC equal to 100. Measurements of the amount of protein and mRNA were performed at least three times. Values of relative EGFP mRNA and translational efficiency were calculated in each measurement.
Figure 3Expression of EGFP in the transgenic silkworm. a. Stemmata of the fifth instar larva. UAS-EGFP constructs with NC, PC, AAT, and Ser1 sequence contexts of the initiation codon were marked with a 3 × P3-EGFP marker construct allowing the expression of EGFP in the stemmata. The GAL4 driver construct under the control of the sericin 1, fibroin H chain, or 30 k protein gene promoter was linked with a marker construct, 3 × P3-DsRed, allowing the expression of DsRed in the stemmata. The stemmata of a larva harboring PC-EGFP, Ser1-GAL4, or both PC-EGFP and Ser1-GAL4 are shown. The images were taken under white light or under a fluorescence microscope equipped with an EGFP or DsRed filter. Arrows indicate the stemmata with EGFP or DsRed fluorescence. Scale bar: 1 mm. b. EGFP expression in the MSG, PSG, and the fat body of the transgenic silkworm with different GAL4 driver constructs. MSG, PSG, and the fat body on the sixth day of the fifth instar are shown. Images were taken under a fluorescence microscope equipped with an EGFP filter. Expression of UAS-EGFP with different context motifs was only observed in silkworms with GAL4 drivers. The numbers above the tissue photographs indicate the transgenic silkworm lines. Scale bar: 10 mm.
Effects of sequence context of the ATG initiation codon on translational efficiency of EGFP expressed in MSG
| Line no. | EGFP protein (μg/larva) (A) | EGFP mRNA (×10 −7 ) (pmol/ng) (B) | rp49 mRNA (×10 −7 ) (pmol/ng) (C) | EGFP/rp49 mRNA (B/C) | Translational efficiency (A/{B/C}) | Fold efficiency (%) | |
|---|---|---|---|---|---|---|---|
| NC | 1 | 31.4 ± 5.8 | 70.8 | 7.83 | 9.05 | 3.47 | |
| -EGFP | 2 | 16.8 ± 2.4 | 44.1 | 7.26 | 6.07 | 2.76 | |
| 3 | 12.6 ± 1.7 | 49.9 | 7.70 | 6.48 | 1.95 | ||
| Average | 20.3 | 54.9 | 7.60 | 7.23 | 2.73 | 100 | |
| PC | 1 | 130.2 ± 9.9 | 19.8 | 6.96 | 2.85 | 45.7 | |
| -EGFP | 2 | 158.9 ± 12.1 | 29.6 | 6.71 | 4.40 | 36.1 | |
| Average | 144.5 | 24.7 | 6.84 | 3.61 | 40.9 | 1,498 | |
| AAT | 1 | 76.8 ± 7.2 | 48.7 | 9.04 | 5.39 | 14.2 | |
| -EGFP | 2 | 89.0 ± 12.9 | 34.4 | 8.14 | 4.23 | 21.0 | |
| Average | 82.9 | 41.6 | 8.59 | 4.84 | 17.6 | 645 | |
| Ser1 | 1 | 276.3 ± 36.4 | 40.8 | 8.14 | 5.01 | 55.1 | |
| -EGFP | 2 | 261.4 ± 18.4 | 37.3 | 7.76 | 4.80 | 54.4 | |
| Average | 268.9 | 39.0 | 7.95 | 4.91 | 54.8 | 2,007 |
Values obtained from each line and averages of lines are shown. The fold efficiency is indicated as the efficiency of the average of NC equal to 100.
Effects of sequence context of the ATG initiation codon on translational efficiency of EGFP expressed in PSG
| Line no. | EGFP protein (μg/larva) (A) | EGFP mRNA (×10 −7 ) (pmol/ng) (B) | rp49 mRNA (×10 −7 ) (pmol/ng) (C) | EGFP/rp49 mRNA (B/C) | Translational efficiency (A/{B/C}) | Fold efficiency (%) | |
|---|---|---|---|---|---|---|---|
| NC | 1 | 12.9 ± 0.8 | 4.20 | 3.95 | 1.06 | 12.1 | |
| -EGFP | 2 | 9.9 ± 0.3 | 1.73 | 3.54 | 0.49 | 20.2 | |
| 3 | 12.6 ± 0.7 | 0.48 | 3.60 | 0.14 | 93.0 | ||
| Average | 11.8 | 2.14 | 3.70 | 0.56 | 41.8 | 100 | |
| PC | 1 | 80.6 ± 15.3 | 1.35 | 2.93 | 0.46 | 175.5 | |
| -EGFP | 2 | 104.8 ± 24.2 | 2.14 | 3.65 | 0.59 | 178.1 | |
| Average | 92.7 | 1.75 | 3.29 | 0.52 | 176.8 | 423 | |
| AAT | 1 | 25.5 ± 3.6 | 1.64 | 3.23 | 0.51 | 50.3 | |
| -EGFP-EGFP | 2 | 53.5 ± 15.8 | 0.84 | 3.50 | 0.24 | 220.5 | |
| Average | 39.5 | 1.24 | 3.37 | 0.37 | 135.4 | 324 | |
| Ser1 | 1 | 94.4 ± 10.4 | 1.53 | 3.52 | 0.44 | 216.9 | |
| -EGFP | 2 | 106.2 ± 34.7 | 1.15 | 3.09 | 0.37 | 284.9 | |
| Average | 100.3 | 1.34 | 3.31 | 0.40 | 250.9 | 600 |
Values obtained from each line and averages of lines are shown. The fold efficiency is indicated as the efficiency of the average of NC equal to 100.
Effects of sequence context of the ATG initiation codon on translational efficiency of EGFP expressed in the fat body
| Line no. | EGFP protein (μg/larva) (A) | EGFP mRNA (×10 −7 ) (pmol/ng) (B) | rp49 mRNA (×10 −7 ) (pmol/ng) (C) | EGFP/rp49 mRNA (B/C) | Translational efficiency (A/{B/C}) | Fold efficiency (%) | |
|---|---|---|---|---|---|---|---|
| NC | 1 | 4.9 ± 0.6 | 25.8 | 5.15 | 5.01 | 0.98 | |
| -EGFP | 2 | 4.4 ± 0.1 | 13.7 | 3.14 | 4.37 | 1.01 | |
| 3 | 1.3 ± 0.1 | 21.6 | 3.86 | 5.61 | 0.23 | ||
| Average | 3.5 | 20.4 | 4.05 | 5.04 | 0.70 | 100 | |
| PC | 1 | 51.8 ± 3.0 | 11.5 | 6.13 | 1.87 | 27.6 | |
| -EGFP | 2 | 70.2 ± 17.3 | 12.4 | 6.89 | 1.80 | 39.0 | |
| Average | 61.0 | 11.9 | 6.51 | 1.83 | 33.3 | 4,757 | |
| AAT | 1 | 11.9 ± 0.1 | 15.6 | 4.20 | 3.72 | 3.20 | |
| -EGFP | 2 | 4.2 ± 0.2 | 11.8 | 4.83 | 2.44 | 1.71 | |
| Average | 8.0 | 13.7 | 4.52 | 3.03 | 2.65 | 379 | |
| Ser1 | 1 | 44.6 ± 3.8 | 35.3 | 6.20 | 5.70 | 7.82 | |
| -EGFP | 2 | 50.6 ± 2.5 | 11.5 | 6.08 | 1.90 | 26.7 | |
| Average | 47.6 | 23.4 | 6.14 | 3.82 | 12.5 | 1,785 |
Values obtained from each line and averages of lines are shown. The fold efficiency is indicated as the efficiency of the average of NC equal to 100.
List of primers and adapters used in the experiments
| Name | Sequences |
|---|---|
| Primer | |
| EGFP-BstI-U | 5′-GGGATCCGCTAGCACCATGGTTTCGAAGGGCGAG-3′ |
| EGFP-Stop(+)-L | 5′-GGAATTCGGACCGCTAGCTTACTTGTACAGCTCG-3′ |
| pGL3-U | 5′-GGATCGATCAAAAACATAAAGAAAGGCCCGGCG-3′ |
| pGL3-L | 5′-GGGCTAGCTTACACGGCGATCTTTCCGCCCTTC-3′ |
| EGFPLCU2 | 5′-AACTTCAAGATCCGCCACAACATCGAGGAC-3′ |
| EGFPLCL2 | 5′-AGGACCATGTGATCGCGCTTCTCGT-3′ |
| rp49LCF2 | 5′-GGATCGCTATGACAAACTTAAGAGGA-3′ |
| rp49LCR1 | 5′-TGCTGGGCTCTTTCCACGA-3′ |
| Adapter | |
| NC | 5′-GATCCGCTAGCCCGGCGGCGTATGGTTT-3′ |
| 3′-GCGATCGGGCCGCCGCATACCAAAGC-5′ | |
| PC | 5′-GATCCGCTAGCAAAAATCAAAATGGTTT-3′ |
| 3′-GCGATCGTTTTTAGTTTTACCAAAGC-5′ | |
| 7N3P | 5′-GATCCGCTAGCCCGGCGGAAAATGGTTT-3′ |
| 3′-GCGATCGGGCCGCCTTTTACCAAAGC-5′ | |
| 7P3N | 5′-GATCCGCTAGCAAAAATCCGTATGGTTT-3′ |
| 3′-GCGATCGTTTTTAGGCATACCAAAGC-5′ | |
| AAT | 5′-GATCCGCTAGCCCGGCGGAATATGGTTT-3′ |
| 3′-GCGATCGGGCCGCCTTATACCAAAGC-5′ | |
| AGT | 5′-GATCCGCTAGCCCGGCGGAGTATGGTTT-3′ |
| 3′-GCGATCGGGCCGCCTCATACCAAAGC-5′ | |
| CAA | 5′-GATCCGCTAGCCCGGCGGCAAATGGTTT-3′ |
| 3′-GCGATCGGGCCGCCGTTTACCAAAGC-5′ | |
| CGA | 5′-GATCCGCTAGCCCGGCGGCGAATGGTTT-3′ |
| 3′-GCGATCGGGCCGCCGCTTACCAAAGC-5′ | |
| Ser1 | 5′-GATCCGCTAGCGACCGCCAACATGGTTT-3′ |
| 3′-GCGATCGCTGGCGGTTGTACCAAAGC-5′ | |
| Vert3 | 5′-GATCCGCTAGCCCGGCGGACCATGGTTT-3′ |
| 3′-GCGATCGGGCCGCCTGGTACCAAAGC-5′ | |
| Vert9 | 5′-GATCCGCTAGCCGCCGCCACCATGGTTT-3′ |
| 3′-GCGATCGGCGGCGGTGGTACCAAAGC-5′ | |
| atgA | 5′-GATCCGCTAGCCCGGCGGAATATGATTT-3′ |
| 3′-GCGATCGGGCCGCCTTATACTAAAGC-5′ | |
| atgC | 5′-GATCCGCTAGCCCGGCGGAATATGCTTT-3′ |
| 3′-GCGATCGGGCCGCCTTATACGAAAGC-5′ | |
| atgT | 5′-GATCCGCTAGCCCGGCGGAATATGTTTT-3′ |
| 3′-GCGATCGGGCCGCCTTATACAAAAGC-5′ | |
| IB-BlnI | 5′-AGCTTCCTAGGGTCGACGGTAC-3′ |
| 3′-AGGATCCCAGCTGC-5′ |