| Literature DB >> 31664102 |
Aikaterini Alexaki1, Gaya K Hettiarachchi1, John C Athey1, Upendra K Katneni1, Vijaya Simhadri1, Nobuko Hamasaki-Katagiri1, Puja Nanavaty2, Brian Lin1, Kazuyo Takeda1, Darón Freedberg1, Dougald Monroe3, Joseph R McGill1, Robert Peters4, Jacob M Kames1, David D Holcomb1, Ryan C Hunt1, Zuben E Sauna1, Amy Gelinas5, Nebojsa Janjic5, Michael DiCuccio6, Haim Bar7, Anton A Komar2, Chava Kimchi-Sarfaty8.
Abstract
Synonymous codons occur with different frequencies in different organisms, a phenomenon termed codon usage bias. Codon optimization, a common term for a variety of approaches used widely by the biopharmaceutical industry, involves synonymous substitutions to increase protein expression. It had long been presumed that synonymous variants, which, by definition, do not alter the primary amino acid sequence, have no effect on protein structure and function. However, a critical mass of reports suggests that synonymous codon variations may impact protein conformation. To investigate the impact of synonymous codons usage on protein expression and function, we designed an optimized coagulation factor IX (FIX) variant and used multiple methods to compare its properties to the wild-type FIX upon expression in HEK293T cells. We found that the two variants differ in their conformation, even when controlling for the difference in expression levels. Using ribosome profiling, we identified robust changes in the translational kinetics of the two variants and were able to identify a region in the gene that may have a role in altering the conformation of the protein. Our data have direct implications for codon optimization strategies, for production of recombinant proteins and gene therapies.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31664102 PMCID: PMC6820528 DOI: 10.1038/s41598-019-51984-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Properties of wild-type and codon-optimized F9 genes’ sequences. (a) CO F9 utilizes more common codons and common codon pairs. The Relative Synonymous Codon Usage (RSCU) and Relative Synonymous Codon Pair Usage (RSCPU) were calculated based on codon usage frequencies obtained from https://hive.biochemistry.gwu.edu/review/codon. The 7 codon and codon pair average of RSCU and RSCPU values were plotted for the WT and CO sequences of F9. (b) In silico analyses of mRNA equilibrium base-pairing probabilities were calculated based on RNAfold webserver (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi).
Figure 2Factor IX high expression in HEK293T cells stably transfected with wild-type vs. codon-optimized F9. (a) Cell DNA was isolated and plasmid DNA copy numbers were determined by qPCR. The average of 3 experiments was plotted, s.e.m. are shown. (b) mRNA was isolated and analyzed by qPCR. The average of 3 replicates was plotted, s.e.m. are shown. (c) Cell supernatant (concentrated medium) and lysate was collected and analyzed by immunoblotting. Membranes were stained with anti-V5 or anti-GAPDH. (d) Cells were stained with anti-V5 primary antibody and Alexa488 conjugated donkey anti-mouse IgG (green) and examined by confocal microscopy. HEK293T cells stably transfected with empty vector were also included as a control.
Figure 3Generation of HEK293T cell clones expressing similar levels of wild-type and codon-optimized factor IX: (a) mRNA was isolated and analyzed by qPCR. The average of 3 replicates was plotted, s.e.m. are shown. (b) cell supernatant was collected and analyzed by immunoblotting. Membranes were stained with anti-V5 antibody. The average band intensity was plotted, s.e.m. are shown.
Figure 4Codon-optimization leads to altered characteristics of factor IX protein. (a) WT and CO FIX activity were assessed with chromogenic assay. The average of 3 replicates was plotted, s.e.m. are shown. (b) WT and CO FIX were incubated with increasing amounts of plasma containing inhibitory antibodies, resulting in a dose dependent reduction of activity. Results from one (of three) representative experiment are shown. The EC50 was significantly different between WT and CO (WT EC50 = 0.059, CO EC50 = 0.039, p = 3.9e-09). (c) Binding affinities (KD) of WT and CO FIXs to the FIX-specific SOMAmer were significantly (p = 0.015) different. The anti-V5 antibody showed comparable (p = 0.555) affinities for the WT and CO FIX variants. (d) WT and CO FIX were exposed to the indicated concentrations of cathepsin. Samples were electrophoresed and silver stained to assess their pattern of digestion. The box highlights the area where most of the changes are observed. Band density was quantitated with ImageJ software. Arrow (1) points to a band that is present in the WT (cathepsin 0.2 μg/ml) lane and absent from the corresponding CO lane. Arrow (2) points to a band that is absent from the WT (cathepsin 0.2 μg/ml) lane and present the corresponding CO lane.
Figure 5Ribosome profiling of wild-type and codon-optimized F9 demonstrates drastically altered patterns of translation kinetics, but similar translation efficiency, in the two variants. (a) F9 ribosome profiling data, shaded areas indicate the major FIX protein domains, Gla: γ-carboxyglutamic acid-rich, EGF: epidermal growth factor-like. Inserts show magnifications of the Gla-EGF1-EGF2 and His-V5 domains. (b) Translation efficiency (TE) of F9, GAPDH and ACTB in the WT and CO FIX expressing cell lines was calculated based on transcript mRNA and RPF abundance of thee independent experiments. (c) Representative [35S]-autoradiogram of the WT and CO FIX in vitro translation products in rabbit reticulocyte lysate (RRL) system (bottom panel) and quantitation analysis of the band intensities of three independent experiments (top panel). The average of 3 experiments was plotted, s.e.m. are shown.
In a linear stepwise selection model, the ribosome occupancy of adjacent codons is the only consistently significant predictor of ribosome occupancy of a codon.
| Estimate | Std. Error | t-value | Pr( > |t|) | Significance | |
|---|---|---|---|---|---|
|
| |||||
| (Intercept) | −0.89325 | 0.38834 | −2.3 | 0.0218 | * |
| RSCPU_Genome | −0.50998 | 0.27813 | −1.834 | 0.0673 | . |
| RSCPU_HEK_ERX | 0.40442 | 0.27254 | 1.484 | 0.1385 | |
| tRNA | −4.77239 | 2.88107 | −1.656 | 0.0983 | . |
| MFE_151 | −0.02178 | 0.01039 | −2.096 | 0.0366 | * |
| Lag3 | 0.06721 | 0.04018 | 1.673 | 0.095 | . |
| Lag1 | 0.48761 | 0.04027 | 12.109 | <2e-16 | *** |
|
| |||||
| (Intercept) | −0.29018 | 0.05481 | −5.295 | 1.78E-07 | *** |
| Lag2 | 0.16889 | 0.04401 | 3.838 | 0.00014 | *** |
| Lag1 | 0.39237 | 0.0439 | 8.939 | <2e-16 | *** |
|
| |||||
| (Intercept) | −0.99862 | 0.23398 | −4.268 | 2.51E-05 | *** |
| RSCPU_HEK_ERX | 0.10396 | 0.06776 | 1.534 | 0.12586 | |
| Pair-prob | 0.66064 | 0.25935 | 2.547 | 0.01126 | * |
| Lag3 | 0.12719 | 0.04649 | 2.736 | 0.00652 | ** |
| Lag1 | 0.46591 | 0.04713 | 9.887 | <2e-16 | *** |
|
| |||||
| (Intercept) | −0.55555 | 0.30251 | −1.836 | 0.067199 | . |
| RSCU_Genome | 1.39465 | 0.61119 | 2.282 | 0.023139 | * |
| RSCU_HEK_SRR | −1.33592 | 0.73108 | −1.827 | 0.068558 | . |
| RSCPU_Genome | −0.66171 | 0.27595 | −2.398 | 0.017048 | * |
| RSCPU_HEK_ERX | 0.75627 | 0.28138 | 2.688 | 0.00756 | ** |
| Lag2 | 0.19206 | 0.05397 | 3.559 | 0.000428 | *** |
| Lag1 | 0.38437 | 0.05391 | 7.13 | 6.46E-12 | *** |
Figure 6Statistical analysis of translation kinetics highlights differences in the two F9 variants at the Gla-EGF1-EGF2 domains. Cumulative sum of (a) ln(RPF)F9 values. (b) ln(RPF)ACTB values and (c) ln(RPF)GAPDH from the WT FIX and CO FIX expressing clones. The average of 3 experiments are shown.