In vitro display technologies such as mRNA and cDNA display are powerful tools to create and select functional peptides. However, in some cases, efficiency of mRNA-protein fusion is very low, which results in decreased library size and lower chance of successful selection. In this study, to improve mRNA-protein fusion efficiency, we prepared an mRNA display library of a protein with random N- and C-terminal coding regions consisting of 12 nucleotides (i.e. four amino acids), and performed an electrophoresis mobility shift assay (EMSA)-based selection of successfully formed mRNA display molecules. A single-domain antibody (Nanobody, or VHH) was used as a model protein, and as a result, a pair of sequences was identified that increased mRNA-protein fusion efficiency of this protein by approximately 20%. Interestingly, enhancement of the fusion efficiency induced by the identified sequences was protein-specific, and different results were obtained for other proteins including VHHs with different CDRs. The results suggested that conformation of mRNA as a whole, rather than the amino acid sequence of the translated peptide, is an important factor to determine mRNA-protein fusion efficiency.
In vitro display technologies such as mRNA and cDNA display are powerful tools to create and select functional peptides. However, in some cases, efficiency of mRNA-protein fusion is very low, which results in decreased library size and lower chance of successful selection. In this study, to improve mRNA-protein fusion efficiency, we prepared an mRNA display library of a protein with random N- and C-terminal coding regions consisting of 12 nucleotides (i.e. four amino acids), and performed an electrophoresis mobility shift assay (EMSA)-based selection of successfully formed mRNA display molecules. A single-domain antibody (Nanobody, or VHH) was used as a model protein, and as a result, a pair of sequences was identified that increased mRNA-protein fusion efficiency of this protein by approximately 20%. Interestingly, enhancement of the fusion efficiency induced by the identified sequences was protein-specific, and different results were obtained for other proteins including VHHs with different CDRs. The results suggested that conformation of mRNA as a whole, rather than the amino acid sequence of the translated peptide, is an important factor to determine mRNA-protein fusion efficiency.
Entities:
Keywords:
Nanobody; electrophoresis mobility shift assay; in vitro translation; variable domain of heavy chain of heavy chain antibody
In vitro display technology has been a powerful tool to discover functional peptides and proteins by in vitro selection methods. Although cell-based display/selection systems such as phage display or yeast surface display are quite popular, in vitro display using a cell-free translation system has advantages in terms of library size, throughput, and feasibility of chemical modifications [1,2]. In particular, we have focused on mRNA [3-5] and cDNA [6] display technology where genotype (i.e. mRNA or cDNA) and phenotype (i.e. its encoding polypeptide) are linked by an antibiotic, puromycin. These technologies have been successfully used in the selection of peptide aptamers against some target molecules [7-9]. However, low efficiency of translation and mRNA-protein fusion for some proteins is problematic for practical use because such low efficiency results in decreased library size and lower chance of successful selection. To overcome this problem, we previously performed an EMSA (electrophoresis mobility shift assay) to find novel translation enhancer sequences at 5′-untranslated region (UTR) [10]. However, although optimization of 5′-UTR should improve the initiation efficiency of translation, it would not directly enhance the formation efficiency of mRNA-protein conjugates. At present, factors determining this efficiency is not well known, and the N- and C-terminal region of structural gene might affect overall efficiency of mRNA-protein fusion. In this study, to further increase the final yields of mRNA-protein fusions in mRNA display, we performed a selection of mRNA display using a library with additional random sequences at 5′- and 3′-translated regions. Specifically, we used a single-domain antibody (i.e., variable domain of heavy chain of heavy chain antibody; Nanobody, or VHH) [11,12], a small antibody derived from camelids, as a model protein for selection that needs improvement of mRNA-protein fusion efficiency, considering its wide range of applications and relatively small size.
Materials and Methods
General
Primers used in this study are shown in Supplementary Table S1. DNA sequences of the constructs are described in Supplementary Table S2 and schematically shown in Supplementary Figure S1. Detailed protocols are described in Supplementary Text S1.
DNA construction
The wild type VHH DNA sequences were shown in Supplementary Table S2. The B domain of protein A (BDA) [13] and Pou-specific DNA-binding domain of Oct1 (PDO) [14]-coding DNAs were prepared as described previously [15]. Insertion of the selected sequence to these proteins was performed by polymerase chain reaction (PCR). After purification of PCR products, second PCR was performed to add a T7 promoter. The full construct DNAs were subject to gel-purification. T12 and A12 sequences were inserted to 3′-terminus of VHH in a similar way.
Library construction
A VHH library was constructed by adding random nucleotides (N12) to the N- and C-termini of the wild type VHH1 gene by PCR. After adding T7 promoter, the full construct DNA library was purified and checked by polyacrylamide gel electrophoresis (PAGE) and DNA direct sequencing (see Supplementary Fig. S1).
EMSA selection
The DNA library was in vitro transcribed into mRNA. The mRNA was ligated with a puromycin DNA linker by T4 ligase as described previously [15]. The ligation products were added to a reaction mixture containing rabbit reticulocyte lysate. The cell free translation mixture was incubated to generate mRNA display molecules. The mRNA display formation was analyzed by 8 M urea containing 4% stacking-6% separating SDS-PAGE and detected with a fluorescence image analyzer. The band containing mRNA display was excised from the gel, and mRNA display molecules were eluted from the gel. The mRNA display molecules were ethanol precipitated and reverse transcribed to synthesize cDNA display. Synthesized cDNA display molecules were amplified by PCR to add a T7 promoter for the next round. The selected DNAs at final round were cloned and their sequences were analyzed.
Evaluation of mRNA-protein fusion efficiency by SDS-PAGE
mRNA-protein fusion efficiency of each clone was evaluated by SDS-PAGE. Efficiency of mRNA-protein fusion with inserted A12 and T12 sequences at 3′-terminus of VHH1 gene was also confirmed. Each DNA was transcribed into mRNA and the mRNA was ligated with puromycin linker as described above. mRNA-protein fusion was formed by cell free translation. The mRNA display molecules were analyzed by SDS-PAGE and fluorescently detected using FITC attached to the linker moiety. The mRNA-protein fusion efficiency was calculated from band intensity. mRNA displays of BDA, PDO, and VHH2-4 inserted with selected sequence were also formed by the method described above, and their mRNA-protein fusion efficiency was evaluated.
Results
Sample preparation
Random sequences of 12 nucleotides were inserted into both ends of DNA coding a model VHH protein (named as VHH1 in Supplementary Fig. S1) with a His × 6 tag at C-terminus by PCR. For efficient transcription and translation, T7 promoter, tobacco mosaic virus Ω, and Kozak sequences were attached as 5′-UTR (see Supplementary Table S2 for the sequence) [8]. The created library (7 × 1012 molecules at the first round) was transcribed into mRNA, and ligated to our original puromycin linker incorporating fluorescein and biotin (SBP-linker) [15]. The ligation product was translated in vitro using rabbit reticulocyte lysate, and incubation under high salt concentration was performed to promote mRNA-protein fusion [8]. Thus-obtained sample was used for EMSA selection. Schematic representation of the overall selection cycle is shown in Figure 1.
Figure 1
Overall schematic representation of this study. DNA library coding VHH with 12 nt random sequences at both termini were constructed by PCR. After transcription, ligation of puromycin linker was performed. Following cell-free translation, correctly formed mRNA-protein fusion, or mRNA display, was separated from non-fused mRNA by SDS-PAGE. The fusion was collected by dissecting gel, and reverse-transcribed cDNA was amplified by PCR, which was used as a template for the next round. LHR: linker hybridization region.
Selection by EMSA
To iteratively concentrate sequences that had high efficiency of mRNA-protein fusion, four rounds of EMSA selection was performed according to our previous literature [10]. In particular, mRNA-protein conjugate was separated from non-conjugated mRNA by denaturing SDS-PAGE, and the band containing the conjugate was carefully excised. Extracted RNA was reverse-transcribed and the cDNA was used as a template for the next round after addition of UTR. Reduced time of translation and high-salt incubation was used as a selection pressure. After the fourth selection, mRNA-protein fusion efficiency of the initial and selected library was compared directly by SDS-PAGE (Fig. 2). The band of the conjugate had increased intensity after the selection (6.6% of total mRNA) than before (1.2% of total mRNA), which indicated that the sequences featuring high mRNA-protein fusion efficiency were successfully enriched by this approach. Then, a portion of the selected library was cloned into E. coli and the colonies were randomly picked up for DNA sequencing.
Figure 2
SDS-PAGE analysis of mRNA-protein fusion efficiency. (a) Initial library (lane 2, 3) and selected library (lane 4, 5) were translated under the same conditions and electrophoresed. mRNA-linker conjugate only was applied as a reference (lane 1). Gel was imaged using fluorescence of FITC, which was attached to the linker moiety (see Fig. 1). Samples were prepared duplicate and run on the same gel to confirm reproducibility. (b) Quantification of bands shown in (a). mRNA-protein fusion efficiencies were calculated and are shown as mean of two lanes.
Analysis of selected sequences
Twelve clones in total were successfully sequenced. As shown in Supplementary Table S3, no sequences were found duplicate or more, and we could not see clear homology among the identified DNA sequences or the peptides they coded, suggesting low convergence of selection at this stage. Even so, because the increase of average fusion efficiency of library was clear from Figure 2, mRNA-protein fusion efficiency of each clone was evaluated (Supplementary Fig. S2). Contrary to our expectation, however, only a few clones including #4 showed increase of efficiency compared with the original wt-VHH1 (i.e. protein without additional terminal sequences). This would be mainly because the resolution of EMSA selection is not high due to small difference of mobility. Indeed, when we prepared cDNA library from the dissected gel and analyzed by SDS-PAGE, majority of the sample was found to be non-conjugated cDNA (Supplementary Fig. S3). We note, however, that the population of cDNA-protein conjugate (15% of total cDNA) became much higher after four rounds of selection, and that the ratio of conjugate was increased by dissection (see Fig. 2 for comparison). This experiment also suggested that we might increase the library size of cDNA display for functional selection, if the selected sequences could be applied to other proteins as well.Next, we focused on clone #4, which was the most promising clone identified in our selection (Supplementary Fig. S2). This clone had a DNA sequence of GTCGGTGTTTTT (translated into VGVF amino acids) at the 5′-terminal random region and CTAATTAGAAAT (translated into LIRN amino acids) at the 3′-terminal region. First, we examined which of the two regions contributed to the enhancement of mRNA-protein fusion efficiency, by translating mRNAs that had only one of the terminal sequences. As shown in Figure 3, introduction of one of the sequences alone to wt-VHH1 increased the mRNA-protein fusion efficiency by approximately 10%. This value was smaller than that of clone #4 (approximately 20% increase), indicating that both 5′- and 3′-terminal RNA (or amino acid) sequences played positive roles for the enhancement. We also attached T12 (translated to FFFF) or A12 (translated to KKKK) sequences to the 3′-terminus of wt-VHH1 mRNA, and compared their fusion efficiency with that of clone #4 (Supplementary Fig. S4). Only clone #4 showed significant enhancement, implying that the sequence of clone #4 had some special characteristics and that the achieved enhancement should not be explained by hydrophobicity or basicity of the selected C-terminal amino acids.
Figure 3
mRNA-protein fusion efficiency of mRNAs having both or only one of the identified 5′- and 3′-terminal sequences. (a) SDS-PAGE of translated samples. Lane 1, reference (non-translated mRNA of wt-VHH1). Lane 2, wt-VHH1 without any additional sequences. Lane 3, VHH1 with the additional 5′-sequence only. Lane 4, VHH1 with the additional 3′-sequence only. Lane 5, clone #4 (i.e. VHH1 with both additional sequences). (b) Quantification of bands shown in (a). mRNA-protein fusion efficiencies were normalized to that of wt-VHH1, and shown as mean± S.D. (n= 3 independent experiments)
Introduction of the identified sequence to other proteins
We then asked if the selected sequence could enhance mRNA-protein fusion efficiency of other genes in general. As model proteins, the B domain of protein A (BDA) [13] and the Pou-specific DNA-binding domain of Oct-1 (PDO) [14] were chosen. The additional sequences of clone #4 was attached to the both termini of DNA coding these proteins (see Supplementary Table S2 for complete sequences). The DNAs were converted to mRNA-display as described above for VHH1, and the formation efficiencies of mRNA-protein conjugates were quantified by SDS-PAGE. As shown in Figure 4, significant increase of the efficiency was not observed for these proteins, implying that the effect of terminal sequences on mRNA-protein fusion efficiency is protein-dependent. To further test this idea, we attached T12 and A12 sequences to the 3′-terminus of BDA and PDO and checked their mRNA-protein fusion efficiencies (Supplementary Fig. S5). T12 did not change the efficiency, and A12 decreased mRNA-protein fusion efficiency of PDO only. These results also suggest the protein-dependent nature of the effect of terminal sequences on mRNA-protein fusion efficiency.
Figure 4
Effect of additional sequence of clone #4 on the mRNA-protein fusion efficiencies of BDA and PDO mRNAs. (a) SDS-PAGE of mRNAs before (odd lanes) and after (even lanes) translation reaction. Lane 1, 2: wt-BDA. Lane 3, 4: mRNA of BDA attached with 5′- and 3′-additional sequences. Lane 5, 6; wt-PDO. Lane 7, 8: mRNA of PDO attached with 5′- and 3′-additional sequences. (b) Quantification of bands shown in (a). mRNA-protein fusion efficiencies were normalized to those of wild types, and shown as mean ± S.D. (n = 3 independent experiments)
Introduction of the identified sequence to VHHs with different CDRs
The above results motivated us to further examine the scope of limitation of the identified sequence. So, we investigated whether the additional sequences of clone #4 had positive effects on mRNA-protein fusion efficiencies of other VHHs (i.e. antibodies with different complementarity determining regions (CDRs)). In particular, we randomly picked up three clones from a recombinant VHH library in our laboratory (named wt-VHH2-4), and inserted the identified sequences at the termini of coding regions (see Supplementary Table S2 for complete sequences). The DNAs were converted to mRNA display and the formation efficiencies of mRNA-protein conjugates were quantified as above (Fig. 5). To our surprise, the effect of the additional sequences was different among even VHHs, and significant increase of the efficiency was not observed for VHH2 or VHH3. The results suggested that the identified sequences herein could only enhance the fusion efficiency of a part of the VHH library, against our initial expectation.
Figure 5
Effect of additional sequence of clone #4 on the mRNA-protein fusion efficiencies of other VHHs. (a) SDS-PAGE of translated mRNAs without (wild type, odd lanes) and with (even lanes) 5′- and 3′-additional sequences. Lane 1, reference (non-translated mRNA of wt-VHH1). Lane 2, 3: VHH2. Lane 4, 5: VHH3. Lane 6, 7: VHH3. (b) Quantification of bands shown in (a). mRNA-protein fusion efficiencies were normalized to those of wild types, and shown as mean ± S.D. (n= 3 independent experiments)
Discussion
Control of translation efficiency is a long-standing question in biology, and several elegant theoretical models have been proposed mainly on the initiation process [16,17]. Also, selections of 5′-UTR sequences that enhanced translation efficiency were conducted earlier [10,18,19]. However, for mRNA/cDNA display technologies, not only translation initiation rate but the efficiency of puromycin incorporation are important. To our knowledge, the latter factor is rarely studied so far. In this work, by our original selection we identified a pair of sequence at the termini of coding region that increase this efficiency of a VHH, and demonstrated that the enhancing effect of the found sequences was highly protein-specific.In the course of this study, Doi group addressed a similar problem [20]. They performed a selection to improve formation efficiency of mRNA-protein conjugates of single-chain variable fragment (scFv) using PURE system [21], and discussed that destabilization of mRNA secondary structure increased the efficiency of the formation of mRNA protein conjugates. Although the aim was the same as that of our paper, we think the following three points are different. First, the translation system and the translated protein are different. Second, in their work, a random sequence of 7 or 12 amino acid residues was inserted before FLAG tag at the C-terminus, unlike this study where the C-terminus of the translated region without FLAG tag was randomized. Our strategy should be more direct analysis of the effect of C-terminal sequences. Third, their selection method was dependent not only on the mRNA-protein fusion efficiency but also on the function (i.e. binding ability to the target protein) of displayed molecules. Though such strategy is meaningful to increase the yield of mRNA display that has a known function, we believe our EMSA selection should be more advantageous for creating a library of mRNA/cDNA display for subsequent functional selection.Although the detailed mechanism of altered fusion efficiency by additional terminal sequences remains to be studied, the results in this work as well as those reported by Doi and others [20] suggested that mRNA as a whole is more important than the nature of translated polypeptides or protein scaffolds, at least in the case of antibodies. Perhaps, structures of mRNA (thermodynamic stability of secondary structures, interaction with ribosomes/tRNAs, and others) are a key factor that determines mRNA-protein fusion efficiency. Other factors may play roles, but further study is necessary. Also, it is currently unknown which part of the selected 12-nt sequences is important for the enhancement of efficiency. High-level in silico calculation or structural analysis of mRNA may provide clues to these remaining questions.The fact that only the difference at CDR of VHH changed the enhancement effect of the added sequence was not what we expected, and it suggests that the identified sequence cannot be regarded as a universal and unbiased tool to increase the library size of VHH and other proteins in mRNA/cDNA display. However, we believe the obtained results provided fruitful insights into our basic understanding on the efficiency of mRNA-protein fusion as well as on the mRNA-ribosome complex.
Conclusion
In vitro display technologies are powerful tools to create and select functional peptides, but efficiency of mRNA-protein fusion is sometimes low, which needs to be addressed to improve the chance of successful selection. Also the factors that affect this efficiency is not well understood. Here, we inserted random sequences at both termini of the coding regions of a VHH, and performed our original EMSA-based selections. As results, we identified specific sequences that enhanced formation of mRNA-protein conjugate of the VHH. The degree of enhancement was only about 20%, but the difference from the original VHH was significant. Also, the 5′- and 3′-sequences seemed to have additive effects for the enhancement. Although the found sequences should not be generally used to increase the formation of mRNA-protein conjugates, this strategy itself will be applicable to other proteins. Moreover, the results gave us important suggestion that conformation of mRNA as a whole is a critical factor to determine mRNA-protein fusion efficiency.mRNA display is a robust technology to identify functional proteins from large libraries by in vitro selection, which involves covalently fusing the mRNA to its coding protein via a puromycin-linker. However, low fusion efficiency for some proteins, in particular antibodies, remained to be solved. Here, we describe improvement of the mRNA-protein fusion efficiency of a single-domain antibody by an electrophoresis mobility shift assay-based selection of mRNA display molecules. We believe the work makes the in vitro selection of affinity reagents using mRNA display more user friendly and practical.
Authors: Ross W Cheloha; Thibault J Harmand; Charlotte Wijne; Thomas U Schwartz; Hidde L Ploegh Journal: J Biol Chem Date: 2020-08-31 Impact factor: 5.157