| Literature DB >> 29420788 |
Arie Ryvkin1, Haim Ashkenazy1, Yael Weiss-Ottolenghi1, Chen Piller1, Tal Pupko1, Jonathan M Gershoni1.
Abstract
Peptide-expressing phage display libraries are widely used for the interrogation of antibodies. Affinity selected peptides are then analyzed to discover epitope mimetics, or are subjected to computational algorithms for epitope prediction. A critical assumption for these applications is the random representation of amino acids in the initial naïve peptide library. In a previous study, we implemented next generation sequencing to evaluate a naïve library and discovered severe deviations from randomness in UAG codon over-representation as well as in high G phosphoramidite abundance causing amino acid distribution biases. In this study, we demonstrate that the UAG over-representation can be attributed to the burden imposed on the phage upon the assembly of the recombinant Protein 8 subunits. This was corrected by constructing the libraries using supE44-containing bacteria which suppress the UAG driven abortive termination. We also demonstrate that the overabundance of G stems from variant synthesis-efficiency and can be corrected using compensating oligonucleotide-mixtures calibrated by mass spectroscopy. Construction of libraries implementing these correctives results in markedly improved libraries that display random distribution of amino acids, thus ensuring that enriched peptides obtained in biopanning represent a genuine selection event, a fundamental assumption for phage display applications.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29420788 PMCID: PMC5961013 DOI: 10.1093/nar/gky077
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Oligonucleotides used for Illumina sequencing
| # | Name | Sequence (5′→ 3′) | Use |
|---|---|---|---|
| 1 | fth1-DP_F | AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCT | PCR for sample preparation for Illumina |
| 2 | fth1-DP_R | CAAGCAGAAGACGGCATACGAGCTCTTCCGATCT | sequencing—fth1-DP libraries |
| 3 | fth1_Fp8_BC | AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCTNNNNNCAACGTGGC | PCR for sample preparation for Illumina |
| 4 | fth1_Rp8 | CAAGCAGAAGACGGCATACGAGCTCTTCCGATCTGGCCCCAGAGGC | sequencing—fth1 libraries |
Figure 1.Distribution analysis of phage display peptide libraries. Oligonucleotides were synthesized and inserted into the fth1-DP (A and B) or fth1 (C and D) phage display systems and sequenced using Illumina NGS. (1) Parameters relevant to library construction. (2) Pie chart depicting the proportion of in-frame peptides in phage display libraries. The percentage of phages containing at least one of the UGA or UAA stop codons in their recombinant Protein 8 sequence is shown in red. Phages completely devoid of stop codons in their recombinant Protein 8 are shown in blue and peptides containing at least one UAG codon translated as glutamine are colored green. (3) The nucleotides distribution for five representative NNK codons. The oligonucleotides were constructed in an NNK format, where N position should contain equal 25% for each nucleotide and K position 50% of G and T. (4) The normalized amino acid distribution. The red horizontal line denotes the expected value (1/21 = 0.048, expected for 20 amino acids plus the additional glutamine which results from UAG suppression). (E) These data were collected from the commercially available Ph.D. 7 NEB library as described in text. (1) The nucleotide distribution for five representative NNK codons. (2) The normalized amino acid distribution.
Libraries characteristics
| Library | Type | Theoretical complexity | Total number of peptides | Number of unique peptides |
|---|---|---|---|---|
| First generation | 7 | 1.28E+09 | 116,516 | 112,153 |
| Second generation | C8C | 2.56E+10 | 57,871 | 57,845 |
| Third generation | C10C | 1.02E+13 | 2,475,659 | 1,898,920 |
| Fourth generation | C10C | 1.02E+13 | 2,048,929 | 1,484,655 |
| NEB Ph.D.-7 | 7 | 1.28E+09 | 3,960,150 | 3,137,986 |
Note: The library titers for all four generations are similar. The differences in the ‘total number of peptides’ are the result of different number of phages sequenced for each library.
Figure 2.Comparison of UAG bearing phages in supE44- versus supE44+ bacteria. A C10C phage library was grown in supE44+ (DH5α) and supE44− (MC1061) in two independent repeats (red and cyan). Each library was sequenced three times after 1 and 19 h and the percentage of peptides carrying at least a single UAG codon is shown. Each boxplot represents three libraries taken from the same bacterial strain at the same time.
Figure 3.Oligonucleotide library calibration pipeline. (A) Baseline calibration. Four oligonucleotides comprised of 12 consecutive thymidines and a single additional oligonucleotide were prepared. Each oligo was HPLC purified, and mixed in the desired 1:1:1:1 molar ratio (for random N position in future library). The mixture was analyzed using mass spectrometry to be used as a correct molar ratio reference point, post synthesis. (B) Discovering the stock ACGT molar ratio that yields the desired equimolar post synthesis ratio. Thymidine 12mers were subjected to a single synthesis cycle with various ACGT ratio stock vials. To find the stock vial ratio giving equimolar synthesized ratio, the resulting oligonucleotides were Mass Spec analyzed to locate the one with the same profile discovered in (A), indicating a synthesized 1:1:1:1 molar ratio. The ratio found in (B) was used for future library production.