| Literature DB >> 15341664 |
Ichiro Tabuchi1, Sayaka Soramoto, Shingo Ueno, Yuzuru Husimi.
Abstract
BACKGROUND: We developed a method to make a various high quality random peptide libraries for evolutionary protein engineering based on a combinatorial DNA synthesis.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15341664 PMCID: PMC520752 DOI: 10.1186/1472-6750-4-19
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Figure 1Scheme of the MLSDS method. The case of three-lines is shown. Uppermost three boxes indicate the state in the DNA synthesizers after the first three synthetic cycles, that is, partial mixtures of triplet codons attached to the CPGs. 1, 2 and 3 denote 1st, 2nd and 3rd letter mixture in the line-1 DNA synthesizer, respectively. Their A:T:G:C mixing ratios are designed with the GA program so that 123 gives a partial mixture of triplet codons without any stop codons. In the same way, 456 and 789 denote corresponding partial mixtures of triplet codons in the line-2 and the line-3 DNA synthesizers, respectively. The equimolar mixture of 123, 456 and 789, which is obtained after split-and-mix procedure, can be designed to give approximately the target amino acid composition without stop codons. Examples of 1,2,.....,9 are shown in Table 1.
Examples of molar mixing ratio of bases for MLSDS libraries
| Line-1 | Line-2 | Line-3 | Line-1 | Line-2 | Line-3 | |||||||||||||
| 2 | 3 | 5 | 5 | 0 | 0 | 2 | 5 | 0 | 5 | 4 | 2 | 2 | 0 | 0 | 3 | 4 | 0 | |
| 0 | 0 | 3 | 4 | 3 | 0 | 3 | 5 | 2 | 0 | 1 | 1 | 3 | 2 | 3 | 4 | 2 | 5 | |
| 3 | 0 | 0 | 4 | 3 | 1 | 1 | 0 | 0 | 5 | 0 | 2 | 5 | 5 | 2 | 4 | 0 | 0 | |
| 3 | 1 | 0 | 0 | 1 | 0 | 1 | 3 | 5 | 3 | 1 | 0 | 2 | 2 | 1 | 2 | 4 | 0 | |
Figure 2Dependence of the final fitness on the number of DNA synthesizers for various libraries. Ordinate: final fitness (=correlation coefficient). Abscissa: line number of DNA synthesizers. Libraries are "Uniform", "Natural", "No Met" and "No Cys".
Comparison of the amino acid compositions of the actually synthesized DNA libraries (the full length library) with the target composition
| Target | Actual | Target | Actual | Target | Actual | |
| Ala | 8.9% | 11.4% | 5.3% | 6.7% | 0.0% | 0.0% |
| Arg | 4.7 | 3.5 | 5.3 | 5.9 | 30.0 | 29.7 |
| Asn | 4.4 | 3.5 | 5.3 | 4.3 | 0.0 | 0.0 |
| Asp | 5.5 | 8.6 | 5.3 | 5.9 | 0.0 | 0.0 |
| Cys | 2.8 | 2.5 | 0.0 | 0.0 | ||
| Gln | 3.9 | 2.1 | 5.3 | 5.5 | 20.0 | 19.6 |
| Glu | 6.2 | 6.9 | 5.3 | 5.5 | 20.0 | 25.0 |
| Gly | 7.8 | 12.0 | 5.3 | 8.2 | 0.0 | 0.0 |
| His | 2.0 | 1.2 | 5.3 | 2.7 | 0.0 | 0.0 |
| Ile | 4.6 | 3.9 | 5.3 | 5.9 | 0.0 | 0.0 |
| Leu | 7.5 | 3.7 | 5.3 | 4.3 | 0.0 | 0.0 |
| Lys | 7.0 | 6.2 | 5.3 | 4.7 | 30.0 | 27.9 |
| Met | 1.7 | 1.1 | 5.3 | 3.5 | 0.0 | 0.0 |
| Phe | 3.5 | 4.0 | 5.3 | 5.1 | 0.0 | 0.0 |
| Pro | 4.6 | 2.5 | 5.3 | 2.4 | 0.0 | 0.0 |
| Ser | 7.1 | 8.1 | 5.3 | 1.6 | 0.0 | 0.0 |
| Thr | 6.0 | 5.8 | 5.3 | 6.7 | 0.0 | 0.0 |
| Trp | 1.1 | 1.8 | 5.3 | 8.2 | 0.0 | 0.0 |
| Tyr | 3.5 | 4.4 | 5.3 | 4.7 | 0.0 | 0.0 |
| Val | 6.9 | 6.9 | 5.3 | 8.2 | 0.0 | 0.0 |
| Term | ||||||
| Number of sequenced codon | - | 568 | - | 255 | - | 219 |
| Correlation coefficient | - | 0.85 | - | 0.66 | - | 1.00 |
| Sum of absolute errors | - | 26.9% | - | 28.1% | - | 5.66% |
"Term" denotes stop codons. In each target composition, [Term] = 0. "Natural": the library with the natural abundance (the average composition in 207 natural proteins) [18]; "No Cys": the library with uniform composition except [Cys] = 0; "c-Fos e', g'-": a c-Fos mutant library containing only 4 kinds of amino acid as same as in Ref.[26].
Figure 3Production of 2×, 4×, and 8× length library. Lane-M: Size marker. 10 base ladder (left) and 100 base ladder (right). Lane-S: Initial synthesized library (87 mer). Lane-1U: Initial single-unit library (94 mer) made by PCR amplification. Lane-2U: Double-unit library (138 mer) made by ligation product of digested 1U. Lane-4U: quadruple-unit library (240 mer) Lane-8U: octuple-unit library (444 mer). PAGE was done with 5 % polyacrylamide, TAE buffer (pH 8.0), 8 M urea, 65°C and stained with SybrGreen I and visualized with a fluorescence imager (Bio-Rad FX).
The comparison between the target amino acids composition and the actual composition of various libraries.
| A. Uniform Type | |||
| Method | 3NPs | mRNA display | MLSDS |
| Reference, Library | [29], 96T λ 6 | [18], Random | This work, No Cys |
| Target value | Equimolar 19 amino acids | Equimolar 20 amino acids | Equimolar 19 amino acids |
| Correlation coefficient a | 0.43 | 0.56 | 0.66 |
| Sum of absolute errors b [%] | 54.5 | 23.7 | 28.1 |
| Percentage of stop codon | ND c | 0.46 | 0.00 |
| Percentage of cassettes containing stop codon | ND c | 8.33 | 0.00 |
| B. Doping Type | |||
| Method | 3NPs | mRNA display | MLSDS |
| Reference, Library | [29], 100z | [18], β-Cassette polar | This work, c-Fos, e', g'- |
| Target value | Biased | Polar amino acid | Biased |
| Correlation coefficient a | 0.94 | 0.56 | 1.00 |
| Sum of absolute errors b [%] | 42.4 | 73.7 | 5.66 |
| Percentage of stop codon | ND c | 0.39 | 0.00 |
| Percentage of cassettes containing stop codon | ND c | 6.25 | 0.00 |
The values were calculated using the data of full-length (without deletion) libraries. Values of 3NPs and mRNA display method were calculated with some assumptions. In mRNA display method, the value of stop codons were estimated from data described in the article and some assumptions; one cassette did not have two or more stop codons and stop codons appeared equally in every position. The target composition of β-Cassette polar is assumed that polar amino acids are equi-molar and the others are 0%.
aCorrelation coefficient is calculated using target and actual amio acids composition including stop codons contribution. And it is assumed that stop codons were not appeared in full-length libraries of 3NPs method.
b Sum of absolute values of difference between each target and actual amino acid.
c ND is abbreviation of no data.