| Literature DB >> 26150973 |
Zhiwu Xu1, Jieyu Wu1, Fan Feng1, Xiaoxiao Zhang2, Xiaoqian Ma1, Man Tang1, Yan Huang1, Ying Zhang3, Yongchang Cao1, Weiguo Cao4, Ran He1, Ye Gao5, Qiuyun Liu1.
Abstract
Rapid evolution is a hallmark of the viral kingdom and a major concern for developing universal vaccines. The isolation of substantial numbers of viral sequence variants at highly variable viral protein domains remains a major challenge. We previously developed a combinatorial method for the isolation of novel sequences to cope with rapid viral variations at the G-H loop of Foot and Mouth Disease virus VP1 protein [1]. Here we present a modification of that method in its application in the randomization of the hemagglutinin gene from a H5N2 virus, namely: •removal of potentially stressful region which harbored a stretch of basic amino acids to increase the success rates of gene cloning, and to streamline the process of future engineering of novel viral variants.•clustered randomization in a full-length gene, as the positive rate for partial gene fragment libraries was extremely low before enrichment in the previous FMDV studies.•the use of fusion partner was avoided, which was used previously for protein expression, stabilization of clones and reduction of stresses on host cells.•the use of Poisson distribution is proposed to approximate sequencing output to achieve cost effectiveness.Entities:
Keywords: Full length gene; H5N1; Hemagglutinin gene; Mutations; Novel sequences; Viral variations
Year: 2015 PMID: 26150973 PMCID: PMC4487337 DOI: 10.1016/j.mex.2015.02.005
Source DB: PubMed Journal: MethodsX ISSN: 2215-0161
Oligos and primers used in this study.a
| Oligos | DNA sequences |
|---|---|
| Oligo 1 | GCTCTAGAAATAAACCACTTTGAGAAADHSSDWATKW TCCCAAAGTCTTCCTGGTCC |
| Oligo 2 | CCAAAGTCTTCCTGGTCCARTCATRDSKCTWCTKYAG GTGTCTCTTCTGCTTGTY |
| Oligo 3 | CCGAATTCACATTTCTGAAAAAGGAAGRSBYACCSDD GTATRRACAAGCAGAAGACA |
| Primer 4 | ATGGATCCAAAAATGGATCAGATTTGCATTGGTTTCC |
| Primer 5 | TCTCAAAGTGGTTTATTCTGCTCAATAGGTG |
| Primer 6 | CTCCTTTTTCAGAAATGTGGTATGGCTC |
| Primer 7 | AAATAGTCCTCTGGTTTCTCTTTGAGGGGTATTTCTGAGTCCAGT |
| Primer 8 | AAAGAGAAACCAGAGGACTATTTGGAGCTATAGCAGGGTTTATAGAG |
| Primer 9 | CCCAAGCTTTCATATTTGGTAAGTTCCCATTGATTCCAATTTTACTCCAC |
| Primer 10 | GATGAGCCATACCACATTTCTGAAAAAGGAG |
Representations for degenerate bases: M = A/C, R = A/G, W = A/T, S = G/C, Y = C/T, K = G/T, V = A/G/C, H = A/C/T, D = A/G/T, B = G/C/T, N = A/G/C/T.
Fig. 1Polymorphisms at 121–165 aa regions of the H5N1 hemagglutinin proteins.
Fig. 2HA gene assembly via PCR amplifications. DNA fragments of different sizes were generated in a stepwise fashion, and assembled via overlapping PCR.
Fig. 3Colony PCR yielded 1550 bp products from E. coli transformants. M: molecular standard. Lanes 1–22 were PCR amplifications of 22 colonies.
Fig. 4The sequencing chromatograms of the first (A), second (B) and third (C) variable regions of the transformant library pool.
Comparisons between our method and other methods for discovering novel viral gene variants.
| Our method | Our previous method, or in vivo SElex approach, or other directed evolution methods |
|---|---|
| Working with the full length gene has eliminated the obligatory enrichment step and the use of antioxidant. Consequently the positive rate was about 26 fold higher than the previous FMDV gene fragment libraries before the enrichment procedure. Sequences obtained were ready for making DNA vaccines for animals. | Chimerization of the gene fragments was performed to allow gene expression and reduce stresses on host cells in our previous method. A post-electroporation enrichment procedure was required to increase the discovery rate of positive clones. Immunogenicity may be a problem since the peptides were only 63 residues in length. |
| The adoption of Poisson distribution is proposed to approximate sequencing output to achieve cost effectiveness | Our previous method did not use statistical method for projection of sequencing output. |
| Our method targeted protein coding gene. Protein coding regions could be under higher selection pressure than non-coding regions. | The in vivo SElex approach, or other directed evolution methods mostly targeted untranslated viral regions |
| Our approach can target both highly variable domains and full length proteins. It can generate both clustered and dispersed mutations. Mutations can be radical or conservative. | Error-prone PCR, DNA shuffling and pertinent approaches generated mostly dispersed point mutations |
| Hemagglutinin gene in this study was from a highly virulent H5N2 virus, which shared very high homology with its counterparts in H5N1 viruses. | To our knowledge, directed evolution has only been performed on H5N1 receptor specificity with limited number of mutations introduced |
| The removal of a stretch of basic amino acids may be advantageous, as cationic amino acids frequently feature in antimicrobial peptides, which are also detrimental to animal cells. Vaccine development process can be streamlined with such a prior consideration | To our knowledge, no such concern has been addressed in directed evolution experiments related to Avian influenza. |
Troubleshooting.
| Problems encountered | Solution |
|---|---|
| Fully random design did not work | Try semi-random design which is based on sequence alignment |
| Unable to form clones | Stressful peptides at protein terminals need to be avoided to confer less strain to the host cells. It is best that residues generally enriched in terminals of natural proteins are present in the terminals of randomized proteins and peptides. |
| Too many background clones harboring insert free vector | Try triple restriction digests as reported |
| Plasmid yield could be very low | Use PCR to amplify genes for long term storage |
| Sequences may not have adapted to host perfectly, and expression in | Make protein fusions to reduce stresses, and use some other viral proteins as adjuvant when preparing protein vaccines |
| No colonies after electroporations | Do a calibration of electroporation efficiency; try commercially available competent cells or use highly efficient protocols |
| α complementation for easy visualization or protein induction during cloning | Not recommended for high expression may increase stresses to host cells |
| Difficulty in plasmid sequencing | Try sequencing PCR amplicons; grow clones at lower temperature or in the presence of 1–4% glucose to repress gene expression prior to PCR. |
| Reducing workload | Next generation sequencing can be performed, followed by primer design, PCR of the library pools and cloning. |