Literature DB >> 28336957

Next-generation sequencing enables the discovery of more diverse positive clones from a phage-displayed antibody library.

Wonjun Yang1,2,3, Aerin Yoon1,3, Sanghoon Lee4, Soohyun Kim1,2,3, Jungwon Han1,3, Junho Chung1,2,3.   

Abstract

Phage display technology provides a powerful tool to screen a library for a binding molecule via an enrichment process. It has been adopted as a critical technology in the development of therapeutic antibodies. However, a major drawback of phage display technology is that because the degree of the enrichment cannot be controlled during the bio-panning process, it frequently results in a limited number of clones. In this study, we applied next-generation sequencing (NGS) to screen clones from a library and determine whether a greater number of clones can be identified using NGS than using conventional methods. Three chicken immune single-chain variable fragment (scFv) libraries were subjected to bio-panning on prostate-specific antigen (PSA). Phagemid DNA prepared from the original libraries as well as from the Escherichia coli pool after each round of bio-panning was analyzed using NGS, and the heavy chain complementarity-determining region 3 (HCDR3) sequences of the scFv clones were determined. Subsequently, through two-step linker PCR and cloning, the entire scFv gene was retrieved and analyzed for its reactivity to PSA in a phage enzyme immunoassay. After four rounds of bio-panning, the conventional colony screening method was performed for comparison. The scFv clones retrieved from NGS analysis included all clones identified by the conventional colony screening method as well as many additional clones. The enrichment of the HCDR3 sequence throughout the bio-panning process was a positive predictive factor for the selection of PSA-reactive scFv clones.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28336957      PMCID: PMC5382563          DOI: 10.1038/emm.2017.22

Source DB:  PubMed          Journal:  Exp Mol Med        ISSN: 1226-3613            Impact factor:   8.718


Introduction

One of the most important products on the therapeutic recombinant protein market is the monoclonal antibody. More than 54 therapeutic antibodies have been approved for various indications, including cancer and autoimmune diseases.[1] Traditionally, therapeutic antibodies have been generated by mouse B-cell hybridoma technology followed by chimerization or humanization.[2] In the past few decades, technologies such as transgenic mice encompassing human antibody gene repertoires, and phage display of antibody libraries, have become available, facilitating the rapid flourishing of therapeutic antibodies in the drug discovery field.[3] Phage display technology frequently allows the creation of libraries containing up to 1011 different variants, which can be used to screen antibody clones by bio-panning.[4] Despite the development of alternative display technologies such as bacterial display, yeast display and ribosome display, phage display remains the most widely used display technology due to the robustness of the filamentous bacteriophage M13.[5] Several therapeutic antibodies that are currently either approved or in clinical trials have been developed by phage display technology.[6, 7] Recently, next-generation sequencing (NGS) technology has allowed a massive increase in capacity to sequence genomes at relatively low cost and in a short time frame.[8] It has revolutionized multiple aspects of biological research[5] and is also actively being adopted into antibody phage display technology. Several NGS platforms are currently available, with average read lengths of 75–8500 bp and different error rates.[9] The CDR3 sequence of the VH and VL genes has been effectively determined by the MiSeq system;[10] a single-domain antibody gene was successfully determined by the MiSeq system using a 2 × 250 paired-end module;[11] and the entire VH gene was successfully sequenced using the 454 pyrosequencing system.[12] However, sequencing of the entire single-chain variable fragment (scFv) gene, which contains 750–800 bases, could not be achieved using any of these NGS platforms, to the extent of the authors' knowledge. In one study, to obtain the whole scFv gene sequence, HCDR3 sequences were first determined by the MiSeq system; the entire scFv gene was then generated by two-step linker PCR using primers based on the heavy chain complementarity-determining region 3 (HCDR3) sequences, and its sequence was determined by Sanger sequencing analysis.[6] In another similar study, HCDR3/FR4 sequences were determined from Ion Torrent PGM sequence analysis using the 318 chip. Then, the entire scFv gene was retrieved by inverse PCR using primers based on the HCDR3/FR4 sequences.[13] Following NGS analysis, the antibody gene is typically cloned and expressed. And the binding reactivity of the antibody to its target as well as its biological activity are tested. However, this may prove to be unproductive when the fraction of positive clones is not high following bio-panning. It has been extensively reported that positive clones tend to be enriched through bio-panning and negative clones show the opposite tendency. Therefore, NGS analysis of clones after each round of bio-panning could provide insights on which clones are more likely to be positive. Furthermore, it is unknown whether there is a difference between scFv clones identified by conventional colony screening methods[14] and those obtained from NGS. In this study, we have attempted to answer these questions. We performed four rounds of bio-panning using three scFv libraries constructed from prostate-specific antigen (PSA)-immunized chickens. We then performed NGS analysis of scFv clones focusing on HCDR3 in the initial scFv library and in four enriched scFv libraries obtained from subsequent rounds of bio-panning. scFv clones were obtained after the last round of bio-panning using the conventional colony screening method from the output titer plate, or from phagemid DNA prepared following a previously reported procedure.[6] The reactivity of these scFv clones was measured using a phage enzyme immunoassay. Based on these experiments, the sequences obtained using NGS and the conventional colony screening method were compared. We also classified scFv clones obtained from NGS into 3–4 clusters based on their enrichment or impoverishment patterns, and analyzed these patterns for clues regarding the reactivity of scFv clones.

Materials and methods

Library construction and bio-panning

Three white leghorn chickens were immunized and boosted four times with recombinant human PSA (Fitzgerald, Acton, MA, USA). After the final booster injection, total RNA was extracted from the spleen, bone marrow, and bursa of Fabricius using the TRI Reagent (Invitrogen, Grand Island, NY, USA). First-strand cDNA was synthesized using SuperScript reverse transcriptase with oligo (dT) priming (Invitrogen). Using this cDNA, three phage-displayed libraries of chicken scFvs were constructed using the pComb3XSS phagemid vector, as described previously.[14] Four rounds of bio-panning were performed to screen scFv clones from the library following a previously reported procedure.[15] For each round of bio-panning, 5 × 106 magnetic beads (Dynabeads M-270 epoxy) (Invitrogen) coated with 1.5 μg recombinant PSA protein were used.

Phage enzyme immunoassay

The scFv-displaying phages were rescued from titer plates after transformation and subjected to phage enzyme immunoassay as described previously.[14] The microtiter plates (Corning, NY, USA) were coated overnight at 4 °C with 20 μl recombinant human Fc-tagged PSA (5 μl ml−1) dissolved in phosphate-buffered saline (PBS). After blocking with 3% bovine serum albumin dissolved in PBS (w/v, PBS-B), the plates were then sequentially incubated with scFv-displaying phages in the culture supernatant, horseradish peroxidase (HRP)-conjugated mouse anti-M13 monoclonal antibody (GE Healthcare, Pittsburg, PA, USA) in PBS-B, and then finally with 2,2′-Azinobis [3-ethylbenzothiazoline-6-sulfonic acid]-diammonium salt (ABTS) substrate solutions (Amresco LLC, Solon, OH, USA), with intermittent washing using 0.05% Tween-20 in PBS (PBST). After incubating the plates at 37 °C for 10 min, the optical density was measured at 405 nm using a microtiter plate reader (Labsystems AiG SL, Barcelona, Spain).

Sanger sequencing analysis

Phagemid DNA from selected clones identified by phage enzyme immunoassays was prepared with a small-scale plasmid preparation kit (Qiagen, Hilden, Germany). The OmpSeq primer (5′-AAGACAGCTATCGCGATTGCAG-3′) and HRML-F primer (5′-GGTGGTTCCTCTAGATCTTCC-3′) were used to sequence the VH and VL chains of the antibody.[14] Sequence analysis of positive clones (O.D.405nm>0.3) was performed by Macrogen (Seoul, Korea).

Next-generation sequencing analysis

NGS analysis was performed as described previously.[16] A total of 15 sets of phagemid DNA including three initial chicken scFv libraries and three libraries obtained after each of four rounds of bio-panning were analyzed using a MiSeq system (Illumina Inc., San Diego, CA, USA). The MiSeq library for DNA sequencing was prepared using Illumina Nextera XT chemistry (Illumina) following the protocol provided by the manufacturer. The genes from the chicken library were amplified using the forward primer (pre-adaptor, 5′-TCGTCGGCAGCGTC-3′ sequencing primer, 5′-AGATGTGTATAAGAGACAG-3′ specific locus primer, 5′-TCAGCCTCGTCTGCAAGG-3′), and reverse primer (pre-adaptor, 5′-GTCTCGTGGGCTCGG-3′ sequencing primer, 5′-AGATGTGTATAAGAGACAG-3′ specific locus primer, 5′-AGTGGAGGAGACGATGACTTC-3′), respectively. The final libraries were normalized by quantification with LightCycler 480 II (Roche Applied Science, Indianapolis, IN, USA) and qualification with Bioanalyzer (Agilent, Palo Alto, CA, USA). The final loading concentration was adjusted to 11 pM following the MiSeq loading protocol. The MiSeq reagent kit v3 (Illumina) was used for long paired-end reads (2 × 300 bp) sequencing reactions. The sequencing data was processed by CLC Genomics Workbench version 5 (CLC Bio, Aarhus, Denmark) software. Low-quality sequencing data were first trimmed depending on quality scores using PHRED with the minimum quality score of 20 and reads with less than 150 bases in length were discarded.[17] The cleaned-up sequencing data were processed by merging the paired-end sequence reads using fast length adjustment of short reads to obtain complete sequences of the chicken scFv libraries.[18] Sequencing data were further cleaned up using PRINSEQ (San Diego State University, San Diego, CA, USA), setting the minimum quality score at 20 and read length at 150.[17] EMBOSS Needle 6.5.0.0 (The European Bioinformatics Institute (EMBL-EBI), UK) was used to map sequence read in the HCDR3 region, with a threshold score of 300.[19] Subsequently, a custom Perl script was used to summarize and count sequence reads in 15 sets of phagemid DNA. We merged the read counts across all the panning rounds, but for computational and statistical analysis, we only counted the reads existing in the phagemid DNA after the fourth bio-panning round.

Clustering analysis

An optimized number of clusters in the merged sequence read counts was estimated using the clValid algorithm, to facilitate pattern analysis of NGS data for population shifts in antibody clones throughout the bio-panning process.[20] The clValid algorithm validated number of clusters by assessing intra-cluster homogeneity and inter-cluster separation, and the assessment for each and every clustering is represented in the Dunn index.[20] A higher Dunn index indicates better clustering. The ‘Internal' cluster validation metrics were chosen, which consider only the data set and the clustering partition, and the intrinsic properties of the data were used to evaluate the quality of the clustering results in designated clustering algorithms such as hierarchical clustering and k-mean clustering.[21] Unsupervised hierarchical clustering analysis was used to cluster HCDR3 sequences according to the number of clusters estimated by clValid. Ward's method was used to measure distances between sequence reads based on read counts throughout the bio-panning, and a heat map visualizing the sequence read changes in each cluster was generated using Gene Pattern v3.9.2 software.[22] Line charts representing the pattern of sequence read changes in each cluster across all the bio-panning rounds were then generated as in a previous study.

Cloning to retrieve scFvs

To rebuild real scFv clones from the virtual HCDR3 sequences in the clusters, we performed two-step linker PCR. In the first PCR step, primers targeting both LFR1-HCDR3 (LFR1_F primer, 5′-GTGGCCCAGGCGGCCCTG-3′) and HCDR3-HFR4 fragments (HFR4_R primer, 5′-CTGGCCGGCCTGGCCACT-3′) were synthesized, based on HCDR3 sequences determined in NGS analysis and phagemid DNA obtained after the 4th round of bio-panning. The second PCR step linked these two gene fragments into a single scFv gene using primers annealing to LFR1 and HFR4 (LFR1_F primer, 5′-GTGGCCCAGGCGGCCCTG-3′ HFR4_R primer, 5′-CTGGCCGGCCTGGCCACT-3′). The scFv gene was ligated into the pComb3XSS phagemid vector and rescued as scFv-displaying phages, as described previously.[14] To measure the binding reactivity of these scFv-displaying phages, we rescued more than 15 clones per HCDR3 sequence, and performed phage enzyme immunoassay as described earlier. We regarded the clone providing the highest optical density at 405 nm as the retrieved clone.

Statistical analysis

Statistical analysis was performed with GraphPad Prism 5 software. Specific P-values and statistical methods are provided in the figure legends.

Results

Generation of antibody library and screening for positive clones using the conventional colony screening method

Using mRNA prepared from spleen, bone marrow, and bursa of Fabricius from three PSA-immunized chickens, we generated scFv libraries with complexities of 6.09 × 1010, 3.64 × 1010 and 5.16 × 1010 clones, respectively, referred to as chicken libraries 1, 2 and 3. Next, we performed four rounds of bio-panning, rescued phage clones from the output titer plate of the fourth round, and performed a phage enzyme immunoassay to screen for positive clones. A total of 300 clones (100 clones in each library) exhibiting an optical density of >0.3 at 405 nm were considered to be positive, and their scFv gene sequence was determined by Sanger sequencing analysis. We finally obtained 22 clones with unique HCDR3 sequences (Table 1).
Table 1

HCDR3 amino-acid sequences selected using the conventional colony screening method, and binding reactivity measurement of the antibody clones

LibraryCluster labelSequence of HCDR3Proportion of NGS (%)Proportion of conventional method (%)Binding reactivity (O.D.405 nm)
Library 1Cluster 1DFGSGVGEIDA3.811.041.010
  GIESDSDGYMTAEEIDA0.131.040.977
 Cluster 2AAHSTYIWGGYEAGSIDA6.494.170.669
  SAVSSCSSGSCSASWIDA1.162.080.873
  TADDGFSCGGYGLCADRIDA0.391.040.723
  ESGNGGWITAARIDA0.081.040.767
  SSHSTYIWGAYEAGSIDA0.032.080.651
 Cluster 4APGTGSGYCGIWTYTTAGCIDA0.031.040.964
  GRISYICADYDAGCIDA0.025.211.063
  SSHSTYIWGGYEAGSIDA0.012.080.916
Library 2Cluster 2SSYSDGATVIYNIDA0.691.040.870
 Cluster 3GRISYICADYDAGCIDA0.046.251.063
  AAGSWCAWGTGSCAGSIDA0.025.211.067
  AAGSWCAWGTGSCAGNIDA0.011.040.985
  TTGGDFYSGIDTAGYIDA0.015.210.938
  APGTGSGYCGIWTYTTAGCIDA0.013.130.964
Library 3Cluster 2AAGSGYIYSGSAGWIDA1.073.130.941
 Cluster 3AAGSWCAWGTGSCAGSIDA0.034.170.918
  GRISYICADYDAGCIDA0.028.331.063
  TTGGDFYSGIDTAGYIDA0.022.080.889
  AAGSWCAWGAGSCAGSIDA0.011.040.914
  AAGSGYVYSGSAGWIDA0.012.081.021

Abbreviations: HCDR3; heavy chain complementarity-determining region 3; NGS, next-generation sequencing; O.D., optical density.

Diversity analysis of antibody clones using next-generation sequencing

A total of 15 sets of phagemid DNA (three chicken libraries from bio-panning rounds 0, 1, 2, 3, and 4) were used for NGS analysis. After the NGS experiment, we obtained 60,000–180,000 VH sequences. Raw paired-end nucleotide sequences were merged, filtered, aligned and trimmed by uniformly applying pre-specified criteria to remove low-quality and meaningless short sequences. The numbers of nucleotide sequences remaining after each preprocess are summarized in Table 2; 44–53% of the original sequences were retained after aligning with OmpSeq primer sequence[14] by Needle, and were used in subsequent analyses. From the NGS results, the total population of VH fragment nucleotides decreased as the bio-panning rounds proceeded. To analyze HCDR3 diversity and frequency, we used HCDR3 sequences existing only in the fourth bio-panning round. clValid predicted that 2–6 clusters would be the most dependable in the HCDR3 sequence count profile data (Table 3). The sequence reads in chicken library 1 showed the maximum Dunn index (0.1048) with 4–6 clusters, and chicken libraries 2 and 3 had maximum Dunn indices with 2–3 clusters. We clustered HCDR3 sequences into 2–6 clusters using hierarchical clustering, and generated heat maps for each cluster to examine the patterns of HCDR3 sequence enrichment and population shift throughout the bio-panning rounds. The pattern of HCDR3 sequence enrichment and population shift in chicken library 1 showed four clear clusters, and the patterns in chicken libraries 2 and 3 showed three clear clusters (Figure 1).
Table 2

Sequence read counts by preprocessing raw sequencing data

    Trimmed by Prinseq  
 Panning roundRaw sequencing read count (paired-end FASTQ)Read count after merging paired-end sequences by FLASHRead ount of qualified sequencesRead count of disqualified sequencesSequence read count aligned with HCDR3 region by NEEDLE (percentage in sequences merged by FLASH)Unique nucleotide sequence count
Library 1R0664 955393 749393 624125310 589 (78.9)205 255
 R1663 061377 630377 484146298 474 (79)198 150
 R2391 118229 873229 773100181 430 (78.9)128 513
 R3673 875388 341388 179162314 517 (81)148 787
 R4621 174379 630379 61119334 387 (88.1)27 141
Library 2R0432 274256 268256 19969193 262 (75.4)148 862
 R1661 248417 426417 323103316 150 (75.7)221 423
 R2608 850363 553363 46093274 100 (75.4)197 190
 R3547 353342 189342 12366289 287 (84.5)66 545
 R4455 119290 741290 72219274 635 (94.5)22 763
Library 3R0616 410360 830360 78347279 996 (77.6)164 869
 R1608 045370 090370 03357288 172 (77.9)167 249
 R2619 731373 093373 03855290 056 (77.7)168 084
 R3690 602419 796419 75739343 996 (81.9)74 611
 R4568 948354 314354 30113287 126 (81)21 884

Abbreviations: FLASH, fast length adjustment of short reads; HCDR3, heavy chain complementarity-determining region 3.

Table 3

Dunn index on hierarchical clustering to estimate optimal number of clusters in scFv nucleotide sequence profile data

 Number of clusters
 23456
Library 10.08630.07230.10480.10480.1048
Library 20.23310.23310.05640.05640.0845
Library 30.15080.18600.15440.08930.0893

Abbreviation: scFv, single-chain variable fragment. Bold numbers indicate the largest Dunn index in each library.

Figure 1

Heat map representing the population of heavy chain complementarity-determining region 3 (HCDR3) sequences in each cluster through bio-panning rounds. Red and blue denote high and low proportions of the HCDR3 sequence, respectively. (a) scFv library 1, (b) scFv library 2 and (c) scFv library 3.

Population shift in HCDR3 sequences throughout bio-panning rounds

The diversity of the antibody clones is represented by the number of HCDR3 sequences that belong to each cluster (Figure 1). The abundance of the HCDR3 sequences in each cluster is represented by heat map color; high and low populations are indicated in red and blue, respectively. HCDR3 sequences in cluster 1 were highly abundant before bio-panning and up to the second bio-panning round. However, there was a sudden impoverishment in rounds 3 and 4 of bio-panning. In contrast, HCDR3 sequences that belonged to clusters 2 and 3 (including cluster 4 of library 1) showed the opposite pattern. Their populations were very low before bio-panning, remained low after the second round of bio-panning, and started to enrich from the third round of bio-panning. The increase continued in the fourth round of bio-panning. This population shift of HCDR3 sequences throughout bio-panning is represented in Figure 2. All 22 HCDR3 sequences in clones found via the conventional colony screening method existed among the HCDR3 sequences obtained from NGS analysis of phagemid DNA prepared after the fourth round of bio-panning (Table 1). Two out of the 22 unique HCDR3 sequences belonged to cluster 1, and the other 20 HCDR3 sequences belonged to clusters 2, 3 or 4.
Figure 2

Line graph representing population shifts in HCDR3 sequences through bio-panning rounds. (a) scFv library 1, (b) scFv library 2 and (c) scFv library 3.

Reactivity of scFv clones identified in NGS analysis

For each cluster, 1–5 HCDR3 sequences newly identified from the fourth round of bio-panning via NGS analysis were selected arbitrarily (Table 4). These selected sequences were used to synthesize the primers to retrieve the whole scFv gene from the phagemid DNA. The scFv gene was prepared in two-step linker PCR using the primers and cloned into a phagemid vector (Figure 3). After transformation of the phagemid vector-encoding scFv gene and rescue with helper phage, scFv-displaying phage was used to test their binding reactivity against PSA (Figure 4). In cluster 1, across the three libraries, 12 out of 14 antibody clones (85.7%) had negligible binding reactivity against PSA (O.D.450nm<0.2; Table 4, blue). In contrast, 21 out of 26 antibody clones (80.8%) in clusters 2~4 across the three libraries had significant binding reactivity (O.D.450nm>0.3; Table 4, red). These results imply that antibody clones with low reactivity tend to be impoverished throughout bio-panning (cluster 1), in contrast to the antibody clones with high reactivity, which showed enrichment throughout bio-panning (clusters 2~4).
Table 4

HCDR3 amino-acid sequences selected in each cluster from NGS and binding reactivity measurement of antibody clones

LibraryCluster labelHCDR3 SequenceProportion of the sequence in R4Read count of identified sequences
Binding reactivity (O.D.405 nm)
    R0R1R2R3R4 
Library 1Cluster 1GVYSGSPDGYDIDA0.32%502550289113312350.454
  TTCVGSSYCGGENIDA0.16%80618199478662736030.173
  GAYSDWGAGFIDA0.08%20162033123718093010.161
  DGDSGWGVYLNSAGNIDA0.03%392519761330.153
 Cluster 2YAGSGWTYYSSDVGSIDA2.16%012149883140.620
  GVYSASGCCDSIDT1.93%002144574431.032
  SAHSTYIWGGYEAGSIDA1.41%010104954201.075
  GGGAGYGAPSIDT1.05%00086640340.871
  DVYSGLITANTIDA0.67%01132526070.639
 Cluster 3SSHSTYIWGAYEAGCIDA0.02%5005640.757
  RAYGGGYCGCIEDIDA0.01%00012440.323
  AASTWSFYGSAEDIDA0.01%0003310.725
 Cluster 4APGTGSGYCGIWTYTTAGSIDA0.04%0001390.323
  GRISYICADYEAGSIDA0.02%0000610.407
Library 2Cluster 1GAYGHCDGWCAVDSIDT0.07%1673261024308231960.175
  AAGSGYCGWGDCIAGSIDA0.07%1081591391841930.167
  GIYGYSGGDYAAAEIDA0.06%1145181517126211670.179
  GAGGSCDGGSWCSPGIIDA0.04%1423217919645951210.187
  TRGGAGSGWYWYSGIAGIIDA0.03%78211721118399960.180
 Cluster 2TAGCGPWSYITAGCIDA0.21%0069696041.119
  DAAYGYCGTWAGCAGRIDA0.21%12223754046061.187
  CAYSGCTGGWSTSSIDA0.20%18231910465921.007
  DVYGCNSYGCPYIGNTIDA0.09%0231902591.254
  RAFSGCCDADSIDA0.07%4532751950.845
 Cluster 3SSSGTTYYSSGVISAGGIDA0.17%000624880.167
  GRISYICVDYDAGCIDA0.07%000592090.706
  NAYTSAYITDIDS0.06%0111031880.944
  SAYSDSCCAEDIDA0.04%001531060.876
  SAFGGGACCYTAGTIDA0.03%040151030.165
Library 3Cluster 1DGSGCGWSAAGCIDA0.35%9970103851043846399240.160
  AATYSWLHSGIDA0.29%11210410324610450.728
  DGSDCGWSAAGCIDA0.06%24302498247611642220.146
  GTGSWCYSGADSIDT0.06%22062381236710062070.167
  SAAGYWYAGSIDA0.05%108121211940.138
 Cluster 2TAGGDFYSGVDTAGYIDA4.79%1143070171871.064
 Cluster 3GSGYSCWSYAGCIDA0.66%111103421321.083
  GRIYYICADYDAGCIDA0.53%00142918901.052
  TADSGFGCGGYGLCAAFIDA0.09%2227433030.907
  TADIGYCFGGGIGCIDA0.08%000862890.984
  SAGGSYGYRYMDTAAAIDA0.07%2111952690.861

Abbreviations: HCDR3; heavy chain complementarity-determining region 3; NGS, next-generation sequencing.

Figure 3

Schematic representation of next-generation sequencing and two-step linker PCR. The structure of scFv gene, CDRs and frameworks of variable regions are indicated by colored boxes. (a) For NGS analysis, most of VH region including HCDR3 was amplified and sequenced using specific primers as described in materials and methods. The sequencing coverage is indicated with dashed lines. (b) To retrieve scFv gene, two-step linker PCR was performed using primers annealing to HCDR3, LFR1 and HFR4. The first step of PCR was performed using LFR1_F and HCDR3_R primers and HCDR3_F and HFR4_R primers. The linker PCR was performed using LFR1_F and HFR4_R primers.

Figure 4

Binding reactivity of scFv antibodies retrieved from selected HCDR3 amino-acid sequences in each cluster using NGS. (a) scFv library 1, (b) scFv library 2 and (c) scFv library 3. ANOVA with Turkey's multiple-comparison test was used to compare cluster 1 with other clusters. In library 3, the P-value was calculated using the Mann–Whitney U-test. *P-value <0.05; **P-value <0.01; ***P-value <0.001. ANOVA, analysis of variance.

Discussion

Phage display technology is commonly employed in the development of therapeutic antibodies.[23] One of the major drawbacks of phage display technology lies in the fact that some clones become dominant through the bio-panning process, and frequently, only limited numbers of clones became available at the end of the screening.[16] In this study, we showed that NGS analysis provided not only all of the HCDR3 sequences of clones identified using the conventional colony screening method but also new HCDR3 sequences. The proportion of HCDR3 sequences found by the conventional colony screening method varied from 0.01 to 6.49% of the HCDR3 sequences identified by NGS analysis (Table 1). There was no significant correlation between these two proportions. After we successfully retrieved the entire scFv gene via two-step linker PCR using PCR primers designed based on the HCDR3 sequences from NGS, we measured the binding reactivity of these antibody clones. Of the 40 clones retrieved from the phagemid DNA pool prepared after the fourth round of bio-panning, positive binding reactivity was confirmed in 26 clones. Four clones in library 1 and one clone in library 3 exhibited a proportion of >1% among the HCDR3 sequences obtained after the fourth round. The proportion of two positive clones in library 1 and library 3 were 2.16 and 4.79%, respectively. All these clones successfully formed colonies after the retrieval process. Why these clones with such high proportions were not identified in the conventional colony screening method is not clear; however, it might be caused by either inherent toxicity with phage assembly or interference from bacterial growth.[6, 24] Among the clones retrieved, thirteen clones with proportions less than 0.1% showed binding reactivity. Two of the positive clones were present at a proportion of 0.01%. Theoretically, screening for a clone with such low proportions via the conventional colony screening method requires either 1000 or 10000 positive colonies and Sanger sequencing analysis, which would require significant resources of time and cost. Many scFv clones prove difficult to identify by the conventional colony screening method from the phage pool obtained after bio-panning, and a method for retrieving these scFv clones in a high-throughput way has not yet been developed. Using currently available NGS tools, it is not possible to sequence the entire scFv gene, which is about 750 bp in length, without error. Our study and the other previous studies[6, 13] have proved that the scFv gene can be amplified by PCR primers designed based on HCDR3/FR4 sequences. However, there is always the possibility of cross-priming between clones during the PCR process. In this study, we also confirmed that clones retrieved via two-step linker PCR are typically a mixture of both negative and positive clones. We also proved that when a certain HCDR3 sequence was enriched through the bio-panning process, the clone with the HCDR3 is more likely to be positive; in this study over 80% (21 out of 26 clones) (Table 4). Monitoring the enrichment or impoverishment pattern of HCDR3 during the bio-panning process might increase the efficiency of retrieving clones from NGS analysis. However, this two-step linker PCR and cloning process is very difficult to perform in a high-throughput manner and there is definitely a need for better way to retrieve whole scFv gene. In summary, NGS analysis of the HCDR3 sequence, and two-step linker PCR using PCR primers based on this sequence, provide an effective way to retrieve antigen-specific scFv clones that are difficult to identify by the conventional colony screening method. Enrichment of the HCDR3 sequence over the bio-panning process is a positive predictive factor in the selection of scFv clones harboring binding reactivity.
  21 in total

1.  Concepts in antibody phage display.

Authors:  Sara Carmen; Lutz Jermutus
Journal:  Brief Funct Genomic Proteomic       Date:  2002-07

Review 2.  Using GenePattern for gene expression analysis.

Authors:  Heidi Kuehn; Arthur Liberzon; Michael Reich; Jill P Mesirov
Journal:  Curr Protoc Bioinformatics       Date:  2008-06

Review 3.  The therapeutic monoclonal antibody market.

Authors:  Dawn M Ecker; Susan Dana Jones; Howard L Levine
Journal:  MAbs       Date:  2015       Impact factor: 5.857

Review 4.  Next-Generation Sequencing: A Review of Technologies and Tools for Wound Microbiome Research.

Authors:  Brendan P Hodkinson; Elizabeth A Grice
Journal:  Adv Wound Care (New Rochelle)       Date:  2015-01-01       Impact factor: 4.730

Review 5.  Phage display--a powerful technique for immunotherapy: 1. Introduction and potential of therapeutic applications.

Authors:  Justyna Bazan; Ireneusz Całkosiński; Andrzej Gamian
Journal:  Hum Vaccin Immunother       Date:  2012-08-21       Impact factor: 3.452

6.  Identification of antigen-specific human monoclonal antibodies using high-throughput sequencing of the antibody repertoire.

Authors:  Ju Liu; Ruihua Li; Kun Liu; Liangliang Li; Xiaodong Zai; Xiangyang Chi; Ling Fu; Junjie Xu; Wei Chen
Journal:  Biochem Biophys Res Commun       Date:  2016-03-12       Impact factor: 3.575

7.  The EMBL-EBI bioinformatics web and programmatic tools framework.

Authors:  Weizhong Li; Andrew Cowley; Mahmut Uludag; Tamer Gur; Hamish McWilliam; Silvano Squizzato; Young Mi Park; Nicola Buso; Rodrigo Lopez
Journal:  Nucleic Acids Res       Date:  2015-04-06       Impact factor: 16.971

8.  A phosphorylation pattern-recognizing antibody specifically reacts to RNA polymerase II bound to exons.

Authors:  Jungwon Han; Jong-Hyuk Lee; Sunyoung Park; Soomin Yoon; Aerin Yoon; Do B Hwang; Hwa K Lee; Min S Kim; Yujean Lee; Won J Yang; Hong-Duk Youn; Hyori Kim; Junho Chung
Journal:  Exp Mol Med       Date:  2016-11-18       Impact factor: 8.718

Review 9.  The role of phage display in therapeutic antibody discovery.

Authors:  Conrad E Z Chan; Angeline P C Lim; Paul A MacAry; Brendon J Hanson
Journal:  Int Immunol       Date:  2014-08-18       Impact factor: 4.823

Review 10.  Next generation deep sequencing and vaccine design: today and tomorrow.

Authors:  Fabio Luciani; Rowena A Bull; Andrew R Lloyd
Journal:  Trends Biotechnol       Date:  2012-06-20       Impact factor: 19.536

View more
  24 in total

1.  High-throughput retrieval of physical DNA for NGS-identifiable clones in phage display library.

Authors:  Jinsung Noh; Okju Kim; Yushin Jung; Haejun Han; Jung-Eun Kim; Soohyun Kim; Sanghyub Lee; Jaeseong Park; Rae Hyuck Jung; Sang Il Kim; Jaejun Park; Jerome Han; Hyunho Lee; Duck Kyun Yoo; Amos C Lee; Euijin Kwon; Taehoon Ryu; Junho Chung; Sunghoon Kwon
Journal:  MAbs       Date:  2019-02-12       Impact factor: 5.857

Review 2.  Integrating high-throughput screening and sequencing for monoclonal antibody discovery and engineering.

Authors:  Cristina Parola; Daniel Neumeier; Sai T Reddy
Journal:  Immunology       Date:  2017-10-30       Impact factor: 7.397

3.  High-Throughput Generation of In Silico Derived Synthetic Antibodies via One-step Enzymatic DNA Assembly of Fragments.

Authors:  Eugenio Gallo
Journal:  Mol Biotechnol       Date:  2020-02       Impact factor: 2.695

4.  Reconstruction of full antibody sequences in NGS datasets and accurate VL:VH coupling by cluster coordinate matching of non-overlapping reads.

Authors:  Jorge Moura-Sampaio; André F Faustino; Remi Boeuf; Miguel A Antunes; Stefan Ewert; Ana P Batista
Journal:  Comput Struct Biotechnol J       Date:  2022-05-31       Impact factor: 6.155

5.  Identification of New Antibodies Targeting Malignant Plasma Cells for Immunotherapy by Next-Generation Sequencing-Assisted Phage Display.

Authors:  Steffen Krohn; Ammelie Svea Boje; Carina Lynn Gehlert; Sebastian Lutz; Nikos Darzentas; Henrik Knecht; Dietrich Herrmann; Monika Brüggemann; Axel J Scheidig; Katja Weisel; Martin Gramatzki; Matthias Peipp; Katja Klausz
Journal:  Front Immunol       Date:  2022-06-17       Impact factor: 8.786

6.  Multifaceted antibodies development against synthetic α-dystroglycan mucin glycopeptide as promising tools for dystroglycanopathies diagnostic.

Authors:  Thais Canassa-DeLeo; Vanessa Leiria Campo; Lílian Cataldi Rodrigues; Marcelo Fiori Marchiori; Carlos Fuzo; Marcelo Macedo Brigido; Annamaria Sandomenico; Menotti Ruvo; Andrea Queiroz Maranhão; Marcelo Dias-Baruffi
Journal:  Glycoconj J       Date:  2019-12-10       Impact factor: 2.916

Review 7.  Current state of in vivo panning technologies: Designing specificity and affinity into the future of drug targeting.

Authors:  Heather H Gustafson; Audrey Olshefsky; Meilyn Sylvestre; Drew L Sellers; Suzie H Pun
Journal:  Adv Drug Deliv Rev       Date:  2018-06-28       Impact factor: 15.470

8.  CellectSeq: In silico discovery of antibodies targeting integral membrane proteins combining in situ selections and next-generation sequencing.

Authors:  Abdellali Kelil; Eugenio Gallo; Sunandan Banerjee; Jarrett J Adams; Sachdev S Sidhu
Journal:  Commun Biol       Date:  2021-05-12

9.  Pacific Biosciences Sequencing and IMGT/HighV-QUEST Analysis of Full-Length Single Chain Fragment Variable from an In Vivo Selected Phage-Display Combinatorial Library.

Authors:  Audrey Hemadou; Véronique Giudicelli; Melissa Laird Smith; Marie-Paule Lefranc; Patrice Duroux; Sofia Kossida; Cheryl Heiner; N Lance Hepler; John Kuijpers; Alexis Groppi; Jonas Korlach; Philippe Mondon; Florence Ottones; Marie-Josée Jacobin-Valat; Jeanny Laroche-Traineau; Gisèle Clofent-Sanchez
Journal:  Front Immunol       Date:  2017-12-20       Impact factor: 7.561

Review 10.  How B-Cell Receptor Repertoire Sequencing Can Be Enriched with Structural Antibody Data.

Authors:  Aleksandr Kovaltsuk; Konrad Krawczyk; Jacob D Galson; Dominic F Kelly; Charlotte M Deane; Johannes Trück
Journal:  Front Immunol       Date:  2017-12-08       Impact factor: 7.561

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.