| Literature DB >> 34806047 |
Luis Antonio Rodriguez Carnero1, André Azevedo Reis Teixeira1, Fenny Hui Fen Tang1,2, Andréia Kuramoto3, Maria Júlia Manso Alves1, Walter Colli1, João Carlos Setubal1, Edécio Cunha-Neto3,4,5, Renata Pasqualini2,6, Wadih Arap2,7, Ricardo José Giordano1,5.
Abstract
This protocol describes the genomic phage (gPhage) display platform, a large-scale antigen and epitope mapping technique. We constructed a gPhage display peptide library of a eukaryotic organism, Trypanosoma cruzi (causative agent of Chagas disease), to map the antibody response landscape against the parasite. Here, we used an organism with a relatively large but intronless genome, although future applications could include other prevalent or (re)emerging infectious organisms carrying genomes with a limited number of introns. For complete details on the use and execution of this protocol, please refer to Teixeira et al. (2021).Entities:
Keywords: Antibody; Genomics; High Throughput Screening; Immunology; Microbiology; Model Organisms; Molecular Biology; Molecular/Chemical Probes; Sequence analysis; Sequencing
Mesh:
Substances:
Year: 2021 PMID: 34806047 PMCID: PMC8590080 DOI: 10.1016/j.xpro.2021.100936
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1Constructing a gPhage library
(A) To build the gPhage library, genomic DNA is fragmented and cloned into a phagemid vector such as pG8SAET. Escherichia coli is transformed with the vector and subsequently used to produce the gPhage library displaying the epitopes that can be recognized by the corresponding donor-derived and control-derived immunoglobulins.
(B) Detail of the fusion transcript encoded by modified pG8SAET. The construct is formed by a signal peptide, the epitope-encoding DNA and the rpVIII gene. When transformed into bacteria, pG8SAET will produce a fusion rpVIII protein, as described.
(C) After fragmentation and cloning, only 1 in 18 inserts will be in the correct frame (i.e., frame 2) to yield a phage particle displaying a T. cruzi-derived peptide. In the correct frame, the mature constructed protein will be formed by the epitope fused in-frame to the rpVIII. For using the phagemid system, co-infection with a helper phage allows the production of native pVIII protein and the other bacteriophage proteins while the pG8SAET phagemid encodes the rpVIII displaying the epitope. These two pVIII proteins (i.e., native and recombinant) will be packed into a hybrid phage construct.
Figure 2Building and quality control of a Trypanosoma cruzi genomic DNA library
(A) T. cruzi genomic DNA can be fragmented with the COVARIS S2 ultrasonicator. The image shows an agarose gel electrophoresis comparing T. cruzi DNA before and after fragmentation. The obtained fragments were mainly distributed between 100-500 bp.
(B) The efficiency of ligation depends on the vector:insert ratio. In the example with T. cruzi fragmented DNA and pG8SAET, the 1:30 ratio yielded the best results. Error bars represent the standard error of the mean (SEM, N=3)
(C) After a PCR amplification of individual clones with primers flanking the insert, each clone will show a particular product size. When using a molecular ladder to calculate the molecular weight of the products, this value serves to estimate the coverage of the library. As a technical cautionary note (shown), a minor subset of clones will have an empty vector (non-insert), with the same size as pG8SAET.
Ligation reaction
| Reagent | Quantity |
|---|---|
| pG8SAET phagemid digested | 100 ng |
| Blunt-end genomic fragments | 50, 100, 200 or 300 ng |
| 10 | To 1 |
| PEG 50% | To 5% |
| T4 DNA Ligase | 5 U |
Primers used for pG8SAET colony sequencing and PCR
| Primer | Sequence |
|---|---|
| pG8SAET Sequencing Forward Primer | CAGGGGGTATTAATTTGAAAAGG |
| pG8SAET Sequencing Reverse Primer | TATTCGGTCGCTGAGGCTTG |
Recommended sequence of primers for first PCR (pre-indexation)
| Primer | Sequence | ||
|---|---|---|---|
| Partial Illumina index | Degenerated nt | pG8SAET annealing region | |
| Fw-0N | 5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG- | -CTGCGCAACACGATGACC-3′ | |
| Fw-1N | 5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG- | -N- | -CTGCGCAACACGATGACC-3′ |
| Fw-2N | 5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG- | -NN- | -CTGCGCAACACGATGACC-3′ |
| Fw-3N | 5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG- | -NNN- | -CTGCGCAACACGATGACC-3′ |
| Rv-0N | 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG- | -CTGCGCAACACGATGACC-3′ | |
| Rv-1N | 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG- | -N- | -CTGCGCAACACGATGACC-3′ |
| Rv-2N | 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG- | -NN- | -CTGCGCAACACGATGACC-3′ |
| Rv-3N | 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG- | -NNN- | -CTGCGCAACACGATGACC-3′ |
Figure 3Clustering procedure
Representation of the clustering algorithm to be used for antigen identification from third-round phage sequences (obtained by NGS) selected by the patient immunoglobulins (IgG).
Variables that can be customized at the script dna_processing.py
| Variable | Description |
|---|---|
| BLASTN_EXE | path/command to BlastN executable |
| BLASTN_DB | path to BlastN database |
| BLASTP_EXE | path/command to BlastP executable |
| BLASTP_DB | path to BlastP database |
| dict_sample_fastq | sample names and path to respective FastQ to be processed. To add a sample, add a new line with the following code: dict_sample_fastq["Sample Name"] = "path_to_fastq.fastq" |
| cutoff_dna_identity | minimum relative identity to reference genome for a DNA insert to be considered a hit (default is 0.9) |
| cutoff_peptide_similarity | minimum relative similarity to reference proteome for a peptide to be considered a hit (default is 0.6) |
| flank_upstream | sequence found upstream of the DNA insert (default set for pG8SAET sequence) |
| flank_downstream | sequence found downstream of the DNA insert (default set for pG8SAET sequence) |
Output files generated by the script dna_processing.py
| Output file | Description |
|---|---|
| dna_dataframe.xlsx | table containing all DNA inserts identified in all samples, their respective frequencies in each sample and BlastN results against reference genomes. |
| dna_above_cutoff.xlsx | same as dna_dataframe.xlsx, but containing only the inserts that match the reference genomes above the BlastN identity cutoff |
| dna_below_cutoff.xlsx | same as dna_dataframe.xlsx, but containing only the inserts that are below the BlastN identity cutoff (or do not match reference genomes) |
| peptide_dataframe.xlsx | table containing all peptides identified in all samples peptide, including the frequency in each sample and BlastP results against reference proteomes. |
| norm_peptide_df.xlsx | same as peptide_dataframe.xlsx, but with counts normalized by total DNA (relative frequency) |
| peptide_above_cutoff.xlsx | same as peptide_dataframe.xlsx, but containing only the peptides that match reference proteomes above the BlastP similarity cutoff |
| peptide_below_cutoff.xlsx | same as peptide_dataframe.xlsx, but containing only the peptides that are below the BlastP similarity cutoff (or do not match reference proteomes) |
| statistics.xlsx | general processing statistics |
| peptides.txt | A list of all peptides identified by the script. This file can used as input for the clustering.py script |
Variables that can be customized at the script clustering.py
| Variable | Description |
|---|---|
| PEPTIDES_TXT | path to the file containing the peptides to be clustered (text file containing one peptide sequence per line) |
| MAFFT | path/command to MAFFT executable |
| HMMBUILD | path/command to hmmbuild executable |
| HMMEMIT | path/command to hmmemit executable |
| K | K-mer length for peptide search. Only peptides that share a K-mer are compared. Default is 4 amino acids. |
| min_len | minimum length of the peptides used for clustering. Peptides with length below this threshold are excluded. Default is 6 amino acids. |
| score_cutoff | minimum score (% partial identity) for peptides to be clustered together. Default is 80. |
Output files generated by the script clustering.py
| Output file | Description |
|---|---|
| cluster_consensus.xlsx | table containing cluster IDs, number of different peptides forming the clusters, and consensus sequence. |
| peptide_cluster.xlsx | table containing all peptide sequences and corresponding cluster. |
| align folder | folder containing the multiple sequence alignments, consensus sequences, and HMM profiles for each cluster. |
Figure 4A gPhage-based validation assay
(A) Scheme to illustrate the phage binding assay applicable to validate the specific binding of epitope-containing phage to patient immunoglobulins. The number of colonies obtained after plating and incubation can be multiplied by the dilution factor to obtain the Transducing Units (TU, viable phage particles) value. The validation can also be performed by immunoassays with an anti-bacteriophage antibody labeled with HRP or a fluorophore, for example.
(B) Phage binding assay for a specific phage obtained when performing the panning with a pool of severe cardiomyopathy Chagas disease patients. For this binding assay, two pools of sera from patients with severe cardiomyopathy (S1 IgG, S2 IgG) and two pools of control sera (C1 IgG, C2 IgG) were used. Phage particles (CCC phage) show strong binding to immunoglobulins of patients with severe cardiomyopathy in comparison to the corresponding binding to negative control insertless phage particles (pG8SAET).
(C) Relative phage binding value (quotient of the CCC and pG8 binding values in Transducing Units for each point) when performing the binding assay with individual patient serum samples. An index (S8) patient serum sample presents a 200-fold CCC/pG8 quotient, indicating a strong sensitivity when incubating the CCC phage particles with the IgG of this patient. The index patient serum was included in the S1 IgG and S2 IgG pools during the panning selection.
(D) Reactivity of the CCC phage with index patient serial serum dilutions. High-reactivity is observed until 106 dilution. In all cases, error bars represent SEM from biological (N=2) and technical (N=3) replicas.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Rabbit anti-bacteriophage IgG (1:400) | Sigma | Cat#B7786-5μL |
| Goat anti-rabbit IgG IRDye 680 LT (1:1000) | LI-COR Biosciences | Cat#926-68021 |
| Goat Anti-Human IgG, Fc fragment specific | Jackson ImmunoResearch | Cat#109-001-008 |
| Thermo Fisher Scientific | Cat#EC113 | |
| Lucigen | Cat#60502-1 | |
| M13K07 Helper Phage | New England BioLabs | Cat#N0315 S |
| Donor serum samples | Heart Institute, University of Sao Paulo. Approved by the Institutional Review Board at the University of São Paulo Medical School, Brazil. Approval number 0265/10. For more details of the subjects, refer to | N/A |
| Bovine blood | Certified commercial slaughterhouse or laboratory vendor. | N/A |
| Phosphate buffered saline | Thermo Fisher Scientific | Cat#21600010 |
| Sodium dodecyl sulphate | Bio-Rad Laboratories | Cat#1610301 |
| Proteinase K Solution | Thermo Fisher Scientific | Cat#AM2548 |
| Phenol:Chloroform:Isoamyl Alcohol 25:24:1 | Sigma | Cat#P3808 |
| Ethanol Absolute | Merk | Cat#1009831000 |
| UltraPure Tris Buffer | Thermo Fisher Scientific | Cat#15504-020 |
| EDTA | Sigma | Cat#E5134 |
| T4 DNA polymerase | Thermo Fisher Scientific | Cat#EP0062 |
| Tryptone | Thermo Fisher Scientific | Cat#LP0042 |
| Yeast extract | Thermo Fisher Scientific | Cat#LP0021 |
| Dipotassium hydrogen phosphate | Merck | Cat#5101 |
| Monopotassium phosphate | Sigma | Cat#V003710 |
| Disodium phosphate heptahydrate | Synth | Cat#F1031.01.AH |
| LB Broth | Sigma | Cat#L3022-1Kg |
| LB agar | Sigma | Cat#L2897-1KG |
| Triptose broth | Difco | Cat#262200 |
| Carbenicillin | Sigma | Cat#C1389-5G |
| Penicillin | Sigma | Cat#D7794-100MU |
| Streptomycin | Sigma | Cat#S9137-100G |
| CsCl | Sigma | Cat#V000561 |
| Ethidium bromide | Thermo Fisher Scientific | Cat#15585-011 |
| Eco105I | New England BioLabs | Cat#R0130L |
| FastAP | Thermo Fisher Scientific | Cat#EF0651 |
| Low-melting point agarose | Sigma | Cat#A9539-500G |
| T4 DNA ligase | Thermo Fisher Scientific | Cat#EL0011 |
| Polyethylene Glycol 8000 | Amresco | Cat#0159-1Kg |
| Sodium chloride | Merck | Cat#1064041000 |
| Potassium chloride | Synth | Cat#C1060.01.AH |
| Glycerol | Sigma | Cat#G5516 |
| Glucose | Sigma | Cat#S9137-100G |
| Protein G Sepharose 4 Fast Flow | GE Healthcare | Cat#17-0618-02 |
| Bovine Serum Albumin | Sigma | Cat#A2153-1kg |
| Liver Infusion Broth | Difco | Cat#226920 |
| Fetal Calf Serum | Vitrocell | Cat#SOROFETAL500 |
| Carbonate | Merck | Cat#106392 |
| Tween-20 | Sigma | Cat#V001280 |
| Synthetic epitope peptides | Chinese Peptide Company (China) | N/A |
| QIAGEN Plasmid Maxi Kit | Qiagen | Cat#12162 |
| QIAprep Spin Miniprep Kit | Qiagen | Cat#27104 |
| Taq DNA Polymerase (5 U/μL) | Thermo Fisher Scientific | Cat#EP0401 |
| KAPA HiFi HotStart ReadyMix Kit | Roche | Cat#7958935001 |
| QIAquick PCR Purification Kit | Illumina | Cat#FC-131-1024 |
| Nextera XT DNA Library Preparation Kit (24 samples) | Qiagen | Cat#28106 |
| KAPA Library Quantification Kit | Roche | Cat#KR0405 |
| MiSeq Reagent Kit v2 (500-cycles) | Illumina | Cat#MS-102-2003 |
| SIGMAFAST OPD | Sigma | Cat#P9187 |
| Dr. Bianca Zingales, Chemistry Institute, University of São Paulo, Brazil ( | N/A | |
| Illumina sequencing oligonucleotides | Exxtend (Brazil) | N/A |
| pG8SAET | Department of Microbiology Swedish University of Agricultural Sciences Uppsala University, Sweden ( | GenBank: AF130864.1 |
| gPhage library | This article | N/A |
| BLAST | v.2.12.0 | |
| PEAR | v 0.9.10 | |
| MAFFT | v 7.307 | |
| Jalview | v 2.11.0 | |
| MUSCLE | 3.8.31 | |
| Chimera | v 1.11 | |
| XSTREAM | v 1.73 | |
| FuzzyWuzzy | v 0.18 | |
| HMMER | 3.1b2 | |
| Sequence analysis script | This article | |
| Clustering script | This article | |
| 96 Well EIA/RIA High Binding Plate | Corning Incorporated Costar | Cat#3590 |
LIT medium
| Reagents | Final concentration | Amount |
|---|---|---|
| Triptose broth | 5 g/L | 5g |
| Sodium chloride | 4 g/L | 4g |
| Potassium chloride | 0.4 g/L | 0.4 g |
| Disodium phosphate heptahydrate | 15 g/L | 15 g |
| Penicillin G | 0.15 g/L | 0.15 g |
| Glucose | 2 g/L | 2 g |
| Streptomycin | 0.15 g/L | 0.15 g |
| Liver infusion broth | 5 g/L | 5 g |
| Bovine blood | - | 20 mL |
| Cell culture grade, heat inactivated FBS | 10% | 100 mL |
To prepare 1 L of medium. Dissolve all the reagents except the bovine blood in 800 mL of double-distilled water (ddH2O). Incubate at 37oC for 1 h. Cool the solution on ice. Centrifugate 20 mL of bovine blood at 12,000 × g for 10 min at 23°C–25°C and add the supernatant to the medium. Adjust pH to 7.3, complete volume to 900 mL with ddH2O and sterilize it by filtration (0.22 μm). Aliquot and store medium at 4oC. Before use, add 100 mL of heat-inactivated FBS under sterile conditions.
Lysis Buffer
| Reagents | Final concentration | Amount |
|---|---|---|
| Tris-HCl pH 8.0 stock solution | 100 mM | 50 mL |
| Sodium dodecyl sulfate 10% stock solution | 1% | 50 mL |
| Proteinase K Solution (20 mg/mL) | 100 μg/mL | 2.5 mL |
For 500 mL, mix 300 mL of ddH2O with 50 mL of Tris-HCl 1M pH 8.0 stock solution and 50 mL of 10% SDS stock solution. Adjust pH to 8.0 with NaOH or HCl 1M and complete with ddH2O to the final volume. Store at RT. Before use, add proteinase K to a final concentration of 100 μg/mL.
TB: phosphate broth
| Reagents | Final concentration | Amount |
|---|---|---|
| Yeast extract | 24 g/L | 24 g |
| Tryptone | 20 g/L | 20 g |
| Glycerol | 4 mL/L | 4 mL |
| Phosphate buffer (0.17 M KH2PO4, 0.72 M K2HPO4) | 0.017 M KH2PO4 | 100 mL |
Dissolve yeast extract, tryptone and glycerol in 900 mL of ddH2O and sterilize the solution (autoclave). After cooling the admixture, add the phosphate buffer under sterile technique. Store the solution at 23°C–25°C for 6 months.
PEG/NaCl
| Reagents | Final concentration | Amount |
|---|---|---|
| PEG 8000 | 166,7 g/L | 100 g |
| NaCl | 3,3 M | 116.9 g |
To obtain a 600 mL solution, dissolve polyethylene glycol (PEG) 8,000 and NaCl with 450 mL ddH2O. Once dissolved, complete to 600 mL final volume with ddH2O. Sterilize (autoclave) and mix thoroughly while cooling to avoid phase separation. Store solution at 4°C until use. Discard if any precipitation is observed.