| Literature DB >> 34568839 |
Cyrus M Haas1, Irene M Francino-Urdaniz1, Paul J Steiner1, Timothy A Whitehead1.
Abstract
Here, we describe a protocol to identify escape mutants on the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike (S) receptor-binding domain (RBD) using a yeast screen combined with deep mutational scanning. Over 90% of all potential single S RBD escape mutants can be identified for monoclonal antibodies that directly compete with angiotensin-converting enzyme 2 for binding. Six to 10 antibodies can be assessed in parallel. This approach has been shown to determine escape mutants that are consistent with more laborious SARS-CoV-2 pseudoneutralization assays. For complete details on the use and execution of this protocol, please refer to Francino-Urdaniz et al. (2021).Entities:
Keywords: Antibody; Immunology; Microbiology; Molecular Biology; Protein Biochemistry; Sequence analysis; Sequencing
Mesh:
Substances:
Year: 2021 PMID: 34568839 PMCID: PMC8455247 DOI: 10.1016/j.xpro.2021.100869
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
List of plasmids that can be used for yeast display screening
| Plasmid | Characteristics | Antibiotic resistance | AddGene collection # |
|---|---|---|---|
| pJS697 | Contains yeast display backbone | Kan | |
| pJS699 | Contains insert gene encoding Wuhan-Hu-1 S RBD N343Q (333-537) (“WT”) | Kan | |
| pIFU001 | Contains insert gene encoding Wuhan-Hu-1 S RBD N343Q, E484K (333-537) (“E484K”) | Kan | |
| pIFU002 | Contains insert gene encoding Wuhan-Hu-1 S RBD N343Q, N501Y (333-537) (“N501Y”) | Kan | |
| pJS699_L1 | Library of mutations on WT RBD between positions 333 and 437 | Kan | |
| pJS699_L2 | Library of mutations on WT RBD between positions 438 and 537 | Kan | |
| pIFU001_T1 | Library of mutations on E484K RBD between positions 333 and 437 | Kan | |
| pIFU001_T2 | Library of mutations on E484K RBD between positions 438 and 537 | Kan | |
| pIFU002_T1 | Library of mutations on N501Y RBD between positions 333 and 437 | Kan | |
| pIFU002_T2 | Library of mutations on N501Y RBD between positions 438 and 537 | Kan |
Figure 1Preparative steps for the escape mutant protocol
(A) Homologous recombination in yeast. Left to right: the two plasmids (one containing the backbone and the other one the S RBD insert) are digested independently and the segments are collected. The linearized plasmids are co-transformed into yeast. Yeast cells generate a single plasmid through homologous recombination.
(B) S RBD displayed on the yeast surface bound to antibody. A PE fluorescence signal increase is observed when an antibody binds to S RBD. Two independent populations are detected when screening using FACS. The IgGlow/RBD Displaylow population are yeast cells not containing plasmid.
(C) S RBD labeled with biotinylated ACE2.
(D) Antibodies may bind competitively or non-competitively. Incubation of unlabeled antibody followed by co-incubation with biotinylated ACE2 results in loss of ACE2 binding signal only for the competitive inhibition case. This protocol is designed to identify escape mutants for neutralizing antibodies that function by competitive inhibition of ACE2.
Figure 2Sort gates needed for FACS screening of potential S RBD escape mutants
(A) Competitive binding reaction cartoons between neutralizing antibody and ACE2. Top is the control where the displayed S RBD is not labeled, top right shows the SAPE+/FITC+ gate collecting the top 2%. Bottom shows escape mutants that no longer bind to nAb but keep the affinity for ACE2. Bottom right shows the SAPE+/FITC+ gate collecting the top 2%.
(B) Collection gates on FACS. FSC/SSC+ gate for isolation of yeast cells, FSC-H/FSC-A gate to discriminate single cells, an FSC-A/FITC+ gate selects the cells displaying the RBD on their surface and from this last gate, the top 2% by a PE+/FITC+ is collected.
Figure 3Schematic of deep sequencing library preparation
Barcodes and Illumina adapters are added to the DNA library through two rounds of PCR. The amplicon is sequenced using an Illumina MiSeq.
Figure 4An example of the analysis output from the software
The CSV file is first generated, adding the results of statistical calculations. The second output from the analysis module is the Microsoft Excel document containing a heatmap with escape mutant residues identified. In the heatmap, wild type residues and positions are listed in the top two rows of the file and each possible mutation is listed in the left column. Three colors exist in each Excel output file: gray cells indicate that the mutation was not observed in the population; white cells indicate mutations that were observed in the population but did not meet the criteria for escape mutant; and, blue cells represent mutations that meet the necessary criteria to be classified as potential escape mutants for the given antibody.
Figure 5The flow structure of the software
For each module (Deep mutational scanning; Analysis) the actions taken and output are listed.
Descriptions of sections in the configuration file
| Configuration file section | Description |
|---|---|
| Parameters | This section defines the basic parameters for the dms module. |
| Tile:Tx | This section defines a tile that will be analyzed. For every tile in an experiment, a new tile section should be created. To define a tile, change the name after the colon. |
| Samples | This section defines the different samples in the experiment, connecting the tiles with the file names for the paired-end reads. All samples are in this section, no other [Samples] section is necessary to define multiple samples. |
| Experiments | This section defines each experiment by giving it a name and identifying the reference and selected populations. All experiments are in this section, no other [Experiments] section is necessary to define multiple experiments. |
| Proteins | This section combines different tiles that correspond to the same protein. This is so the information across all tiles for a given protein can be consolidated into single concise files. |
| Analysis | This section defines statistical parameters for the analysis module. The information provided here results in the final escape mutant hits identified. |
Descriptions of options for each section in the configuration file
| Configuration file section | Configuration file options | Description |
|---|---|---|
| Parameters | max_mismatches | The maximum number of mismatches allowed when reading overlapping sequences of paired-end reads from FASTQ deep sequencing files. Reads with a higher number of mismatches are discarded. |
| Parameters | min_quality | The minimum sequencing quality score required to keep a read in the analysis. A read with a score lower than this minimum benchmark will be discarded. |
| Parameters | fastq_file_dir | This option allows the user to specify the directory that contains the FASTQ files to be analyzed. The file path should be a folder that contains FASTQ files with the file names provided in the configuration file. An absolute file path should be used. |
| Tile:Tx | wt_seq | The nucleotide DNA sequence of the tile which includes the entire sequence of a read from deep sequencing. This generally includes the primers used. |
| Tile:Tx | first_aa | The residue number of the first amino acid in the protein sequence. |
| Tile:Tx | cds_start | The nucleotide position in a merged read is where the coding sequence starts (after primers or any other miscellaneous nucleotides at the start of the read). Note: this index is given using Python slicing conventions, so it is zero-indexed. For example, the sequence ‘GATC’ is numbered 0123. If the sequence starts at ‘A’, cds_start would be 1. |
| Tile:Tx | cds_end | The nucleotide position in a merged read is where the coding sequence ends (before primers or any other miscellaneous nucleotides at the end of the read). Note: this index is given using Python slicing conventions, and so is the index of the first nucleotide that is not part of the coding sequence. For example, if the CDS is ‘AT’ in ‘GATC’, cds_end would be 3. |
| Tile:Tx | positions | The amino acid positions of interest in the experiment. If the user only desires certain positions in the output files, this is where those positions should be specified. |
| Analysis | control_filepath | The file path, including the file name, to the output file from the initial deep mutational scanning module for the control experiment. Absolute file paths should be used. |
| Analysis | antibody_filepath | The file path, including the file name, to the output from the initial deep mutational scanning module for the desired antibody experiment. Absolute file paths should be used. |
| Analysis | output_title | The prefix to be used in the processed file names at the end of running the analysis module. |
| Analysis | FDR | The target false discovery rate (FDR) for identifying an enrichment ratio threshold. This threshold is then used to determine escape mutant hits. |
| Analysis | significance | The p-value cutoff used to determine if a given enrichment ratio is significantly greater than the threshold value. |
Command line flags that can be used when running dms module
| Command Line flag | Description |
|---|---|
| ‘--use_multiprocessing’ [boolean] | This flag provides an option for the user to use the multiprocessing module or to turn it off. The multiprocessing module reads FASTQ files using multiple cores and can speed up the run time. |
| ‘--fastq_file_dir’ [file path] | This flag allows the user to specify the directory that contains the FASTQ files to be analyzed. The file path should be a folder that contains FASTQ files with the file names provided in the configuration file. |
| ‘--output_dir’ [file path] | This flag allows the user to specify the output directory where all CSV and Microsoft Excel files will be saved when the run is complete. |
| ‘--max_mismatches’ [number] | The maximum number of mismatches allowed when reading overlapping sequences of paired-end reads from FASTQ deep sequencing files. Reads with a higher number of mismatches are discarded. |
| ‘--min_quality’ [number] | The minimum deep sequencing quality score required to keep a read in the analysis. A read with a score lower than this minimum benchmark will be discarded. |
| ‘--min_ref_counts’ [number] | The cutoff used to discard variants that have a low number of counts in the reference population. Variants with reference counts below this cutoff will be discarded. |
| ‘--pseudocount’ [number] | The pseudocount used when a variant contains counts observed in the reference population but not in the selected population. This is required because observing no counts in the selected population while having counts in the reference population will lead to an undefined enrichment ratio. |
Figure 6Statistical approach used to determine potential escape mutants
(A) An example of an enrichment ratio calculation. This calculation uses the total counts in the selected population, , the total counts in the reference population, , and the total counts of the given N501Y mutation in the selected and reference populations, and , respectively.
(B) Demonstrating that the best fit for this data is a kernel density estimate (KDE). The KDE fits very closely to the data, while other common probability distributions, such as the normal and logistic curves, do not.
(C) Showing the final statistical step that considers the depth of coverage when deciding to identify an escape mutation. For mutations with high depth of coverage, the returned p-value will be much smaller than for mutations with lower depths of coverage.
| Reagent | Final concentration | Amount |
|---|---|---|
| pJS697 (pJS697 for library) | n/a | 1 μg (5 μg) |
| 10× CutSmart Buffer | n/a | 5 μL |
| BsaI-HFv2 | n/a | 1 μL |
| Nuclease-free water | n/a | to 50 μL |
| Reagent | Final concentration | Amount |
|---|---|---|
| pJS699 (or pJS699_L1, pJS699_L2) | n/a | 1 μg (5 μg) |
| 10× CutSmart Buffer | n/a | 5 μL |
| NotI-HF | n/a | 1 μL |
| Nuclease-free water | n/a | to 50 μL |
| Reagent | Final concentration | Amount |
|---|---|---|
| EBY100 cells | n/a | 70 μL |
| 50% w/v PEG 3350 | n/a | 240 μL |
| 1M lithium acetate | n/a | 36 μL |
| Salmon sperm DNA | n/a | 10 μL |
| Reagent | Final concentration | Amount |
|---|---|---|
| EBY100 cells | n/a | 210 μL |
| 50% w/v PEG 3350 | n/a | 720 μL |
| 1M lithium acetate | n/a | 108 μL |
| Salmon sperm DNA | n/a | 30 μL |
| Tube 1 | Tube 2 | Tube 3 | |
|---|---|---|---|
| 5μL | 5μL | 5μL | |
| 75nM | - | 75nM | |
| 10μg/mL | 10μg/mL | 10μg/mL | |
| To 50μL total volume | To 50μL total volume | To 50μL total volume | |
| Incubate for 30 min at 20°C–25°C. | |||
| - | - | 75nM | |
| - | - | Incubate for 30 min at 20°C–25°C. | |
For example: In tube 1, if your biotinylated ACE2 concentration was 1 μM, you would add 5 μL of cells, 41.2 μL PBSF, and 3.8 μL antibody.
| Tube 1 | Tube 2 | Tube 3 | |
|---|---|---|---|
| 0.6μL | 0.6μL | 0.6μL | |
| - | 0.25μL | - | |
| 0.25μL | - | 0.25μL | |
| 49.15 μL | 49.15 μL | 49.15 μL |
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Anti-c-myc FITC, 0.6 μL per 1e5 cells (80× dilution) | Miltenyi Biotec | Cat# 130-116-485 |
| Goat anti-human IgG Fc secondary antibody, PE, eBiosciences, 0.25 μL per 1e5 cells (200× dilution) | Invitrogen | Cat# 12-4998-82 |
| Thermo Scientific | Cat# C862003 | |
| ACE2-Fc | Institute of Protein Design, Laboratory of Professor Neil King | |
| NotI-HF | NEB | Cat# R3189 |
| BsaI-HFv2 | NEB | Cat# R3733 |
| CutSmart® Buffer | NEB | Cat# B7204S |
| Nuclease-Free Water | IDT | Cat# 11-05-01-14 |
| Ultrapure Agarose | Invitrogen | Cat# 16500-500 |
| TAE Buffer (Tris-acetate-EDTA) (50×) | Thermo Scientific | Cat# B49 |
| SYBR Safe DNA Gel Stain | Invitrogen | Cat# S33102 |
| Gel Loading Dye, Purple | NEB | Cat# B7024 |
| UltraPure™ Salmon Sperm DNA Solution | Invitrogen | Cat# 15632011 |
| Glycerol | MacronTM Chemicals | Cat# 5092-16 |
| HEPES-free acid | Millipore | Cat# 391338 |
| HEPES sodium salt | Amresco | Cat# 0485 |
| PEG 3350 | Spectrum | Cat #P0125 |
| Lithium acetate dihydrate | Sigma-Aldrich | Cat# L6883 |
| PBS - phosphate-buffered saline (10×) pH 7.4 | Invitrogen | Cat# AM9624 |
| Streptavidin phycoerythrin (SAPE) | Invitrogen | Cat# S866 |
| Bovine Serum Albumina (BSA), Fraction V, Fatty Acid-Free | VWR | Cat# 7907-25 |
| EZ-Link NHS-Biotin | Thermo Fisher | Cat# 20217 |
| Zymolyase | Zymo Research | Cat# E1005 |
| Sodium chloride | Sigma-Aldrich | Cat# 746398 |
| Difico yeast nitrogen base without amino acids | Sigma-Aldrich | Cat#Y026 |
| Bacto casamino acids, technical grade | Fisher | Cat# 223120 |
| Sodium phosphate dibasic anhydrous | Fisher Chemical | Cat# BP3321 |
| Sodium phosphate monobasic monohydrate | Fisher Chemical | Cat# S369-500 |
| D-Galactose | Fisher Bioreagents | Cat# BP656-500 |
| Dextrose | Fisher Chemical | Cat# D19212 |
| Peptone | Fisher | Cat# 211677 |
| Yeast extract | Fisher | Cat# 212750 |
| Agar | BD Biosciences | Cat# 214010 |
| Pen/Strep | Fisher | Cat# 15140-122 |
| Kanamycin | GoldBio | Cat# K-120-25 |
| Exonuclease I | NEB | Cat# M0293S |
| Lambda exonuclease | NEB | Cat# M0262S |
| Lambda exonuclease reaction buffer 10× | NEB | Cat# B0262S |
| Q5 HotStart 2× MasterMix | NEB | Cat# M0494L |
| rSAP | NEB | Cat# M0371L |
| 70% v/v Denatured ethanol solution | Fisher Bioreagents | Cat# BP82031GAL |
| IDTE pH 8.0 (1× TE Solution) | IDT | Cat# 11-05-01-13 |
| Quant-iT™ PicoGreen™ dsDNA Reagent | Thermo Scientific | Cat# T7581 |
| Lambda DNA | Thermo Scientific | Cat# SD0011 |
| Monarch PCR & DNA Cleanup Kit | NEB | Cat# T1030 |
| Monarch DNA Gel Extraction Kit | NEB | Cat# T1020 |
| Monarch Plasmid Miniprep Kit | NEB | Cat# T1010 |
| Zymoprep Yeast Plasmid Miniprep II | Zymo Research | Cat# D2004 |
| Agencourt AMPure XP | Beckman Coulter | Cat# A63881 |
| PhiX | Illumina | Cat# FC-110-3001 |
| ATCC | MYA-4941TM | |
| IFU-104 | gttcagagttctacagtccgacgatctggaggaggctctgg | |
| IFU-105 | ccttggcacccgagaattccaccaagctataacgcagcc | |
| IFU-106 | gttcagagttctacagtccgacg | |
| IFU-107 | ccttggcacccgagaattccagccccctttgtttttaaccaa | |
| Forward outer primer | aatgatacggcgaccaccgaga | |
| Reverse outer primer | caagcagaagacggcatacgagat | |
| pJS697 | ||
| pJS699 | ||
| pIFU001 | ||
| pIFU002 | ||
| pJS699_L1 | ||
| pJS699_L2 | ||
| pIFU001_T1 | ||
| pIFU001_T2 | ||
| pIFU002_T1 | ||
| pIFU002_T2 | ||
| Python software (dms and analysis modules) | ||
| Python3 | N/A | |
| Benchling | N/A | |
| −20°C Freezer | VWR | N/A |
| −80°C Freezer | Fisher Scientific, model: REVCO EXF | N/A |
| Pipettes | N/A | N/A |
| Centrifuge (does not need to be refrigerated) | Eppendorf, model: 5810R | Cat# 05-413-113 |
| Microcentrifuge (does not need to be refrigerated) | Fisher, model: accuSpin microcentrifuge 17 | Cat# 13-100-675 |
| Vortex mixer | Thermo Scientific, model: M37165 | Cat# M16710-33Q |
| Static incubator | VWR, model: Gr Con 4CF | Cat# 89511-420 |
| Incubator shaker | Eppendorf, model: New Brunswick I26 Inc Shaker | Cat# M1324-0000 |
| UV-Vis absorbance plate reader | BioTek, model: Synergy H1M | N/A |
| Spectrophotometer | Thermo Fisher Scientific, model: 4001/4 | Cat# 4001 |
| Cell sorter with a 488 nm laser | SONY, model: LE-SH800SAP | N/A |
| Thermal cycler | Eppendorf, model: MastercyclerTM pro | Cat# 950040025 |
| Horizontal Gel Electrophoresis System | Bio-Rad, model: Wide Mini-Sub Cell GT with PowerPac Basic Power Supply | Cat# 1640301 |
Yeast extract-peptone-dextrose (YPD) media
| Reagent | Final concentration | Amount |
|---|---|---|
| Yeast extract | 10 g/L | 10 g |
| Peptone | 20 g/L | 20 g |
| Dextrose | 20 g/L | 20 g |
| ddH2O | n/a | to 1 L |
Filter sterilize using a 0.22 μm filter. Store at 20°C–25°C, protected from light for up to one month.
Yeast storage buffer
| Reagent | Final concentration | Amount |
|---|---|---|
| Glycerol ≥ 99.5% | 20% v/v | 200 mL |
| HEPES | 10 mM | 2.38 g |
| HEPES sodium salt | 10 mM | 2.60 g |
| NaCl | 200 mM | 11.69 g |
| ddH2O | n/a | to 1 L |
Take to pH7.5 with NaOH.
Filter sterilize using a 0.22 μm filter and store at 20°C–25°C for up to one year.
SCAA media
| Reagent | Final concentration | Amount |
|---|---|---|
| Difico yeast nitrogen base | 6.7 g/L | 3.35 g |
| Bacto casamino acids | 5 g/L | 2.5 g |
| NaH2PO4·H2O | 62 mM | 4.28 g |
| Na2HPO4 | 38 mM | 2.7 g |
| ddH2O | n/a | to 450 mL |
Filter sterilize using a 0.22 μm filter and store at 20°C–25°C for up to six months.
20% w/v dextrose
| Reagent | Final concentration | Amount |
|---|---|---|
| Dextrose | 20% w/v | 100 g |
| ddH2O | n/a | to 500 mL |
Filter sterilize using a 0.22μm filter and store at 20°C–25°C for up to six months.
20% w/v galactose
| Reagent | Final concentration | Amount |
|---|---|---|
| Galactose | 20% w/v | 100 g |
| ddH2O | n/a | to 500 mL |
Filter sterilize using a 0.22μm filter and store at 20°C–25°C for up to six months.
SDCAA+
| Reagent | Final concentration | Amount |
|---|---|---|
| 20% w/v dextrose | 20g/L | 10 mL |
| SCAA | n/a | 90 mL |
| 100× Pen/Strep | 1× | 1 mL |
| 50 mg/mL Kanamycin | 50 μg/mL | 100 μL |
Make on the same day of use and keep at 4°C.
SGCAA+
| Reagent | Final concentration | Amount |
|---|---|---|
| 20% w/v galactose | 20 g/L | 10 mL |
| SCAA | n/a | 90 mL |
| 100× Pen/Strep | 1× | 1 mL |
| 50 mg/mL Kanamycin | 50 μg/mL | 100 μL |
Make on the same day of use and keep at 4°C.
Solution A
| Reagent | Final concentration | Amount |
|---|---|---|
| NaH2PO4·H2O | 62 mM | 4.28 g |
| Na2HPO4 | 38 mM | 2.7 g |
| Agar | 15 g/L | 7.5 g |
| ddH2O | n/a | to 450 mL |
Autoclave 20 min at 121°C and 0.5 bar gauge.
Solution B
| Reagent | Final concentration | Amount |
|---|---|---|
| Difico yeast nitrogen base | 6.7 g/L | 3.35 g |
| Bacto casamino acids | 5 g/L | 2.5 g |
| Dextrose | 20% w/v | 10 g |
| ddH2O | n/a | to 50 mL |
Filter sterilize using a 0.22μm filter.
PBSF
| Reagent | Final concentration | Amount |
|---|---|---|
| bovine serum albumin frac V | 1 g/L | 100 mg |
| PBS | n/a | 100 mL |
Filter sterilize using a 0.22μm filter and store at 4°C for no longer than 3 days.
Fluorescent labeling mix
| Reagent | Final concentration | Amount |
|---|---|---|
| anti-c-myc FITC | n/a | 0.60 μL |
| SAPE | n/a | 0.25 μL |
| PBSF | n/a | 49.15 μL |
Make on same day of use and store on ice, protected from light. We recommend making a master mix, adding an extra 10% for each reaction.
| Tube 1 | Tube 2 | Tube 3 | |
|---|---|---|---|
| 5μL | 5μL | - | |
| - | - | 5μL | |
| - | 75nM | - | |
| - | - | 10μg/mL | |
| To 50μL total volume | To 50μL total volume | To 50μL total volume | |
| Incubate for 30 min at 20°C–25°C. | |||
| - | - | 75nM | |
| - | - | Incubate for 30 min at 20°C–25°C. | |
| Reagent | Final concentration | Amount |
|---|---|---|
| Lambda buffer 10× | n/a | 2 μL |
| Miniprep DNA | n/a | 15 μL |
| Exonuclease I | n/a | 2 μL |
| Lambda exonuclease | n/a | 1 μL |
| Name | Description | Sequence | Section of RBD amplification | Library compatibility |
|---|---|---|---|---|
| IFU-104 | L1_Inner_FWD | gttcagagttctacagtccgacgatctggaggaggctctgg | 333-437 | pJS699_L1, pIFU001_T1 and pIFU002_T1 |
| IFU-105 | L1_Inner_REV | ccttggcacccgagaattccaccaagctataacgcagcc | ||
| IFU-106 | L2_Inner_FWD | gttcagagttctacagtccg | 438-537 | pJS699_L2, pIFU001_T2 and pIFU002_T2 |
| IFU-107 | L2_Inner_REV | ccttggcacccgagaattcca |
| Reagent | Final concentration | Amount |
|---|---|---|
| PCR Products | n/a | 50 μL |
| Exonuclease I | n/a | 5 μL |
| rSAP | n/a | 10 μL |
| Description | Sequence |
|---|---|
| Illumina_FWD (RP1) | aatgatacggcgaccaccgagatcta |
| Illumina_REV | caagcagaagacggcatacgagat |
| Reagent | Final concentration | Amount |
|---|---|---|
| H2O | n/a | 18 μL |
| DNA (from cleaning step) | n/a | 2 μL |
| Fwd outer primer (10μM) | n/a | 2.5 μL |
| Rev outer primer (10μM) | n/a | 2.5 μL |
| 2× Q5 MM | n/a | 25 μL |