Coronavirus disease 2019 (COVID-19) is a global health crisis caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and there is a critical need to produce large quantities of high-quality SARS-CoV-2 Spike (S) protein for use in both clinical and basic science settings. To address this need, we have evaluated the expression and purification of two previously reported S protein constructs in Expi293F and ExpiCHO-S cells, two different cell lines selected for increased protein expression. We show that ExpiCHO-S cells produce enhanced yields of both SARS-CoV-2 S proteins. Biochemical, biophysical, and structural (cryo-EM) characterizations of the SARS-CoV-2 S proteins produced in both cell lines demonstrate that the reported purification strategy yields high-quality S protein (nonaggregated, uniform material with appropriate biochemical and biophysical properties), and analysis of 20 deposited S protein cryo-EM structures reveals conformation plasticity in the region composed of amino acids 614-642 and 828-854. Importantly, we show that multiple preparations of these two recombinant S proteins from either cell line exhibit identical behavior in two different serology assays. We also evaluate the specificity of S protein-mediated host cell binding by examining interactions with proposed binding partners in the human secretome and report no novel binding partners and notably fail to validate the Spike:CD147 interaction. In addition, the antigenicity of these proteins is demonstrated by standard ELISAs and in a flexible protein microarray format. Collectively, we establish an array of metrics for ensuring the production of high-quality S protein to support clinical, biological, biochemical, structural, and mechanistic studies to combat the global pandemic caused by SARS-CoV-2.
Coronavirus disease 2019 (COVID-19) is a global health crisis caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and there is a critical need to produce large quantities of high-quality SARS-CoV-2Spike (S) protein for use in both clinical and basic science settings. To address thisneed, we haveevaluated theexpression and purification of two previously reported S protein constructs in Expi293F and ExpiCHO-S cells, two different cell lines selected for increased protein expression. We show that ExpiCHO-S cells produceenhanced yields of both SARS-CoV-2S proteins. Biochemical, biophysical, and structural (cryo-EM) characterizations of theSARS-CoV-2S proteins produced in both cell lines demonstrate that the reported purification strategy yields high-quality S protein (nonaggregated, uniform material with appropriate biochemical and biophysical properties), and analysis of 20 deposited S protein cryo-EM structures reveals conformation plasticity in the region composed of amino acids 614-642 and 828-854. Importantly, we show that multiple preparations of these two recombinant S proteins from either cell lineexhibit identical behavior in two different serology assays. We also evaluate the specificity of S protein-mediated host cell binding by examining interactions with proposed binding partners in thehuman secretome and report no novel binding partners and notably fail to validate theSpike:CD147 interaction. In addition, the antigenicity of these proteins is demonstrated by standard ELISAs and in a flexible protein microarray format. Collectively, weestablish an array of metrics for ensuring the production of high-quality S protein to support clinical, biological, biochemical, structural, and mechanistic studies to combat the global pandemic caused by SARS-CoV-2.
Most humancoronavirusinfections are associated with mild symptoms,
but in the last two decades, threebeta coronaviruses, SARS-CoV, MERS,
and SARS-CoV-2, haveemerged that are able to infect humans and cause
severe disease.[1,2] The current pandemic of coronavirus
disease 19 (COVID-19) is caused by severe acute respiratory syndromecoronavirus 2 (SARS-CoV-2),[3] an enveloped
virus from theCoronaviridae family with a single
positively stranded RNA genome.[3] This RNA
virus, which likely originated in bats, has several structural components,
including Spike (S), Envelope (E), Membrane (M), and Nucleocapsid
(N) proteins.[2]TheS protein is a
class I viral fusion protein, which consists
of two subunits (S1 and S2) and forms a trimer on the viral membrane.[4] The S1 subunit contains the receptor binding
domain (RBD), which is responsible for host cell receptor binding,
while the S2 subunit facilitates membrane fusion between the viral
and host cell membranes.[4−7] Host cell proteases areessential for activating
theS protein for Coronaviridae cellular entry.[8] TheS protein in many Coronaviridae, including SARS-CoV-2, is cleaved between the S1 and S2 subunits
(S1/S2 cleavage site) and at an additional site present in S2 (S2’
cleavage site).[8−10] Similar to SARS-CoV, theSARS-CoV-2 trimeric S glycoprotein
mediates viral entry into the cell by utilizing angiotensin converting
enzyme 2 (ACE2) as a human cell surfaceentry receptor.[8] Processing of both SARS-CoV and SARS-CoV-2 S
proteins is dependent on theendosomal cysteine proteases cathepsin
B and cathepsin L and theserine proteaseTMPRSS2.[8] In many coronaviruses, theseevents lead to conformational
rearrangements in S2, which ultimately result in the fusion of the
host and viral cell membranes, and delivery of the viral genome into
thenewly infected cell.[11,12]Due to the global
COVID-19 pandemic, theSARS-CoV-2 S ectodomain
protein has become an important target for clinical, biological, and
structural investigations, and future studies will require theefficient
and streamlined production of this protein. Clinically, the production
of large amounts of S ectodomain protein enables testing of individuals
for SARS-CoV-2 seropositivity. Serological testing is important for
determining individuals who have beenexposed to the virus, and the
resulting antibody titer can facilitate identification of potential
donors of convalescent plasma.[13] Additionally,
theS proteinectodomain could be used to identify therapeutic monoclonal
antibodies (mAbs) through single B cell cloning from convalescent
patients. Furthermore, the development of small molecules and protein
therapeutics designed to inhibit viral infection by targeting theS proteinneeds to be tested and biochemically characterized. Biologically,
the complete mechanisms of viral host cell fusion and replication
remain to be completely understood. Structural studies will continue
to support these ongoing clinical, therapeutic, and biological investigations.To provide support for these investigations, weexamined theexpression
and purification of two recently reported recombinant versions of
theS protein (here termed OptSpike1 and OptSpike2, see Figure A). OptSpike1 was initially
reported by McLellan and co-workers and used to determine the cryo-EM
structure of theSpike protein in the prefusion conformation.[7] This construct has additionally been utilized
as an antigen for clinical ELISAs at Montefiore Medical Center (Bronx,
NY).[14] OptSpike2 was described by Krammer
and co-workers and has been successfully used to conduct serum ELISAs
to test patients for the presence of anti-S antibodies at Mount Sinai
Hospital (New York, NY).[15] The successful
use of both of these recombinant forms of theS protein in clinical
(OptSpike1 and OptSpike2) and structural (OptSpike1) applications
has made them attractive targets for futureCOVID-19 studies. Both
constructs are cloned into themammalian expression vector pCAGGS
and includeeither the majority of theectodomain (OptSpike1: AAs
1–1208) or the full-length ectodomain (OptSpike2: 1–1213)
of theSARS-CoV-2S protein (based on the Wuhan-Hu-1 sequence).[16] Both constructs include theK986P and K987P
stabilizing mutations and use a T4 Foldon motif (T4) to enhance trimerization.[17,18] Both constructs lack thefurin cleavage site: OptSpike1 contains
the mutation RRAR:GSAS, while OptSpike2 contains the mutation RRAR:A.[19] OptSpike1 contains the T4-HRV3C protease cleavage
sequence-8x HisTag and a TwinStrepTag at the C-terminus, while OptSpike2
contains a Thrombin cleavage sequence-T4-6x HisTag at the C-terminus.
Figure 1
Expression
and purification of the SARS-CoV-2 S protein. (A) Schematic
showing the characteristics of OptSpike1 (upper) and OptSpike2 (lower).
(B) Average yields of OptSpike1 and OptSpike2 produced in either Expi293FTM
or ExpiCHO-STM cells harvested on either day 6 (Expi293FTM) or day
12 (ExpiCHO-STM) and purified using nickel affinity chromatography
and size exclusion chromatography. (C) SDS-PAGE showing apparent molecular
mass and purity of OptSpike1 and OptSpike2 purifications. Lanes 1–4:
3 μg of OptSpike1-ExpiCHO-STM, OptSpike1-Expi293FTM, OptSpike2-ExpiCHO-STM,
OptSpike2-Expi293FTM in the presence of 100 μM DTT, Lanes 5–8:
same as 1–4 without DTT. (D,E) Representative HiLoadTM 16/600
SuperdexTM 200 purification of (D) OptSpike1-Expi293FTM and (E) OptSpike2-Expi293FTM.
(F) OptSpike1-ExpiCHO-STM and (G) OptSpike2-CHO after nickel affinity
purification (the second peak represents imidazole). (H) Yield from
1 mL crude nickel purification of OptSpike1 or OptSpike2 on indicated
day post transfection in Expi293F cells. (I) Yield from 1 mL crude
nickel purification of OptSpike1 or OptSpike2 on indicated day post
transfection in ExpiCHO-STM cells.
Expression
and purification of theSARS-CoV-2S protein. (A) Schematic
showing the characteristics of OptSpike1 (upper) and OptSpike2 (lower).
(B) Average yields of OptSpike1 and OptSpike2 produced in either Expi293FTM
or ExpiCHO-STM cells harvested on either day 6 (Expi293FTM) or day
12 (ExpiCHO-STM) and purified using nickel affinity chromatography
and sizeexclusion chromatography. (C) SDS-PAGE showing apparent molecular
mass and purity of OptSpike1 and OptSpike2 purifications. Lanes 1–4:
3 μg of OptSpike1-ExpiCHO-STM, OptSpike1-Expi293FTM, OptSpike2-ExpiCHO-STM,
OptSpike2-Expi293FTM in the presence of 100 μM DTT, Lanes 5–8:
same as 1–4 without DTT. (D,E) Representative HiLoadTM 16/600
SuperdexTM 200 purification of (D) OptSpike1-Expi293FTM and (E) OptSpike2-Expi293FTM.
(F) OptSpike1-ExpiCHO-STM and (G) OptSpike2-CHO after nickel affinity
purification (the second peak represents imidazole). (H) Yield from
1 mL crudenickel purification of OptSpike1 or OptSpike2 on indicated
day post transfection in Expi293F cells. (I) Yield from 1 mL crudenickel purification of OptSpike1 or OptSpike2 on indicated day post
transfection in ExpiCHO-STM cells.Here, weexamine theexpression of OptSpike1 and OptSpike2 in Expi293F
and ExpiCHO-Sexpression systems[20] and
provide detailed protein production protocols, including a Standard
Operating Procedure (SOP) in the Supporting Information Appendix 1. We thenevaluate the biochemical and biophysical
properties of these proteins to define standards of protein quality
and evaluate the recognition specificity of theS protein within thehuman secretome. Furthermore, we demonstrate the functional reproducibility
of recombinant S protein in serum ELISAs and develop a multiantigenCOVID-19 protein microarray that can simultaneously test for antibodies
against multiple antigens, including the full-length S protein, the
receptor binding domain (RBD) of theS protein, and the full-length
N protein. Finally, we determine the 3.22 Å cryo-EM structure
of OptSpike1 produced in ExpiCHO-S cells and the 3.44 Å cryo-EM
structure of OptSpike1 produced in Expi293F cells, which arenearly
indistinguishable and in agreement with previously described structures.[7,21]This work provides a comprehensive strategy, with a widerange
of guiding standards and metrics, for production of large quantities
of high-quality recombinant S protein for use in future clinical,
diagnostic, biological, biochemical, structural, and mechanistic studies
that will beneeded to combat the global pandemic caused by SARS-CoV-2.
Results
Enhanced
Expression and Purification of SARS-CoV-2 S Proteins
To determine
conditions for theenhanced expression and purification
of theSARS-CoV-2S protein, we tested theexpression of two recently
reported recombinant variants (OptSpike1 and OptSpike2, Figure A) in both theExpi293F and
ExpiCHO-S cells.[20] TheExpiCHO-Sexpression
system reproducibly afforded the highest yield per liter (∼28
mg/L) of transfected cells for both versions of theS protein (Figure B). OptSpike1 and
OptSpike2 were readily produced at a high level of purity, via transient
transfection of both Expi293F and ExpiCHO-S cells, followed by purification
with nickel affinity and sizeexclusion chromatography (SEC) (Figure C). SEC revealed
that both S proteins ran at the appropriate size on a HiLoad 16/600
Superdex 200 gel filtration column (Figure D–G), without the presence of significant
aggregates. To identify optimal culture growth times, a time course
was performed for both constructs in both expression systems, and
the optimal day for harvest was found to be day 6 in Expi293F cells
and day 12 in ExpiCHO-S cells (Figure H,I). Additionally, theExpiCHO-S standard titer and
high titer protocols (Figure S1H) were
compared. Collectively, these data reveal that theExpiCHO-Sexpression
system is moreefficient than theExpi293Fexpression system for the
production of recombinant Spike protein. To provide further information
about the stability, quality, and aggregation state of OptSpike1 and
OptSpike2, their biophysical properties were characterized by a number
of approaches.
PNGase F Digestion of Recombinant S Protein
Produced in Expi293F
or ExpiCHO-S Cells
OptSpike1 and OptSpike2 proteins migrated
as slightly larger species on SDS-PAGE whenexpressed in theExpi293F
cells than in theExpiCHO-S system (Figure A). We hypothesized that this difference
in size of the proteins was due to differential N-linked glycosylation
associated with Expi293F and ExpiCHO-S cells. Heat-denatured OptSpike1
was digested with PNGase F, an amidase that cleaves between the innermost
GlcNAc and asparagine residues in N-linked glycoproteins, and analyzed
by SDS-PAGE. While undigested OptSpike1 produced in Expi293F cells
runs larger than OptSpike1 produced in ExpiCHO-S, the difference in
apparent molecular weight is abolished after PNGase F treatment (Figure A). It is known that
recombinant S proteins are heavily glycosylated,[22] and these results confirm that these proteins exist as
distinct N-linked glycoforms whenexpressed in different mammalianexpression systems, as has been previously reported with a number
of other glycoproteins.[23−25] Importantly, these differences
in glycosylation had no effect on reactivity with sera from convalescent
patients (see Figure B).
Figure 2
Biophysical characterization of recombinant SARS-CoV-2 S protein
indicates that it is stable, uniform, and has appropriate molecular
mass. (A) SDS-PAGE analysis of OptSpike1-Expi293F (lane 1) or OptSpike1-ExpiCHO-S
(lane 2) compared to the same protein denatured and treated with PNGase
F for 60 min (lanes 3 and 4, respectively), the lower MW band is PNGase
F. (B) Representative SEC traces of OptSpike1-Expi293F analyzed on
a Yarra 3 μm SEC-4000 LC column on day 1 post purification (black
– left Y axis) and day 14 post purification
(blue – right Y axis). (C) Representative
SEC traces of OptSpike1-ExpiCHO-S analyzed on a Yarra 3 μm SEC-4000
LC column on day 1 post purification (black – left Y axis) and day 14 post purification (blue – right Y axis). (D) Representative MALS analysis of OptSpike1-ExpiCHO-S
day 1 (red curve: light scattering, green curve: UV280, blue curve:
refractive index, black line: Mw). (E)
Representative MALS analysis of OptSpike1- ExpiCHO-S day 14. (F) Molecular
mass of OptSpike1-ExpiCHO-S and PDI (Mw:Mn) determined by MALS (the right peak
in panel (E)). (G) Results of SYPRO Orange thermal denaturation of
OptSpike1-ExpiCHO-S and OptSpike2-ExpiCHO-S showing a graph of first
derivative vs temperature and a table showing Tm1 (left peak) and
Tm2 (right peak) for each, all experiments are representative of three
individual replicates. (H) The c(s) distribution obtained by AUC analysis of OptSpike1-ExpiCHO-S at
4.8 μM total protein concentration. The asterisk denotes the
S protein trimeric species. The best fit molecular mass resolved for
the trimer is 455 kDa. The best fit ratio of f/fo is 1.63.
Figure 4
Highly reproducible data
in COVID-19 convalescent serum ELISA experiments
using OptSpike1 and OptSpike2. (A) Five different production batches
of OptSpike1-ExpiCHO-STM were tested by ELISA for the detection of
anti-S IgG antibodies from convalescent COVID-19 patients. Low batch-to-batch
variability in OptSpike1 was observed, with no statistically significant
differences in EC50 values between batches (P > .9999). (B) Different S protein constructs OptSpike1-ExpiCHO-STM,
OptSpike1Expi293FTM, and OptSpike2-ExpiCHO-STM were tested by ELISA
for the detection of anti-S IgG antibodies from confirmed COVID-19
convalescent patients. Very little variability was seen between different
constructs and expression cell lines used, with no statistically significant
differences in EC50 values (P > .9999).
Biophysical characterization of recombinant SARS-CoV-2S protein
indicates that it is stable, uniform, and has appropriate molecular
mass. (A) SDS-PAGE analysis of OptSpike1-Expi293F (lane 1) or OptSpike1-ExpiCHO-S
(lane 2) compared to the same protein denatured and treated with PNGase
F for 60 min (lanes 3 and 4, respectively), the lower MW band is PNGase
F. (B) Representative SEC traces of OptSpike1-Expi293F analyzed on
a Yarra 3 μm SEC-4000 LC column on day 1 post purification (black
– left Y axis) and day 14 post purification
(blue – right Y axis). (C) Representative
SEC traces of OptSpike1-ExpiCHO-S analyzed on a Yarra 3 μm SEC-4000
LC column on day 1 post purification (black – left Y axis) and day 14 post purification (blue – right Y axis). (D) RepresentativeMALS analysis of OptSpike1-ExpiCHO-S
day 1 (red curve: light scattering, green curve: UV280, blue curve:
refractive index, black line: Mw). (E)
RepresentativeMALS analysis of OptSpike1- ExpiCHO-S day 14. (F) Molecular
mass of OptSpike1-ExpiCHO-S and PDI (Mw:Mn) determined by MALS (the right peak
in panel (E)). (G) Results of SYPRO Orange thermal denaturation of
OptSpike1-ExpiCHO-S and OptSpike2-ExpiCHO-S showing a graph of first
derivative vs temperature and a table showing Tm1 (left peak) and
Tm2 (right peak) for each, all experiments are representative of three
individual replicates. (H) The c(s) distribution obtained by AUC analysis of OptSpike1-ExpiCHO-S at
4.8 μM total protein concentration. The asterisk denotes theS protein trimeric species. The best fit molecular mass resolved for
the trimer is 455 kDa. The best fit ratio of f/fo is 1.63.
Size Exclusion Chromatography and Multiangle Light Scattering
(SEC-MALS) Analysis of OptSpike1 and OptSpike2
The purified,
concentrated OptSpike1 and OptSpike2 proteins were analyzed by FPLC
on a Superose 6 Increase 10/300 GL column (Figure S1A–D) and by HPLC on a Yarra 3 μm SEC-4000 LC
column (Figure B,C, Figure S1E,F), both of which are appropriate
for resolving proteins in therange of, and greater than, the predicted
molecular mass (based on the amino acid sequence) of ∼420 kDa
of trimeric OptSpike1 and OptSpike2. On theSuperose 6 Increase 10/300
GL column, OptSpike1 and Optspike2eluted as a single peak within
the included volume, with an apparent molecular mass of ∼670
kDa, and did not form aggregates (Figure S1A–D). Interestingly, analysis on the Yarra 3 μm SEC-4000 LC column
showed that both OptSpike1 and OptSpike2, expressed in either Expi293F
or ExpiCHO-S cells, eluted as two partially overlapping peaks (elution
times of 10.35 and 11.00 mL) (Figure B,C, Figure S1E,F), which
could not be resolved on theSuperose 6 Increase 10/300 GL column
(Figure S1A–D). Furthermore, after
storage at 4 °C for 14 days, the distribution of theelution
volume shifted entirely to the faster migrating peak at 10.35 mL (Figure B,C). The OptSpike1
produced in ExpiCHO-S cells was analyzed with multiangle light scattering
(MALS) 1 day after purification, and the polydispersity index (PDI)
for species present across both peaks was 1.006, indicating that the
species across both peaks are uniform with respect to molecular mass
(Figure D,F). The
molecular mass of this species was calculated to be 486 kDa ±
2.1% using MALS. This measurement is larger than the predicated molecular
mass of 420 kDa (based on the amino acid sequence of trimeric OptSpike1
and not accounting for glycans). The PDI of OptSpike1 on day 14 (which
ran as one peak) was determined to be 1.005, which also indicated
that this peak is uniform, and the molecular mass from MALS was 493
kDa ± 1.08% (FigureE,F). OptSpike-2-Expi293F and OptSpike2-ExpiCHO-Sexhibited
a similar two-peak profile when analyzed on the Yarra 3 μm SEC-4000
LC column (Figure S1E,F) Together, these
data indicate theexistence of closely related species that likely
arise from conformational conversion and not from changes in the oligomeric
state (see below).
Thermal Denaturation of OptSpike1 and OptSpike2
Differential
scanning fluorimetry (DSF) was used to assess the relative thermal
stabilities of OptSpike1 and OptSpike2 produced in ExpiCHO-S cells.
Analysis of DSF data revealed no discernible difference in melting
behavior, with both OptSpike1 and 2exhibiting two similar melting
transitions (OptSpike1 Tm1 = 49.2 °
C, Tm2 = 63.4 ° C and OptSpike2 Tm1 = 48.7 ° C, Tm2 = 63.9 ° C) (Figure G). This consistent behavior suggests that differences in
expression yields of OptSpike1 and Optspike2 are likely not due to
inherent differences in protein stability.
Analytical Ultracentrifugation
(AUC) Analysis of OptSpike1
To confirm that purified OptSpike1-ExpiCHO-S
is a stable trimer
and further evaluate the molecular mass, we conducted a series of
sedimentation velocity experiments as a function of protein concentration.
Six concentrations of S protein were centrifuged spanning a range
from 136 nM to 4.8 μM. The distribution of sedimentation coefficients g(s*) and in particular the maxima of the
main peak are invariant over the concentration rangeexamined (Figure S1H). This behavior demonstrates that
the protein molecules present in the preparation are stable noninteracting
particles. The distribution of species present in a solution of noninteracting
particles can be deconvolved using a continuous distribution, c(s), analysis (Figure H, Figure S1H).
The dominant peak (68%) in the c(s) distribution has a mass of 455 kDa, consistent with theS protein
trimer being the dominant protein present in the solution. This calculated
molecular mass is comparable to both the predicted molecular mass
based on the amino acid sequence (420 kDa) and the molecular mass
determined by MALS (490 kDa). The best fit ratio of f/fo is 1.63, indicative of significant
geometric asymmetry. These data demonstrate that OptSpike1 is a stable
trimer in solution from nM to μM protein concentrations.
Biochemical
Characterization of OptSpike1 Interaction with Human
ACE2
Purified OptSpike1-ExpiCHO-S protein activity was demonstrated
by binding to humanACE2 (hACE2) but not mouseACE2 (mACE2) or humanCD26 (hCD26), theentry receptor for MERS. A 500 nM OptSpike1 trimer
was incubated with FreeStyle293-F cells expressing hACE2, mACE2,
or hCD26—all C-terminally tagged with GFP—and binding
was detected by flow cytometry with an anti-His PE labeled antibody
recognizing theHis8 tag on theS protein (see Figure S3A). Theexpression and cell surface
localization of hACE2, mACE2, and hCD26 were confirmed by antibody
staining (Figure S3E,F,K). Strong binding
to cells expressing hACE2 (Figure A,C) was observed, but not to cells expressing mACE2
or hCD26 (Figure B,C),
confirming that theS protein does not bind to CD26 (in agreement
with a previous report[21]) and confirming
specificity of theS protein for hACE2. Titration of hACE2-expression
FreeStyle293-F cells with recombinant OptSpike1 (0.0025–400
nM) yielded an EC50 of 2.0 ± 0.6 nM (Figure D).
Figure 3
SARS-CoV-2 S protein
binds to human ACE2 but not mouse ACE2 or
human CD26. (A) Representative flow plot of data quantified in panel
(C) showing that OptSpike1-ExpiCHOSTM binds to FreeStyleTM 293-F cells
expressing hACE2, binding was detected using an antibody against the
8x-HIS tag on OptSpike1. (B) Representative flow plot of data quantified
in panel (C) showing that OptSpike1-CHO does not bind to FreeStyleTM
293-F cells expressing mACE2, binding was detected using an antibody
against the 8x-HIS tag on OptSpike1. (C) 500 nM OptSpike1-CHO was
incubated with cells expressing human ACE2, mouse ACE2, or human CD26,
binding was detected with an anti-HIS antibody, and data was acquired
by flow cytometry, n = 4. (D) OptSpike1-ExpiCHO-STM
was titrated on FreeStyleTM 293-F cells expressing human ACE2 from
.0025–.400 nM, binding was quantified with an anti-HIS antibody
using flow cytometry, and a binding curve was fit in GraphPad using
the equation Y=Bmax*Xh/(Kdh+Xh), n = 4.
SARS-CoV-2S protein
binds to humanACE2 but not mouseACE2 or
humanCD26. (A) Representative flow plot of data quantified in panel
(C) showing that OptSpike1-ExpiCHOSTM binds to FreeStyleTM 293-F cells
expressing hACE2, binding was detected using an antibody against the
8x-HIS tag on OptSpike1. (B) Representative flow plot of data quantified
in panel (C) showing that OptSpike1-CHO does not bind to FreeStyleTM
293-F cells expressing mACE2, binding was detected using an antibody
against the 8x-HIS tag on OptSpike1. (C) 500 nM OptSpike1-CHO was
incubated with cells expressing humanACE2, mouseACE2, or humanCD26,
binding was detected with an anti-HIS antibody, and data was acquired
by flow cytometry, n = 4. (D) OptSpike1-ExpiCHO-STM
was titrated on FreeStyleTM 293-F cells expressing humanACE2 from
.0025–.400 nM, binding was quantified with an anti-HIS antibody
using flow cytometry, and a binding curve was fit in GraphPad using
theequation Y=Bmax*Xh/(Kdh+Xh), n = 4.
Screening OptSpike1 for
Binding to Members of the Human Secretome
To further evaluate
the specificity of theSARS-CoV-2S protein,
OptSpike1-ExpiCHO-S was screened for binding to 900 members (around
∼20%) of thehuman secretome (from the Ig superfamily, TNFR
superfamily, Integrin family, chemokine family, and GPCR family –
see Table S1). Each member of this library
(tagged with cytosolic GFP to confirm expression – see Table S1 and Figure S3) was individually transfected into FreeStyle293-F cells. Individual
transfections were then incubated with 200 nM OptSpike1 in the 96-well
plate format, and binding was analyzed by flow cytometry with an anti-HIS
PE labeled antibody as above. hACE2-GFP expressing cells were included
as a positive control on each plate. Integrins were screened both
as individual transfections and as alpha-beta pairs. While strong
binding to each replicate of hACE2 was detected, we did not detect
binding to any other members of the library (Figure S3, Table S1), including proteins
suggested to be targets of S protein binding, including CD147[26,27] and Siglec 9 and 10,[28] as well as the
host cell receptors for other coronaviruses, including Ceacam1, which
is the receptor for Murine Coronavirus[29] and CD26 and Ceacam5, which are both receptors for MERS.[30] Expression and cell surface presentation of
these putative receptors were validated using monoclonal antibodies
(Figure S3I–L).
OptSpike1 and
OptSpike2 Can Be Used Reproducibly and Interchangeably
in COVID-19 Convalescent Serum ELISA
Enzyme-linked immunosorbent
assays (ELISAs) are commonly used in clinical settings to detect the
presence of viral antibodies. Therefore, we assessed the reactivity
of serum antibodies from oneCOVID-19 convalescent patient toward
multiple independent preparations of the S ectodomain proteins (Figure A). We analyzed theEC50 and standard error by
one-way ANOVA and found that there is no statistical significance
in EC50 values when different production batches of OptSpike1
were used as the target (Figure A, p > .9999) or when the different
S protein constructs OptSpike1-ExpiCHO-S, OptSpike2-ExpiCHO-S, and
OptSpike1-Expi293F were used as the target (Figure B, p > .9999). These
data
demonstrate that theexpression and purification protocols reported
herein consistently yield OptSpike1 with reproducible behavior in
ELISAs detecting anti-S IgG antibodies. Additionally, we have shown
that reactivity of serum antibodies from convalescent patients toward
OptSpike1 and OptSpike2 is not distinguishable.Highly reproducible data
in COVID-19 convalescent serum ELISA experiments
using OptSpike1 and OptSpike2. (A) Five different production batches
of OptSpike1-ExpiCHO-STM were tested by ELISA for the detection of
anti-S IgG antibodies from convalescent COVID-19patients. Low batch-to-batch
variability in OptSpike1 was observed, with no statistically significant
differences in EC50 values between batches (P > .9999). (B) Different S protein constructs OptSpike1-ExpiCHO-STM,
OptSpike1Expi293FTM, and OptSpike2-ExpiCHO-STM were tested by ELISA
for the detection of anti-S IgG antibodies from confirmed COVID-19
convalescent patients. Very little variability was seen between different
constructs and expression cell lines used, with no statistically significant
differences in EC50 values (P > .9999).
Development of a COVID-19 Multiantigen Protein
Array
Protein microarray technology allows for the high-throughput,
multiplexed
screening of numerous parameters within a singleexperiment.[31] To simultaneously and rapidly screen the serum
from convalescent COVID-19patients against multipleSARS-CoV-2 antigens
at once, we developed a COVID-19 multiantigen protein array presenting
purified S protein, the RBD of theS protein (Figure S2C), and Nucleocapsid protein (Figure S2A,B,D) of SARS CoV-2 (printed with a Marathon Argus
piezoelectric printer from Arrayjet). This multiantigen array was
challenged with either convalescent sera from COVID-19patients that
had been previously diagnosed using RT-PCR and ELISA tests or serum
from control individuals that had been collected prior to theSARS-CoV-2
outbreak in November 2019. Seropositivity of COVID-19patients was
confirmed by ELISA (Figure S5). In total,
serum from eight individuals was screened. Individuals 2, 3, 5, 6,
and 7 are confirmed SARS-CoV-2-positive. Individuals 1, 4, and 8 had
serum collected beforeSARS-CoV-2 was reported (Figure and Figure S4).
Figure 5
SARS-CoV-2 multiantigen protein array. (A) Representative image
of antigen detection from SARS-CoV-2 positive serum 2. Serum 2 detects
all of the SARS-CoV-2 antigens printed on the protein array: S protein
(OptSpike1 and 2), receptor binding domain of S protein (RBD), and
Nucleocapsid protein (N). (B) Representative image of screening with
SARS-CoV2 negative serum 4. Serum 4 is negative for all SARS-CoV-2
antigens printed and only positive control, human IgG1 (hIgG1) is
detected. (C) Representative image of antigen detection from SARS-CoV-2
positive serum 5. Serum 5 does not detect all of the SARS-CoV-2 antigens
printed on the protein array and only detects S protein and N proteins.
All protein arrays contain negative buffer and 1× PBS controls
as well as negative protein controls (huAche). (E) Quantifications
of serums 2, 4, and 5 results with titrations of OptSpike1 produced
in ExpiCHO-S cells from 200 to 25 pg. (F) Quantifications of serums
2, 4, and 5 results with titrations of RBD from 200 to 25 pg. (G)
Quantifications of serums 2, 4, and 5 results with titrations of N
protein from 200 to 25 pg.
SARS-CoV-2 multiantigen protein array. (A) Representative image
of antigen detection from SARS-CoV-2 positive serum 2. Serum 2 detects
all of theSARS-CoV-2 antigens printed on the protein array: S protein
(OptSpike1 and 2), receptor binding domain of S protein (RBD), and
Nucleocapsid protein (N). (B) Representative image of screening with
SARS-CoV2negative serum 4. Serum 4 is negative for all SARS-CoV-2
antigens printed and only positive control, human IgG1 (hIgG1) is
detected. (C) Representative image of antigen detection from SARS-CoV-2
positive serum 5. Serum 5 does not detect all of theSARS-CoV-2 antigens
printed on the protein array and only detects S protein and N proteins.
All protein arrays contain negative buffer and 1× PBS controls
as well as negative protein controls (huAche). (E) Quantifications
of serums 2, 4, and 5 results with titrations of OptSpike1 produced
in ExpiCHO-S cells from 200 to 25 pg. (F) Quantifications of serums
2, 4, and 5 results with titrations of RBD from 200 to 25 pg. (G)
Quantifications of serums 2, 4, and 5 results with titrations of N
protein from 200 to 25 pg.Each array was printed with 16 identical subarrays. A range of
target protein concentrations was spotted (25, 50, 100, and 200 pg
per spot), and each serum was screened in duplicate (Figure A–C, Figure S4A,B). Recombinant human IgG1, which is recognized
by the anti-human-647 labeled secondary antibody, was included in
each array as a positive control for protein printing. Negative controls
(human acetyl cholinesterase (huAche), printing buffer, and PBS) were
used to determine nonspecific binding and background correction for
data analysis. All confirmed COVID-19patients had strong antibody
responses to theS protein in our multiantigen protein array, which
was in good agreement with theS proteinELISAs (Figure S4D–F, Figure S5).
Additionally, all confirmed COVID-19patient sera tested positive
for N protein in our protein multiantigen array (Figure S4H). Antibodies against the RBD were detected in serums
2, 3, 6, and 7. However, antibodies against the RBD from patients
5 (which additionally had relatively lower detectable levels of antibodies
against the S and N proteins) could not be detected in this assay
(Figure D–G, Figure S4C–H). These data demonstrate
the ability of theCOVID-19 multiantigen protein array to simultaneously
analyze antibody responses to multiple antigens in a high-throughput
format and again validate the use of our recombinant S protein in
an antigen detection platform.Understanding variable antibody
responses to the spectrum of SARS-CoV-2
antigens will substantially impact our understanding of population-wide
immune responses to COVID-19.
Cryo-EM Structure of the
SARS-CoV-2 S Protein Produced in ExpiCHO-S
Cells
Cryogenic electron microscopy (cryo-EM) single particle
reconstruction was conducted to assess the tertiary and quaternary
arrangements of theS proteinexpressed in theExpi293F and ExpiCHO-S
systems (Table S2). Aliquots of purified
OptSpike1 protein from each expression host (1 mg/mL in 50 mM Tris
and 250 mM NaCl, pH 8.0) were applied to cryo-EM grids, blotted, and
plunge-frozen in liquid ethane at LN2 temperature. Micrographs
collected on a Titan Krios at 300 kV, using a K3 camera, were aligned
and motion-corrected followed by CTF estimation and 2D classification
as described in the Materials and Methods.
Particle picking, 3D refinement, and reconstruction yielded high-quality
maps for samples arising from both Expi293F and ExpiCHO-S productions
(Figure S6, Table S2). Using the OptSpike1-ExpiCHO-S material, we obtained a 3.22 Å
resolution reconstruction of the symmetrical trimer in the closed
conformation (all three RBDs in the down position) (Figure A,B, Figure S6, Table S2). Similarly, theOptSpike1-Expi293F
material yielded a 3.44 Å reconstruction of the symmetrical trimer
in the closed conformation (Figure S6, Table S2). Rigid body refinement and manual adjustment
in COOT followed by RSR refinement against the reconstructed maps
with PHENIX[32] yielded models with good
stereochemistry and validation metrics (Table S2, Figure , Figure S6A–F). Comparison of
the molecular envelopes from the current work with the recently deposited
cryo-EM structure of the closed state of theSARS-CoV-2SPIKE protein
(PDB: 6VXX)
showed excellent agreement. For example, structural alignment of the
trimeric ExpiCHO-produced S protein coordinates with those from 6VXX
resulted in a core RMSD of 0.63 Å for 2889 aligned Cα pairs.
Likewise, comparison of the coordinates for theExpi293F and ExpiCHO-S
coordinates from the current work aligned with an RMSD of 0.60 Å
over 2696 aligned Cα pairs. This is consistent with proper tertiary
folding and trimeric quaternary organization of both samples.[21]
Figure 6
Cryo-EM structure of OptSpike1-ExpiCHO-STM in the closed
state.
(A) Side view of the SARS-CoV-2 OptSpike1 trimer in the prefusion
conformation. (B) Top view of the SARS-CoV-2 OptSpike1 trimer in the
prefusion conformation. Two protomers are displayed with the cryo-EM
density maps (dark gray and light gray), and the third protomer is
displayed as a ribbon structure (magenta) with glycans represented
on the ribbon structure (cyan).
Cryo-EM structure of OptSpike1-ExpiCHO-STM in the closed
state.
(A) Side view of theSARS-CoV-2 OptSpike1 trimer in the prefusion
conformation. (B) Top view of theSARS-CoV-2 OptSpike1 trimer in the
prefusion conformation. Two protomers are displayed with the cryo-EM
density maps (dark gray and light gray), and the third protomer is
displayed as a ribbon structure (magenta) with glycans represented
on the ribbon structure (cyan).
Analysis of Cryo-EM Structures of the SARS-CoV-2 S Protein Reveals
Two Distinct Conformations at Residues 614–642
Wenext compared all available structures of theSARS-CoV-2S protein
in the closed (all three RBDs pointing down, no additional binding
partners) conformation (Figure A,B). The structures were aligned pairwise to Cα atoms
in chain A of PDB 6VXX using PyMol, revealing general tertiary and quaternary agreement
among all models. It is apparent that theN-terminal domain and the
RBD of S1 show greater positional variability among the structures
than do the S2 domains (Figure A,B). One region, amino acid segment 614–642, adopted
two distinct conformations across different models (Figure C–F). In nine structures
that align well in this region (conformation 1: 6VXX, 6X29, 6X2C,
6X79, 6ZOX, 6ZOY, 6ZP0, 6ZP1, 6ZWV), the amino acid segment 621–640
was not modeled, presumably due to disorder (Figure C). Seven structures (conformation 2: 6XR8,
6ZGE, 6ZGH, 6ZGI, 6ZP2, 7JJI, 7JJJ) show a different chain trace in
this region (first reported elsewhere[33]) and are missing amino acids 619–631, except for 7JJJ, which
exhibits a continuous trace in this region (shown in green – Figure D) by virtue of stabilization
due to packing against theN-terminal domain of a second trimer. Additionally,
these conformation structures contain an ordered segment spanning
amino acids 828–854 (colored blue, Figure D) not observed in the above structures characterized
by conformation 1 (Figure C).
Figure 7
Comparison of 20 S protein structures in the RBD down conformation
reveals two distinct structural variations at amino acids 614–642.
(A) Side and (B) top views of superposition of 19 trimeric S structures
with RBDs in the down conformation (6VXX, 6X29, 6X2C, 6X6P, 6X79, 6XLU, 6XM5, 6XR8, 6ZGE, 6ZGH, 6ZGI, 6ZOX, 6ZOY, 6ZOZ, 6ZP0, 6ZP1, 6ZP2, 6ZWV, 7JJI) aligned pairwise to Cα atoms in
chain A of 6VXX using Pymol. (C–F) Examination of the two regions of S protein
structures exhibiting structural variability at amino acid segments
614–642 (colored magenta or green in all panels, red asterisk
denotes the end of the trace) and 828–854 (shown in blue):
(C) nine structures (6VXX, 6X29, 6X2C, 6X79, 6ZOX, 6ZOY, 6ZP0, 6ZP1,
6ZWV) displaying conformation 1 between the region 614–642
and the amino acid segment 621–640 was not modeled; (D) seven
structures (6XR8, 6ZGE, 6ZGH, 6ZGI, 6ZP2, 7JJI, 7JJJ) displaying conformation
2 in this region and missing amino acids 619–631 (except 7JJJ,
shown in green) and which also have an ordered segment spanning amino
acids 828–854 (colored blue) not observed in structures in
panel (C); (E) the structure from this work (6X6P, in green) overlaid
on the structures from panel (C) showing general structural agreement;
(F) five structures (6X6P, 6XLU, 6XM5, 6ZOZ with 6X6P in green for
621–640) for which amino acids from 621 to 640 in conformation
1 were modeled, amino acids 828–854 are shown in blue.
Comparison of 20 S protein structures in the RBD down conformation
reveals two distinct structural variations at amino acids 614–642.
(A) Side and (B) top views of superposition of 19 trimeric S structures
with RBDs in the down conformation (6VXX, 6X29, 6X2C, 6X6P, 6X79, 6XLU, 6XM5, 6XR8, 6ZGE, 6ZGH, 6ZGI, 6ZOX, 6ZOY, 6ZOZ, 6ZP0, 6ZP1, 6ZP2, 6ZWV, 7JJI) aligned pairwise to Cα atoms in
chain A of 6VXX using Pymol. (C–F) Examination of the two regions of S protein
structures exhibiting structural variability at amino acid segments
614–642 (colored magenta or green in all panels, red asterisk
denotes theend of the trace) and 828–854 (shown in blue):
(C) nine structures (6VXX, 6X29, 6X2C, 6X79, 6ZOX, 6ZOY, 6ZP0, 6ZP1,
6ZWV) displaying conformation 1 between the region 614–642
and the amino acid segment 621–640 was not modeled; (D) seven
structures (6XR8, 6ZGE, 6ZGH, 6ZGI, 6ZP2, 7JJI, 7JJJ) displaying conformation
2 in this region and missing amino acids 619–631 (except 7JJJ,
shown in green) and which also have an ordered segment spanning amino
acids 828–854 (colored blue) not observed in structures in
panel (C); (E) the structure from this work (6X6P, in green) overlaid
on the structures from panel (C) showing general structural agreement;
(F) five structures (6X6P, 6XLU, 6XM5, 6ZOZ with 6X6P in green for
621–640) for which amino acids from 621 to 640 in conformation
1 were modeled, amino acids 828–854 are shown in blue.The structure from the present work (6X6P) displays
conformation
1 and is overlaid (in green) on the structures in Figure C, which reveals closely matched
organization for the amino acid segments that are mutually present
(614–620 and 641–642) (FigureE).Wenext overlaid all structures displaying
conformation 1, which also exhibited a complete trace in this region
(residues 614–642) (6X6P, 6XLU, 6XM5, 6ZOZ in magenta; 6X6P
in green Figure F).
These structures are not fully conserved structurally in segment 614–642
but generally follow a similar chain path distinct from conformation
2. Three of these four conformation 1 structures also have features
modeled for amino acids 828–854 (colored blue Figure F), although these segments
do not overlay as well as those present in the conformation 2 structures
(colored blue, Figure D). Based on this analysis, it is likely that at least two discrete
structural states for the amino acid segment spanning residues 614–642
exist, which may influence the conformation of residues 828–854.
Discussion
To effectively address theCOVID-19 global health
crisis, reproducibility
in biological and clinical studies will be critical. Efficient and
uniform production of key biological reagents is essential for the
generation of reliable data in the coming months and years. Herein,
we describe our protocols (including the SOP) for the optimized production
of two recently reported recombinant forms of theSARS-CoV-2S protein.
We show that using theExpiCHO-Sexpression system, recombinant S
protein can be reproducibly produced in both high quantities and high
quality. Our reported yields of 28 mg/L for OptSpike1 and 18 mg/L
for OptSpike2 exceed those previously reported (OptSpike1: .5 mg/L
in FreeStyle293-F cells,[7] OptSpike2: 5
mg/L in Expi293F cells, and .5 mg/L in insect cells[15]). To validate protein quality, weemployed analytical sizeexclusion chromatography, multiangle light scattering, thermal denaturation,
and analytical ultracentrifugation and demonstrated that these recombinant
S proteins have similar biophysical properties and antigenicity.Interestingly, we found that all preparations consistently exhibited
different behaviors whenexamined on two different analytical sizeexclusion columns. These proteins eluted as a single peak on theSuperose
6 Increase 10/300 GL column (agarose-based resin) but eluted as two
overlapping peaks on the Yarra 3 μm SEC-4000 (silica-based resin).
We also noted a time-dependent evolution as the distribution of peaks
on the Yarra 3 μm SEC-4000 moved entirely to the faster migrating
species. Furthermore, MALS analysis demonstrated that there was no
mass difference between the two overlapping peaks and AUC analysis
of OptSpike1-ExpiCHO-S confirmed that the predominant species in solution
was a stable trimer. We propose a modest time-dependent structural
alteration of theS protein, involving two states with distinct physicochemical
properties, resulting in differential interactions with thesilica-based
Yarra 3 μm SEC-4000 resin. This proposal is consistent with
the reported cryo-EM structures displaying the trimeric S proteinexisting in both a closed conformation with all RBDs pointing downward
(PDB: 6VXX)
and an open conformation with one RBD pointing upward (PDB: 6VSB and 6VYB).[7,21] Future
work will be required to fully understand the nature and functional
consequences of the multiple solution species observed for theS protein
(Figure B).We demonstrate that the protein purified here is well suited for
structural studies by determining the cryo-EM structures of OptSpike1-Expi293F
at 3.44 Å resolution and OptSpike1-ExpiCHO-S at 3.22 Å.
These structures arenearly identical to each other and are in good
agreement with the closed form observed in previously reported structures
of theS protein (which were produced in FreeStyle293-F cells).[7,21] Interestingly, the structures determined from protein expressed
in mammalian cells differ from a recently reported cryo-EM structure
of wild-typeS protein produced from High Five insect cells. This
structure appeared to be more compact overall, and the model could
not accommodate the presence of the SD1 domain.[34] Whether these disparities are the consequence of differences
in expression hosts (e.g., glycosylation or other processing functions)
or the lack of stabilizing mutations remains an open question, although
RBD recognition and antigenicity were preserved.We compared
20 deposited cryoEM structures of theS protein with
the RBDs in the down conformation. We found that, though there was
generally good overall structural agreement, one region (amino acid
segment 614–642) was modeled as one of the two distinct paths
in each structure (Figure ). This behavior indicates that this region of theS protein
can adopt at least two discrete conformations (conformation 1: Figure C,E,F; conformation
2: Figure D), which
may influenceneighboring segments of theS protein (notably amino
acids 828–854). Interestingly, this region is proximal to both
S cleavage sites, and residue 614 (the start of the 614–642
site) is often mutated in SARS-CoV-2 (i.e., D614G, which is associated
with increased infectivity).[35] A recent
structural study of theD614G variant reported a model in which neither
residues 621–639 nor the RBD was included (PDB: 6XS6),[36] and the trace modeled for residues 614–620 follows
the path observed in Figure C,E,F (conformation 1). TheD614G structure did contain several
differences in theS protein conformation, including an increased
distance between protomers and an increase in the fraction of S proteinexhibiting two or three RBDs in the open (up) conformation compared
to D614.[35] However, it has been recently
reported that the structure of theS protein and the distribution
of RBD conformations are affected by changes in pH,[37] and yet, another group reported temperature sensitivity
of theS protein structure.[38] Thus, the
apparent propensity for the immediate physicochemical environment
of theS protein to alter conformation makes it difficult to compare
distributions of structures when samples were not prepared identically.One structure (PDB: 7JJJ – the only structure in the “down”
conformation to fully model every amino acid in conformation 2- Figure D)[39] formed a “dimer of trimers” in which the
region spanning amino acids 619–631 extends away from one trimer
and interacts with theNTD of a second trimer. While the biological
significance of this conformation and interaction is unclear, the
authors proposed that “inter-spike interactions may play a
role in viral viability”, which is consistent with a mutagenesis
experiment they conducted.[39]Thus,
the determinants of different observed S protein conformations
remain unclear, and future studies areneeded to fully define the
structural and dynamic characteristics of theS protein and to determine
how these properties contribute to SARS-CoV-2 infectivity.Both
OptSpike1 and OptSpike2 proteins exhibit comparable thermal
denaturation profiles with two similar melting transitions (Figure G). This finding
is consistent with the thermal denaturation profile of wild-type S
protein purified from High Five insect cells despite the overall structural
differences.[34] Additional work will be
required to assign these transitions to specific domains or modules
within the full-length COVID-19spike protein.Weexamined the
receptor-binding specificity of theSARS-CoV-2S protein. We demonstrate strong binding between theS protein and
hACE2, but not mACE2, as has been previously reported.[8,40] Furthermore, we screened recombinant OptSpike1 against a sublibrary
of 900 members of thehuman secretome that wereexpressed on FreeStyle293-F cells and did not identify any additional interactions (Figure S3C). This library included many proteins
that others have suggested may bind to theSARS-CoV-2S protein, including
CD147, Siglec 9, Siglec 10, and proteins that other coronaviruses
use for viral entry, including CD26 (MERS), Ceacam5 (MERS), and Ceacam1
(Murine Coronavirus).[28,29,41,42] CD147 is of particular interest because
of the ongoing clinical trial that aims to treat SARS-CoV-2 with Meplazumab
(a humanized anti-CD147 antibody).[26] Importantly,
for this subset of proteins, we conducted antibody staining to explicitly
evaluate cell surface localization (Figure S3I–L). While all members of this subset could be detected by antibody
staining and are therefore properly localize to the cell surface,
we were not able to detect binding to theS protein. It is important
to note that the assay weemployed represents only one possible format
for detecting protein–protein interactions (recombinant S protein
binding to the cell surface displayed receptor: Figure S3A). Because we did not directly recapitulate theexperiments that originally detected interactions between theS protein
and CD147 (SPR, ELISA, Co-IP[27]) or Siglec
9 and 10 (ELISA[28]), it is possible that
theS protein binds to these receptors in a way that is undetectable
in this assay or that there will be cell-specific differences in the
presentation of the putative receptors (i.e., the requirement an unknown
co-receptor not expressed on FreeStyle293-F cells, etc.). It is important
to note that anti-CD147 Abs were reported to block COVID-19infection
in Vero E6 cells, which also express ACE2,[43] highlighting the potential importance of cell-specific differences
in receptor presentation and recognition. Further investigations are
warranted to evaluate these potential interactions. Finally, as our
screening efforts only included ∼20% of thehuman secretome,
it is important to continue to examine the remaining 80% for alternateS protein interactions, which might reveal additional mechanisms of
viral entry for SARS-CoV-2.We additionally demonstrate that
these recombinant S proteins can
be used interchangeably and reproducibly in two different assays screening
for a human serum response to S protein (ELISA: see Figure , and protein microarray: see Figure , Figure S4, and below). Validating that both S proteins can
be utilized in a serum ELISA screen is important because both OptSpike1
and OptSpike2 are being widely utilized for serology testing at different
clinical sites. These data collectively confirm that the strategy
for antigen preparation described here can be used reliably and that
the antigenicity of OptSpike1 and OptSpike2 is comparable regardless
of the cell line used (Expi293F vs ExpiCHO-S: Figure B). These data are critical for interpreting
clinical results from various institutions, which are using a wide
array of different serological tests to detect anti-S antibodies (for
a complete list of antibody tests approved for use by the FDA, please
see their website[44]). Consequently, additional
versions of theS protein that are being used clinically should be
analyzed similarly, so as to further ensure clinical reproducibility
and compatibility of different antibody serology tests.We developed
a protein microarray platform to simultaneously analyze
serum for antibodies against theS protein, as well as recombinant
RBD and N protein. Microarray results from a small cohort of serum
from five seroreactiveSARS-CoV-2patients and serum from three individuals
collected beforeNovember 2019 revealed that detectable anti-S antibody
titer correlated well with results of serological ELISAs (Figures S4 and S5) and additionally demonstrated
the simultaneous detection of antibodies against theS protein, theN protein, and the RBD of theS protein. It is important to note the
limitations of the current platform. In particular, we stress that
at present, this is a qualitative approach as a number of variables
can impact the ability to detect antibody reactivity, including patient
titers for specific antigens and the relative affinities of an antigen-specific
pool. Furthermore, it should be appreciated that the current platform
is programmed to detect the capture of IgG antibodies. Thus, the resultant
IgG signal (or the lack thereof) for a particular antigen could be
the consequence of different prevalences between different isotypes
(IgA, IgD, IgE, IgG, IgM), which are known to evolve during the course
of infection and subsequent resolution. Although currently focused
on three antigens, this platform can be readily expanded to study
differential antibody responses to different SARS-CoV-2 antigens and
subdomains of those antigens among individuals, which are actively
being investigated by others.[45−48] We are currently working to include not only other
antigens from SARS-CoV-2 (E protein, M protein, etc.) but also antigens
from other coronaviruses that may be cross reactive with SARS-CoV-2
antibodies. The analysis of antibody reactivity to multipleSARS-CoV-2
and related antigens will provide broad insight into the humoral immune
response to SARS-CoV-2.Collectively, this work provides an
array of standards and metrics
for high-quality protein reagents that can yield comparable clinical,
biological, and structural data as we continue to combat the global
health crisis caused by SARS-CoV-2.
Materials and Methods
Recombinant
Spike Protein Production
An SOP for the
production of high-quality recombinant S antigen is provided in the Supporting Information Appendix 1. Both variants
of the recombinant SARS-CoV-2S protein (OptSpike1 and OptSpike2)
wereexpressed in Expi293F and ExpiCHO-S cells according to the manufacturer’s
instructions (Thermo Fisher Scientific). For Expi293F (Thermo Fisher
Cat # A14527) purifications, cells were grown in sterileTC flasks,
vented, with a baffled bottom (example: Fisher Scientific Cat # BBV12-5)
in a Climo Shaker ISF4-X at 110 rpm (an orbital diameter of 50 mm)
at 37 °C and 8% CO2. On the day of transfection, Expi293F
cells can reach a density between 4.5−5.5 × 106 cells/mL. TheExpi293F cells are diluted to 3 × 106 cells/mL in the appropriate volume and shaker flask. For example,
for a transfection volume of 30 mL, 75 × 106 cells
were diluted in 25 mL of fresh media in a 125 mL shaker flask. Cells
were then transfected by diluting 30 μg of plasmid DNA in Opti-MEM
I Reduced Serum Medium (Cat # 31985-062) to a total volume of 1.5
mL in a 15 mL conical tube and then mixing briefly by inverting the
tube. 80 μL of ExpiFectamine 293 Reagent was then diluted in
Opti-MEM I medium to a total volume of 1.5 mL in a separate 15 mL
conical tube. Both reactions were incubated for 5 min. The transfection
reagent mixture was thencarefully pipetted into the DNA mixture and
mixed by inverting the 15 mL conical tube three times. The combined
DNA and transfection reagent mixture was incubated at room temperature
for 25 min, after which the complexes were added to the cell culture
in a dropwise fashion while swirling the cell culture flask to ensure
uniform distribution of the DNA complexes. 16 h post-transfection,
150 μL of ExpiFectamine 293 Transfection Enhancer 1 and 1.5
mL of ExpiFectamine 293 Transfection Enhancer 2 were added to the
transfected cells, bringing the total volume of the transfection to
30 mL. Expi293F transfections were harvested on day 6 post transfection
unless otherwise specified. This protocol can be scaled linearly for
larger Expi293F cultures, according to the manufacturer’s protocol,
for both OptSpike1 and OptSpike2.For ExpiCHO-S transfections,
we obtained theExpiCHO-S (Thermo Fisher Cat # A29127) cells and ExpiCHOExpression System kit (Cat # A29133) from Thermo Fisher Scientific.
ExpiCHO-S cells were grown in sterileErlenmeyer-shaped flasks with
plain bottoms with vented screw caps (appropriate flasks for cell
culture volume should be used; see, for example, Fisher brand PBV125)
in a Climo Shaker ISF4-X at 110 rpm (an orbital diameter of 50 mm)
at 37 °C and 8% CO2. ExpiCHO-S cells were passaged
and split every 2–3 days when cell densities reached 0.3 ×
106 to 6 × 106 viable cells per mL. ExpiCHO-S
cells are allowed to reach a density of 7–10 × 106 cells per mL on the day of transfection, with a viability
of at least 95%. ExpiCHO-S cells are diluted to a final density of
6 × 106 cells per mL in a plain bottom Erlenmeyer
flask. For spike protein transfections, the manufacturer’s
ExpiCHOexpression system manual was followed closely. For example,
for a 25 mL transfection, 25 μg of filtered DNA was diluted
into 1 mL of OptiPRO SFM (Cat # 12309019), and 80 μL of ExpiFectamineCHO reagent was added to 920 μL of OptiPRO SFM. Dilutions were
mixed by inversion, and then diluted DNA was mixed with diluted ExpiFectamineCHO reagent and mixed by inversion. ExpiFectamineCHO and DNA complexes
were incubated at room temperature for 5 min and slowly transferred
to a 125 mL plain bottom flask containing 25 mL of ExpiCHO-S cells
at a cell density of 6 × 106 cells per mL. For all
spike transfections, we followed the max titer protocol (unless otherwise
stated), and thus, on day 1 post transfection, we added 150 μL
of ExpiCHO-SEnhancer and 4 mL of ExpiCHO-S Feed. On day 1 post transfection,
the cells were shifted to another Climo Shaker ISF4-X at 110 rpm (an
orbital diameter of 50 mm) with temperature set at 32 °C and
5% CO2. On day 5 post transfection, another 4 mL of ExpiCHO-S
Feed was added and the cells were set back at 32 °C. ExpiCHO-S
transfections were harvested on day 12 post transfection unless otherwise
specified.
S Protein Purification
For purifications
of theSARS-CoV-2S protein (OptSpike1 and OptSpike2), on the indicated day post transfection
(optimally day 6 for Expi293F and day 12 for ExpiCHO-S), the media
was harvested by centrifuging cells at 500g for 10
min, removing the supernatant, and centrifuging the supernatant at
2000g for 10 min. For ExpiCHO-S, the supernatant
was clarified by filtering through a rapid flow filter from Thermo
Fisher (example: 524-0020). The supernatant was then adjusted to 50
mM ArgCl using a 2 M stock solution of ArgCl at pH 6.5 to increase
stability of the protein during purification. OptSpike1 and OptSpike2
were subsequently purified by His60Ni2+ superflow resin
(Takara Cat: 635664) using a batch binding method: .6 mL of resin
was added for every 10 mL of supernatant, and supernatants were incubated
with resin for 2 h at 4 °C with shaking or rotating. Batch adsorption
was followed by gravity flow over a column. TheNi2+His60
resin was washed with 3 column volumes of wash buffer (50 mM Tris
pH 8.0, 100 mM ArgCl, 5 mM imidazole, 150 mM NaCl, 10% glycerol),
and the bound protein was eluted with 5 mL of the same buffer containing
500 mM imidazole. Elution from nickel resin was concentrated by centrifugation
in a 100 K concentrator (Thermo Fisher Cat # 88533) by spinning at
1100g for intervals of 8 min (if necessary) and further
purified by gel filtration on a HiLoad 16/600 Superdex 200 column
(GE) equilibrated with 50 mM Tris, 100 mM ArgCl, 150 mM NaCl, 10%
glycerol, pH 8.0. For use in cryo-EM experiments, nickeleluates were
further purified by gel filtration on a HiLoad 16/600 Superdex 200
(GE) equilibrated with 50 mM Tris, 150 mM NaCl, pH 8.0. For the protein
used in serum ELISA or protein microarray experiments, 30 mL of nickeleluates was dialyzed against 5 L of 50 mM Tris, 150 mM NaCl, pH 8.0
at 4 °C for 16 h. Generally, the SEC step did not affect the
serum ELISA experiments but was important for all other experiments.
Protein concentration was determined using an extinction coefficient
(146,850 M–1 cm–1) estimated from
the amino acid sequence by theExpasy online ProtParam tool.[49]
SARS-CoV-2 RBD Expression and Purification
The pCAGGS[50] SARS-CoV-2 RBD plasmid
(provided by Florian
Krammer) was used for recombinant RBD expression as previously described.
FreeStyle293-F cells (Thermo Fisher, R79007) were transiently transfected
with a mixture of plasmid DNA diluted in PBS (0.67 μg of total
plasmid DNA per mL of culture) and polyethylenimine (PEI) (Polysciences,
Inc., 23,966) at a DNA-to-PEI ratio of 1:3. At 6 days post-transfection,
cultures were harvested by centrifugation at 4000g for 20 min, and the supernatant was incubated with Ni-NTA resin
(Goldbio) for 2 h at 4 °C with gentle stirring. The resin was
collected in columns by gravity flow, washed with 16 column volumes
of wash buffer (50 mM Tris HCl pH 8.0, 250 mM NaCl, 20 mM imidazole),
and eluted in 12 mL of elution buffer (50 mM Tris HCl pH 8.0, 250
mM NaCl, 250 mM imidazole). Eluates were concentrated and exchanged
into storage buffer (50 mM Tris HCl pH 8.0, 250 mM NaCl) using an
Amicon centrifugal units (EMD Millipore). Protein concentration was
determined using an extinction coefficient (33,350 M–1 cm–1), estimated from the amino acid sequence
by Expasy online ProtParam, and was further analyzed by SDS-PAGE.
Recombinant Expression and Purification of SARS-CoV-2 N Protein
Thenucleocapsid sequence was PCR-amplified from a diagnostic test
positive control plasmid obtained from IDTDNA (cat # 10006625 GenBank: NC_045512.2) and InFusion-cloned into a derivative of theNYSGRC pSGC-HIS vector.
50 ng of plasmid was used to transform 20 μL of the BL21 DE3
strain of E. coli. Cultures were then grown overnight (16 h) in LB at 37 °C and
used to inoculateeither LB media thenext day (1:100× dilution
of overnight culture). Inoculated cultures were grown at 37 °C
until they reached OD600 of 0.7, at which point they were
induced using 500 μM IPTG. Upon induction of LB media, the temperature
of the cultures was immediately lowered to 25 °C for 16 h.To harvest protein, cells were lysed by sonic disruption using a
550 sonic dismembrator from Fisher Scientific. Every 5 g was resuspended
in 30 mL of a lysis buffer consisting of 50 mM HEPES, 250 mM KCl,
10% glycerol, 10 mM BME, 0.1% Igepal CA-630 (Sigma Aldrich), pH 7.5
and 1/2 protease inhibitor tablets (Roche). After lysis, samples were
cleared by centrifugation at 20,000 rpm. The resulting supernatant
was purified on an AKTA FPLC (GE Biosciences). Supernatants were loaded
onto fast flow HisTrap columns and washed with 20 column volumes of
lysis buffer and eluted with 2 column volumes of Buffer B (Buffer
A + 500 mM imidazole, pH 7.5). The resulting eluent with high OD280 absorbance was collected and loaded onto a HiPrep 16/60
S-200 sizeexclusion column equilibrated with 50 mM HEPES, 250 mM
KCl, 10% glycerol, 5 mM DTT, pH 7.5. Protein concentrations of fractions
were approximated using an extinction coefficient of 43,890 M–1 cm–1, and a molecular mass of 45.62570
kDa was estimated from the amino acid sequence by theExpasy online
ProtParam tool.[49]
Analytical Size Exclusion
Chromatography
After nickelelutes were concentrated and purified by gel filtration on a HiLoad
16/600 Superdex 200 column and concentrated, the protein aggregation
state was assessed by analytical gel filtration on a Superose 6 Increase
10/300 GL column. The void for this column runs at 8.5 mL. The aggregation
state was monitored over time and after freeze–thaw cycles
on this column.
Molecular Mass Determination Using Multiangle
Light Scattering
(MALS)
30 μL of OptSpike1, Optspike2, or Nucleocapsid
was run over a Yarra 3 μm SEC-4000 LC column using an Agilent
Technologies 1260 Infinity instrument, equipped with an autoinjector.
10 μL samples were injected onto the column in 50 mM Tris, 150
mM NaCl, 100 mM ArgCl, 10% glycerol, pH 8.0 at a flow rate of 0.25
mL/min. MALS analysis was performed using a miniDAWN Treos MALS detector
(Wyatt) and Optilab T-rEX and analyzed using the associated Astra
software. Baselines were determined automatically; peaks were manually
delineated.
OptSpike1 Protein Melting Curve
A 5000× stock
of SYPRO dye (Thermo Fisher Scientific) was diluted to 200× in
Tris HCl (pH 7.5), 100 mM KCl buffer, the same buffer as OptSpike1
and OptSpike2 that were diluted in for the assay. We prepared a mixture
with a final protein concentration of 1.58 μM and a final SYPRO
orange concentration of 20×.The samples were split into three
technical triplicates (one in each well of a 384-well plate), and
the reactions were placed into a 7900 HT Fast Real Time PCR System.
The PCR machine was programmed to monitor the fluorescence of the
dye over a temperature gradient spanning 25 to 99 °C. The protocol
was designed to hold the samples at 25 °C for 2 min and then
ramp the temperature. Background fluorescence was measured using a
20× dye, and all measurements were taken in technical triplicate.Data was exported in Excel and then analyzed and graphed by GraphPad
Prism. To discern the melting temperature (Tm) of the protein, two methods were used as outlined in (1)
fitting the Boltzmannequation to the nonlinear raw data and (2) detection
of local maximum values in the 1st derivative of melting curve. The
curve had two apparent melting transitions, one between 42 and 55
°C and the other between 55 and 70 °C. The curves in these
two temperatureranges were fitted with Boltzmannequations (listed
below). The function fit to data in each of the temperatureranges
contained an approximate value for each Tm. In order to determine the local maximum values, the 1st and 2nd
derivatives were graphed (the 2nd derivative should equal 0 at the
local maxima of the 1st derivative, which can be visually verified
by the graph of the 1st derivative). Both methods produced comparable
melting temperature values: OptSpike1 Tm1 = 49.04 ± 0.1699 °C and OptSpike2 Tm1 = 49.36 ± 0.1652 °C; OptSpike1 Tm2 = 63.04 ± 0.1659 °C and OptSpike2 Tm2 = 63.31 ± 0.09236 °C. The melting
temperature can also be approximated by determining local maxima of
the 1st derivative of the fluorescence vs temperature curve. There
was not a significant difference in either Tm1 and Tm2 between the OptSpike1
and OptSpike2 constructs.
Analytical Ultracentrifugation
of OptSpike1
Analytical
ultracentrifugation (AUC) studies were conducted with OptSpike1 in
250 mM NaCl, 50 mM Tris, pH 8.0 in a Beckman XL-AUC using the absorption
optics to scan cells assembled with a double sector charcoal EPON
centerpiece and sapphire glass windows inserted into an AN-60 Ti rotor.
Centrifuge runs were conducted at 20 °C and 30,000 rpm. The first
run was scanned at 230 nm. A second run was scanned at 280 nm. This
protocol allowed protein concentrations ranging from 136 nM to 4.8
μM to be analyzed. A minimum of 50 scans was acquired for each
sample. An overview of the sedimentation behavior of each sample was
obtained by time-derivative g(s*)
analysis conducted using either the program SedAnal[51,52] or the program DCDT+ by John Philo v2.5.1.[53,54] Component analysis was conducted using the program Sedfit[55,56] to deconvolute the species present in a solution. The program Prism
v8 was used to generate the AUC figures shown in the text and the Supporting Information.
Convalescent Serum and
Plasma Samples
Serum and plasma
samples were collected from healthy adult volunteers residing in Westchester
County, NY, who had recovered from COVID-19 in April 2020. Patients
had reported a positive nasopharyngeal swab by qPCR for SARS-CoV-2
during illness and had been asymptomatic for at least 14 days prior
to sample collection. After obtaining informed consent, serum was
obtained by venipuncture (BD Vacutainer, serum), centrifuged, aliquoted,
and stored at −80 °C prior to use. The sera were heat-inactivated
at 56 °C for 30 min and stored at 4 °C prior to analysis.
Protocol approval was obtained from the Institutional Review Board
(protocol IRB# 2016-6137) of the Albert Einstein College of Medicine.
Protein Microarray Production and Processing
TheCOVID-19
protein array included OptSpike1, OptSpike2, receptor binding domain
(RBD) of Spike protein, and nucleocapsid proteins, along with positive
(human IgG) and negative (human acetylcholinesterase, PBS, or printing
buffer) controls. Arrays were generated by printing purified proteins
onto aminosilane-coated slides with a printing buffer containing an
amine-to-amine homo-bifunctional cross-linker (bissulfosuccinimidyl
suberate, BS3, Thermo Scientific PierceCat # A39266) and glycerol.
BS3 was used for covalent immobilization of the proteins to the slide,
and glycerol was used to keep them hydrated at all times. Protein
concentrations were adjusted to 25, 50, 100, and 200 pg per spot (2-fold
dilutions between 25 and 200 pg per spot). The array layout was designed
with 16 identical subarrays. Each sample was spotted in duplicate
with a total drop deposition volume of 800 pL (8 drops of 100 pL)
per spot using a Marathon Argus piezoelectric printer (Arrayjet, Edinburgh,
UK) at 50–55% humidity. The slides were incubated in a humidity-controlled
(50%) enclosure for 45 min and blocked by incubation with Superblock
blocking buffer (Thermo Fisher Scientific, Rockford, IL) for 45 min
at room temperature with shaking at 60 rpm.After blocking,
a multiwell chamber (ProPlate Multi-Well Chambers, Grace Bio-labs,
Bend, OR) was used to screen multiple serum samples with subarrays
on the same slide. As negative control for screening, we used three
serum samples (1, 4, and 8), which were received from theNew York
Blood Center, which were collected before the 2019 SARS-CoV-2 outbreak
and stored as aliquots at −80 °C. COVID-19 convalescence
sera samples (2, 3, 5, 6, 7) were previously validated to benegative
with RT-PCR for SARS-CoV-2 and positiveELISA for antibodies against
SARS-CoV-2.Each subarray was challenged with 250 μL of
1:100 patient
sera diluted in 5% (w/v) milk, PBS, and 0.2% (v/v) Tween-20 (5% milk-PBST)
overnight at 4 °C with gentle shaking on a plate shaker. The
arrays were rinsed with 5% milk-PBST briefly and incubated for 1 h
at room temperature with a fluorescently labeled secondary antibody
(AlexaFluor 647 labeled goat anti-human IgG (H + L) cross-adsorbed
secondary antibody, Thermo Fisher Scientific, Cat # A21445) diluted
1:150 in 5% milk-PBST. After washing three times with PBST, the slides
were rinsed with water and dried in falcon tubes by centrifugation
at 1000g for 3 min. The slides were scanned with
a GenePix 4400A microarray scanner (Molecular Devices), and data were
analyzed with GenePix Pro 7 software. Mean fluorescence intensities
(MFI) of every spot were quantified after background correction was
performed for each serum sample by subtracting the MFI of huAche,
which was printed as negative control within the subarray. Duplicate
spot measurements were averaged. The results were normalized relative
to the corresponding signal of hIgG1 and multiplied by a hundred for
presentation as percentage.
Recombinant Protein ELISAs
Recombinant
SARS-CoV-2Spike
protein was coated onto high-binding 96-well plates (Corning, 3690)
at 2 μg/mL, overnight at 4 °C. Wash steps were done using
1× PBS containing 0.05% Tween-20 (Sigma, P1379), and all incubations
were done at 25 °C. Plates were blocked with 3% nonfat dry milk
(Bio-Rad, P1379) in 1× PBS for 1 h. Serum from COVID-19 convalescent
patients was serially diluted in 1× PBS, containing 1% nonfat
dry milk and 0.1% Tween-20, and then added to plates and incubated
for 2 h. Following plate washing, serum reactivity to theSpike protein
was measured using an HRP-conjugated goat anti-human IgG (H + L) (Invitrogen,
31410) at a 1:3000 dilution for 1 h. After a final wash, the Ultra-TMB
substrate (Thermo Fisher, 34029) was added to the plates and incubated
for 5 min followed by quenching of the reaction by addition of an
equal volume of 0.5 M H2SO4. Absorption was
measured at OD450 using a Synergy4 plate reader (Biotek),
and data was analyzed using GraphPad Prism7.0 to calculate IC50 values.
PNGase F Digestion of Spike Protein
For PNGase F digestion,
purified Spike protein expressed from Expi293F or ExpiCHO-S cells
was digested under denaturing reaction conditions. 5 μg of protein,
1 μL of glycoprotein denaturing buffer (NEB Cat # B0701S, 10X),
and 4 μL of water were mixed to a total of 10 μL. Thespike glycoproteins were denatured at 100 ° C for 10 min, and
the denatured proteins were incubated on ice for 5 min and centrifuged
for 10 s at max speed on a microcentrifuge. Next, 2 μL of GlycoBuffer
2 (NEB Cat # B0701S, 10X), 2 μL of 10% NP-40 (NEB Cat # B0701S),
and 6 μL of water were mixed with 10 μL of denatured spike
glycoprotein. Finally, 1 μL of PNGase F (NEB Cat # P0704S) was
added to the reaction, and the digestion was allowed to proceed at
37 ° C for 1 h. Glycoproteins were analyzed by SDS-PAGE to observe
loss of N-linked glycans compared to undigested glycoproteins.
FreeStyle
293-F Transient Transfections and Culture
FreeStyle293-F
suspension cells were cultured in FreeStyle 293 Expression
Media (Invitrogen) at 37 °C in a humidified shaking platform
incubator (Kuhner) with 5% CO2. For transfection, cells
were pelleted at 500g and resuspended in fresh media.
For small-scale (1 mL of cells at 1 × 106/mL) transient
transfections performed in 24-well nontreated tissue culture plates,
2 μg of polyethylenimine (PEI) was added to 0.5 μg of
diluted plasmid DNA in a final volume of 100 μL. For small-scale
(.25 mL of cells at 1 × 106/mL) transient transfections
performed in 48-well nontreated tissue culture plates, .5 μg
of polyethylenimine (PEI) was added to 0.125 μg of diluted plasmid
DNA in a final volume of 25 μL. DNA–PEI complexes were
incubated at room temperature for 10 min and then added directly to
cells in 48-well plates. FreeStyle293-F cells were utilized to screen
for spike protein interactions, so genes that were transfected into
FreeStyle293-F cells included humanACE2 and CD26, mouseACE2, the
secretome library, and GFP controls.
Flow Cytometry Titration
Experiment
Flow cytometry
titration assays were performed with theOptSpike1 and OptSpike2 proteins
described above. FreeStyle293-F suspension cells were transfected
with DNA encoding humanACE2, mouseACE2, humanCD26, or GFP. Both
ACE2 variants wereexpressed with GFP tags on their cytosolic C term,
whileCD26 was expressed in an IRES vector expressing GFP. Two days
post transfection, cells were counted and diluted to 1 × 106 cells/mL in 1× PBS, .2% BSA. OptSpike1 and OptSpike2
proteins were serially diluted to a range of concentrations from 1
nM to 10 μM. Subsequently, 10 μL of diluted Spike protein
was added to 90 μL of diluted cells in wells of a 96-well plate
(90,000 cells per well). Binding was performed at room temperature
for 1 h with shaking at 900 rpm, after which the cells were washed
twice with 1× PBS, .2% BSA by centrifugation. Cells were then
incubated with a PE labeled anti-6x His tag antibody (Abcam Cat #
ab72467) to detect Spike protein binding. Antibody binding was performed
for 30 min at room temperature with shaking, after which the cells
were washed twice with 1× PBS, .2% BSA by centrifugation. Cells
were analyzed by flow cytometry/spectral analysis on a Sony Spectral
Analyzer. Gated live cells were subgated for GFP, and GFP-positive
cells were subgated for PE-positiveevents. Data points represent
the average of three independent experiments fit to the single site
binding equation: Y=Bmax*Xh/(Kdh+Xh).
Generation of a Human Plasma Membrane Protein Library for Mammalian
Expression
Thehuman genome was analyzed for all transcripts
that contain at least one transmembrane domain (TMHMM Server 2.0)
and a predicted secretion signal peptide sequence.[57−59] The resulting
list of 14,028 transcripts was manually purged of mitochondrial, nuclear,
and ER membrane proteins, resulting in a final target set of 9065
potential human plasma membrane transcripts representing a set of
4860 genes. All of the full-length cDNAs available from GeneCopoeia
were purchased as covalent C-terminal GFP fusions in a CMV mammalianexpression plasmid (3926 total). To enhance library coverage of proteins
missing from the GeneCopoeia library, a separate set of synthetic
transcripts was ordered from Gen9 (1282 total). This set was selected
by identifying the largest transcript for each of the missing proteins
with the limitation that they be under 4 kb in length (the limit for
high-throughput synthesis). Each full-length cDNA was codon-optimized,
synthesized, and subcloned into the Clonetech pEGFP N1 vector as C-terminal
GFP fusions. Working libraries of Ig superfamily proteins (IGSF),
TNF receptor superfamily proteins (TNFRSF), G-Protein coupled receptor
proteins (GPCR), Integrins, and chemokines were identified for each
subfamily based on lists generated using the HUGO GeneNomenclature
Committee (HGNC) gene family resource.[60] Each set was re-arrayed manually from glycerolstocks into liquid
2xYT media in 96-well deep-well blocks. Overnight cultures were mini-prepped
in 96-well plates (Macherey Nagel kits) for use in downstream high-throughput
mammalian cell transfections.
High-Throughput Screening
of the Human Plasma Membrane Protein
Library
Thehuman plasma membrane protein library[59,61] was transfected into FreeStyle293-F cells in the 48-well format.
Two days post-transfection, cells were diluted to 1 × 106 cells/mL in .2% BSA. Binding reactions were set up in 96-well
V-bottom plates by incubating 100 μL of cells with 200 nM OptSpike1;
each plate also contained hACE2-GFP expressing cells as an internal
positive control. After 45 min, cells were pelleted and washed twice
and then incubated with an anti-HIS PE antibody, washed twice, and
analyzed on a Sony Spectral Analyzer. The percent bound was calculated
as the number of double-positiveevents (GFP and mCherry) divided
by the total number of GFP positive cells. Expressions of hACE2, mACE2,
CD147, CD26, Siglec 9, Siglec 10, Ceacam1, and Ceacam5 were confirmed
by antibody staining of transfected FreeStyle293-F cells. Binding
was conducted as described for S protein binding. Antibodies used
were as follows: hACE2 and mACE2 (R&D Cat # AF933-SP), CD147 (Biolegend
Clone HIM6), CD26 (Biolegend Clone BA5b), Siglec 9 (Biolegend Clone
K8), Siglec 10 (Biolegend Clone FG6), Ceacam1, and Ceacam5 (Biolegend
Clone ASL-32).
CryoEM
Grid Preparation
3 μL of protein (1 mg/mL in
50 mM Tris, 250 mM NaCl, pH 8.0) was applied to plasma-cleaned C-flat
1.2/1.3400 mesh Cu holey carbon grids (Protochips, Raleigh, NC) or
1.2/1.3300 mesh UltrAuFoil gold holey gold grids (Quantifoil Micro
Tools GmbH, Großlöbichau, Germany), blotted for 2.5 s
after a 30 s wait time, and then plunge frozen in liquid ethane, cooled
by liquid nitrogen, using theEM GP2 (Leica Microsystems, Inc., Buffalo
Grove, IL) or Vitrobot Mark IV (Thermo Fisher Scientific, Hillsboro,
Oregon).
Microscopy
A Thermo Fisher Titan
Krios operated at
300 kV, Gatan GIF-Bioquantum with a 20 eV slit, and K3 camera were
used with 100 μm C2 aperture, 100 μm objective aperture,
and 1.058 Å calibrated pixel size.
Imaging
Movies
were collected in counting mode using
Leginon[62] at a dose rate of 26.6 e–/Å2/s with a total exposure time of
2.5 s, for an accumulated dose of 66.5 e–/Å2. Intermediate frames were recorded every 50 ms for a total
of 50 frames per micrograph. Defocus values ranged from approximately
0.8 to 2.5 μm.
Image Processing
Movies recorded
on the K3 were aligned
using Appion[63] and MotionCor2[64] and CTF estimated and 2D classified in cryoSPARC
v2.14.2.[65] Particle picking was done with
TOPAZ[66] as implemented in cryoSPARC. A
Topaz picking model was trained using frame-summed micrographs of
4000 particles manually curated from 100 micrographs initially picked
from 7500 blob picks in cryoSPARC. For the final reconstruction, particles
were selected and subjected to 3D refinement in cryoSPARC with a final
box size of 384 × 384pixels. TheExpiCHOexpressed dataset (n20apr21a)
processing used 1131 micrographs and 75,582 particles that was curated
to 54,395 particles. TheExpi293Fexpressed dataset (n20apr22) processing
used 1694 micrographs and 99,154 particles that was curated to 54,066
particles.
Authors: Markus F Templin; Dieter Stoll; Monika Schrenk; Petra C Traub; Christian F Vöhringer; Thomas O Joos Journal: Drug Discov Today Date: 2002-08-01 Impact factor: 7.851
Authors: Eric L Smith; Kim Harrington; Mette Staehr; Reed Masakayan; Jon Jones; Thomas J Long; Khong Y Ng; Majid Ghoddusi; Terence J Purdon; Xiuyan Wang; Trevor Do; Minh Thu Pham; Jessica M Brown; Carlos Fernandez De Larrea; Eric Olson; Elizabeth Peguero; Pei Wang; Hong Liu; Yiyang Xu; Sarah C Garrett-Thomson; Steven C Almo; Hans-Guido Wendel; Isabelle Riviere; Cheng Liu; Blythe Sather; Renier J Brentjens Journal: Sci Transl Med Date: 2019-03-27 Impact factor: 17.956
Authors: Tongqing Zhou; Yaroslav Tsybovsky; Jason Gorman; Micah Rapp; Gabriele Cerutti; Gwo-Yu Chuang; Phinikoula S Katsamba; Jared M Sampson; Arne Schön; Jude Bimela; Jeffrey C Boyington; Alexandra Nazzari; Adam S Olia; Wei Shi; Mallika Sastry; Tyler Stephens; Jonathan Stuckey; I-Ting Teng; Pengfei Wang; Shuishu Wang; Baoshan Zhang; Richard A Friesner; David D Ho; John R Mascola; Lawrence Shapiro; Peter D Kwong Journal: Cell Host Microbe Date: 2020-11-17 Impact factor: 21.023
Authors: Christina M Zimanyi; Mykhailo Kopylov; Clinton S Potter; Bridget Carragher; Edward T Eng Journal: Trends Biochem Sci Date: 2022-02 Impact factor: 13.807
Authors: Timothy P Riley; Hui-Ting Chou; Ruozhen Hu; Krzysztof P Bzymek; Ana R Correia; Alexander C Partin; Danqing Li; Danyang Gong; Zhulun Wang; Xinchao Yu; Paolo Manzanillo; Fernando Garces Journal: Front Immunol Date: 2021-04-22 Impact factor: 7.561
Authors: Abigail E Powell; Kaiming Zhang; Mrinmoy Sanyal; Shaogeng Tang; Payton A Weidenbacher; Shanshan Li; Tho D Pham; John E Pak; Wah Chiu; Peter S Kim Journal: ACS Cent Sci Date: 2021-01-05 Impact factor: 14.553
Authors: Hyun Ah Yoon; Rachel Bartash; Inessa Gendlina; Johanna Rivera; Antonio Nakouzi; Robert H Bortz; Ariel S Wirchnianski; Monika Paroder; Karen Fehn; Leana Serrano-Rahman; Rachelle Babb; Uzma N Sarwar; Denise Haslwanter; Ethan Laudermilch; Catalina Florez; M Eugenia Dieterle; Rohit K Jangra; J Maximilian Fels; Karen Tong; Margarette C Mariano; Olivia Vergnolle; George I Georgiev; Natalia G Herrera; Ryan J Malonis; Jose A Quiroz; Nicholas C Morano; Gregory J Krause; Joseph M Sweeney; Kelsie Cowman; Stephanie Allen; Jayabhargav Annam; Ariella Applebaum; Daniel Barboto; Ahmed Khokhar; Brianna J Lally; Audrey Lee; Max Lee; Avinash Malaviya; Reise Sample; Xiuyi A Yang; Yang Li; Rafael Ruiz; Raja Thota; Jason Barnhill; Doctor Y Goldstein; Joan Uehlinger; Scott J Garforth; Steven C Almo; Jonathan R Lai; Morayma Reyes Gil; Amy S Fox; Kartik Chandran; Tao Wang; Johanna P Daily; Liise-Anne Pirofski Journal: JCI Insight Date: 2021-02-22
Authors: Rute Castro; Lígia S Nobre; Rute P Eleutério; Mónica Thomaz; António Pires; Sandra M Monteiro; Sónia Mendes; Ricardo A Gomes; João J Clemente; Marcos F Q Sousa; Filipe Pinto; Ana C Silva; Micael C Freitas; Ana R Lemos; Onome Akpogheneta; Lindsay Kosack; Marie-Louise Bergman; Nadia Duarte; Paula Matoso; Júlia Costa; Tiago M Bandeiras; Patricia Gomes-Alves; Carlos P Gonçalves; Jocelyne Demengeot; Paula M Alves Journal: Biotechnol Bioeng Date: 2021-03-27 Impact factor: 4.395