| Literature DB >> 36189278 |
Sona Aramyan1, Kirk McGregor1, Samarth Sandeep1, Angela Haczku2.
Abstract
The pulmonary surfactant protein A (SP-A) is a constitutively expressed immune-protective collagenous lectin (collectin) in the lung. It binds to the cell membrane of immune cells and opsonizes infectious agents such as bacteria, fungi, and viruses through glycoprotein binding. SARS-CoV-2 enters airway epithelial cells by ligating the Angiotensin Converting Enzyme 2 (ACE2) receptor on the cell surface using its Spike glycoprotein (S protein). We hypothesized that SP-A binds to the SARS-CoV-2 S protein and this binding interferes with ACE2 ligation. To study this hypothesis, we used a hybrid quantum and classical in silico modeling technique that utilized protein graph pruning. This graph pruning technique determines the best binding sites between amino acid chains by utilizing the Quantum Approximate Optimization Algorithm (QAOA)-based MaxCut (QAOA-MaxCut) program on a Near Intermediate Scale Quantum (NISQ) device. In this, the angles between every neighboring three atoms were Fourier-transformed into microwave frequencies and sent to a quantum chip that identified the chemically irrelevant atoms to eliminate based on their chemical topology. We confirmed that the remaining residues contained all the potential binding sites in the molecules by the Universal Protein Resource (UniProt) database. QAOA-MaxCut was compared with GROMACS with T-REMD using AMBER, OPLS, and CHARMM force fields to determine the differences in preparing a protein structure docking, as well as with Goemans-Williamson, the best classical algorithm for MaxCut. The relative binding affinity of potential interactions between the pruned protein chain residues of SP-A and SARS-CoV-2 S proteins was assessed by the ZDOCK program. Our data indicate that SP-A could ligate the S protein with a similar affinity to the ACE2-Spike binding. Interestingly, however, the results suggest that the most tightly-bound SP-A binding site is localized to the S2 chain, in the fusion region of the SARS-CoV-2 S protein, that is responsible for cell entry Based on these findings we speculate that SP-A may not directly compete with ACE2 for the binding site on the S protein, but interferes with viral entry to the cell by hindering necessary conformational changes or the fusion process.Entities:
Keywords: MaxCut; QAOA; SARS-CoV-2; SP-A; glycosylation; immunoprotection; in silico; quantum computation (QC)
Mesh:
Substances:
Year: 2022 PMID: 36189278 PMCID: PMC9519185 DOI: 10.3389/fimmu.2022.945317
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 8.786
Figure 1Hypothesis: SARS-CoV-2 Spike glycoprotein glycosylation sites are potential binding sites for SP-A (A): Glycosylation sites on the SARS-CoV-2 spike glycoprotein trimer are denoted by NAG residues, shown in (O): red; (C): grey; (N): blue space fill balls. Structure of SARS-CoV-2 spike glycoprotein with a single receptor-binding domain up NGL Viewer (AS Rose et al. 2018) PDB: 6VSB DOI: 10.2210/pdb6VSB/ pdb EM Map EMD-21375: EMDB EMDataResource (B): Structure of monomer (top panel), trimeric and octadecameric SP-A and potential carbohydrate recognition sites on the S protein by the Carbohydrate Recognition Domain (CRD) of the lectin head of SP-A. The CRD binds carbohydrate residues with high affinity in a Ca++ dependent manner. The X-ray Crystal Structure depicts the rat Surfactant Protein A neck and carbohydrate recognition domain ligated with mannose. Atoms represented by the spacefill balls are: (O): red; (C): grey; (Ca): green; (Na): purple. PDB: 3PAK DOI: 10.2210/pdb3PAK/ pdb 2010-11-03 Shang, F. et al. (X-RAY DIFFRACTION Resolution: 1.90 Å). (RBD, receptor binding domain; S1, Spike 1 region; S2, Spike 2 region; TMD, Transmembrane domain; CRD, Carbohydrate recognition domain).
Figure 2Pruning Program by utilizing the Quantum Approximate Optimization Algorithm (QAOA) on the Rigetti Quantum Processor. Process diagram for finding protein binding by Pruning. (A) Create graphs of 3 neighboring atoms each, with angles beta and gamma stored (B) Fourier-transform angles into frequencies to be placed on quantum chip by microwave. (C) Send microwaves to quantum chip. (D) Read results from quantum chip to determine which atoms to cut, with 00=Cut. (E) Summation of cut atom graphs to build reduced structures. (F) Binding studies between reduced structures with ZDOCK testing to identify the best binding sites.
Figure 3SARS-CoV-2 S protein (6VSB) modeling. (A): The S protein trimer model (green ribbons representing chains A, B and C with red denoting glycan residues) and bound SP-A (pink). (B): The final reduced S-protein-SP-A complex processed by our QAOA-based MaxCut protein pruning tool followed by ZDOCK docking (purple representing glycan residues and X marking the SP-A binding site. (A, B) were derived from the visualization software SAMSON. (C): The amino acid ASN 1134 on the S protein C chain is identified as a likely candidate to mediate SP-A binding. ASN 1134 is outlined by dark pink and the blue cubes represent NAG glycosylation.
Figure 4The complex resulting from docking of reduced structures of the S protein and SP-A with ZDOCK highlighting the top ranked binding sites on SP-A. (A): Visualization by SAMSON after clashes/contacts with less than 2 Å distance were identified using the Chimera program. (B): The top 5 binding sites are shown (ball and stick) highlighting bound NAG (purple space fill balls). SP-A and S protein amino acid residues are shown as a ball and stick. NAG1301 is bound to ASN1134 on the S protein and is shown in close proximity to ASN151 of SP-A. (C): The SP-A carbohydrate binding grove showing the amino acids identified in the pruned complex including ASN 151 (pink) TYR 188 (khaki), GLU 171 (bordeaux) and Ca++ (green) and Na+ (purple). ASN 151 is clustered with ASN 214 and ASN 190 amplifying carbohydrate binding ability. The groove is flanked by TYR 188 and GLU 171 and harbors a Ca++ and a Na+ ligand. Presence of Ca++ is known to be required for carbohydrae binding. Atoms represented by the balls and stick are: (O): red; (C): grey; (Ca): green; (Na): purple; (S): yellow.
Top 16 ZDOCK scores, representing the electrostatic and geometric fit between protein residues, out of 2000 potential conformations between SP-A and SARS-CoV-2 Spike.
| Conformation rank (of 2000) | X Docking Grid coordinate | Y Docking Grid coordinate | Z Docking Grid coordinate | ZDOCK Score |
|---|---|---|---|---|
| 1 | 140 | 38 | 30 | 448.202 |
| 2 | 173 | 21 | 60 | 405.703 |
| 3 | 5 | 27 | 55 | 390.222 |
| 4 | 163 | 48 | 44 | 386.713 |
| 5 | 140 | 38 | 28 | 386.160 |
| 6 | 178 | 31 | 65 | 376.122 |
| 7 | 139 | 40 | 31 | 373.924 |
| 8 | 147 | 49 | 26 | 372.902 |
| 9 | 5 | 25 | 53 | 372.588 |
| 10 | 145 | 39 | 27 | 371.860 |
| 11 | 139 | 43 | 29 | 365.741 |
| 12 | 173 | 22 | 60 | 364.003 |
| 13 | 138 | 34 | 22 | 359.788 |
| 14 | 8 | 30 | 55 | 352.675 |
| 15 | 140 | 46 | 31 | 349.967 |
| 16 | 151 | 55 | 27 | 342.542 |
The central atom position between each conformations of the bound SP-A and the S protein is determined by the X,Y,Z coordinates that reflect the spatial localization differences.
Top 16 binding sites between SP-A and SARS-CoV-2.
| S Protein Chain; Amino Acid/Glycan residue; Contact Atom for Binding SP-A | Position of NAG or amino acid on the S protein (6VSB, published Feb 2020) | SP-A Amino Acid; Contact Atom for Binding the S Protein | Position of Amino acid on SP-A (5FFR) | Distance between both atoms (Å) |
|---|---|---|---|---|
| C; NAG; O | 1301 | TYR; C | 188 | 1.308 |
| C; NAG; O | 1301 | TYR; C | 188 | 1.730 |
| C; NAG; O | 1301 | TYR; C | 188 | 1.905 |
| C; NAG; C | 1312 | ASN; C | 151 | 2.309 |
| C; NAG; O | 1312 | ARG; C | 197 | 2.079 |
| C; NAG; O | 1311 | PRO; C | 175 | 2.153 |
| C; NAG; C | 1312 | GLU; C | 171 | 2.455 |
| C; NAG; C | 1312 | GLN; C | 199 | 2.635 |
| C; CYS; C | 1126 | SER; C | 187 | 2.572 |
| C; ILE; C | 1130 | SER; C | 185 | 2.715 |
| C; NAG; C | 1302 | TYR; C | 188 | 2.620 |
| C; NAG; C | 1301 | THR; C | 189 | 2.653 |
| C; NAG; C | 1302 | TYR; C | 188 | 2.745 |
| C; NAG; C | 1302 | TRP; C | 213 | 2.749 |
| C; NAG; C | 1302 | ASN; C | 151 | 2.885 |
| C; NAG; C | 1311 | PRO; C | 175 | 2.809 |
The chain, the amino acid, and their respective atoms that are predicted to mediate binding between the S protein and SP-A according to the minimal distance that can be achieved between two atoms.
| ACE2 | Angiotensin Converting Enzyme 2 |
| ARG | Arginine |
| ASN | Asparagine |
| CHARMM | Chemistry at Harvard Macromolecular Mechanics |
| COVID-19 | Coronavirus Disease Pandemic 2019 |
| CRD | Carbohydrate Recognition Domain |
| Fc | Crystallizable Fragment |
| GLN | Glutamine |
| GLU | Glutamic acid |
| HAT | Human Airway Trypsin-like protease |
| HIS | Histidine |
| LEU | Leucine |
| MASP | Mannose Binding Lectin Associated Serine Protease |
| MBL | Mannose Binding Lectin |
| NET | Neutrophil Extracellular Traps |
| Extracellular Traps | |
| NHS | National Health Service United Kingdom |
| NISQ | Near Intermediate Scale Quantum |
| PAR4 | Protease-activated Receptor 4; |
| PHE | Phenylalanine |
| PRO | Proline |
| QAOA | Quantum Approximate Optimization Algorithm |
| RMSD | Root Mean Square Deviation |
| RSV | Respiratory Syncytial Virus |
| S1 | Subunit 1 of the SARS-CoV-2 Spike that is found at its tip |
| S2 | Subunit 2 of the SARS-CoV-2 Spike that is bound to the rest of the virion |
| SARS CoV-2 | Severe Acute Respiratory Syndrome Coronavirus 2 |
| SER | Serine |
| SIRPa | Signal Regulatory Protein-alpha |
| SP-A | Surfactant Protein-A |
| SP-B | Surfactant Protein-B |
| SP-C | Surfactant Protein-C |
| SP-D | Surfactant Protein-D |
| S Protein | SARS-CoV-2 Spike Protein |
| THR | Threonine |
| TMPRSS2 | Transmembrane Protease Serine 2 |
| TRP | Tryptophan |
| TTSP | Type II Transmembrane Serine Proteases |
| TYR | Tyrosine |
| UNK | Unknown/Unlabeled |
| VAL | Valine |
| ZDOCK | Docking Program based on the Fast Fourier Transform algorithm developed by the Zheng Lab, at the University of Massachusetts, Amherst |