| Literature DB >> 33236007 |
Lorenzo Casalino, Abigail Dommer, Zied Gaieb, Emilia P Barros, Terra Sztain, Surl-Hee Ahn, Anda Trifan, Alexander Brace, Anthony Bogetti, Heng Ma, Hyungro Lee, Matteo Turilli, Syma Khalid, Lillian Chong, Carlos Simmerling, David J Hardy, Julio D C Maia, James C Phillips, Thorsten Kurth, Abraham Stern, Lei Huang, John McCalpin, Mahidhar Tatineni, Tom Gibbs, John E Stone, Shantenu Jha, Arvind Ramanathan, Rommie E Amaro.
Abstract
We develop a generalizable AI-driven workflow that leverages heterogeneous HPC resources to explore the time-dependent dynamics of molecular systems. We use this workflow to investigate the mechanisms of infectivity of the SARS-CoV-2 spike protein, the main viral infection machinery. Our workflow enables more efficient investigation of spike dynamics in a variety of complex environments, including within a complete SARS-CoV-2 viral envelope simulation, which contains 305 million atoms and shows strong scaling on ORNL Summit using NAMD. We present several novel scientific discoveries, including the elucidation of the spike's full glycan shield, the role of spike glycans in modulating the infectivity of the virus, and the characterization of the flexible interactions between the spike and the human ACE2 receptor. We also demonstrate how AI can accelerate conformational sampling across different systems and pave the way for the future application of such methods to additional studies in SARS-CoV-2 and other molecular systems. ACM REFERENCE FORMAT: Lorenzo Casalino 1† , Abigail Dommer 1† , Zied Gaieb 1† , Emilia P. Barros 1 , Terra Sztain 1 , Surl-Hee Ahn 1 , Anda Trifan 2,3 , Alexander Brace 2 , Anthony Bogetti 4 , Heng Ma 2 , Hyungro Lee 5 , Matteo Turilli 5 , Syma Khalid 6 , Lillian Chong 4 , Carlos Simmerling 7 , David J. Hardy 3 , Julio D. C. Maia 3 , James C. Phillips 3 , Thorsten Kurth 8 , Abraham Stern 8 , Lei Huang 9 , John McCalpin 9 , Mahidhar Tatineni 10 , Tom Gibbs 8 , John E. Stone 3 , Shantenu Jha 5 , Arvind Ramanathan 2∗ , Rommie E. Amaro 1∗ . 2020. AI-Driven Multiscale Simulations Illuminate Mechanisms of SARS-CoV-2 Spike Dynamics. In Supercomputing '20: International Conference for High Performance Computing, Networking, Storage, and Analysis. ACM, New York, NY, USA, 14 pages. https://doi.org/finalDOI.Entities:
Year: 2020 PMID: 33236007 PMCID: PMC7685317 DOI: 10.1101/2020.11.19.390187
Source DB: PubMed Journal: bioRxiv
Figure 1:Multiscale modeling of SARS-CoV-2. A) All-atom model of the SARS-CoV-2 viral envelope (305 M atoms), including 24 spike proteins (colored in gray) in both the open (16) and closed states (8). The RBDs in the “up” state are highlighted in cyan) N-/O-Glycans are shown in blue. Water molecules and ions have been omitted for clarity. B) Full-length model of the glycosylated SARS-CoV-2 spike protein (gray surface) embedded into an ERGIC-like lipid bilayer (1.7 M atoms). RBD in the “up” state is highlighted in cyan. C) The glycan shield is shown by overlaying multiple conformations for each glycan collected at subsequent timesteps along the dynamics (blue bushlike representation). Highlighted in pink and red are two N-glycans (linked to N165 and N234, respectively) responsible for the modulation of the RBD dynamics, thus priming the virus for infection. The RBD “up” is depicted with a cyan surface. D) Two-parallel-membrane system of the spike-ACE2 complex (8.5 M atoms). The spike protein, embedded into an ERGIC-like membrane, is depicted with a gray transparent surface, whereas ACE2 is shown with a yellow transparent surface and it is embedded into a lipid bilayer mimicking the composition of mammalian cell membranes. Glycans are shown in blue, whereas water has been omitted for clarity. Visualizations were created in VMD using its custom GPU-accelerated ray tracing engine [23, 58–61].
Figure 2:Opening of the spike protein. VMD visualization of weighted ensemble simulations shows the transition of the spike’s RBD from the closed state to the open state. Many conformations of the RBD along its opening pathway are represented at the same time using cyan cartoons and a transparency gradient. Glycans appear as dark blue.
NAMD AVX-512 FP operation breakdown.
| FP Instr. | Ops | % total | FP Instr. | Ops | % total |
|---|---|---|---|---|---|
| DblScalar | 4.99e16 | 26.9% | SglScalar | 2.09e15 | 1.1% |
| Dbl128b | 6.86e15 | 3.7% | Sgl128b | 3.61e15 | 1.9% |
| Dbl256b | 1.06e17 | 57.1% | Sgl256b | 1.18e16 | 6.3% |
| Dbl512b | 4.96e15 | 2.7% | Sgl512b | 3.43e14 | 0.2% |
3D-AAE training performance on one V100 GPU.
| Latent Dimensions | Peak TFLOP/s | Sustained TFLOP/s |
|---|---|---|
| 32 | 2.96 | 0.97 |
| 64 | 3.16 | 2.28 |
| 128 | 3.13 | 0.91 |
NAMD simulation floating point ops per timestep.
| NAMD Simulation | Atoms | FLOPS/step |
|---|---|---|
| ACE2-RBD complex | 800k | 21.57 GFLOPS/step |
| Single Spike | 1.7M | 47.96 GFLOPS/step |
| Spike-ACE2 complex | 8.5M | 243.7 GFLOPS/step |
| SARS-CoV-2 virion | 305M | 8.3511 TFLOPS/step |
NAMD performance: 8.5M-atom Spike-ACE2.
| Nodes | Frontera CPU-only | Summit CPU-only | Summit CPU + GPU |
|---|---|---|---|
| 64 | 7.52 ns/day | 6.67 ns/day | 52.15 ns/day |
| 128 | 13.00 ns/day | 12.59 ns/day | 79.68 ns/day |
| 256 | 22.09 ns/day | 24.19 ns/day | 105.54 ns/day |
| 512 | 34.32 ns/day | 41.31 ns/day | 135.31 ns/day |
| 1024 | 41.88 ns/day | 66.31 ns/day | 162.22 ns/day |
Figure 4:NAMD scaling on Summit and Frontera for 8.5M-atom spike-ACE2 complex (upper lines) and 305M-atom virion (lower line). Thin lines indicate linear scaling.
NAMD performance: 305M-atom virion.
| Nodes | Summit CPU + GPU | Speedup | Efficiency |
|---|---|---|---|
| 128 | 4.23 ns/day | ~1.0× | ~100% |
| 256 | 8.02 ns/day | 1.9× | 95% |
| 512 | 15.32 ns/day | 3.6× | 91% |
| 1024 | 25.66 ns/day | 6.1× | 75% |
| 2048 | 44.27 ns/day | 10.5× | 65% |
| 4096 | 68.36 ns/day | 16.2× | 51% |
Peak NAMD FLOP rates, ORNL Summit
| NAMD Simulation | Atoms | Nodes | Sim rate | Performance |
|---|---|---|---|---|
| Spike-ACE2 complex | 8.5M | 1024 | 162 ns/day | 229 TFLOP/s |
| SARS-CoV-2 virion | 305M | 4096 | 68 ns/day | 3.06 PFLOP/s |
Figure 5:Flexibility of the spike bound to the ACE2 receptor. A) Schematic representation of the two-parallel-membrane system of the spike-ACE2 complex. (B‒E) Distributions of the ankle, knee, hip and spike-tilting angles resulting from MD replicas 1–3 (darker color) and 4–6 (lighter color). Starting points for replicas 4–6 have been selected using DeepDriveMD.
| Performance Attribute | Our Submission |
|---|---|
| Category of achievement | Scalability, Time-to-solution |