| Literature DB >> 34816263 |
Abigail Dommer, Lorenzo Casalino, Fiona Kearns, Mia Rosenfeld, Nicholas Wauer, Surl-Hee Ahn, John Russo, Sofia Oliveira, Clare Morris, Anthony Bogetti, Anda Trifan, Alexander Brace, Terra Sztain, Austin Clyde, Heng Ma, Chakra Chennubhotla, Hyungro Lee, Matteo Turilli, Syma Khalid, Teresa Tamayo-Mendoza, Matthew Welborn, Anders Christensen, Daniel G A Smith, Zhuoran Qiao, Sai Krishna Sirumalla, Michael O'Connor, Frederick Manby, Anima Anandkumar, David Hardy, James Phillips, Abraham Stern, Josh Romero, David Clark, Mitchell Dorrell, Tom Maiden, Lei Huang, John McCalpin, Christopher Woods, Alan Gray, Matt Williams, Bryan Barker, Harinda Rajapaksha, Richard Pitts, Tom Gibbs, John Stone, Daniel Zuckerman, Adrian Mulholland, Thomas Miller, Shantenu Jha, Arvind Ramanathan, Lillian Chong, Rommie Amaro.
Abstract
We seek to completely revise current models of airborne transmission of respiratory viruses by providing never-before-seen atomic-level views of the SARS-CoV-2 virus within a respiratory aerosol. Our work dramatically extends the capabilities of multiscale computational microscopy to address the significant gaps that exist in current experimental methods, which are limited in their ability to interrogate aerosols at the atomic/molecular level and thus ob-scure our understanding of airborne transmission. We demonstrate how our integrated data-driven platform provides a new way of exploring the composition, structure, and dynamics of aerosols and aerosolized viruses, while driving simulation method development along several important axes. We present a series of initial scientific discoveries for the SARS-CoV-2 Delta variant, noting that the full scientific impact of this work has yet to be realized. ACM REFERENCE FORMAT: Abigail Dommer 1† , Lorenzo Casalino 1† , Fiona Kearns 1† , Mia Rosenfeld 1 , Nicholas Wauer 1 , Surl-Hee Ahn 1 , John Russo, 2 Sofia Oliveira 3 , Clare Morris 1 , AnthonyBogetti 4 , AndaTrifan 5,6 , Alexander Brace 5,7 , TerraSztain 1,8 , Austin Clyde 5,7 , Heng Ma 5 , Chakra Chennubhotla 4 , Hyungro Lee 9 , Matteo Turilli 9 , Syma Khalid 10 , Teresa Tamayo-Mendoza 11 , Matthew Welborn 11 , Anders Christensen 11 , Daniel G. A. Smith 11 , Zhuoran Qiao 12 , Sai Krishna Sirumalla 11 , Michael O'Connor 11 , Frederick Manby 11 , Anima Anandkumar 12,13 , David Hardy 6 , James Phillips 6 , Abraham Stern 13 , Josh Romero 13 , David Clark 13 , Mitchell Dorrell 14 , Tom Maiden 14 , Lei Huang 15 , John McCalpin 15 , Christo- pherWoods 3 , Alan Gray 13 , MattWilliams 3 , Bryan Barker 16 , HarindaRajapaksha 16 , Richard Pitts 16 , Tom Gibbs 13 , John Stone 6 , Daniel Zuckerman 2 *, Adrian Mulholland 3 *, Thomas MillerIII 11,12 *, ShantenuJha 9 *, Arvind Ramanathan 5 *, Lillian Chong 4 *, Rommie Amaro 1 *. 2021. #COVIDisAirborne: AI-Enabled Multiscale Computational Microscopy ofDeltaSARS-CoV-2 in a Respiratory Aerosol. In Supercomputing '21: International Conference for High Perfor-mance Computing, Networking, Storage, and Analysis . ACM, New York, NY, USA, 14 pages. https://doi.org/finalDOI.Entities:
Year: 2021 PMID: 34816263 PMCID: PMC8609898 DOI: 10.1101/2021.11.12.468428
Source DB: PubMed Journal: bioRxiv
Figure 1:Overall schematic depicting the construction and multiscale simulations of Delta SARS-CoV-2 in a respiratory aerosol. (N.B.: The size of divalent cations has been increased for visibility.)
Figure 2:Individual protein components of the SARS-CoV-2 Delta virion. The spike is shown with the surface in cyan and with Delta’s mutated residues and deletion sites highlighted in pink and yellow, respectively. Glycans attached to the spike are shown in blue. The E protein is shown in yellow and the M protein is shown in silver and white. Visualized with VMD.
Figure 5:Delta-variant spike opening from WE simulations, and AI/haMSM analysis. A) The integrated workflow. B) Snapshots of the ‘down’, ‘up’, and ‘open’ states for Delta S-opening from a representative pathway generated by WE simulation, which represents ~105 speedup compared to conventional MD. C) Rate-constant estimation with haMSM analysis of WE data (purple lines) significantly improves direct WE computation (red), by comparison to experimental measurement (black dashed). Varying haMSM estimates result from different featurizations which will be individually cross-validated. D) The first three dimensions of the ANCA-AE embeddings depict a clear separation between the closed (darker purple) and open (yellow) conformations of the Delta spike. A sub-sampled landscape is shown here where each sphere represents a conformation from the WE simulations and colored with the root-mean squared deviations (Å) with respect to the closed state. Visualized with VMD.
Figure 6:WE simulations reveal a dramatic opening of the Delta S (cyan), compared to WT S (white). While further investigation is needed, this super open state seen in the Delta S may indicate increased capacity for binding to human host-cell receptors.
Summary of all systems constructed in this work. See Fig 3 for illustration of aerosol construction.
| [ | [ | [ | [ | [ |
|---|---|---|---|---|
| [ | M | 125 × 125 × 124 | 164,741 | 700 |
| [ | E | 123 × 125 × 102 | 136,775 | 41 |
|
| ||||
| [ | S | 206 × 200 × 410 | 1,692,444 | 330 |
| [ | S | 204 × 202 × 400 | 1,658,224 | 330 |
| [ | SH | 172 × 184 × 206 | 615,593 | 73μs |
|
| ||||
| [ | mi | 123 × 104 × 72 | 87,076 | 25 |
| [ | m2 | 120 × 101 × 72 | 82,155 | 25 |
| [ | m3 | 810 × 104 × 115 | 931,778 | 23 |
| [ | m4 | 904 × 106 × 109 | 997,029 | 15 |
| [ | m5 | 860 × 111 × 113 | 1,040,215 | 18 |
| [ | SMA | 227 × 229 × 433 | 2,156,689 | 840 |
| [ | V | 1460 × 1460 × 1460 | 305,326,834 | 41 |
| [ | RAV | 2834 × 2820 × 2828 | 1,016,813,441 | 2.42 |
| TOTAL FLOPS | 2.4 ZFLOPS |
M, E, S, SH, and V models represent SARS-CoV-2 Delta strain.
Abbreviations used throughout document.
Periodic boundary dimensions.
Total number of atoms.
Total aggregate simulation time, including heating and equilibration runs.
Simulated with NAMD.
Simulated with NAMD, AMBER, and GROMACS.
Figure 3:Image of RAV with relative mass ratios of RA molecular components represented in the colorbar. Water content is dependent on the relative humidity of the environment and is thus omitted from the molecular ratios.
Figure 4:SMA system captured with multiscale modeling from classical MD to AI-enabled quantum mechanics. For all panels: S protein shown in cyan, S glycans in blue, m1/m2 shown in red, ALB in orange, Ca2+ in yellow spheres, viral membrane in purple. A) Interactions between mucins and S facilitated by glycans and Ca2+. B) Snapshot from SMA simulations. C) Example Ca2+ binding site from SMA simulations (1800 sites, each 1000+ atoms) used for AI-enabled quantum mechanical estimates from OrbNet Sky. D) Quantification of contacts between S and mucin from SMA simulations. E) OrbNet Sky energies vs CHARMM36m energies for each sub-selected system, colored by total number of atoms. Performance of OrbNet Sky vs. DFT in subplot (ωB97x-D3/def-TZVP, R2=0.99, for 17 systems of peptides chelating Ca2+ (Hu et al., 2021)). Visualized with VMD.
Figure 7:D-NEMD simulations reveal changes in key functional regions of the S protein, including the receptor binding domain, as the result of a pH decrease. Color scale and ribbon thickness indicate the degree of deviation of Cα atoms from their equilibrium position. Red spheres indicate the location of positively charged histidines.
NAMD performance: Respiratory Aerosol + Virion, 1B atoms, 4 fs timestep w/HMR, and PME every 3 steps.
| Nodes | Summit | Speedup | Efficiency |
|---|---|---|---|
| 256 | 4.18ns/day | ~1.0× | ~100% |
| 512 | 7.68 ns/day | 1.84× | 92% |
| 1024 | 13.64 ns/day | 3.27× | 81% |
| 2048 | 23.10ns/day | 5.53× | 69% |
| 4096 | 34.21 ns/day | 8.19× | 51% |
Peak NAMD FLOP rates, ORNL Summit
| NAMD Simulation | Atoms | Nodes | Sim rate | Performance |
|---|---|---|---|---|
| Resp. Aero.+Vir. | 1B | 4096 | 34.21 ns/day | 2.55 PFLOPS |
Figure 8:GROMACS performance across 1–8 A100 GPUs in ns/day (thicker, blue lines) and the fraction of maximum theoretical TFLOPS (thinner, green lines); production setup shown with solid line, and runs with the GPU-accelerated thermostat in dashed.
| Performance Attribute | Our Submission |
|---|---|
| Category of achievement | Scalability, Time-to-solution |
| Type of method used | Explicit, Deep Learning |
| Results reported on the basis of | Whole application including I/O |
| Precision reported | Mixed Precision |
| System scale | Measured on full system |
| Measurement mechanism | Hardware performance counters, Application timers, Performance Modeling |
MD simulation floating point ops per timestep.
| MD Simulation | Code | Atoms | [ |
|---|---|---|---|
| Spike, head |
| 0.6M | 62.14 |
| Spike | NAMD | 1.7M | 43.05 |
| S+m1/m2+ALB | NAMD | 2.1M | 54.86 |
| Resp. Aero.+Vir. | NAMD | 1B | 25.81 |
FLOPs/step data were computed by direct FLOP measurements from hardware performance counters for NAMD simulations, or by using the application-reported FLOP rates and ns/day simulation performance in the case of GROMACS.