| Literature DB >> 34370471 |
Johann V Pototschnig1, Anastasios Papadopoulos1, Dmitry I Lyakh2, Michal Repisky3, Loïc Halbert4, André Severo Pereira Gomes4, Hans Jørgen Aa Jensen5, Lucas Visscher1.
Abstract
In this paper, we report reimplementation of the core algorithms of relativistic coupled cluster theory aimed at modern heterogeneous high-performance computational infrastructures. The code is designed for parallel execution on many compute nodes with optional GPU coprocessing, accomplished via the new ExaTENSOR back end. The resulting ExaCorr module is primarily intended for calculations of molecules with one or more heavy elements, as relativistic effects on the electronic structure are included from the outset. In the current work, we thereby focus on exact two-component methods and demonstrate the accuracy and performance of the software. The module can be used as a stand-alone program requiring a set of molecular orbital coefficients as the starting point, but it is also interfaced to the DIRAC program that can be used to generate these. We therefore also briefly discuss an improvement of the parallel computing aspects of the relativistic self-consistent field algorithm of the DIRAC program.Entities:
Year: 2021 PMID: 34370471 PMCID: PMC8444343 DOI: 10.1021/acs.jctc.1c00260
Source DB: PubMed Journal: J Chem Theory Comput ISSN: 1549-9618 Impact factor: 6.578
Figure 1ExaCorr computational workflow based on the ExaTENSOR library.
Figure 2Workflow of ExaCorr computations; details can be found in the text.
Experimental Reference Values and Structures Used in the Computations for Lanthanide Monofluorides and DFT Bond Distances Applied in the Computations
| LaF | 2.0234[ | 2.0293 | 2.0150 | 2.0215 |
| EuF | 2.083[ | 2.0676 | 1.9992 | 2.0750 |
| YbF | 2.016516[ | 1.9868 | 1.9345 | 2.0204 |
Ionization Potential in eV of YbF for Different Numbers of Correlated Spinors Employing the dyall.v2z Basis Seta
| thresholdlow | thresholdhigh | nocc | nvir | % occ | % vir | CCSD |
|---|---|---|---|---|---|---|
| –20 | 2.3 | 49 | 89 | 63 | 21 | 4.49 |
| –20 | 6 | 49 | 137 | 63 | 32 | 5.89 |
| –20 | 150 | 49 | 267 | 63 | 63 | 5.89 |
| –20 | 10 000 | 49 | 367 | 63 | 86 | 5.89 |
| –60 | 10 | 61 | 155 | 78 | 36 | 5.90 |
| –60 | 20 | 61 | 195 | 78 | 50 | 5.90 |
| –60 | 150 | 61 | 267 | 78 | 63 | 5.90 |
| –3 | 40 | 31 | 213 | 40 | 50 | 5.95 |
| –20 | 40 | 49 | 213 | 63 | 50 | 5.90 |
| –40 | 40 | 51 | 213 | 65 | 50 | 5.90 |
| –60 | 40 | 61 | 213 | 78 | 50 | 5.90 |
| –400 | 40 | 77 | 213 | 99 | 50 | 5.90 |
| exp | 5.91 ± 0.05[ |
The number of occupied and virtual spinors refers to the neutral molecule; for the cation, one of these occupied spinors becomes a virtual spinor. The ΔSCF ionization potential computed using the reference determinant wave functions was 5.48 eV. The spinor thresholds are listed in atomic units.
Vertical (a) and Adiabatic (b) Ionization Energies (in eV) for the Lanthanide Monofluoridesa
| ΔSCF | CCSD | CCSD+T | CCSD(T) | CCSD-T | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| basis | (a) | (b) | (a) | (b) | T1 | (a) | (b) | (a) | (b) | (a) | (b) | ||
| LaF | NR | 2z | 3.41 | 3.44 | 5.03 | 5.06 | 0.01 | 5.15 | 5.18 | 5.11 | 5.14 | 5.10 | 5.13 |
| 3z | 3.43 | 3.44 | 4.71 | 4.71 | 0.02 | 4.90 | 4.91 | 4.84 | 4.84 | 4.82 | 4.74 | ||
| 4z | 3.43 | 3.43 | 4.61 | 4.60 | 0.02 | 4.81 | 4.81 | 4.74 | 4.74 | 4.72 | 4.72 | ||
| ∞z | 3.42 | 3.43 | 4.53 | 4.53 | 4.74 | 4.74 | 4.66 | 4.66 | 4.65 | 4.64 | |||
| X2C | 2z | 4.93 | 4.96 | 5.91 | 5.93 | 0.01 | 5.96 | 5.99 | 5.96 | 5.98 | 5.96 | 5.98 | |
| 3z | 4.87 | 4.87 | 5.87 | 5.87 | 0.01 | 5.98 | 5.98 | 5.96 | 5.96 | 5.96 | 5.95 | ||
| 4z | 4.86 | 4.86 | 5.87 | 5.86 | 0.01 | 5.99 | 5.99 | 5.97 | 5.97 | 5.97 | 5.96 | ||
| ∞z | 4.85 | 4.84 | 5.87 | 5.86 | 6.00 | 5.99 | 5.98 | 5.97 | 5.97 | 5.96 | |||
| exp. | 6.3 ± 0.3[ | ||||||||||||
| EuF | NR | 2z | 4.73 | 4.75 | 5.08 | 5.12 | 0.01 | 5.10 | 5.15 | 5.10 | 5.15 | 5.11 | 5.15 |
| 3z | 4.72 | 4.74 | 5.10 | 5.13 | 0.01 | 5.13 | 5.17 | 5.13 | 5.17 | 5.13 | 5.17 | ||
| X2C | 2z | 5.04 | 5.07 | 5.46 | 5.51 | 0.01 | 5.51 | 5.57 | 5.51 | 5.56 | 5.51 | 5.56 | |
| 3z | 5.02 | 5.05 | 5.48 | 5.52 | 0.01 | 5.54 | 5.58 | 5.53 | 5.57 | 5.53 | 5.57 | ||
| exp. | 5.9 ± 0.3[ | ||||||||||||
| YbF | NR | 2z | 5.04 | 5.09 | 5.39 | 5.39 | 0.01 | 5.43 | 5.43 | 5.42 | 5.43 | 5.42 | 5.43 |
| 3z | 4.93 | 4.98 | 5.40 | 5.43 | 0.01 | 5.47 | 5.49 | 5.45 | 5.47 | 5.46 | 5.48 | ||
| 4z | 4.93 | 4.98 | 5.42 | 5.44 | 0.01 | 5.59 | 5.60 | 5.56 | 5.57 | 5.57 | 5.58 | ||
| ∞z | 4.93 | 4.98 | 5.44 | 5.45 | 5.67 | 5.68 | 5.64 | 5.65 | 5.65 | 5.66 | |||
| X2C | 2z | 5.48 | 5.49 | 5.90 | 5.87 | 0.05 | 6.51 | 6.44 | 5.57 | 5.56 | 5.67 | 5.65 | |
| 3z | 5.44 | 5.46 | 6.00 | 5.98 | 0.09 | 7.76 | 7.73 | 5.30 | 5.29 | 5.76 | 5.75 | ||
| 4z | 5.44 | 5.46 | 6.00 | 5.97 | 0.05 | 6.82 | 6.78 | 5.57 | 5.56 | 5.75 | 5.74 | ||
| ∞z | 5.44 | 5.46 | 6.00 | 5.97 | 6.13 | 6.08 | 5.77 | 5.75 | 5.75 | 5.73 | |||
| exp. | 5.91 ± 0.05[ | ||||||||||||
The active ranges for LaF, EuF, and YbF are −20 to 40, −200 to 200, and −60 to 40 hartree, respecitvely. The X2C Hamiltonians used were X2C-AMFI for LaF and YbF (spinors obtained with the DIRAC code) and X2C-1e for EuF (spinors obtained with the ReSpect code). The complete basis set limit values (∞z) have been obtained with a two-point extrapolation formula based on the 3z and 4z values.
Total Binding Energy (ΔE, in eV) of AuAr Systems with dyall Basis Sets of Different Cardinal Numbersa
| Δ | ||||||||
|---|---|---|---|---|---|---|---|---|
| system | basis | V | HF | MP2 | CCSD | CCSD + T | CCSD(T) | CCSD – T |
| AuAr+ | 2z | 136 | –0.1341 | –0.5260 | –0.4620 | –0.5267 | –0.5165 | –0.5156 |
| 3z | 230 | –0.1401 | –0.5993 | –0.4699 | –0.5519 | –0.5408 | –0.5400 | |
| 4z | 400 | –0.1456 | –0.6408 | –0.4817 | –0.5689 | –0.5586 | –0.5581 | |
| ∞z | –0.1496 | –0.6711 | –0.4904 | –0.5814 | –0.5716 | –0.5713 | ||
| AuAr2+ | 2z | 168 | 0.0845 | –1.2292 | –1.0356 | –1.1981 | –1.1723 | –1.1706 |
| 3z | 288 | –0.0219 | –1.4124 | –1.0890 | –1.2813 | –1.2563 | –1.2550 | |
| 4z | 508 | –0.0399 | –1.4938 | –1.1161 | ||||
| ∞z | –0.0531 | –1.5532 | –1.1360 | |||||
| AuAr3+ | 2z | 200 | –0.0140 | –1.3416 | –1.1491 | –1.3100 | –1.2875 | –1.2860 |
| 3z | 346 | –0.0816 | –1.5202 | –1.1899 | –1.3902 | –1.3669 | –1.3652 | |
| 4z | 616 | –0.1019 | –1.6124 | –1.2268 | ||||
| ∞z | –0.1167 | –1.6797 | –1.2538 | |||||
| AuAr4+(3D) | 2z | 232 | –0.1061 | –1.4673 | –1.2681 | –1.4267 | –1.4090 | –1.4078 |
| 3z | 404 | –0.1367 | –1.6474 | –1.3009 | –1.5092 | –1.4890 | –1.4872 | |
| 4z | 724 | –0.1590 | –1.7527 | –1.3491 | ||||
| ∞z | –0.1753 | –1.8295 | –1.3843 | |||||
| AuAr4+(2D) | 2z | 232 | –0.1605 | –1.4326 | –1.2530 | –1.4024 | –1.3863 | –1.3852 |
| 3z | 404 | –0.1783 | –1.5967 | –1.2769 | –1.4750 | –1.4563 | –1.4545 | |
| 4z | 724 | –0.2001 | –1.6978 | –1.3243 | ||||
| ∞z | –0.2160 | –1.7715 | –1.3588 | |||||
| AuAr5+ | 2z | 264 | –0.2217 | –1.5945 | –1.3898 | –1.5398 | –1.5301 | –1.5750 |
| 3z | 462 | –0.2059 | –1.7730 | –1.4126 | –1.6238 | –1.6100 | –1.6081 | |
In all cases, spinors with energies between −10 and 20 hartree have been included in the correlation treatment. The number of virtual spinors in each case (V) is shown; see Section for the number of correlated electrons for each species. The complete basis set limit values (∞z) have been obtained with a two-point extrapolation formula based on the 3z and 4z values.
Wall Time in Seconds for the New Implementation and the RELCCSD Release 2019 Reference Implementation, with Symmetry (R19s) and without Using Symmetry (R19)a
| step | system | nodes | R19 | R19s | ExaCorr |
|---|---|---|---|---|---|
| integrals | AuAr+ | 4 | 371 | 71 | 106 |
| AuAr+ | 8 | 308 | 70 | 58 | |
| AuAr2+ | 8 | 715 | 87 | 77 | |
| AuAr3+ | 8 | 1602 | 810 | 133 | |
| AuAr3+ | 20 | 1450 | 703 | 87 | |
| CCSD | AuAr+ | 4 | 327 | 7 | 84 |
| AuAr+ | 8 | 232 | 6 | 71 | |
| AuAr2+ | 8 | 1129 | 7 | 73 | |
| AuAr3+ | 8 | 2512 | 849 | 183 | |
| AuAr3+ | 20 | 1585 | 586 | 143 | |
| triples | AuAr+ | 4 | 2055 | 42 | 1555 |
| AuAr+ | 8 | 3130 | 46 | 1294 | |
| AuAr2+ | 8 | >5200 | 90 | 2666 | |
| AuAr3+ | 20 | >5200 | >5200 | 5821 |
All computations were performed using the dyall.v2z basis set. There are 243/42 atomic orbitals, 32/14 occupied and 104/32 virtual spinors per gold/argon atom.
Time in Seconds for Integral Transformation (tI) and Solving the Coupled Cluster (tCCD, tCCSD) and Λ Equations (tΛ) for UF6 Using the dyall.v2z Basis Seta
| 16 | 841 | 1179 | 843 | 1314 | 2046 | |
| 24 | 654 | 902 | 711 | 1045 | 1869 | 1865 |
| 32 | 505 | 764 | 617 | 925 | 1645 | 1649 |
| 48 | 413 | 686 | 594 | 861 | 1512 | 1493 |
| 64 | 375 | 634 | 600 | 810 | 1545 | 1412 |
The CCD, CCSD, and Λ equations, took resp. 10, 20, and 21 iterations to solve. For the selected thresholds of −35 to 80 hartree, 110 occupied and 474 virtual spinors are included. n is the number of Summit nodes. The number in parentheses in the header is the segment size used by ExaTENSOR for chunking the occupied and virtual vector spaces.
Time in Seconds for Integral Transformation (tI) and Coupled Cluster (tCCSD) for UF6 Applying the dyall.v3z Basis Seta
| 32 | 1685 | 1279 |
| 64 | 1191 | 1205 |
| 96 | 817 | 1081 |
| 128 | 687 | 973 |
The CCSD equations took 21 iterations to solve. For thresholds of −10 to 8 hartree,l 70 occupied and 554 virtual spinors are active. n is the number of Summit nodes. The number in parentheses in the header is the segment size used by ExaTENSOR for chunking the occupied and virtual vector spaces.
Figure 3Orientation of the UF6 dimers.
UF6 Dimer Bonding Energies (dE, eV) for a System with 140 Occupied and 1108 Virtual Spinors for Different Computational Methodsa
| d | ||||||||
|---|---|---|---|---|---|---|---|---|
| sym | U–U | DFT | HF | MP2 | CCSD | |||
| D3d | 5.144 | –0.136 | 0.037 | –0.154 | –0.131 | 385 | 9244 | 687 |
| D2d | 5.139 | –0.160 | –0.003 | –0.189 | –0.178 | 513 | 8641 | 758 |
| C2h | 5.290 | –0.150 | –0.001 | –0.163 | –0.151 | 1025 | 8356 | 665 |
The U–F distances have been fixed at 1.996 Å;[66] the U–U distances are optimized with DFT and are listed in the first column in Å. n is the number of Summit nodes.
Strong Scaling Benchmark: Times in Seconds of Three Representative Tensor Contractions with 560 AO, 560 Virtual, and 280 Occupied Spinorsa
| no. of nodes | contraction 1 | contraction 2 | contraction 3 |
|---|---|---|---|
| 32 | 35.70 | 11.82 | 96.68 |
| 64 | 23.45 | 8.25 | 56.99 |
| 128 | 22.75 | 4.32 | 36.08 |
Execution configuration: 2 MPI processes per node and 3 GPU per MPI process. The number of nodes refers only to Summit nodes running worker processes.
Strong Scaling Benchmark: Times in Seconds of Three Representative Tensor Contractions with 840 AO, 840 Virtual, and 280 Occupied Spinorsa
| no. of nodes | contraction 1 | contraction 2 | contraction 3 |
|---|---|---|---|
| 128 | 87.66 | 18.11 | 177.13 |
| 256 | 71.88 | 10.32 | 125.52 |
| 512 | 80.64 | 7.70 | 89.11 |
Execution configuration: 2 MPI processes per node and 3 GPU per MPI process. The number of nodes refers only to Summit nodes running worker processes.
Figure 4Structure of the uranyl complex derived from the X-ray structure for the RbUO2(NO3)3 crystyal.[67]
Comparison between Electronic Structure Methods and Experimental Results for the q Component of the EFG Tensor for the Uranyl Tris-nitrate Complex ([UO2(NO3)3]−) for Basis Sets of Different Cardinal Numbersb
| method | 2z | 3z | 4z |
|---|---|---|---|
| HF | 14.67 | 14.85 | 14.97 |
| B3LYP | 6.64 | 6.79 | 6.89 |
| CAMB3LYP | 7.24 | 7.40 | 7.50 |
| PBE0 | 7.58 | 7.75 | 7.85 |
| CCSD | 8.54 | ||
| ref ( | 4.11 | ||
| ref ( | 15.17 | ||
| ref ( | 6.81 | ||
| ref ( | 8.33 | ||
| exp[ | 8.38 ± 0.13 | ||
The coupled cluster calculation employs 202 occupied and 896 virtual spinors. All values in are atomic units. The X2C-AMFI Hamiltonian is employed in all calculations.
Calculations on [UO2(HCO3)3]−.
Comparison of Times to Solution (TTS, in Hours, Equal to the Total Wall Time for Each Calculation and the Part Spent in the T and Λ Equations), T1 Diagnostic, and q Component of the EFG Tensor (in Atomic Units) for CCSD Expectation Value Calculations for the Uranyl Tris-nitrate Complex ([UO2(NO3)3]−) with the X2C-AMFI Hamiltoniana
| TTS | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| dyall.v2z | |||||||||||
| 1 | 106 | 0.53 | 534 | 0.42 | 160 | 9 | 1.00 | 3h34 | 2h55 | 0.0126 | 10.02 |
| 2 | 156 | 0.77 | 534 | 0.42 | 160 | 20 | 2.17 | 5h33 | 5h06 | 0.0103 | 9.70 |
| 3 | 156 | 0.77 | 818 | 0.64 | 360 | 109 | 5.30 | 11h21 | 10h43 | 0.0102 | 8.48 |
| 4 | 202 | 1.00 | 534 | 0.42 | 400 | 33 | 1.45 | 8h02 | 7h29 | 0.0091 | 9.73 |
| 5 | 202 | 1.00 | 680 | 0.53 | 2050 | 87 | 0.75 | 12h28 | 11h58 | 0.0090 | 8.89 |
| 6 | 202 | 1.00 | 818 | 0.64 | 2050 | 183 | 1.56 | 18h23 | 17h49 | 0.0089 | 8.59 |
| 7 | 202 | 1.00 | 896 | 0.68 | 2050 | 263 | 2.25 | 23h04 | 22h21 | 0.0089 | 8.54 |
| dyall.v3z | |||||||||||
| 8 | 106 | 0.53 | 694 | 0.33 | 480 | 26 | 0.95 | 6h | 4h08 | 0.0139 | 10.29 |
| 9 | 156 | 0.72 | 694 | 0.33 | 480 | 57 | 2.06 | 10h24 | 8h02 | 0.0114 | 10.04 |
| 10 | 156 | 0.72 | 944 | 0.46 | 2050 | 193 | 1.65 | 17h48 | 15h44 | 0.0104 | 9.99 |
O, number of occupied spinors correlated; V, number of virtual spinors correlated; M, number of ranks in the parallel calculation; C, cost estimate for the calculation (scaled by 1.0 × 1014); NormC, cost divided by the number of ranks and normalized with respect to the value for the smallest calculation. See the text for details.