| Literature DB >> 35277513 |
Lina Zhang1, Shuang Zhang1, Alec Owens2, Sergei N Yurchenko3, Pavlo O Dral4.
Abstract
High-level ab initio quantum chemical (QC) molecular potential energy surfaces (PESs) are crucial for accurately simulating molecular rotation-vibration spectra. Machine learning (ML) can help alleviate the cost of constructing such PESs, but requires access to the original ab initio PES data, namely potential energies computed on high-density grids of nuclear geometries. In this work, we present a new structured PES database called VIB5, which contains high-quality ab initio data on 5 small polyatomic molecules of astrophysical significance (CH3Cl, CH4, SiH4, CH3F, and NaOH). The VIB5 database is based on previously used PESs, which, however, are either publicly unavailable or lacking key information to make them suitable for ML applications. The VIB5 database provides tens of thousands of grid points for each molecule with theoretical best estimates of potential energies along with their constituent energy correction terms and a data-extraction script. In addition, new complementary QC calculations of energies and energy gradients have been performed to provide a consistent database, which, e.g., can be used for gradient-based ML methods.Entities:
Year: 2022 PMID: 35277513 PMCID: PMC8917215 DOI: 10.1038/s41597-022-01185-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
The number of grid points (grid size) for each molecule with references to original studies generating these grid points, theoretical best estimates (TBE), and TBE constituent terms.
| Molecule | Grid size | Reference |
|---|---|---|
| CH3Cl | 44819 | [ |
| CH4 | 97217 | [ |
| SiH4 | 84002 | [ |
| CH3F | 82653 | [ |
| NaOH | 15901 | [ |
| Total: 5 molecules | 324592 |
The number of grid points is slightly smaller than that reported in the original publications as we found very few duplicates in the original data set. See section Technical Validation.
Fig. 1Definition of internal coordinates in each molecule. Internal coordinates of (a) CH3Cl; r0 is C–Cl bond length, r and β are C–H bond lengths and ∠(HCCl) angles (I = 1, 2, 3), τ are HCClH dihedral angles (jk = 12, 13); only r0, r3, β1 and τ12 are shown; (b) CH4; r and α are C–H bond lengths and ∠(HCH) angles (i = 1, 2, 3, 4; jk = 12, 13, 14, 23, 24); only r4 and α14 are shown; (c) SiH4; r and α are Si–H bond lengths and ∠(HSiH) angles (i = 1, 2, 3, 4; jk = 12, 13, 14, 23, 24); only r4 and α14 are shown; (d) CH3F; r0 is C–F bond length, r and β are C–H bond lengths and ∠(HCF) angles (i = 1, 2, 3), τ are HCFH dihedral angles (jk = 12, 13); only r0, r3, β1 and τ12 are shown; (e) NaOH; rNaO and rOH are Na–O and O–H bond lengths, θNaOH is ∠(NaOH) bond angle.
The comparative table of the computational details behind the calculations of the constituent terms of theoretical best estimates for five molecules of the VIB5 database.
| Molecule | ∆ | ∆ | ∆ | ∆ | |
|---|---|---|---|---|---|
| CH3Cl | Software: MOLPRO2012 | The basis set: cc-pCVQZ-F12; Slater geminal exponent value | Levels of theory: CCSD(T), CCSDT, and CCSDT(Q); Basis sets for the full triples and the perturbative quadruples calculations are aug-cc-pVTZ(+d for Cl) and aug-cc-pVDZ(+d for Cl), respectively. | Method: one-electron mass velocity and Darwin (MVD1) terms from the Breit–Pauli Hamiltonian in first-order perturbation theory; All electrons correlated (except for the 1 | The 1 |
| CH4 | Software: MOLPRO2012 | The basis set: cc-pCVTZ-F12; Slater geminal exponent value | Levels of theory: CCSD(T), CCSDT, and CCSDT(Q); Basis sets for the full triples and the perturbative quadruples calculations are cc-pVTZ and cc-pVDZ, respectively. | Method: the second-order Douglas–Kroll–Hess approach; frozen core approximation; CCSD(T) | All electrons are correlated; basis set: aug-cc-pCVDZ |
| SiH4 | Software: MOLPRO2012 | The basis set: cc-pCVTZ-F12; Slater geminal exponent value | Levels of theory: CCSD(T), CCSDT, and CCSDT(Q); basis sets for the full triples and the perturbative quadruples calculations are cc-pVTZ(+d for Si) and cc-pVDZ(+d for Si), respectively. | Method: the second-order Douglas-Kroll-Hess approach; frozen core approximation; CCSD(T) | The correction was not included. |
| CH3F | Software: MOLPRO2012 | The basis set: cc-pCVTZ-F12; Slater geminal exponent value | Levels of theory: CCSD(T), CCSDT, and CCSDT(Q); basis sets for the full triples and the perturbative quadruples calculations are cc-pVTZ and cc-pVDZ, respectively. | Method: the second-order Douglas–Kroll–Hess approach; frozen core approximation; CCSD(T)/ cc-pVQZ-DK. Software: MOLPRO2012 | All electrons are correlated; basis set: aug-cc-pCVDZ |
| NaOH | Software: MOLPRO2015 | The basis set: cc-pCVTZ-F12; Slater geminal exponent value | Levels of theory: CCSD(T) and CCSDT; basis set: cc-pVTZ(+d for Na). | Method: the second-order Douglas–Kroll–Hess approach; frozen core approximation; CCSD(T)/cc-pVQZ-DK. Software: MOLPRO2015 | The 1 |
This table mainly emphasizes differences for each molecule, rather than giving the full description of computational details.
Fig. 2Typical CFOUR input options for (a) MP2/cc-pVTZ, (b) CCSD(T)/cc-pVQZ for CH3Cl, CH4, CH3F, NaOH and (c) CCSD(T)/cc-pVQZ for SiH4. The blue options were used for most cases and the light grey options are examples of options used to improve SCF convergence only for some geometries.
Layout of the VIB5.json file containing the VIB5 database.
| No. | Key | Description | Units |
|---|---|---|---|
| 1 | XYZ | Nuclear positions in Cartesian coordinates | Å |
| 2 | INT | Nuclear positions in internal coordinates | Å; degree |
| 3 | HF-TZ | Total energy at HF/cc-pVTZ | Hartree |
| 4 | HF-QZ | Total energy at HF/cc-pVQZ | Hartree |
| 5 | MP2 | Total energy at MP2/cc-pVTZ | Hartree |
| 6 | CCSD-T | Total energy at CCSD(T)/cc-pVQZ | Hartree |
| 7 | TBE | Theoretical best estimate of ab initio deformation energies | cm−1 |
| 8 | MP2_grad_xyz | Energy gradient in Cartesian coordinates at MP2/cc-pVTZ | Hartree/Å |
| 9 | MP2_grad_int | Energy gradient in internal coordinates at MP2/cc-pVTZ | Hartree/Å; Hartree/degree |
| 10 | CCSD-T_grad_xyz | Energy gradient in Cartesian coordinates at CCSD(T)/cc-pVQZ | Hartree/Å |
| 11 | CCSD-T_grad_int | Energy gradient in internal coordinates at CCSD(T)/cc-pVQZ | Hartree/Å; Hartree/degree |
| 12 | CBS | Deformation energies at CCSD(T)-F12b/CBS | cm−1 |
| 13 | VTZ | Deformation energies at CCSD(T)-F12b/cc-pVTZ-F12 (only for CH3Cl molecule) | cm−1 |
| 14 | VQZ | Deformation energies at CCSD(T)-F12b/cc-pVQZ-F12 (only for CH3Cl molecule) | cm−1 |
| 15 | CV | Deformation energy corrections to account for core-valence electron correlation | cm−1 |
| 16 | HO | Deformation higher-order coupled cluster terms beyond perturbative triples | cm−1 |
| 17 | SR | Deformation scalar relativistic (SR) effects | cm−1 |
| 18 | DBOC | Deformation diagonal Born–Oppenheimer corrections (only for CH3Cl, CH4, CH3F, and NaOH molecules) | cm−1 |
| 19 | MP2-aQZ | Deformation energies at MP2/aug-cc-pVQZ (only for CH3Cl molecule) | cm−1 |
Fig. 3Definition of internal coordinates for each molecule used in the database file VIB5.json and in the complimentary calculations. Internal coordinates of (a) CH3Cl; R0 is C–Cl bond length, R and A are C–H bond lengths and ∠(HCCl) angles (i = 1, 2, 3), D are HCClH dihedral angles (jk = 12, 13); only R0, R1, A1, A2, A3, D12, and D13 are shown; (b) CH4; R and A1 are C–H bond lengths and ∠(H2CH) angles (i = 1, 2, 3, 4; j = 2, 3, 4), D are HCH2H3 dihedral angles (k = 3, 4); only R1, A12, A13, A14, D32, and D42 are shown; (c) SiH4; R and A1 are Si–H bond lengths and ∠(H2SiH) angles (i = 1, 2, 3, 4; j = 2, 3, 4), D are HSiH2H3 dihedral angles (k = 3, 4); only R1, A12, A13, A14, D32, and D42 are shown; (d) CH3F; R0 is C–F bond length, R and A are C–H bond lengths and ∠(HCF) angles (i = 1, 2, 3), D are HCFH dihedral angles (jk = 12, 13); only R0, R1, A1, A2, A3, D12, and D13 are shown; (e) NaOH; R1 and R2 are Na–O and H–O bond lengths, RX is O–X bond length, AX1 and AX2 are ∠(XONa) and ∠(XOH) angles, and D is NaXOH dihedral angle. X is a dummy atom.
| Measurement(s) | potential energy surfaces |
| Technology Type(s) | quantum chemistry computational methods |