| Literature DB >> 32271575 |
Henning Henschel1, Alfred T Andersson1, Willem Jespers1, Mohammad Mehdi Ghahremanpour1,2, David van der Spoel1.
Abstract
Infrared spectroscopy can provide significant insight into the structures and dynamics of molecules of all sizes. The information that is contained in the spectrum is, however, often not easily extracted without the aid of theoretical calculations or simulations. We present here the calculation of the infrared spectra of a database of 703 gas phase compounds with four different force fields (CGenFF, GAFF-BCC, GAFF-ESP, and OPLS) using normal-mode analysis. Modern force fields increasingly use virtual sites to describe, e.g., lone-pair electrons or the σ-holes on halogen atoms. This requires some adaptation of code to perform normal-mode analysis of such compounds, the implementation of which into the GROMACS software is briefly described as well. For the quantitative comparison of the obtained spectra with experimental reference data, we discuss the application of two different statistical correlation coefficients, Pearson and Spearman. The advantages and drawbacks of the different methods of comparison are discussed, and we find that both methods of comparison give the same overall picture, showing that present force field methods cannot match the performance of quantum chemical methods for the calculation of infrared spectra.Entities:
Year: 2020 PMID: 32271575 PMCID: PMC7304875 DOI: 10.1021/acs.jctc.0c00126
Source DB: PubMed Journal: J Chem Theory Comput ISSN: 1549-9618 Impact factor: 6.006
Correlation between Computational and Experimental Spectra for Four Force Field and Two Frequency-Scaled QM Methods by Compound Classa
| B3LYP/6-31G(2df,p) | B3LYP/aug-cc-pVTZ | CGenFF | GAFF-BCC | GAFF-ESP | OPLS | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| class | Pearson | Spearman ρ | Pearson | Spearman ρ | Pearson | Spearman ρ | Pearson | Spearman ρ | Pearson | Spearman ρ | Pearson | Spearman ρ | |
| alcohol | 79 | 0.723 | 0.803 | 0.696 | 0.809 | 0.243 | 0.725 | 0.164 | 0.612 | 0.207 | 0.590 | 0.193 | 0.690 |
| aldehyde | 14 | 0.663 | 0.682 | 0.679 | 0.275 | 0.258 | 0.169 | 0.677 | |||||
| alkane | 85 | 0.643 | 0.752 | 0.692 | 0.757 | 0.519 | 0.732 | 0.720 | 0.665 | 0.731 | |||
| alkene | 179 | 0.766 | 0.825 | 0.789 | 0.837 | 0.465 | 0.459 | 0.748 | 0.406 | 0.480 | |||
| alkyl bromide | 20 | 0.575 | 0.784 | 0.630 | 0.788 | 0.473 | 0.734 | 0.405 | 0.417 | 0.681 | 0.424 | 0.733 | |
| alkyl chloride | 42 | 0.515 | 0.305 | 0.666 | 0.246 | 0.645 | 0.217 | 0.604 | 0.282 | 0.654 | |||
| alkylfluoride | 9 | 0.696 | 0.757 | 0.520 | 0.766 | 0.241 | 0.644 | 0.256 | 0.223 | 0.149 | 0.630 | ||
| alkyne | 28 | 0.646 | 0.778 | 0.735 | 0.795 | 0.713 | 0.330 | 0.628 | 0.269 | 0.577 | 0.422 | ||
| amide | 12 | 0.729 | 0.333 | 0.192 | 0.667 | 0.206 | 0.688 | 0.341 | 0.723 | ||||
| amine | 45 | 0.593 | 0.815 | 0.662 | 0.810 | 0.223 | 0.531 | 0.543 | 0.145 | 0.576 | |||
| aromatic | 184 | 0.677 | 0.765 | 0.667 | 0.769 | 0.332 | 0.681 | 0.187 | 0.635 | 0.188 | 0.634 | 0.219 | 0.650 |
| aryl chloride | 23 | 0.744 | 0.578 | 0.751 | 0.296 | 0.649 | 0.096 | 0.590 | 0.110 | 0.589 | 0.160 | 0.626 | |
| arylfluoride | 15 | 0.651 | 0.744 | 0.742 | 0.243 | ||||||||
| cycloalkane | 41 | 0.514 | 0.701 | 0.662 | 0.639 | 0.437 | 0.676 | ||||||
| cycloalkene | 11 | 0.728 | 0.779 | 0.755 | 0.418 | 0.669 | 0.439 | 0.670 | 0.382 | 0.650 | 0.376 | 0.650 | |
| halogenated compound | 106 | 0.576 | 0.746 | 0.557 | 0.750 | 0.308 | 0.658 | 0.216 | 0.621 | 0.202 | 0.597 | 0.240 | 0.640 |
| heterocyclic | 65 | 0.770 | 0.757 | 0.743 | 0.761 | 0.345 | 0.673 | 0.247 | 0.636 | 0.225 | 0.633 | 0.281 | 0.658 |
| ketone | 34 | 0.685 | 0.807 | 0.613 | 0.812 | 0.400 | 0.743 | 0.254 | 0.712 | 0.238 | 0.718 | 0.339 | 0.720 |
| nitro | 11 | 0.637 | 0.734 | 0.585 | 0.730 | 0.110 | 0.654 | 0.151 | 0.633 | 0.205 | 0.707 | ||
| phenol | 18 | 0.621 | 0.761 | 0.567 | 0.763 | 0.247 | 0.638 | 0.214 | 0.607 | 0.189 | 0.592 | 0.577 | |
| thiol | 16 | 0.740 | 0.792 | 0.804 | 0.743 | 0.195 | 0.698 | 0.385 | 0.714 | ||||
| 703 | 0.712 ± 0.154 | 0.785 ± 0.099 | 0.710 ± 0.180 | 0.792 ± 0.099 | 0.417 ± 0.198 | 0.726 ± 0.121 | 0.322 ± 0.235 | 0.682 ± 0.135 | 0.307 ± 0.213 | 0.663 ± 0.135 | 0.361 ± 0.229 | 0.694 ± 0.137 | |
Only classes containing at least five compounds are shown. The two highest correlation coefficients of each type for each method are marked in bold; the two lowest, underlined. In the last line, the standard deviation of the correlation coefficient over all molecules is given. Please note that the sum of the compounds in all classes is larger than the total number of molecules, as each molecule can belong to several classes.
Figure 1Infrared spectra calculated by two quantum chemical and four force field methods in comparison with the experimental spectrum. Intensities for the experimental and quantum chemically calculated spectra are depicted negative.
Correlation Coefficients of the Spectra Shown in Figure
| 1,3-oxazole | ethyl sulfate | diethyl oxalate | ||||||
|---|---|---|---|---|---|---|---|---|
| method | Pearson | Spearman ρ | Pearson | Spearman ρ | Pearson | Spearman ρ | Pearson | Spearman ρ |
| B3LYP/6-31G(2df,p) | 0.740 | 0.607 | 0.859 | 0.605 | 0.222 | 0.840 | 0.846 | 0.855 |
| B3LYP/aug-cc-pVTZ | 0.801 | 0.602 | 0.857 | 0.602 | 0.309 | 0.852 | 0.761 | 0.853 |
| CGenFF | 0.661 | 0.430 | 0.231 | 0.502 | 0.420 | 0.873 | 0.098 | 0.787 |
| GAFF-BCC | 0.652 | 0.401 | 0.153 | 0.421 | 0.231 | 0.842 | 0.321 | 0.756 |
| GAFF-ESP | 0.517 | 0.306 | 0.212 | 0.378 | 0.213 | 0.856 | 0.231 | 0.756 |
| OPLS | 0.331 | 0.438 | 0.122 | 0.440 | 0.228 | 0.895 | 0.103 | 0.763 |
Scaling Factors Obtained by Optimizing the Respective Statistical Measurea
| Pearson | Spearman ρ | |||||
|---|---|---|---|---|---|---|
| level of theory | (unscaled) | optimized scaling factor | (scaled) | (unscaled) | optimized scaling factor | (scaled) |
| B3LYP/6-31G(2df,p) | 0.277 | 0.959 | 0.742 | 0.695 | 0.963 | 0.786 |
| 0.965 | 0.712 | 0.965 | 0.785 | |||
| B3LYP/aug-cc-pVTZ | 0.341 | 0.964 | 0.717 | 0.708 | 0.965 | 0.792 |
| 0.968 | 0.710 | 0.968 | 0.792 | |||
| CGenFF | 0.417 | 1.002 | 0.420 | 0.726 | 0.997 | 0.727 |
| GAFF-BCC | 0.322 | 1.002 | 0.324 | 0.682 | 0.997 | 0.682 |
| GAFF-ESP | 0.307 | 1.005 | 0.310 | 0.663 | 0.993 | 0.665 |
| OPLS | 0.361 | 0.996 | 0.377 | 0.694 | 0.994 | 0.697 |
Database (CCCBDB[43] and UMN[44]) values and correlation coefficients resulting from these for the quantum chemical methods are given in the second line. It should be noted that the exact combination of functional and basis set used here is not available in the UMN database. The value used is given for the marginally larger 6-31G(2df,2p) basis set.
CCCBDB.
UMN.
Correlation Coefficients Obtained Using the Standard (24 cm–1) and Optimized Values of the Full Width at Half-Maximum
| Pearson | Spearman ρ | |||||
|---|---|---|---|---|---|---|
| level of theory | (standard) | optimized fwhm (cm–1) | (optimized) | (standard) | optimized fwhm (cm–1) | (optimized) |
| B3LYP/6-31G(2df,p) | 0.712 | 65.6 | 0.791 | 0.785 | 29.1 | 0.786 |
| B3LYP/aug-cc-pVTZ | 0.710 | 68.2 | 0.792 | 0.792 | 27.7 | 0.792 |
| CGenFF | 0.417 | 90.7 | 0.490 | 0.726 | 47.7 | 0.728 |
| GAFF-BCC | 0.322 | 150.1 | 0.433 | 0.682 | 124.2 | 0.687 |
| GAFF-ESP | 0.307 | 149.7 | 0.410 | 0.663 | 38.9 | 0.663 |
| OPLS | 0.361 | 104.4 | 0.485 | 0.694 | 77.2 | 0.697 |