| Literature DB >> 30367288 |
Yasemin Yesiltepe1,2, Jamie R Nuñez2, Sean M Colby2, Dennis G Thomas2, Mark I Borkum2, Patrick N Reardon3, Nancy M Washton2, Thomas O Metz2, Justin G Teeguarden2, Niranjan Govind2, Ryan S Renslow4,5.
Abstract
When using nuclear magnetic resonance (NMR) to assist in chemical identification in complex samples, researchers commonly rely on databases for chemical shift spectra. However, authentic standards are typically depended upon to build libraries experimentally. Considering complex biological samples, such as blood and soil, the entirety of NMR spectra required for all possible compounds would be infeasible to ascertain due to limitations of available standards and experimental processing time. As an alternative, we introduce the in silico Chemical Library Engine (ISiCLE) NMR chemical shift module to accurately and automatically calculate NMR chemical shifts of small organic molecules through use of quantum chemical calculations. ISiCLE performs density functional theory (DFT)-based calculations for predicting chemical properties-specifically NMR chemical shifts in this manuscript-via the open source, high-performance computational chemistry software, NWChem. ISiCLE calculates the NMR chemical shifts of sets of molecules using any available combination of DFT method, solvent, and NMR-active nuclei, using both user-selected reference compounds and/or linear regression methods. Calculated NMR chemical shifts are provided to the user for each molecule, along with comparisons with respect to a number of metrics commonly used in the literature. Here, we demonstrate ISiCLE using a set of 312 molecules, ranging in size up to 90 carbon atoms. For each, calculation of NMR chemical shifts have been performed with 8 different levels of DFT theory, and with solvation effects using the implicit solvent Conductor-like Screening Model. The DFT method dependence of the calculated chemical shifts have been systematically investigated through benchmarking and subsequently compared to experimental data available in the literature. Furthermore, ISiCLE has been applied to a set of 80 methylcyclohexane conformers, combined via Boltzmann weighting and compared to experimental values. We demonstrate that our protocol shows promise in the automation of chemical shift calculations and, ultimately, the expansion of chemical shift libraries.Entities:
Keywords: Chemical shift; DFT; Density functional theory; Metabolomics; NMR; NWchem; Python; Quantum chemistry
Year: 2018 PMID: 30367288 PMCID: PMC6755567 DOI: 10.1186/s13321-018-0305-8
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1Schematic representation of inputs and outputs of the ISiCLE NMR module
Fig. 2The step-by-step conceptual workflow for the ISiCLE NMR module. Conformer generation with Boltzmann weighting is optional and will be automated in subsequent versions. Please see github.com/pnnl/isicle for the latest versions
Demonstration set sources and details
| References | # of molecules | Ave. atoms per molecule | Ave. H per molecule | Ave. C per molecule | Types of molecules |
|---|---|---|---|---|---|
| Alver [ | 1 | 24 | 11 | 8 | Boron-based compound |
| Asiri et al. [ | 1 | 33 | 12 | 16 | Organic photochromic compound |
| Bagno et al. [ | 4 | 44 | 21 | 18 | Small organic molecules with constrained conformations |
| Barkowski et al. [ | 15 | 83 | 50 | 31 | Pentacyclic terpenoids (fernenes) |
| Coruh et al. [ | 1 | 25 | 8 | 13 | Heterocyclic aromatic compound |
| Fulmer et al. [ | 33 | 14 | 9 | 4 | Commonly used NMR solvents |
| Hill et al. [ | 1 | 70 | 33 | 30 | Complex drug with multiple chemical groups and one stereocenter |
| Izgi et al. [ | 1 | 24 | 15 | 8 | Molecule with cyclohexene (C6H10) attached to ethylamine (C2H7N) |
| Karabacak et al. [ | 1 | 17 | 8 | 7 | Planar benzene ring with attached B(OH)2 and two F groups |
| Krishnakumar et al. [ | 2 | 16 | 4 | 6 | Agrochemical intermediate compounds with planar rings |
| Krishnakumar et al. [ | 2 | 16 | 7 | 7 | Nitrotoluene derivatives |
| Krishnakumar et al. [ | 2 | 17 | 6 | 7 | Phenol derivatives |
| Kwan and Liu [ | 1 | 42 | 22 | 18 | Natural product |
| Li et al. [ | 78 | 11 | 4 | 5 | Molecules with a large number of connected substituent groups |
| Lomas [ | 15 | 18 | 12 | 5 | Saturated alcohols |
| Osmialowski et al. [ | 28 | 28 | 11 | 14 | Substituted phenacylpyridines (ketimine forms) and tautomers |
| Parlak et al. [ | 1 | 26 | 4 | 12 | Polyfluoroaromatic compound with two rings |
| Perez et al. [ | 2 | 22 | 10 | 8 | Chloropyrimidine species |
| Rablen et al. [ | 80 | 6 | 4 | 11 | Rigid organic compounds with constrained conformations |
| Sarotti and Pellegrinet [ | 66 | 15 | 8 | 6 | Low polarity compounds with constrained conformations |
| Sebestian et al. [ | 1 | 26 | 11 | 14 | Phenyl cyanide compound with two planar phenyl rings |
| Seca et al. [ | 4 | 40 | 24 | 65 | Light petroleum extracts |
| Senyel et al. [ | 1 | 23 | 13 | 9 | A structural element of many pharmaceutical drugs |
| Senyel et al. [ | 1 | 28 | 18 | 8 | 3-Piperidino-propylamine molecule |
| Sridevi et al. [ | 1 | 21 | 8 | 10 | Chromene, a two ringed planar compound |
| Tormena and da Silva [ | 3 | 17 | 8 | 8 | Para-X-sub’ed (X=H, CH3O, & NO2) aromatic carbonyl compounds |
| Vijaja and Sankaran [ | 1 | 47 | 24 | 20 | Azine |
| Watts et al. [ | 6 | 45 | 21 | 18 | Coniferol, a building block of lignin, stereoisomers and conformers |
| Wiitala et al. [ | 43 | 11 | 6 | 3 | Organic compounds |
| Wiitala et al. [ | 7 | 24 | 14 | 8 | |
| Willoughby et al. [ | 2 | 22 | 14 | 7 | |
| Yang et al. [ | 2 | 57 | 28 | 23 | Complex natural products |
| This study’s demonstration set | 312 | 20 | 10 | 8 | Small- to medium-sized organic molecules with constrained conformations |
Details (i.e. experimental conditions, total and average number of nuclei, molecule types and classes) for each molecule set are given in Additional file 1 (S3)
Fig. 3Mean absolute errors (MAE) and maximum absolute errors (MAXAE) of chemical shifts for the demonstration set. The grey bars represent MAE, the black bars represent MAXAE. For all methods, geometries are optimized at B3LYP/6-31G(d) in chloroform
Fig. 4Computational costs of DFT methods performed for the demonstration set. Each bar is for two DFT methods with basis sets of cc-pVDZ and cc-pVTZ. The grey bars represent CPU times for the methods with cc-pVDZ and the black bars represent those with cc-pVDZ and the black bars represent those with cc-pVTZ
Fig. 5Linear correlation plots of a 13C and c 1H isotropic shielding values, and b 13C and d 1H NMR chemical shifts versus experimental NMR chemical shifts. Chemical shifts are calculated using the GIAO/B3LYP/cc-pVDZ//B3LYP/6-31G(d) level of theory for the demonstration set in CDCl3 (312 molecules (1554 carbons and 1830 hydrogens)). R2 indicates the correlation coefficient
Fig. 6Chemical shifts of sp2- and sp3- hybridized carbon atoms. a Chemical shifts, b associated errors. Chemical shifts were calculated using the B3LYP/cc-pVDZ//B3LYP/6-31G(d) level of theory in CDCl3
Fig. 7Chemical shift prediction errors for different functional groups. a 13C NMR chemical shifts, b 1H NMR chemical shifts. All molecules are from the demonstration set and are calculated using the GIAO/B3LYP/cc-pVDZ//B3LYP/6-31G(d) level of theory in chloroform
Fig. 8The a equatorial and b axial structures of methylcyclohexane
Fig. 9Experimental and scaled chemical shifts (ppm) of methylcyclohexane