Literature DB >> 21797997

Theoretical NMR correlations based Structure Discussion.

Jochen Junker1.   

Abstract

The constitutional assignment of natural products by NMR spectroscopy is usually based on 2D NMR experiments like COSY, HSQC, and HMBC. The actual difficulty of the structure elucidation problem depends more on the type of the investigated molecule than on its size. The moment HMBC data is involved in the process or a large number of heteroatoms is present, a possibility of multiple solutions fitting the same data set exists. A structure elucidation software can be used to find such alternative constitutional assignments and help in the discussion in order to find the correct solution. But this is rarely done. This article describes the use of theoretical NMR correlation data in the structure elucidation process with WEBCOCON, not for the initial constitutional assignments, but to define how well a suggested molecule could have been described by NMR correlation data. The results of this analysis can be used to decide on further steps needed to assure the correctness of the structural assignment. As first step the analysis of the deviation of carbon chemical shifts is performed, comparing chemical shifts predicted for each possible solution with the experimental data. The application of this technique to three well known compounds is shown. Using NMR correlation data alone for the description of the constitutions is not always enough, even when including 13C chemical shift prediction.

Entities:  

Year:  2011        PMID: 21797997      PMCID: PMC3162559          DOI: 10.1186/1758-2946-3-27

Source DB:  PubMed          Journal:  J Cheminform        ISSN: 1758-2946            Impact factor:   5.514


Findings

Nuclear Magnetic Resonance allied with Elemental analysis or high resolution Mass Spectroscopy are the most common tools used for the structure elucidation of new compounds. The used 2D NMR experiments like COSY, HSQC, and 13C-HMBC deliver correlation information between atoms that can be translated into connectivity information. Out of these, correlation information from COSY and HSQC experiments can be transcribed directly into connectivity between atoms. But the 13C-HMBC correlations need more attention because of their ambiguity and complexity. Hence the difficulty of the structure elucidation problem depends more on the type of the investigated molecule than on its size [1]. Saturated compounds can usually be assigned unambiguously using mainly COSY and some 13C-HMBC data, whereas condensed heterocycles are problematic due to their lack of protons that could show interatomic connectivities. This ambiguity has driven the development of different software packages to aid in the interpretation of the 13C-HMBC correlation data [2-20] as much as the development of additional correlation experiments [21,22]. Most of these approaches have in common that they work only based on experimental NMR correlation data. COCON [1,4,23,24] has recently been extended with the capability to create a theoretical NMR correlation data set, based on a molecule's suggested constitution. The theoretical data set is used as input data for the structure elucidation software COCON. The resulting set of constitutional assignments indicates how unambiguous NMR would have been able to describe the originally suggested molecule. The freely accessible online version of COCON (WEBCOCON at http://cocon.nmr.de) offers this analysis as "Alternative Constitutions". The data derived from the NMR correlation spectra is the result of magnetization transfer via scalar coupling between the atoms in the molecule of interest. Since the scalar coupling is based on the interatomic bonds, the correlation data will reflect those bonds. Hence, a set of all feasible NMR correlation data (theoretical correlation data) can be derived from the molecular constitution. This is done by iteratively looking for all protons in the molecule, then building a list of their atoms in 2-bond and 3-bond distance. From each proton all connectivities are inspected recursively up to three bonds distance. If a carbon is found in a two bond distance, a 2J and a 1,1-ADEQUATE correlation are added to the list. If a carbon is found in a three bond distance, a HMBC correlation is added to the list, if a proton is found, a COSY correlation is added. In principle 4J correlations for COSY and HMBC could be generated, as sometimes they are observable in experiments as well. But, COCON can not handle 4J COSY correlations, therefore those are left out. The generation of 4J HMBC correlations is not used, because when the HMBC correlations are allowed to be 4J in the structure generation process, the process takes much more time and many more results are produced. Finally carbon chemical shifts are generated by table lookup, a table reverse generated based on the chemical shift rules that COCON uses. This values are not comparable to a chemical shift prediction, but enough to ensure that COCON will generate the starting structure. For online use, the MarvinSketch applet from ChemAxon is available for drawing or loading of the molecule. The resulting MDL file contains all atoms, their connectivity and multiplicity information. Based on this file, the recently developed Module "Alternative Constitutions" in WEBCOCON generates atomtypes, theoretical correlation data and table-based carbon chemical shifts. The actual magnitude of the scalar coupling, and therefore the observability of a correlation, depends on the atoms involved, their chemical environment and relative geometry. For 1J and 2J couplings mainly the atoms involved and their chemical environment are of importance, since the geometry varies little. That is different with 3J coupling, which depends on the dihedral angle, hence the actual molecular conformation decides on the magnitude of the coupling. The creation of theoretical correlation data disregards the molecule's real conformation, assuming that all correlations are observable. Hence the data set represents the upper limit of correlations that may be experimentally available for the constitution. Calculations were run with three molecules (Figure 1) on the publicly available WEBCOCON server, running times varied from one to twelve minutes. All molecules were drawn in the "Alternative Constitutions" module and submitted to the server. The number of solutions suggested for Ascomycin 1 and Oroidin 2 in runs with theoretical and experimental data are shown in table 1. Also, a webpage allowing direct access to the results shown here has been set up on the WEBCOCON server at http://cocon.nmr.de/StructureDiscussion/ (The results are mirrored at http://science.jotjot.net/StructureDiscussion/).
Figure 1

Ascomycin 1, Oroidin 2 and Aflatoxin B1 3 are used to evaluate the use of theoretical data.

Table 1

Number of constitutional assignments suggested for 1 and 2.

open atom typesfixed atom types


theoexptheoexp
1110011
216252.56611486
Ascomycin 1, Oroidin 2 and Aflatoxin B1 3 are used to evaluate the use of theoretical data. Number of constitutional assignments suggested for 1 and 2. Ascomycin 1 is a well known ethyl derivative of Tacrolimus, it serves as example of a large natural product, featuring 43 Carbon atoms. Using theoretical NMR correlation data (COSY and 13C-HMBC correlations) COCON generates only one solution, independent of whether atom types are defined or not. Using experimental COSY and 13C-HMBC correlation data the structure generator comes up with 100 structural assignments, which are reduced to one when the atom types are fixed as well. In this case NMR correlation data was able to define the constitution unambiguously. Oroidin 2 has been frequently used for the demonstration of COCON. The use of theoretical COSY and 13C-HMBC correlations leads to a total of 16 possible constitutional assignments, also predefining the atom types reduces this set to one constitutional assignment. The experimental data set leads to 252,566 structural assignments generated, which reduce to 1,486 when atom types are predefined as well. Hence the structure can not be safely determined by NMR alone. The original structure determination was carried out by chemical derivatization and total synthesis [25,26]. The pictures change with Aflatoxin B1 3 with 17 Carbon atoms. Using theoretical COSY and 13C-HMBC data alone, COCON generates 1,048 structures, compared to 1,932 solutions using experimental data. When the atom types are predefined, COCON generates 55 constitutional assignments, compared to 108 with experimental data. The molecule set generated contains constitutions with the element cyclobutadiene, a structural element that is very uncommon in natural products. COCON has several built-in rules that eliminate certain constitutional elements, like cyclobutadiene, cyclopropene and peroxides. By default these rules are not used, but in this special case we observed a substantial difference in the number of results. When these rules are activated the number of solutions drops to 58 for the experimental correlation data set and 33 for the theoretical data set. All planar molecules suggested are shown in Figure 2, the correct constitution and starting point of the analysis is 6. For the small number of interesting constitutions a back-calculation on the carbon chemical shifts was made (ChemDraw v11), that were compared to the experimental values (see table 2). The last line in the table contains the sum of the absolute chemical shift differences for all carbons, exposing molecule 6 as the one that best fits the experimental data [24,27,28].
Figure 2

Planar constitutions suggested for Aflatoxin B1. Suggestions 4 - 6 are obtained using theoretical data, 5 - 10 using experimental data. Constitution 6 is the correct one.

Table 2

Experimental and predicted 13C chemical shifts for the different constitutions suggested for Aflatoxin B1.

13C shifts for molecule
exp45678910
C201170167200203202190190
C177167166177161167175166
C166163163161155154169162
C161152154159150145158160
C155149149159137144153154
C153140142151136141152127
C117128125123133126140122
C108117116111129122131121
C104112113104128107104114
CH145149149149149149149149
CH114106106100105105105107
CH1031051051001009493101
CH919393971008893100
CH4843464548504345
CH23535353430323333
CH22924212825311414
CH35753535656595852

∑|Δδ|13012756171122116129
Planar constitutions suggested for Aflatoxin B1. Suggestions 4 - 6 are obtained using theoretical data, 5 - 10 using experimental data. Constitution 6 is the correct one. Experimental and predicted 13C chemical shifts for the different constitutions suggested for Aflatoxin B1. The theoretical NMR correlation dataset is the upper limit of number of correlations that are possible with a given constitution. Therefore all alternative constitutions generated with this data are "NMR-identical" with regard to correlation data. A careful analysis of this alternatives might be used to direct further investigations needed to confirm the proposed constitution. Whilst Ascomycin's structure can be confirmed by NMR correlations, Oroidin's structure can not. The results obtained would direct further work towards chemical derivatization and synthesis [25,26] or x-ray crystallography. The results obtained for Aflatoxin B1 show nicely how carbon chemical shift prediction can be used as tool for the structure discussion, exposing one suggested constitutional assignment as best fitting.

Availability

The WEBCOCON server is freely accessible via http://cocon.nmr.de.

Competing interests

The author declares that they have no competing interests.

Authors' contributions

JJ maintains the WEBCOCON software and has run all the calculations shown.
  8 in total

1.  SENECA: A platform-independent, distributed, and parallel system for computer-assisted structure elucidation in organic chemistry.

Authors:  C Steinbeck
Journal:  J Chem Inf Comput Sci       Date:  2001 Nov-Dec

2.  Validation of structural proposals by substructure analysis and 13C NMR chemical shift prediction.

Authors:  Jens Meiler; Erdogan Sanli; Jochen Junker; Reinhard Meusinger; Thomas Lindel; Martin Will; Walter Maier; Matthias Köck
Journal:  J Chem Inf Comput Sci       Date:  2002 Mar-Apr

3.  Applications of a HOUDINI-based structure elucidation system.

Authors:  K-P Schulz; A Korytko; M E Munk
Journal:  J Chem Inf Comput Sci       Date:  2003 Sep-Oct

Review 4.  Recent developments in automated structure elucidation of natural products.

Authors:  Christoph Steinbeck
Journal:  Nat Prod Rep       Date:  2004-07-14       Impact factor: 13.423

5.  Novel methods of automated structure elucidation based on 13C NMR spectroscopy.

Authors:  Jens Meiler; Matthias Köck
Journal:  Magn Reson Chem       Date:  2004-12       Impact factor: 2.447

6.  Computer-assisted constitutional assignment of large molecules: COCON analysis of ascomycin.

Authors:  J Junker; W Maier; T Lindel; M Köck
Journal:  Org Lett       Date:  1999-09-09       Impact factor: 6.005

7.  Fuzzy structure generation: a new efficient tool for Computer-Aided Structure Elucidation (CASE).

Authors:  Mikhail E Elyashberg; Kirill A Blinov; Sergey G Molodtsov; Antony J Williams; Gary E Martin
Journal:  J Chem Inf Model       Date:  2007-03-27       Impact factor: 4.956

8.  Automated structure elucidation of two unexpected products in a reaction of an alpha,beta-unsaturated pyruvate.

Authors:  Gary J Sharman; Ian C Jones; Mark P Parnell; Michael C Willis; Mary F Mahon; Dean V Carlson; Antony Williams; Mikhail Elyashberg; Kirill Blinov; Sergey G Molodtsov
Journal:  Magn Reson Chem       Date:  2004-07       Impact factor: 2.447

  8 in total
  4 in total

1.  Chemical graph generators.

Authors:  Mehmet Aziz Yirik; Christoph Steinbeck
Journal:  PLoS Comput Biol       Date:  2021-01-05       Impact factor: 4.475

2.  Surge: a fast open-source chemical graph generator.

Authors:  Brendan D McKay; Mehmet Aziz Yirik; Christoph Steinbeck
Journal:  J Cheminform       Date:  2022-04-23       Impact factor: 5.514

3.  MAYGEN: an open-source chemical structure generator for constitutional isomers based on the orderly generation principle.

Authors:  Mehmet Aziz Yirik; Maria Sorokina; Christoph Steinbeck
Journal:  J Cheminform       Date:  2021-07-03       Impact factor: 5.514

4.  Sampling CASE Application for the Quality Control of Published Natural Product Structures.

Authors:  Lorena Martins Guimarães Moreira; Jochen Junker
Journal:  Molecules       Date:  2021-12-13       Impact factor: 4.411

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.