Literature DB >> 34946623

Sampling CASE Application for the Quality Control of Published Natural Product Structures.

Lorena Martins Guimarães Moreira1, Jochen Junker1.   

Abstract

Structure elucidation with NMR correlation data is dicey, as there is no way to tell how ambiguous the data set is and how reliably it will define a constitution. Many different software tools for computer assisted structure elucidation (CASE) have become available over the past decades, all of which could ensure a better quality of the elucidation process, but their use is still not common. Since 2011, WebCocon has integrated the possibility to generate theoretical NMR correlation data, starting from an existing structural proposal, allowing this theoretical data then to be used for CASE. Now, WebCocon can also read the recently presented NMReDATA format, allowing for uncomplicated access to CASE with experimental data. With these capabilities, WebCocon presents itself as an easily accessible Web-Tool for the quality control of proposed new natural products. Results of this application to several molecules from literature are shown and demonstrate how CASE can contribute to improve the reliability of Structure elucidation with NMR correlation data.

Entities:  

Keywords:  CASE; natural products; theoretical NMR data; web-tool

Mesh:

Substances:

Year:  2021        PMID: 34946623      PMCID: PMC8708086          DOI: 10.3390/molecules26247543

Source DB:  PubMed          Journal:  Molecules        ISSN: 1420-3049            Impact factor:   4.411


1. Introduction

Software tools for computer assisted structure elucidation (CASE) of small molecules have been under development for over 50 years now. The methods used can roughly be divided into stochastic (S), deterministic (D), and hybrid (H), with hybrid representing various different combinations of stochastic and deterministic methods. The most prominent methods are fragment assemblers (H) [1,2,3,4,5,6], expert systems (H) [7,8,9], databases of C NMR chemical shifts or neural networks (S) [10,11,12,13], structure generation by reduction (S/H) [14], logic engines (D) [15], stochastic structure generators (S) [16], combinatorial brute force (D) [17,18,19,20,21], combinatorial brute force with restraints (D/H) [22,23], genetic algorithms (S) [24,25,26], simulated annealing (S) [27], convergent structure generation (S) [28,29], evolutionary algorithm (S) [30], fuzzy structure generation (S) [31], and expert systems with DFT (H) [32]. Altogether, 14 different methods were implemented in more than 20 different tools, some of which are freely usable, and many of them have not seen further development by their authors after the first publication. It can be observed that, out of the eight stochastic methods, only database methods are still actively developed, whereas both purely deterministic methods see ongoing development. However, the most active development happens for some of the hybrid methods, that combine stochastic and deterministic elements in various ways. Despite of all these efforts, none of these tools has become popular enough to be used for quality control of published structures by a wider community of researchers or journal editors. The C chemical shift-based tools have an outstanding position in this list, as they do not take NMR correlation data into account. They can either map chemical shifts to chemical neighborhoods, which in turn are assembled into a proposed constitution, or determine chemical neighborhoods for each carbon of a proposed constitution and predict a chemical shift. One such free to use tool is CSEARCH [10,13,33], which allows for an automated verification of C chemical shift assignments for a proposed structure combining a database and a neural network verification. This approach might fail if few similar structures are available, or if incorrect data is contained in the database. For such applications, tools using NMR correlation data are better suited. One tool that uses NMR correlation data is Cocon with the online interface WebCocon, which has been improved continuously over the past 20 years [22,34,35,36,37,38,39]. Today, Cocon can be used freely via a web interface called WebCocon, that facilitates the input file creation, submission, and visualization of the results. In a first step towards the use of CASE for quality control, the creation of theoretical NMR correlations from an existing constitution was implemented in WebCocon [37]. This way, the suggested structure is drawn on the web page, theoretical NMR correlations are created, and then they are submitted to a CASE run. It might seem simplistic, but, if WebCocon produces more than one structural proposal under these circumstances, it means that the constitution cannot be verified using NMR correlation data alone, and the use of other information is needed. One possibility is the inclusion of NOEs, which was explored for the first time in combination with genetic algorithms [26]. However, this approach was abandoned due to the very complex parametrization of the genetic algorithm, that defied automation [40]. Since then, NOEs have been used only after the constitution has been defined, in order to determine the stereochemistry of the resulting molecule [41,42]. WebCocon is the first CASE tool that can use NOEs in the ranking of the structural proposals, in a similar way as NOE data is frequently used in publications, but now backed up by structure calculations. It was already shown that, even with coarsely defined NOEs, the conformations obtained as results were similar to published X-ray structures. [39]. The exploration of C NMR chemical shift calculation is commonly accepted for the comparison of structural proposals, and it is carried out manually here. This is a first step towards a future quality control method for the structure discussion used in publications. With the popularization of file formats, such as NMReDATA [43,44,45], the structure verification could be carried out automatically using experimental data. WebCocon can read NMReDATA files, thus offering the possibility of a structure discussion based on theoretical and experimental data. New natural products published over the past years, whose structures were elucidated using NMR data were chosen from the literature and submitted to a structure discussion using WebCocon. As the molecules generally were not available as electronic files, structures were drawn in WebCocon. Theoretical NMR correlation data for HH-COSY and HC-HMBC correlations (HN-HMBC and 1,1-ADEQUATE correlations are generated but were not used) with carbon chemical shifts from an internal lookup table were created automatically for the structures, and the data was submitted to a CASE run. If more than one structure is proposed, the ranked results are compared automatically to the published structure. Additionally, the suggested alternative structures were compared manually to the experimental structures by back-calculating C chemical shifts using CS Chemdraw V15. When chemical shifts were used for the discussion, only the atoms with the largest expected deviations were selected, and the comparison was based on the average deviation observed (). Looking for further automation possibilities, the use of CSEARCH as an alternative to CS Chemdraw was investigated for one example, and, for two molecules, the use of the standard deviation () of the chemical shift deviations and their range was included. A standard deviation of 5.5 ppm is expected for the values back-calculated in CS Chemdraw, according to the manual. HH-COSY and HC-HMBC correlations are the most common experiments acquired that contain information about the heavy atom skeleton of a molecule. The three-bond-long COSY correlation between two hydrogen atoms indicates that the two bearing heavy atoms are directly connected. As both heavy atoms are required to be bonded to hydrogens, this experiment provides only limited information for proton poor compounds. The HC-HMBC correlations are observed between a hydrogen and a carbon that are two or three bonds away. This experiment requires only one hydrogen for the observation of the correlation; thus, it can be used to localize carbons without protons in the skeleton. However, as the observed correlations can originate over two or three bonds, and differentiation normally is not possible, the incorporation of this data is more difficult. The use of CASE for the interpretation is recommended for this ambiguous data. Other experiments have been developed over the years to provide additional information about a molecule’s skeleton, most prominently the HN-HMBC, 1,1-ADEQUATE, and H2BC. The HN-HMBC is similar to the carbon version described before but correlates to a nitrogen. The 1,1-ADEQUATE [22,34,35,46,47] and H2BC [48,49,50] are experiments that provide a HC-HMBC type of correlation and can be used complimentary to the aforementioned HC-HMBC data. Theoretical HN-HMBC and 1,1-ADEQUATE correlations were not used in this analysis because this data is not routinely acquired and, therefore, also not included into publications. WebCocon is a web-based interface to the program Cocon. Initially, WebCocon helps in the creation of the input file for Cocon, offering different functionalities for this purpose. With this input file, Cocon creates a list of all constitutions that are compatible with the provided correlation data. These constitutions are ranked on the server by different methods and then transferred back to the web interface for visualization. Alternatively, the suggested constitutions can be downloaded in a SDF file and inspected offline by the user.

2. Results

The results resumed in Table 1 are just samples from the literature. Further automation in the ranking process will streamline the analysis for many more molecules.
Table 1

Overview of the results.

Molecule# InspectedConfirmed
Luteolin 8-C-E-propenoic acid6yes
Tomentodiplacone2yes
Kadangustin A5no
Berkeleyamide D2no
14-norpseurotin A2no
Feruloylpodospermic acid A2no
Cochinchistemoninone3no
Milicifoline B3pending further analysis
5α-cyprinol sulfate4yes
viridiol aka TAEMC161>3yes

2.1. Luteolin 8-C-E-Propenoic Acid

Luteolin 8-C-E-propenoic acid [51] was published as molecule 1-A from Figure 1. WebCocon suggested 25 constitutions with the theoretical NMR correlation data set, including the published molecule. Out of those, only 6 underwent further inspection, as the others contained very restrained ring systems. The force field total energies calculated strongly favor molecules 1-A and 1-B.
Figure 1

Luteolin 8-C-E-propenoic acid (1-A) and the 5 alternative constitutions suggested by WebCocon.

The distinction of 1-A from the other suggested constitutions can be done using C chemical shifts of selected atoms, when comparing experimental data with back-calculated data in Table 2. The back-calculated C chemical shift values for all carbons of all verified constitutions are in Figure A1.
Table 2

Comparison of experimental and back-calculated C chemical shifts for luteolin 8-C-E-propenoic acid (1-A) and the 5 suggested alternative constitutions. C chemical shifts were back-calculated for the atoms numbered in the molecules. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]
Atom Exp. 1-A 1-B 1-C 1-D 1-E 1-F
1169.0171.5171.5171.5-170.4-
2182.2182.1160.5160.5187.0175.0189.7
Δ¯ 1.312.112.12.44.33.8
Etotal [kcal/mol] 2629458074303
Figure A1

Luteolin 8-C-E-propenoic acid and the 5 alternative constitutions suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

2.2. Tomentodiplacone

Tomentodiplacone [52] was published as molecule 2-A from Figure 2. WebCocon suggested 2 constitutions with the theoretical NMR correlation data set, including the published molecule. The force field total energy calculated for both molecules is almost equal.
Figure 2

Tomentodiplacone (2-A) and the alternative constitution suggested by WebCocon. C chemical shifts were back-calculated for the atoms numbered in the molecules.

The distinction of 2-A from the other suggested constitution can be done using C chemical shifts of selected atoms. The average deviation of the back-calculated chemical shifts from the experimental values is considerably smaller for 2-A, as shown in Table 3. The back-calculated C chemical shift values for all carbons of all verified constitutions are in Figure A2.
Table 3

Comparison of experimental and back-calculated C chemical shifts for tomentodiplacone (2-A) and the suggested alternative constitution. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]
Atom Exp. 2-A 2-B
1164.4162.5182.6
2196.4196.8170.0
Δ¯ 1.222.3
Etotal [kcal/mol] 6264
Figure A2

Tomentodiplacone and the alternative constitution suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

2.3. Kadangustin A

Kadangustin A [53] was published as molecule 3-A from Figure 3. WebCocon suggested 13 constitutions with the theoretical NMR correlation data set, 5 of which are shown here, including the published molecule. The force field total energy calculated for these 5 molecules is very similar.
Figure 3

Kadangustin A (3-A) and the 4 alternative constitutions suggested by WebCocon. C chemical shifts were back-calculated for the atoms numbered in the molecules.

The distinction of 3-A from the other suggested constitutions is possible using C chemical shifts of some selected atoms, when comparing experimental data with back-calculated data in Table 4 using the average deviation (). However, the standard deviation () and range () clearly favor 3-D as solution, in spite of the higher total energy. This case should be verified in more detail. The back-calculated C chemical shift values for all carbons of all verified constitutions are in Figure A3.
Table 4

Comparison of experimental and back-calculated C chemical shifts for kadangustin A (3-A) and the 4 suggested alternative constitutions. is the average deviation, is the standard deviation of the chemical shifts, and is the range of the deviations of the calculated chemical shifts to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]
Atom Exp. 3-A 3-B 3-C 3-D 3-E
1172.0170.2162.1165.0165.4171.4
239.331.333.633.635.946.7
334.834.838.738.731.141.8
Δ¯ 3.36.55.54.65.0
σ 4.27.16.01.84.5
ΔmaxΔmin 8.013.810.96.68.0
Etotal [kcal/mol] 160165160168156
Figure A3

Kadangustin A and the 4 alternative constitutions suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

2.4. Berkeleyamide D

Berkeleyamide D [54] was published as molecule 4-A from Figure 4. WebCocon suggested 2 constitutions with the theoretical NMR correlation data set, including the published molecule. The force field total energy calculated for both molecules clearly favors 4-B.
Figure 4

Berkeleyamide D (4-A) and the alternative constitution suggested by WebCocon. C and H chemical shifts were back-calculated for the atoms numbered in the molecules.

The confirmation of 4-A as correct solution is also not possible using C and H chemical shift values of selected atoms, as the average deviation is smaller for 4-B, as shown in Table 5. The back-calculated C and H chemical shift values using CS Chemdraw for all atoms of both verified constitutions are in Figure A4 and Figure A5, respectively.
Table 5

Comparison of experimental and back-calculated C and H chemical shifts for berkeleyamide D (4-A) and the suggested alternative constitution. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions. Deviations ≫ 10 ppm are labeled with “*”.

δ13C [ppm]δ1H [ppm]
Atom Exp. 4-A 4-B Exp. 4-A 4-B
484.976.687.2
695.3100.190.2
7 4.414.424.0
8197.7165.5 *198.0
9104.4105.698.85.354.855.1
10199.4204.9196.5
1137.445.055.7 *3.983.224.2
Δ¯ 8.54.9 0.50.3
Etotal [kcal/mol] 6724
Figure A4

Berkeleyamide D and the alternative constitution suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

Figure A5

Berkeleyamide D and the alternative constitution suggested by WebCocon with all back-calculated H chemical shifts.

Although the average deviations shown in Table 5 indicate that 4-B matches the experimental chemical shift values better, for both suggested constitutions, there is one carbon with a chemical shift deviation ≫ 10 ppm (carbon 8 for 4-A and carbon 11 for 4-B). Therefore, all C chemical shift values back-calculated by CS Chemdraw (M-I) and CSEARCH (M-II) for all carbon atoms were included in a comparison in Table 6.
Table 6

Comparison of experimental and back-calculated C chemical shifts for all carbons in berkeleyamide D (4-A) and the suggested alternative constitution, using CS Chemdraw (M-I) and CSEARCH (M-II). is the standard deviation and is the range of the deviations between the back-calculated and experimental value.

4-A4-B
Atom Exp. M-I M-II M-I M-II
123.923.523.823.523.8
224.023.624.023.924.0
345.547.145.544.345.4
484.976.684.987.285.5
5164.1169.2171.6169.2170.2
695.3100.195.390.265.4
775.176.575.178.575.7
8197.8204.9181.2198.0156.9
9104.4105.6104.498.889.6
10199.4165.5199.4196.5198.4
1137.445.043.055.747.5
12133.2137.4133.2135.6134.6
13129.2129.0129.2129.6129.6
14129.2128.6129.2129.2128.9
15127.0125.7127.1127.6127.1
16129.2128.6129.2129.2128.9
17129.2129.0129.2129.6129.6
1823.823.523.823.523.8
σ 9.04.65.112.5
ΔmaxΔmin 41.524.123.951.0
The standard deviation and range observed in the comparison of the experimental and the different back-calculated chemical shift values using both methods lead to a contradictory result. Whereas method M-I favors 4-B, method M-II favors 4-A as correct constitution. However, for both cases, the range is very high, indicating that neither one of the constitutions might be correct. An analysis with experimental correlation data would be be the next step, followed by a new verification of back-calculated carbon chemical shift values for all alternatives. If the NMR data was available as NMReDATA, this would be straightforward.

2.5. 14-Norpseurotin A

14-norpseurotin A [55] was published as molecule 5-A from Figure 5. WebCocon suggested 13 constitutions with the theoretical NMR correlation data set, including the published molecule. The force field total energies calculated favors 5-B over 5-A; all other alternatives show considerably higher values.
Figure 5

14-norpseurotin A (5-A) and the alternative constitution suggested by WebCocon. C and H chemical shifts were back-calculated for the atoms numbered in the molecules.

The distinction of 5-A from 5-B is also not possible using C chemical shifts, when comparing experimental data with back-calculated data for selected atoms, as can be seen in Table 7. Actually, for 5-B the back-calculated C and H chemical shifts match the experimental values better, as can be verified by the average deviation. This case should be verified in more detail. The back-calculated C and H chemical shift values for all verified constitutions are in Figure A6 and Figure A7, respectively.
Table 7

Comparison of experimental and back-calculated C and H chemical shifts for 14-norpseurotin A (5-A) and the suggested alternative constitution. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]δ1H [ppm]
Atom Exp. 5-A 5-B Exp. 5-A 5-B
1185.7167.3188.0
270.469.769.84.784.04.2
370.371.688.94.624.04.2
473.268.170.14.705.04.8
Δ¯ 6.46.2 0.60.4
Etotal [kcal/mol] 160148
Figure A6

14-norpseurotin A and the alternative constitution suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

Figure A7

14-norpseurotin A and the alternative constitution suggested by WebCocon with all back-calculated H chemical shifts.

2.6. Feruloylpodospermic Acid A

Feruloylpodospermic acid A [56] was published as molecule 6-A from Figure 6. WebCocon suggested 2 constitutions with the theoretical NMR correlation data set, including the published molecule. The force field total energy calculated for both molecules slightly favors 6-B.
Figure 6

Feruloylpodospermic acid A (6-A) and the alternative constitution suggested by WebCocon. C chemical shifts were back-calculated for the atoms numbered in the molecules.

A comparison of the experimental and back-calculated C chemical shifts for the 5 carbons numbered in Figure 6 is shown in Table 8. The observed average deviation also slightly favors 6-B. This case should be verified in more detail. The back-calculated C chemical shift values for all carbons of all verified constitutions are in Figure A8.
Table 8

Comparison of experimental and back-calculated C chemical shifts for feruloylpodospermic acid A (6-A) and the suggested alternative constitution. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]
Atom Exp. 6-A 6-B
1170.4171.0173.2
238.939.235.9
383.578.985.3
434.138.434.0
5173.1170.2173.1
Δ¯ 2.51.5
Etotal [kcal/mol] 7556
Figure A8

Feruloylpodospermic acid A and the alternative constitution suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

2.7. Cochinchistemoninone

Cochinchistemoninone [57] was published as molecule 7-A from Figure 7. WebCocon suggested 3 constitutions with the theoretical NMR correlation data set, including the published molecule. The force field total energy calculated for the molecules favors 7-C.
Figure 7

Cochinchistemoninone (7-A) and the alternative constitution suggested by WebCocon. C chemical shifts were back-calculated for the atoms numbered in the molecules.

A comparison of the experimental and back-calculated C chemical shifts for the 5 carbons numbered in Figure 7 is shown in Table 9. The observed average deviation favors 7-A as correct solution, but, in all suggested constitutions, the experimental value of the chemical shift of carbon 2 is considerably different from the back-calculated value. This case should be verified in more detail. The back-calculated C chemical shift values for all carbons of all verified constitutions are in Figure A9.
Table 9

Comparison of experimental and back-calculated C chemical shifts for cochinchistemoninone (7-A) and the 2 suggested alternative constitutions. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]
Atom Exp. 7-A 7-B 7-C
1208.9211.2211.3211.3
290.6117.6127.4103.3
3169.2171.8182.6191.4
498.498.788.195.6
5173.1175.2197.9167.3
Δ¯ 6.917.59.2
Etotal [kcal/mol] 1168780
Figure A9

Cochinchistemoninone and the alternative constitution suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

2.8. Milicifoline B

Milicifoline B [58] was published as molecule 8-A from Figure 8. When using theoretical NMR correlation data from 8-A, WebCocon suggested 3 constitutions, including the published molecule. The force field total energy calculated for the three molecules is very similar.
Figure 8

Milicifoline B (8-A) and the 2 alternative constitution suggested by WebCocon.

The experimental and back-calculated C chemical shifts for two carbons of all constitutions are shown in Table 10. Due to the large average deviation, the suggested constitution 8-C can be excluded as possible solution. The average deviation favors 8-B over 8-A, but without a strong differentiation. The back-calculated C chemical shift values for all carbons of all verified constitutions are in Figure A10.
Table 10

Comparison of experimental and back-calculated C chemical shifts for milicifoline B (8-A) and the 2 suggested alternative constitutions. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]
Atom Exp. 8-A 8-B 8-C
1144.1145.5145.5139.6
2142.8136.9144.491.7
Δ¯ 3.61.527.8
Etotal [kcal/mol] 267288265
Figure A10

Milicifoline B and the 2 alternative constitution suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

In their publication, the authors used 3 NOEs (see Figure 9) in order to decide between the two possible constitutions, so this information was also included into WebCocon.
Figure 9

NOEs (depicted in orange) used to identify the correct conformation of milicifoline B (8-A) in the literature and the corresponding contacts in 8-B.

The distances measured after the WebCocon run are shown in Table 11, and they favor the originally published constitution. However, the distances measured in the calculated molecules are all > 500 pm and, therefore, at the limit of measurable NOEs [59]. Due to this, no reliable decision between the two suggested constitutions can be made.
Table 11

Comparison of back-calculated distances for the NOEs for milicifoline B (8-A) and the suggested alternative constitution 8-B.

Δs [pm]
NOE 8-A 8-B
1705850
2612618
2503832

2.9. 5-Cyprinol Sulfate

5-cyprinol sulfate [60] was published as molecule 9-A from Figure 10. When using theoretical NMR correlation data from 9-A, WebCocon suggested only one solution. However, for this molecule, experimental data is available in the NMReDATA format. Thus, the WebCocon analysis was repeated using the published experimental data, resulting in 4 suggested constitutions, including the published molecule. The force field total energy calculated for the four molecules is very similar.
Figure 10

5-cyprinol sulfate (9-A) and the 3 alternative constitutions suggested by WebCocon.

All constitutions have the same skeleton; they differ only in the positioning of the sulfate group. Hence, C chemical shifts are expected to vary mainly for the carbons adjacent to the sulfate group. The experimental and back-calculated C chemical shifts for these 4 carbons of all constitutions are shown in Table 12. The calculated average deviation over the 4 carbons for the suggested constitutions clearly favors the constitution 9-A, matching the publication. The back-calculated C chemical shift values for all carbons of all verified constitutions are in Figure A11.
Table 12

Comparison of experimental and back-calculated C chemical shifts for 5-cyprinol sulfate (9-A) and the 3 suggested alternative constitutions for selected carbons. is the average of the deviations of the calculated to the experimental values. Etotal is the total force field energy calculated in kcal/mol for the constitutions.

δ13C [ppm]
Atom Exp. 9-A 9-B 9-C 9-D
167.677.764.264.264.2
272.373.373.373.387.6
368.468.491.568.468.4
471.471.471.489.571.4
Δ¯ 2.86.95.64.7
Etotal [kcal/mol] 42485653
Figure A11

5-cyprinol sulfate and the 3 alternative constitutions suggested by WebCocon with all back-calculated C chemical shifts. The C chemical shifts in red were used for the discussion.

2.10. Viridiol aka TAEMC161

Viridiol (10-C) was originally published in 1969, as derivative of viridin [61]. The crystal structure was published in 2013 [62], and, in recent years, the total synthesis has become available [63,64,65,66]. In 2000, TAEMC161 was isolated as metabolite from Trichoderma hamatum, and 10-B was suggested as structure [67]. Shortly thereafter, the structure was revised as being identical to viridiol [68]. In 2010, the structure revision was reviewed by the use of CASE software, that confirmed previous findings [69]. The revisions were based mainly on the comparison of experimental and calculated carbon chemical shifts, using DFT and database methods. The theoretical correlation data-based analysis of TAEMC161 reveals that this constitution could be described unambiguously by NMR correlation data. The same analysis with viridiol results in the 3 constitutions with similar total energies shown in Figure 11 and three more results with complex bridged ring systems that are not shown, with much higher total energies. From these 3 constitutions, 10-B is the originally published metabolite TAEMC161, meaning that NMR correlation data cannot distinguish between viridiol and TAEMC161. Thus, other means of distinction need to be used. For this analysis, we have performed carbon chemical shift calculation for all atoms in all 3 constitutions using several methods: M-I was CSEARCH, M-II was CS Chemdraw, M-IIIa DFT with ORCA v5.0.1 using the conformation obtained by WebCocon and M-IIIb DFT with ORCA v5.0.1 using a DFT optimized conformation. The results are shown in Table 13 for 10-A, Table 14 for TAEMC161 (10-B), and Table 15 for viridiol (10-C).
Figure 11

Viridiol (10-C), original TAEMC161 (10-B) and one other alternative constitution, all suggested by WebCocon. The positions with largest carbon chemical shift variations are marked in bold.

Table 13

Comparison of experimental and back-calculated C chemical shifts for all carbons in 10-A. The largest deviations are bold in the table.

AtomExp.M-IM-IIM-IIIaM-IIIb
160.760.857.861.060.4
281.782.088.993.689.5
361.670.466.168.869.3
4122.1138.8164.0154.2158.8
5 145.6 116.1 110.4 126.6 111.6
6173.4161.1162.6174.0165.8
7 142.4 118.6 116.5 133.2 130.4
842.347.344.855.555.1
971.771.980.780.075.9
1030.528.021.022.722.1
11 158.6 157.2 145.4 172.3 169.7
12127.3125.4125.1131.4128.8
13127.3126.6123.7131.6130.9
14129.8133.9133.9142.8144.9
15 158.0150.7137.8154.4158.5
16136.9131.5124.5136.9137.6
17 145.7 146.4148.6 167.9 172.1
1828.429.024.328.828.6
1936.535.736.638.339.3
20206.7206.8206.2213.0210.5
RMSD 10.215.612.114.4
Table 14

Comparison of experimental and back-calculated C chemical shifts for all carbons in TAEMC161 (10-B). The largest deviations are bold in the table.

AtomExp.M-IM-IIM-IIIaM-IIIb
160.760.857.861.659.8
281.781.788.999.496.9
361.669.162.979.276.1
4122.1122.1127.5129.0123.8
5 145.6138.3149.3 158.5 158.4
6173.4158.0153.4171.1163.0
7 142.4142.4 153.0 165.6 174.0
842.353.748.058.659.1
971.771.180.783.283.3
1030.520.521.021.120.0
11 158.6158.6 145.4 171.9 166.3
12127.3122.0125.1131.4129.1
13127.3127.3123.7131.0129.0
14129.8136.9133.9143.3145.4
15 158.0 139.2 137.8 157.8159.8
16136.9129.6134.0140.2141.9
17 145.7 123.5 124.5 138.0 129.5
1828.428.424.132.931.3
1936.536.536.639.039.5
20206.7206.7206.2214.2211.6
RMSD 8.89.811.012.0
Table 15

Comparison of experimental and back-calculated C chemical shifts for all carbons in viridiol (10-C). The largest deviations are bold in the table.

AtomExp.M-IM-IIM-IIIaM-IIIb
160.760.857.559.859.5
281.782.078.994.590.2
361.668.564.763.863.8
4122.1121.0125.9130.9128.5
5 145.6146.3144.5155.5153.2
6173.4178.9173.0153.8154.8
7 142.4140.4146.6149.6149.4
842.345.739.251.949.0
971.773.785.981.776.2
1030.520.223.531.532.4
11 158.6153.9149.3 171.5 165.7
12127.3127.3125.3137.4132.1
13127.3127.4128.4133.3132.2
14129.8133.2134.1141.4143.9
15 158.0 144.1 141.1 165.0168.8
16136.9126.5126.4141.2137.7
17 145.7148.0145.3 180.8 178.9
1828.428.423.532.933.2
1936.536.236.238.939.6
20206.7206.3206.2214.2211.7
RMSD 5.26.511.810.5
The resulting RMSD values for the chemical shift deviations observed for the different methods and constitutions show that the empirical methods outperform the DFT method in all conditions, as has already been observed previously [70,71]. In this case, 10-B and 10-C cannot be distinguished by DFT, whereas both empirical methods clearly favor 10-C as solution. Thus, the analysis confirms the published revisions but also proves that correlations data of viridiol are alone are not enough for a distinction of 10-B and 10-C.

3. Discussion & Conclusions

CASE revealed more than one solution for all molecules discussed here. Since the suggested constitutions are compatible with the theoretical NMR correlation data set of the originally published structure, other means of identification of the correct constitution had to be explored. First, an inspection with “a chemists eye” of the results was carried out, supported by force field total energies, as WebCocon might have generated structures that are not likely to exist. In one case, the inclusion of NOE data from the original publication favors one of the possible solutions, but was not enough for a decision. Secondly, calculation of C NMR chemical shifts for the suggested constitutions was carried out and compared to the experimental data. However, for most examples, this was also not enough for a final decision. The obtained results suggest that many of the new structures published using NMR correlation data could not be described unambiguously by theoretical NMR correlation data. Taking into account that the experimental data sets usually contain fewer data, even more new structures could be subjected to questions. Surveys of revised structures [32,69,71,72,73] clearly show that improvements as to how new small molecules are published are urgently needed. The use any of the software tools for CASE available, together with the popularization of more comprehensive data formats, such as NMReDATA, will lead to an improved traceability of the structure elucidation process, as shown here for 5-cyprinol sulfate. As it is, WebCocon already could easily be integrated into a quality control workflow.
  44 in total

1.  A new inhibitor of 5'-hydroxyaverantin dehydrogenase, an enzyme involved in aflatoxin biosynthesis, from Trichoderma hamatum.

Authors:  E Sakuno; K Yabe; T Hamasaki; H Nakajima
Journal:  J Nat Prod       Date:  2000-12       Impact factor: 4.050

2.  Structure reassignment of the fungal metabolite TAEMC161 as the phytotoxin viridiol.

Authors:  Peter Wipf; Angela D Kerekes
Journal:  J Nat Prod       Date:  2003-05       Impact factor: 4.050

3.  Structure Elucidator: a versatile expert system for molecular structure elucidation from 1D and 2D NMR data and molecular fragments.

Authors:  Mikhail E Elyashberg; Kirill A Blinov; Antony J Williams; Sergey G Molodtsov; Gary E Martin; Eduard R Martirosian
Journal:  J Chem Inf Comput Sci       Date:  2004 May-Jun

Review 4.  Structural revisions of natural products by Computer-Assisted Structure Elucidation (CASE) systems.

Authors:  Mikhail Elyashberg; Antony J Williams; Kirill Blinov
Journal:  Nat Prod Rep       Date:  2010-05-18       Impact factor: 13.423

5.  Statistical filtering for NMR based structure generation.

Authors:  Jochen Junker
Journal:  J Cheminform       Date:  2011-08-11       Impact factor: 5.514

6.  Fuzzy structure generation: a new efficient tool for Computer-Aided Structure Elucidation (CASE).

Authors:  Mikhail E Elyashberg; Kirill A Blinov; Sergey G Molodtsov; Antony J Williams; Gary E Martin
Journal:  J Chem Inf Model       Date:  2007-03-27       Impact factor: 4.956

7.  Heteronuclear two-bond correlation: suppressing heteronuclear three-bond or higher NMR correlations while enhancing two-bond correlations even for vanishing 2J(CH).

Authors:  Nils T Nyberg; Jens O Duus; Ole W Sørensen
Journal:  J Am Chem Soc       Date:  2005-05-04       Impact factor: 15.419

8.  The Advantage of Automatic Peer-Reviewing of 13C-NMR Reference Data Using the CSEARCH-Protocol.

Authors:  Wolfgang Robien
Journal:  Molecules       Date:  2021-06-04       Impact factor: 4.411

9.  The furan-osteroid viridiol.

Authors:  Pierre F Andersson; Anders Broberg; Daniel Lundberg
Journal:  Acta Crystallogr Sect E Struct Rep Online       Date:  2013-03-02
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.