| Literature DB >> 35209156 |
Renata Priscila Barros de Menezes1, Luciana Scotti1, Marcus Tullius Scotti1, Jesús García2, Rosalia González3, Lianet Monzote4, William N Setzer5,6.
Abstract
Essential oils (EOs) are a mixture of chemical compounds with a long history of use in food, cosmetics, perfumes, agricultural and pharmaceuticals industries. The main object of this study was to find chemical patterns between 45 EOs and antiprotozoal activity (antiplasmodial, antileishmanial and antitrypanosomal), using different machine learning algorithms. In the analyses, 45 samples of EOs were included, using unsupervised Self-Organizing Maps (SOM) and supervised Random Forest (RF) methodologies. In the generated map, the hit rate was higher than 70% and the results demonstrate that it is possible find chemical patterns using a supervised and unsupervised machine learning approach. A total of 20 compounds were identified (19 are terpenes and one sulfur-containing compound), which was compared with literature reports. These models can be used to investigate and screen for bioactivity of EOs that have antiprotozoal activity more effectively and with less time and financial cost.Entities:
Keywords: Cuban plants; antiprotozoal activity; essential oil; machine learning analysis
Mesh:
Substances:
Year: 2022 PMID: 35209156 PMCID: PMC8878085 DOI: 10.3390/molecules27041366
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Schematic representation of chemical composition of analyzed EOs. Graphic represent the distribution of total components described for EOs; while list of compounds corresponds to main compounds identified in the EOs.
Accuracy statistics of the training and tests groups of the 5-fold external cross-validation of the Self-Organizing map (SOM).
|
|
|
| ||||
|
|
|
|
|
| ||
| True positive rate | 0.90 | 0.95 | 0.99 | 0.90 | 0.80 | 0.91 |
| True negative rate | 0.70 | 0.65 | 0.68 | 0.70 | 0.75 | 0.71 |
| Accuracy | 0.81 | 0.80 | 0.83 | 0.81 | 0.78 | 0.81 |
|
|
|
| ||||
|
|
|
|
|
| ||
| True positive rate | 0.60 | 0.60 | 0.95 | 0.60 | 0.99 | 0.75 |
| True negative rate | 0.75 | 0.75 | 0.70 | 0.95 | 0.80 | 0.80 |
| Accuracy | 0.67 | 0.67 | 0.83 | 0.77 | 0.90 | 0.78 |
Summary of the statistics parameters of the RF model (average between the five models).
| Model | Specificity | Sensitivity | Accuracy | PPV | NPV |
|---|---|---|---|---|---|
| RF | 0.83 | 0.65 | 0.71 | 0.75 | 0.70 |
RF is random forest, PPV is positive predictive value, and NPV is negative predictive value.
Summary of test averages corresponding to 5-fold cross-validation using the different machine learning algorithms, self-organizing maps (SOM) and random forest (RF).
| Classification of EOs | Average | |
|---|---|---|
| SOM | RF | |
| Active | 0.85 | 0.75 |
| Not reported | 0.83 | 0.70 |
| Accuracy | 0.84 | 0.71 |
Figure 2Visualization of the self-organizing map (SOM) of essential oil (EO) data. In the upper corner we have the U-matrix. The left U-matrix does not identify the activities of the EOs while the right U-matrix identifies those activities by color: active is violet and yellow represents samples with no antiprotozoal activity or that have not been reported. The values shown on the scale between the two U-matrices represent the values of the percent of molecules present in the EOs, varying between 1.19 and 7. These values were used to group the EOs by activities. At the bottom, we have the principal component analysis (PCA) projection of the SOM measured by its two eigenvectors with higher eigenvalues. The activities were plotted using the same identification colors as the U-matrix.
Figure 3The most significant molecules for the EOs by activity. In the upper left we have the U-matrix of the self-organizing map generated in the study, with the upper U-matrix not identifying the tribes and the lower U-matrix identifying the tribes by color; active is violet and yellow represents samples with no antiprotozoal activity or that have not been reported.
In vitro antiprotozoal profile of Cuban and according literature review of EOs that present identified compounds in this study (previous shown in Figure 3).
| Compound | Country | Plant | Compound % | Targeted Protozoa (Result) | Ref. |
|---|---|---|---|---|---|
| ( | Brazil | 6.8% | [ | ||
| 11.7% | [ | ||||
| 6.8% | [ | ||||
| Cuba * | 13% | [ | |||
| 0.49% | [ | ||||
| ( | Brazil | 29% | [ | ||
| Cuba * | 0.9% | [ | |||
| 0.14% | [ | ||||
| β-Phellandrene | Cameroon | 21.1% | [ | ||
| Cuba * | 0.2% | [ | |||
| 2.1% | [ | ||||
| Camphor | Brazil | 17.1% | [ | ||
| 6.5% | [ | ||||
| 5.6% | [ | ||||
| Cuba * | 0.1% | [ | |||
| 17.1% | [ | ||||
| 13.8 and 9.4% | [ | ||||
| Ethiopia | 27.4% | [ | |||
| Morocco | 18.7% | [ | |||
| Spain | 4.5% | [ | |||
| 7.7% | [ | ||||
| Turkey | 42% | [ | |||
| Tunisia | 13.8% | [ | |||
| Germacrene D | Brazil | 19.6% | [ | ||
| 16.1% | [ | ||||
| 22.2% | [ | ||||
| 11.7% | [ | ||||
| 12.7% | [ | ||||
| 12.7% | [ | ||||
| 14.7% | [ | ||||
| 4.3% | [ | ||||
| 17.8% | [ | ||||
| Cuba * | 0.3% | [ | |||
| Methyl eugenol | Brazil | 14.8% | [ | ||
| 5.6% | [ | ||||
| Cuba * | 0.6% | [ | |||
| Bhutan | 3.8% | [ | |||
| Piperitone | Benin | 60.3% | [ | ||
| Cuba * | 0.1% | [ | |||
| 0.1% | [ | ||||
| 23.7% | [ | ||||
| 20.1 and 19.0% | [ | ||||
| Safrole | Brazil | 8.3% | [ | ||
| 6.5% | [ | ||||
| Cuba * | 71.8% | [ |
* Data of EOs from Cuba used in this study.