| Literature DB >> 35558685 |
Probir Kumar Ojha1, Kunal Roy1.
Abstract
Tea and coffee are the most attractive non-alcoholic beverages used worldwide due to the odorant properties of diverse components present in these beverages. The aim of this work is to investigate the key structural features which regulate the odorant properties of constituents present in black tea and coffee using regression-based chemometric models. We have also investigated the key structural properties which create the odor difference between tea and coffee. We have employed different variable selection strategies to extract the most relevant variables prior to development of final partial least squares (PLS) models. The models were extensively validated using different validation metrics, and the results justify the reliability and usefulness of the developed predictive PLS models. The best PLS model captured the necessary structural information on relative hydrophobic surface area, heteroatoms with higher number of multiple bonds, hydrogen atoms connected to C3(sp3)/C2(sp2)/C3(sp2)/C3(sp) fragments, electron-richness, C-O atom pairs at the topological distance 10 and surface weighted charged partial negative surface areas for explaining the odorant properties of the constituents present in black tea. On the other hand, C-S atom pairs at the topological distance 1, C-C atom pairs at the topological distance 5, donor atoms like N and O for hydrogen bonds, hydrogen atoms connected to C3(sp3)/C2(sp2)/C3(sp2)/C3(sp) fragments and R-CX-X fragments (where, R represents any group linked through carbon and X represents any heteroatom (O, N, S, P, Se, and halogens)) are the key structural components captured by the PLS model developed from the constituents present in coffee. The developed models can thus be successfully utilized for in silico prediction of odorant properties of diverse classes of compounds and exploration of the structural information which creates the odor difference between black tea and coffee. This journal is © The Royal Society of Chemistry.Entities:
Year: 2018 PMID: 35558685 PMCID: PMC9092630 DOI: 10.1039/c7ra12914a
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Fig. 1Schematic representation of the steps involved in the development of final PLS models.
Statistical quality and validation parameters of the final PLS models (black tea and coffee)
| Dataset | Model type | Descriptors |
|
|
| LV |
|
|
|
| Δ | CCC | MAE based criteria (test) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Black tea | PLS model | H-049, ETA_Eta_F, ETA_BetaP_ns, Jurs-WNSA-3, F10[C–O], 〈Jurs-RASA-0.767154〉 | 0.616 | 0.578 | 0.534 | 5 | 1.112 | 0.608 | 0.586 | 0.536 | 0.152 | 0.791 | Moderate |
| Coffee | PLS model | C-029, H-049, F05[C–C], nHDon, B01[C–S], ETA_Eta | 0.722 | 0.696 | 0.639 | 3 | 1.068 | 0.781 | 0.781 | 0.777 | 0.101 | 0.905 | Moderate |
Results of the final PLS models (black tea and coffee) obtained according to Golbraikh and Tropsha's criteria
| Parameters | PLS model | Remarks | Threshold value | ||
|---|---|---|---|---|---|
| Black tea | 1 |
| 0.648 | Passed |
|
| 2 | [( | 0.015626143 | Passed | <0.1 | |
|
| 0.116611832 | Passed | |||
| 3 |
| 0.9252 | Passed | 0.85 < | |
|
| 1.0337 | ||||
| Coffee | 1 |
| 0.837 | Passed | r2 > 0.6 |
| 2 | [( | 0.015552055 | Passed | <0.1 | |
|
| 0.000415929 | Passed | |||
| 3 |
| 0.8815 | Passed | 0.85 < | |
|
| 1.0561 |
Fig. 2The PLS model developed from the constituents present in black tea: (A) the scatter plot of the observed and the predicted values of odorant property [log(1/OT)] for the final PLS model. The dashed line indicates the best fit line based on test set compounds and the solid line indicates the best fit line based on the training set compounds. (B) The PLS score plot of the training set compounds using the developed PLS model. (C) The loading plot of the model descriptors and dependent variable (log(1/OT)).
Fig. 3The PLS model developed from the constituents present in coffee: (A) the scatter plot of the observed and the predicted values of odorant property [log(1/OT)] for the final PLS model. The dashed line indicates the best fit line based on test set compounds and the solid line indicates the best fit line based on the training set compounds. (B) The PLS score plot of the training set compounds using the developed PLS model. (C) The loading plot of the model descriptors and dependent variable (log(1/OT)).