| Literature DB >> 35494377 |
Yu-Tang Wang1,2, Zhao-Xia Yang3, Zan-Hao Piao1,2, Xiao-Juan Xu1,2, Jun-Hong Yu3, Ying-Hua Zhang1,2.
Abstract
In order to make a preliminary prediction of flavor and retention index (RI) for compounds in beer, this work applied the machine learning method to modeling depending on molecular structure. Towards this goal, the flavor compounds in beer from existing literature were collected. The database was classified into four groups as aromatic, bitter, sulfury, and others. The RI values on a non-polar SE-30 column and a polar Carbowax 20M column from the National Institute of Standards Technology (NIST) were investigated. The structures were converted to molecular descriptors calculated by molecular operating environment (MOE), ChemoPy and Mordred, respectively. By combining the pretreatment of the descriptors, machine learning models, including support vector machine (SVM), random forest (RF) and k-nearest neighbour (kNN) were utilized for beer flavor models. Principal component regression (PCR), random forest regression (RFR) and partial least squares (PLS) regression were employed to predict the RI. The accuracy of the test set was obtained by SVM, RF, and kNN. Among them, the combination of descriptors calculated by Mordred and RF model afforded the highest accuracy of 0.686. R 2 of the optimal regression model achieved 0.96. The results indicated that the models can be used to predict the flavor of a specific compound in beer and its RI value. This journal is © The Royal Society of Chemistry.Entities:
Year: 2021 PMID: 35494377 PMCID: PMC9044825 DOI: 10.1039/d1ra06551c
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Overview of the resources used for creating beer flavor database
| Flavor | Number of molecules | Reference |
|---|---|---|
| FlavorDB | ||
| Aromatic | 139 | Coelho |
| Gonzalez | ||
| Ochiai | ||
| Lehnert | ||
| Bitter | 62 | Irwin |
| Kaneda | ||
| Verstrepen | ||
| Shen | ||
| Sulfury | 38 | Intelmann |
| Sanekata | ||
| Bettenhausen | ||
| Neiens | ||
| Other | 62 | Dresel |
| Pires | ||
| Sigler |
Fig. 1Variable correlation plot of flavor compounds shows contribution rate of each variable to the principal component (a) MOE, (b) ChemoPy, (c) Mordred.
Fig. 2Loading profiles for the PCs of flavor compounds based on the descriptors calculated by (a) MOE, (b) ChemoPy, (c) Mordred.
Fig. 3The accuracy comparisons of 9 models.
Fig. 4The AUC values of each flavor calculated under the RF, SVM, and kNN models include (a) aromatic, (b) bitter, (c) sulfury, (d) other.
Fig. 5Experimental versus predicted retention indices for (a) non-polar SE-30 column and (b) polar Carbowax 20M column.