| Literature DB >> 33385118 |
Shubham K Jaiswal1, Shitij Manojkumar Agarwal1, Parikshit Thodum1, Vineet K Sharma1.
Abstract
In addition to being pivotal for the host health, the skin microbiome possesses a large reservoir of metabolic enzymes, which can metabolize molecules (cosmetics, medicines, pollutants, etc.) that form a major part of the skin exposome. Therefore, to predict the complete metabolism of any molecule by skin microbiome, a curated database of metabolic enzymes (1,094,153), reactions, and substrates from ∼900 bacterial species from 19 different skin sites were used to develop "SkinBug." It integrates machine learning, neural networks, and chemoinformatics methods, and displays a multiclass multilabel accuracy of up to 82.4% and binary accuracy of up to 90.0%. SkinBug predicts all possible metabolic reactions and associated enzymes, reaction centers, skin microbiome species harboring the enzyme, and the respective skin sites. Thus, SkinBug will be an indispensable tool to predict xenobiotic/biotic metabolism by skin microbiome and will find applications in exposome and microbiome studies, dermatology, and skin cancer research.Entities:
Keywords: In Silico Biology; Metabolomics; Systems Biology
Year: 2020 PMID: 33385118 PMCID: PMC7772573 DOI: 10.1016/j.isci.2020.101925
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Diversity of metabolic enzymes on skin sites
(A) Proportions of different metabolic enzymes from six types of reaction classes (“Oxidoreductases,” “Hydrolases,” “Transferases,” “Lyases,” “Isomerases,” and “Ligases”) plotted as stacked bar plots across the 19 different skin sites.
(B) Silhouette plot for determining the optimum number of clusters for clustering different skin sites. The y axis is average silhouette width value, and the x axis is the number of clusters. The k-value at which the maximum of average silhouette width is achieved is the optimum number of clusters.
(C) The dendrogram showing the results of hierarchical clustering of the skin sites based on the 2,523 unique reactions that can be performed by the metabolic enzymes of species present in skin microbiome. The approximate unbiased p values (AUp) and the bootstrap probability (BP) values for each branch/cluster were calculated using multiscale bootstrap resampling and using normal bootstrap resampling, respectively. The AUp values are mentioned in red, and the BP values are mentioned in green.
See also Figures S1–S4.
Figure 2Evaluating the diversity and complexity of metabolic dataset
(A) The cumulative scree plot and a normal scree plot from the PCA analysis of all the substrate molecules from the complete input dataset based on the 2,322 selected features. The x axis is the principal component number, y axis for the dot plot is the cumulative variance explained by the individual principal components, and the y axis for the bar plot is the percentage of variance explained by the individual principal components.
(B) The density-based clustering of substrate molecules using the principal component PC1 and PC2 from the PCA analysis. The density-based clustering was performed with MinPts value 20 and epsilon value 3.25. Three dense clusters were identified, which are colored as blue for cluster-1, yellow for cluster-2, and gray for cluster-3.
(C) The proportion of different reaction classes in each cluster is shown as pie chart with percentage value labeled for each reaction class. The proportion is calculated based on the number of substrate molecules that can undergo a particular kind of reaction class in that cluster. The pie chart is mentioned for all three above identified clusters: cluster-1, cluster-2, and cluster-3.
See also Figures S5 and S6.
Figure 3The algorithm and complete workflow for the construction of SkinBug
Figure 4The multilabel and binary performance of reaction class-specific RFSRC prediction model
(A) The multilabel accuracy, multilabel sensitivity, multilabel precision, multilabel F1 score, and hamming loss mentioned for three type of statistical testing of the RFSRC model: stratified randomly sampled test dataset, 5-fold cross-validation, and blind dataset.
(B) The box plot for the performance across each fold of the 5-fold cross-validation for the five different performance measures: multilabel accuracy, multilabel sensitivity, multilabel precision, multilabel F1 score, and hamming loss. The y axis shows the fraction value for each of the performance measure. Fraction values were chosen so that all the measures including hamming loss could be plotted on the same y axis scale.
(C) The binary performance of the reaction class prediction RFSRC model for each of six different reaction classes (“Oxidoreductases,” “Hydrolases,” “Transferases,” “Lyases,” “Isomerases,” and “Ligases”). Fraction values were chosen so that all the measures could be shown on the same scale.
AUC = area under the curve, MMCE = binary mean misclassification error, FNR = binary false-negative rate, FPR = binary false-positive rate, ACC = binary accuracy, MCC = Matthews correlation coefficient, NPV = binary negative predictive value, PPV = binary positive predicted value, F1 = binary F1 score, FDR = binary false discovery rate, GPR = geometric mean of binary precision and binary recall.
See also Figures S7–S9.
Figure 5The optimization of epochs and performance evaluation of ANN model
(A) The plot of log loss or binary cross-entropy for different values of epochs for the training and testing dataset.
(B) The plot of binary accuracy for different values of epochs for the training and testing datasets.
(C) The plot of categorical accuracy for different values of epochs for the training and testing datasets.
(D and E) (D) The box plot for the performance across each fold of the 5-fold cross-validation for the three different performance measures of ANN model: categorical accuracy, binary accuracy, and log loss or binary cross-entropy. The y axis for the binary and categorical accuracy shows the percentage value, whereas the y axis for the log loss or binary cross-entropy shows the actual value. (E) The performance of ANN model measured using three evaluation measures, categorical accuracy, binary accuracy, and log loss or binary cross-entropy for two types of statistical testing methods: 5-fold cross-validation and testing on stratified randomly sampled testing dataset.
See also Figures S10–S14.
Multiclass multilabel performance metrics for the reaction subclass-specific models calculated using stratified random sampling
| Reaction class | Accuracy (%) | Sensitivity (%) | Precision (%) | F1 score (%) | Hamming loss |
|---|---|---|---|---|---|
| Oxidoreductases | 73.9 | 77.7 | 88.6 | 76.5 | 0.019 |
| Transferases | 61.0 | 63.7 | 83.3 | 63.9 | 0.072 |
| Hydrolases | 72.4 | 76.8 | 79.2 | 74.8 | 0.04 |
| Lyases | 74.4 | 76.7 | 85.6 | 76.2 | 0.058 |
| Isomerases | 72.2 | 73.6 | 91.4 | 73.1 | 0.06 |
| Ligases | 72.0 | 72.0 | 75.0 | 72.0 | 0.093 |
All the parameters mentioned above were calculated using multilabel evaluation methodology (see Tables S9–S14).
Multiclass multilabel performance metrics for the reaction subclass-specific models calculated using 5-fold cross-validation
| Reaction class | Accuracy (%) | Sensitivity (%) | Precision (%) | F1 score (%) | Hamming loss |
|---|---|---|---|---|---|
| Oxidoreductases | 61.5 | 64.3 | 82.8 | 64.9 | 0.036 |
| Transferases | 56.3 | 59.6 | 80.8 | 60.0 | 0.09 |
| Hydrolases | 63.1 | 66.3 | 80.5 | 66.3 | 0.052 |
| Lyases | 64.4 | 67.1 | 77.7 | 67.1 | 0.084 |
| Isomerases | 65.2 | 67.7 | 82.5 | 67.0 | 0.089 |
| Ligases | 77.7 | 80.0 | 84.5 | 79.9 | 0.075 |
All the parameters mentioned above were calculated using multilabel evaluation methodology (see also Tables S3–S8).
Figure 6The prediction steps and complete workflow for processing of any input molecule by the SkinBug web server
The prediction of metabolism of the 28 selected molecules by SkinBug along with their known metabolism from literature
| Sr. No. | Compound | Function | Reaction subclass from literature | Reaction subclass from SkinBug | Enzyme from literature | Enzyme from SkinBug | Skin microbial genus from literature | Skin microbes genus from SkinBug | References |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Cinnamyl alcohol | Perfuming, Masking | Acting on the CH-OH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on the CH-OH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen; acyltransferases | Alcohol dehydrogenase; cytochrome P450 | Cinnamyl alcohol dehydrogenase; acetyl CoA benzyl alcohol acetyltransferase | NA | No genus predicted | ( |
| 2 | Leucine | Antistatic, hair conditioning, skin conditioning | Transferring nitrogenous groups | Acting on the CH-NH2 group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen; acyltransferases; transferring nitrogenous groups; acting on carbon-nitrogen bonds, other than peptide bonds; intramolecular transferases; forming carbon-oxygen bonds | Branched chain amino acid aminotransferase | Isoleucine N-monooxygenase; valine N-monooxygenase; leucine N-acetyltransferase; branched chain amino acid transferase; leucine transaminase; leucine 2,3-aminomutase; leucine tRNA ligase; etc. | ( | ||
| 3 | Glycerol | Perfuming, denaturant, humectant, solvent, hair conditioning, viscosity controlling | Acting on the CH-OH group of donors; transferring phosphorus-containing groups | Acting on the CH-OH group of donors; glycosyltransferases; transferring phosphorus-containing groups; acting on ester bonds; glycosylases; carbon-carbon lyases; carbon-oxygen lyases | Glycerol kinase; glycerol dehydrogenase | Glycerol kinase; glycerol dehydrogenase; 1,2-alpha-glucosylglycerol phosphorylase; acylglycerol lipase; glycerol phosphatase; dihydroxy acid dehydratase; etc. | ( | ||
| 4 | Benzo(a)pyrene | Environmental pollutant found in soot, tobacco smoke, diesel exhaust etc. | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen | Cytochrome P450: CYP1A1, monooxygenases, dioxygenases | Naphthalene 1,2-dioxygenase; unspecific monooxygenases; steroid 21-monooxygenase; etc. | ( | ||
| 5 | Retinoic acid | Antiseborrheic | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the aldehyde or oxo group of donors; Glycosyltransferases | Cytochrome P450: CYP1A1 | Unspecific monooxygenase; alkane monooxygenase; retinal dehydrogenase; aldehyde oxidase; glucuronosyl transferase; etc. | NA | ( | |
| 6 | 7,12-Dimethylbenz | Environmental pollutant found in tobacco smoke | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen | Cytochrome P450: CYP1A1, cytochrome P450: CYP1B1 | Unspecific monooxygenase; aromatase; estradiol 6-beta monooxygenase | NA | No genus predicted | ( |
| 7 | Vitamin D3 | Skin conditioning | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on the CH-CH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen | Cytochrome P450 | Vitamin D3,24-hydroxylase; calcidiol 1-monooxygenase; vitamin D1, 25-hydroxylase | NA | ( | |
| 8 | 4-Allylanisole | Perfuming | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on the CH-OH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen; transferring one-carbon groups | Cytochrome P450 | 4-(hydroxymethyl) benzenesulfonate dehydrogenase; (iso) eugenol O-methyltransferase; trans-anol O-methyltransferase | NA | No genus predicted | ( |
| 9 | Isoeugenol | Perfuming, masking, drug: local antiseptic and analgesic | Acting on the CH-OH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on the CH-OH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen; transferring one-carbon groups | Cytochrome P450, cytochrome P450: CYP2E1, alcohol dehydrogenase | Alcohol dehydrogenase; isoeugenol synthase; salicylate 1-monooxygenase; 4-hydroxypheny | NA | ( | |
| 10 | 4-Phenylenediamine | Hair dyeing | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on other nitrogenous compounds as donors; acting on paired donors, with incorporation or reduction of molecular oxygen; carbon-carbon lyases | Cytochrome P450 | Nitrobenzene nitroreductase; histidine decarboxylase | NA | ( | |
| 11 | Aniline | Outdoor air, tobacco smoke | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on other nitrogenous compounds as donors; acting on paired donors, with incorporation or reduction of molecular oxygen; transferring one-carbon groups; acyltransferases; glycosyltransferases; transferring sulfur-containing groups; acting on carbon-nitrogen bonds, other than peptide bonds; carbon-carbon lyases; forming carbon-nitrogen bonds | Cytochrome P450 | Phenol 2-monooxygenase; azobenzene reductase; arylamine N-acetyltransferase; arylamine glucosyltransferase; amine sulfotransferase; aryl acylamidase; aminobenzoate decarboxylase; gamma-glutamylanilide synthase; etc. | NA | ( | |
| 12 | Methyl red or azo dye | Textile dyes, tattoo inks, and cosmetic colorant | Acting on other nitrogenous compounds as donors | Acting on the CH-CH group of donors; acting on other nitrogenous compounds as donors; acting on single donors with incorporation of molecular oxygen (oxygenases); acting on paired donors, with incorporation or reduction of molecular oxygen | Azoreductase (Azo1) | FMN-dependent NADPH azoreductase (Azo1); NAD(P)H-dependent oxidoreductase; 4-(dimethylamino) | ( | ||
| 13 | Fluorouracil | Used in actinic keratosis and skin warts conditions | Glycosyltransferases; acting on the CH-CH group of donors | Glycosyltransferase; acting on the CH-CH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen; acting on CH or CH2 groups; glycosylases; acting on carbon-nitrogen bonds, other than peptide bonds | Orotate phosphoribosyl | Orotate phosphoribosy | NA | ( | |
| 14 | Propylene glycol | Used as demulcent and in skin lotions and ointments | Acting on the CH-OH group of donors | Acting on the CH-OH group of donors; carbon-carbon lyases; carbon-oxygen lyases | Alcohol dehydrogenases | L-glycol dehydrogenase; aldehyde reductase; glycerol dehydrogenase; lactaldehyde reductase; propanediol dehydratase; etc. | NA | ( | |
| 15 | Hydroquinone | Demelanizing agent | Acting on single donors with incorporation of molecular oxygen; glycosyltransferases | Acting on single donors with incorporation of molecular oxygen (oxygenases); acting on paired donors, with incorporation or reduction of molecular oxygen; glycosyltransferases; transferring alkyl or aryl groups, other than methyl groups; glycosylases; carbon-carbon lyases; etc. | Dioxygenases; quinone oxidoreductase | Catechol 1 2-dioxygenase; hydroquinone 1 2-dioxygenase; toluene 4-monooxygenase; p-benzoquinone reductase; 4-hydroxybenzoate 1-hydroxylase; arylesterase; hydroquinone glucosyltransferase; etc. | NA | ( | |
| 16 | Para-aminobenzoic acid | Used as sunscreen and melanizing agent. Also used in treatment of fibrotic skin disorders | Transferring alkyl or aryl groups, other than methyl groups | Transferring alkyl or aryl groups, other than methyl groups; acting on paired donors, with incorporation or reduction of molecular oxygen; transferring one-carbon groups; carbon-carbon lyases; etc. | Dihydropteroate synthase | Benzoate 1 2-dioxygenase; phenylethanolamine N-methyltransferase; dihydropteroate synthase; aminobenzoate decarboxylase; etc. | NA | ( | |
| 17 | Benzophenone | Used as organic sunscreen and melanizing agent. Also used as a fragrance enhancer | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the CH-OH group of donors | Cytochrome P450 (CYPs) | Nitrilotriacetate monooxygenase | NA | ( | |
| 18 | Lindane | Used in scabies and pediculosis skin conditions | Acting on halide bonds; acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on halide bonds; acting on paired donors, with incorporation or reduction of molecular oxygen; etc. | Dehydrochlorinase enzyme | Haloalkane dehalogenase | NA | ( | |
| 19 | Ethinyl estradiol | Used for the treatment of moderate acne vulgaris (common acne) in females | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the CH-OH group of donors; glycosyltransferases; etc. | Cytochrome P450 (CYPs) | Unspecific monooxygenase; aromatase; 3-beta-hydroxy-delta-5-steroid dehydrogenase; etc. | NA | No genus predicted | ( |
| 20 | Docosanol | Used in the treatment of herpes virus infection-caused recurring episodes of small, painful, fluid-filled blisters on the skin | Acting on the CH-OH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on the CH-OH group of donors; acting on paired donors, with incorporation or reduction of molecular oxygen | NA | Hexadecanol dehydrogenase; long-chain-alcohol dehydrogenase; naphthalene 1 2-dioxygenase; etc. | NA | ( | |
| 21 | Lauric acid | Used on skin for its antibacterial properties and ability to effectively combat acne | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the aldehyde or oxo group of donors | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the aldehyde or oxo group of donors; acting on ester bonds | Fatty acid beta-oxidation enzymes such as acyl CoA dehydrogenase, hydrolase, etc. | Trimethyllysine dioxygenase; aldehyde dehydrogenase; oleoyl-(acyl-carrier-protein) hydrolase; clavaminate synthase; etc. | NA | ( | |
| 22 | Methyl lactate | Used as a soothing and cooling agent for skin. Also to provide relief against itching and irritation on skin | Acting on the CH-OH group of donors | Acting on the CH-OH group of donors; acting on the aldehyde or oxo group of donors; acyltransferases; etc. | Lactate dehydrogenase | L-lactate dehydrogenase; glyoxylate reductase; oxaloglycolate reductase (decarboxylating); pyruvate synthase; formate C-acetyltransferase; etc. | ( | ||
| 23 | Methylparaben | Used in cosmetic products as preservative to give products a longer shelf life | Glycosyltransferases; acting on ester bonds; acting on paired donors, with incorporation or reduction of molecular oxygen | Glycosyltransferases; acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the CH-OH group of donors; etc. | Glucuronosyltransferase | Cyanohydrin beta-glucosyltransferase; 4-hydroxypheny | NA | ( | |
| 24 | Triclosan | Used as antiseptic and antibacterial on the skin | Acting on paired donors, with incorporation or reduction of molecular oxygen; glycosyltransferases | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on single donors with incorporation of molecular oxygen (oxygenases); acting on the CH-CH group of donors | Sulfur oxygenase/reductase; cytochrome P450 (CYPs); glucuronosyltransferase | Sulfur oxygenase/reductase; carbazole 1,9a-dioxygenase; naphthalene 1,2-dioxygenase; persulfide dioxygenase; etc. | NA | ( | |
| 25 | Terbinafine | Used to treat the fungal infections on skin | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen | Cytochromes (CYPs) | Ammonia monooxygenase | NA | ( | |
| 26 | Alpha-tocopherol | Used in cosmetic products to prevent UV damage to the skin | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the CH-OH group of donors; transferring one-carbon groups | Cytochromes (CYPs) | Validamycin A dioxygenases; aurachin C monooxygenase/isomerase; calcidiol 1-monooxygenase; thymidylate synthase; etc. | ( | ||
| 27 | Cholesterol | Used as an emollient in cosmetic products such as eye makeup, face makeup, skin lotions, creams, and hair care formulations | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on the CH-OH group of donors; acting on the CH-CH group of donors; intramolecular oxidoreductases | Sterol 27-hydroxylase | Cholesterol 25-hydroxylase; vitamin D 25-hydroxylase; 11-beta-hydroxysteroid dehydrogenase; 3-beta-hydroxysteroid 3-dehydrogenase; etc. | ( | ||
| 28 | Linoleic acid | Used in the conditions of skin irritation and to reduce acne breakouts | Acting on paired donors, with incorporation or reduction of molecular oxygen | Acting on paired donors, with incorporation or reduction of molecular oxygen; acting on single donors with incorporation of molecular oxygen (oxygenases); acting on ester bonds | Lipoxygenase; delta-6-desaturase | Linoleate 13S-lipoxygenase; linoleate 11-lipoxygenase; acyl-CoA 6-desaturase; palmitoyl-CoA hydrolase; etc. | ( |
Threshold for the subclass prediction models of “Oxidoreductases” was set from 0.5 to 0.1, and for “Transferases,” “Hydrolases,” and “Lyases” the thresholds were set from 0.5 to 0.2. The Tanimoto coefficient threshold was set to 0.5.
Tanimoto coefficient threshold was set to 0.5 or 0.3.