| Literature DB >> 34793633 |
Wout Bittremieux1,2,3, Rohit S Advani1,2,4, Alan K Jarmusch1,2,5, Shaden Aguirre1,2, Aileen Lu1,2, Pieter C Dorrestein1,2, Shirley M Tsunoda1.
Abstract
Chemicals, including some systemically administered xenobiotics and their biotransformations, can be detected noninvasively using skin swabs and untargeted metabolomics analysis. We sought to understand the principal drivers that determine whether a drug taken orally or systemically is likely to be observed on the epidermis by using a random forest classifier to predict which drugs would be detected on the skin. A variety of molecular descriptors describing calculated properties of drugs, such as measures of volume, electronegativity, bond energy, and electrotopology, were used to train the classifier. The mean area under the receiver operating characteristic curve was 0.71 for predicting drug detection on the epidermis, and the SHapley Additive exPlanations (SHAP) model interpretation technique was used to determine the most relevant molecular descriptors. Based on the analysis of 2561 US Food and Drug Administration (FDA)-approved drugs, we predict that therapeutic drug classes, such as nervous system drugs, are more likely to be detected on the skin. Detecting drugs and other chemicals noninvasively on the skin using untargeted metabolomics could be a useful clinical advancement in therapeutic drug monitoring, adherence, and health status.Entities:
Mesh:
Year: 2021 PMID: 34793633 PMCID: PMC8932847 DOI: 10.1111/cts.13198
Source DB: PubMed Journal: Clin Transl Sci ISSN: 1752-8054 Impact factor: 4.689
FIGURE 1Body sites of the drugs found through spectral library searching. Body sites for the identified drugs were retrieved from the Uberon annotations specified in ReDU, and drug counts per body site were normalized by the total number of ReDU entries for each body site
FIGURE 2ROC curve indicating the performance of the random forest classifier to predict whether drugs can be observed on the epidermis. The curve is the mean ROC curve over 100 random stratified training (80% of the data) and test (20% of the data) splits. The standard deviation over the splits is indicated by the shaded area. The mean AUC is 0.707, with a standard deviation of 0.095. AUC, area under the curve; ROC, receiver operating characteristic
FIGURE 3SHAP features of importance for the top 20 most important Mordred features from the random forest classifier for the 145 training compounds. A positive SHAP feature importance contributes to drugs predicted to appear on the epidermis, whereas a negative SHAP feature importance contributes to drugs predicted to not appear on the epidermis. The top‐ranked features capture information about the volume, electronegativity, bond energy, and electrotopology of the molecules. See Table S4 for a full description of the Mordred features. SHAP, SHapley Additive exPlanations
FIGURE 4Force plots of the SHAP values to interpret predictions of individual drugs. The most important features, their values, and the direction in which they contribute to the predictions (higher/red: observed, lower/blue: not observed) are displayed. The horizontal axis shows the model probability, with the prediction score indicated by “f(x).” Scores above the expected value based on the training data (“base value”) constitute positive predictions, and scores below the expected value constitute negative predictions. The size of the bars for individual features indicates their magnitude contributing to a positive or negative prediction. (a) Diphenhydramine is predicted to be observed on the epidermis. (b) Diphenhydramine N‐hexose is predicted to not be observed on the epidermis. (c) Citalopram is predicted to be observed on the epidermis. (d) Tacrolimus is predicted to not be observed on the epidermis. SHAP, SHapley Additive exPlanations
FIGURE 5Prediction scores for 2561 FDA approved drugs and their 23,693 biotransformations, subdivided by their drug class in the ATC classification system. ATC, Anatomic Therapeutic Chemical; FDA, US Food and Drug Administration