Literature DB >> 32685821

Pruned Machine Learning Models to Predict Aqueous Solubility.

Alexander L Perryman¹, Daigo Inoyama¹, Jimmy S Patel¹, Sean Ekins², Joel S Freundlich^1,3.

Abstract

Solubility is a key metric for therapeutic compounds. Conversely, insoluble compounds cloud the accuracy of assays at all stages of chemical biology and drug discovery. Herein, we disclose naïve Bayesian classifier models to predict aqueous solubility. Publicly accessible aqueous solubility data were used to create two full, or nonpruned, training sets. These two sets were also combined to create a full fused set, and a training set comprised of a literature collation of solubility data was also considered as a reference. We tested different extents of data pruning on the training sets and constructed machine learning models that were evaluated with two independent, external test sets that contained compounds that were different from the training sets. The best pruned and fused model was significantly more accurate, in comparison to either the full model or the full fused model, with the prediction of these external test sets. By carefully removing data from the training set, less information can be used to create more accurate machine learning models for aqueous solubility. This knowledge and the curated training sets should prove useful to future machine learning approaches.

Entities: Chemical Disease Species

Year: 2020 PMID： 32685821 PMCID： PMC7364544 DOI： 10.1021/acsomega.0c01251

Source DB: PubMed Journal: ACS Omega ISSN： 2470-1343

Introduction

More than 20 years after the landmark publication that provided the rule of 5 and asserted the importance of high solubility alongside potency and permeability,[1] poor aqueous solubility (heretofore solubility) of drugs and their precursor hit, lead, and chemical tool compounds continue to be a major issue for chemical biology and drug discovery research. Solubility can have a dramatic impact on the absorption, distribution, metabolism, and excretion (ADME) and pharmacokinetic (PK) properties of a molecule, and it can necessitate significant efforts in formulation to enable downstream in vivo efficacy studies. Within the early stages of development, a compound’s solubility is initially gauged by its kinetic solubility and, in contrast, thermodynamic or equilibrium solubility is generally reserved for downstream assessment.[2] These values are crucial as low solubility can cause potentially valuable compounds to be ignored or neglected, because their activity may be underestimated.[3] Similarly, large differences in the solubility of compounds can produce noisy and inaccurate structure–activity relationship (SAR) data, while adversely affecting in vivo performance, diminishing PK parameters, reducing efficacy, and driving toxicity.[3] The ability to more accurately predict the solubility of druglike small molecules would, therefore, be of significant value. Publications have detailed studies of models and methods for predicting solubility from structure alone or from other measured or estimated molecular properties. Early studies by Huuskonen and collaborators collated sets of compounds (from 24 structurally similar compounds to 884 more diverse compounds) for which solubility was determined, and molecular descriptors with artificial neural networks (ANN) or multiple linear regression (MLR) were implemented to train and validate models.[4−9] The Huuskonen data have been used extensively by others to compare new methods for predicting solubility,[10−14] which highlights the value of developing new, well-curated data sets to the community, far beyond the utility of any initial model. Many different machine learning methods have since been used to predict solubility, such as random forest, partial least squares, Support Vector Machines, Gaussian process Bayesian, ridge regression, and deep learning.[15−20] Across these previous publications, we have seen a growth in the size of the solubility training and test sets, as well as the diversity of methods used to build predictive models. We present a new study on predicting solubility, in which we curated the largest publicly available solubility data set—the Molecular Libraries Small Molecule Repository (MLSMR) set of 57 824 unique compounds (PubChem AID 1996). Naïve Bayesian classifier (heretofore referred to as Bayesian) models were constructed using this set, an AstraZeneca (AZ) set of 1763 compounds (assay ID CHEMBL3301364), and a full fused set in which the MLSMR and AZ sets were combined together to generate 59 510 unique compounds (MSLMR + AZ). While contemplating training sets, it is important to mention that we previously proposed a novel data pruning strategy, in which moderately active compounds are deleted from the training set for a binary classifier model.[21] The pruned Bayesian model was more accurate than the conventional full (i.e., nonpruned) model for predicting the mouse liver microsomal (MLM) stability of independent, external test, and validation sets. The pruned Bayesian model, superior to random forest and support vector machine models that utilized the same training set and descriptors, was also able to guide the evolution of an antitubercular thienopyrimidine to afford analogs with superior metabolic stability.[22] In the present study, we systematically tested several different extents of pruning of the MLSMR set, the AZ set, and the fused MLSMR + AZ set to examine how different levels of pruning impact the ability to predict whether a compound has a solubility ≥100 μM (a typical goal value of this property[23]). Since we utilized a Bayesian classifier model with this 100 μM cutoff, our goal was to differentiate “good” or sufficiently soluble compounds, defined as having a solubility ≥100 μM, from “bad” or insufficiently soluble compounds, which have a solubility <100 μM. Although some may consider 100 μM a high bar for solubility, we believe it to be a reasonable metric given our experience. Since a purpose of the model is to enable the rapid filtering of large areas of chemical space (i.e., millions of compounds), we preferred the selection of a high bar, instead of a minimum value of solubility that could be tolerated for a given program, to help us focus on the regions of chemical space that are more likely to be fruitful. This is especially important because during the course of a molecular optimization, some solubility may be sacrificed in pursuit of other compound metrics. We hypothesized that pruning an insufficient number of compounds from the training set would not produce a significant boost in accuracy, while pruning too many compounds would significantly decrease the predictive power of the model produced. Although this hypothesis is fairly obvious, finding an optimal extent of pruning was nontrivial. The resulting best model, as judged by internal and external statistics, was found to be a model we have termed the Pruned and Fused MLSMR + AZ.

Results and Discussion

Our search of publicly available solubility assay data revealed that different approaches have been utilized to measure solubility.[24] For example, kinetic methods are generally used for high-throughput and early discovery studies. For this method, samples are usually introduced as dimethyl sulfoxide (DMSO) stock solutions into the solubility buffer (typically pH 7.4 phosphate-buffered saline (PBS)), with light scattering data obtained within 1–4 h. In the same vein, a semiequilibrium method also employs this protocol but with measurements being made after 24 h. It is important to note that both of these approaches assay the compound in the presence of typically 1–5% DMSO, which increases the ultimately measured solubility. In contrast, the third method of assaying solubility is thermodynamic or equilibrium. In this approach, the compound, in its crystalline form, is directly added to solubility buffer with measurements taken after 24 h.[2] It is crucial to point out that this method is a more conservative measure of solubility due to the absence of DMSO used in the kinetic and semiequilibrium protocols.[24,25] The more conservative method requires significantly greater amounts of the compound than the high-throughput methods (milligrams of the solid compound vs microliters of mM DMSO stocks).[2] Hence, for hit-to-lead optimization programs, where relatively small amounts of compounds are being synthesized and profiled, kinetic and semiequilibrium methods are more typically employed. The solubility assay data curated herein were identified within PubChem BioAssay and ChEMBL.[26−28] These data were reformatted by combining the PubChem data from a comma-separated-value file (csv; sorted by compound ID) and a structural data file (sdf) into a Discovery Studio spreadsheet or by converting the Excel file from ChEMBL into an sdf file using CDD (Collaborative Drug Discovery; www.collaborativedrug.com).[29] The data were then curated in Discovery Studio 4.5 (BIOVIA, Inc.), and a custom Pipeline Pilot script was used to delete duplicate compounds (according to canonical SMILES) and salts or buffers (by keeping the largest fragment component). Duplicates were deleted from each full training set, as well as from the full fused set and the two external test sets to be discussed later. This preparation process in Pipeline Pilot also included calculating the dominant protonation state and tautomer of each compound at pH 7.4. Our two original training sets were: (a) the Full MLSMR data set measured by the Sanford-Burnham Medical Research Institute (PubChem AID 1996), composed of 57 824 unique compounds, of which 31 644 compounds (54.7%) were defined as soluble (using the definition that solubility ≥100 μM = soluble = good or active), and (b) the Full AZ training set (assay ID CHEMBL3301364), composed of 1763 compounds with solubility data measured by AstraZeneca, of which 618 compounds (35.1%) were defined as soluble. The MLSMR data set contains solubility data that was measured by the semiequilibrium method, while the solubility data from the AZ set was measured via the equilibrium method. We also utilized the Huuskonen collation of solubility data as an additional training set,[4,6−9] containing 987 soluble compounds (76.5%) out of a total of 1290 compounds. Given the Huuskonen training set’s composition of solubility data measured with more than one methodology, we hypothesized that combining the kinetic (MLSMR) and equilibrium (AZ) solubility would ultimately train a better model. Hence, the Full Fused MLSMR + AZ set has 59 510 compounds (after deleting duplicates), of which 32 228 compounds (54.2%) were defined as soluble. Given our recent efforts with the MLM stability training set pruning to remove potential disinformation[21] and arrive at models with significant prospective predictive ability in a medicinal chemistry optimization,[22] we examined four different extents of data pruning on the MLSMR set, four different levels of pruning on the AZ set, and eight different ranges of pruning on the fused set of AZ + MLSMR (Full Fused AZ + MLSMR). To construct Bayesian models for each training set, Pipeline Pilot 9.5 (BIOVIA, Inc.) was utilized with the nine standard (default) descriptors (i.e., molecular fractional polar surface area, molecular weight, number of hydrogen bond donors, number of hydrogen bond acceptors, ALogP, number of aromatic rings, total number of rings, number of rotatable bonds, and the FCFP_6 substructural fingerprints) that have worked well for us in previous Bayesian models for predicting other properties.[21,22,31,32] The details regarding the different ranges of compounds that were deleted from the training set to generate the different pruned models are in the Supporting Information (Table S1). Thus, a total of 20 training sets was included in this study: the two full sets, the full fused set, sixteen different pruned sets, and the Huuskonen reference set. Internal statistics from five-fold cross-validation studies are shown in Tables and S1, consisting of sensitivity, specificity, concordance, and receiver-operator characteristic (ROC) score. Expectedly, pruning of either the MLSMR or AZ training set generally improved the resulting model’s performance at predicting fractions of its own training set. Pruned and then fused models, where both the MLSMR and pruned AZ training sets were pruned to varying extents and then combined, also exhibited this general trend in internal statistics. We also compared these twenty different solubility models by evaluating their predictive power with two independent external sets composed of compounds that were not part of the training sets: (a) The External PubChem test set (External PubChem set) is a concatenation of the three largest sets (after the MLSMR and AZ sets) of solubility results (AID reference numbers 367 335, 517 483, and 606 574). This external test set has 197 unique compounds, of which 35 compounds (17.8%) were defined as soluble; (b) The external validation set of our laboratory’s internal compounds (External JSF set) is composed of hits and leads synthesized during the course of our different infectious disease projects, which have been assayed for kinetic aqueous solubility.[23] In this set, 14 of the 52 compounds (26.9%) were soluble. Evaluations of the external statistics for these models with these two test sets are shown in Tables , S2, and S3. Examining model performance quantitatively through chance-corrected statistics and the calculation of a mean value for each model of its ROC and concordance statistics over the two different training sets led to the selection of an optimal model, termed the Pruned and Fused AZ + MLSMR model. This model’s training set was derived by fusing the AZ set, pruned such that compounds with a solubility of 25–99 μM were deleted, with a pruned version of the MLSMR set, such that only the subset of compounds with a solubility <25 μM were included.

Table 1

Internal Statistics and External Statistics for Select Models

Bayesian model	ROC scoref	sensitivityg	specificityh	concordancei	MCCl	Cohen’s kappam	F1 scoren
internal set (with five-fold cross validation)
Full MLSMRa	0.836	77.8	81.5	79.5	n/a	n/a	n/a
Full AZb	0.744	86.7	76.8	80.3	n/a	n/a	n/a
Full Fused AZ + MLSMRc	0.833	76.6	82.3	79.2	n/a	n/a	n/a
Pruned and Fused AZ + MLSMRd	0.941	93.9	86.7	86.9	n/a	n/a	n/a
Huuskonene	0.926	89.9	94.1	90.9	n/a	n/a	n/a
External PubChem setj
Full MLSMRa	0.534	37.1	45.7	44.2	–0.13	–0.10	0.19
Full AZb	0.761	34.3	32.7	33.0	–0.26	–0.17	0.15
Full Fused AZ + MLSMRc	0.687	34.3	42.6	41.1	–0.18	–0.13	0.17
Pruned and Fused AZ + MLSMRd	0.824	85.7	65.4	69.0	0.39	0.33	0.50
Huuskonene	0.630	25.7	59.9	53.8	–0.11	–0.10	0.17
External JSF setk
Full MLSMRa	0.756	35.7	89.5	75.0	0.30	0.28	0.43
Full AZb	0.714	78.6	60.5	65.4	0.35	0.31	0.55
Full Fused AZ + MLSMRc	0.842	35.7	89.5	75.0	0.30	0.28	0.43
Pruned and Fused AZ + MLSMRd	0.724	64.3	86.8	80.8	0.51	0.51	0.64
Huuskonene	0.835	78.6	68.4	71.2	0.42	0.39	0.59

The Full MLSMR model was constructed from the training set composed of 57 824 unique compounds, of which 54.7% were defined as soluble using our ≥100 μM criteria.

The Full AstraZeneca (or AZ) model was built from the training set composed of 1763 compounds, of which 35.1% were soluble.

The Full Fused MLSMR + AZ model was generated by combining all of the compounds from the Full MLSMR set with the Full AstraZeneca set. It had 59 510 compounds, of which 54.2% were defined as soluble.

The Pruned and Fused MLSMR + AZ model corresponds to the training set in which we combined a pruned version of the AZ set (compounds with a solubility of 25–99 μM were deleted) with a pruned version of the MLSMR set (only the subset of compounds with a solubility <25 μM were included). This training set contains 17 460 compounds, of which 3.5% were soluble.

The Huuskonen reference model was constructed using a concatenation of previously published training and test sets.[7] It has 1290 compounds, of which 76.5% are soluble.

The ROC score is the area under the receiver operating characteristic curve from a five-fold cross-validation study.

Sensitivity represents the percentage of correctly identified soluble compounds (true positives).

Specificity signifies the percentage of correctly identified insoluble compounds (true negatives).

Concordance corresponds to the overall accuracy: percentage of (true positives + true negatives)/total number of compounds.

The External PubChem set has 197 unique compounds (that are not part of any training set), of which 17.8% are soluble.

The External JSF set has 52 compounds, of which 26.9% are soluble.

Matthew’s Correlation Coefficient (MCC), also referred to as the phi coefficient, is a chance-corrected statistic where MCC = 1 indicates perfect agreement, MCC = −1 indicates total disagreement, and MCC = 0 indicates that the model is no better than random.

Cohen’s Kappa is a chance-corrected statistic that uses a different method to calculate the random likelihood of making correct predictions for the external set. Kappa <0 indicates no agreement, Kappa of 0–0.2 indicates slight agreement, Kappa of 0.21–0.40 is fairly predictive, and Kappa of 0.41–0.60 indicates moderate agreement.[30]

The F1 score is the harmonic mean of precision (i.e., the positive predictive value hit rate) and sensitivity. The best F1 score is 1, while the worst score possible is 0.

The Full MLSMR model was constructed from the training set composed of 57 824 unique compounds, of which 54.7% were defined as soluble using our ≥100 μM criteria. The Full AstraZeneca (or AZ) model was built from the training set composed of 1763 compounds, of which 35.1% were soluble. The Full Fused MLSMR + AZ model was generated by combining all of the compounds from the Full MLSMR set with the Full AstraZeneca set. It had 59 510 compounds, of which 54.2% were defined as soluble. The Pruned and Fused MLSMR + AZ model corresponds to the training set in which we combined a pruned version of the AZ set (compounds with a solubility of 25–99 μM were deleted) with a pruned version of the MLSMR set (only the subset of compounds with a solubility <25 μM were included). This training set contains 17 460 compounds, of which 3.5% were soluble. The Huuskonen reference model was constructed using a concatenation of previously published training and test sets.[7] It has 1290 compounds, of which 76.5% are soluble. The ROC score is the area under the receiver operating characteristic curve from a five-fold cross-validation study. Sensitivity represents the percentage of correctly identified soluble compounds (true positives). Specificity signifies the percentage of correctly identified insoluble compounds (true negatives). Concordance corresponds to the overall accuracy: percentage of (true positives + true negatives)/total number of compounds. The External PubChem set has 197 unique compounds (that are not part of any training set), of which 17.8% are soluble. The External JSF set has 52 compounds, of which 26.9% are soluble. Matthew’s Correlation Coefficient (MCC), also referred to as the phi coefficient, is a chance-corrected statistic where MCC = 1 indicates perfect agreement, MCC = −1 indicates total disagreement, and MCC = 0 indicates that the model is no better than random. Cohen’s Kappa is a chance-corrected statistic that uses a different method to calculate the random likelihood of making correct predictions for the external set. Kappa <0 indicates no agreement, Kappa of 0–0.2 indicates slight agreement, Kappa of 0.21–0.40 is fairly predictive, and Kappa of 0.41–0.60 indicates moderate agreement.[30] The F1 score is the harmonic mean of precision (i.e., the positive predictive value hit rate) and sensitivity. The best F1 score is 1, while the worst score possible is 0. The Pruned and Fused AZ + MSLMR model was then compared to its parental models as well as the model trained with the Huuskonen data set (Table ). While the Huuskonen model’s internal statistics were slightly better than those for the Pruned and Fused AZ + MLSMR model, the external statistics for the Pruned and Fused AZ + MLSMR model were superior to all models. Histogram-based analysis demonstrated the ability of the Pruned and Fused AZ + MLSMR model to establish a better separation between the actual soluble and insoluble compounds in the External PubChem set (Figure S1) and External JSF set (Figure S2). Visually, this is most apparent in the ROC curves comparing model performance with the External PubChem set (Figure A). While the Pruned and Fused AZ + MLSMR model did not compare as favorably with the Huuskonen model when using the smaller External JSF set (Figure B) and a ROC analysis, the Pruned and Fused AZ + MLSMR model did demonstrate superior concordance. The greater performance of the Pruned and Fused AZ + MLSMR model was also reached when expanding the model comparison to include the following chance-corrected statistics (Table ): Matthew’s Correlation Coefficient (MCC)[33] and Cohen’s Kappa.[34] Both metrics calculate the random probability of making correct predictions and account for unbalanced data sets. The MCC for the Pruned and Fused AZ + MLSMR model was better than random (>0) and also outperformed the Huuskonen model and both parental models with both external test sets. It also was superior to the Huuskonen model for both external sets when considering Cohen’s Kappa. Finally, when considering a measure of the model’s ability to predict only soluble compounds, F1 score[35]—the harmonic mean of the positive predictive value (PPV) hit rate (Table S4) and sensitivity (Table )—the Pruned and Fused AZ + MLSMR model again outperformed these other models.

Figure 1

ROC curves for evaluation of the solubility Bayesian models with the (A) External PubChem and (B) External JSF sets.

ROC curves for evaluation of the solubility Bayesian models with the (A) External PubChem and (B) External JSF sets. The internal and external statistics of the Pruned and Fused AZ + MLSMR model demonstrate potential predictive value superior to that of the Huuskonen model. Principal component analysis (PCA) (Figure ) demonstrated that the Pruned and Fused AZ + MLSMR training set also occupies different physiochemical property space than the Huuskonen set. This further highlights the novelty and significance of this curated training set to the modeling community, since a well-curated training set that includes novel parts of physiochemical property space can lead to new studies with different modeling approaches that far outlive the utility of any particular initial study. In addition, the soluble compounds (large spheres in Figure B) and insoluble compounds (smaller spheres in Figure B) in the Pruned and Fused AZ + MLSMR training set generally occupy similar regions of physiochemical property space. This observation underscores the need for combining structural fingerprints with the physiochemical properties to predict solubility accurately.

Figure 2

Principal component analysis (PCA) comparing the Pruned and Fused AZ + MLSMR training set (in red) to the Huuskonen reference training set (in blue). (A) shows all compounds with the same radii, while (B) has insoluble compounds displayed with smaller radii.

Conclusions

This study began with the MLSMR and AZ training sets of 57 824 and 1763 compounds, respectively, and explored fusion of the two data sets and then different levels of pruning the training set, following the precedent of our models for mouse liver microsomal stability[21] and Vero cell cytotoxicity.[31] The end result was a Bayesian model derived from the fusion of the MLSMR and AZ training sets where pruning of both parental data sets was conducted: compounds with a solubility of 25–99 μM were deleted in the AZ set and only the subset of MLSMR compounds with a solubility <25 μM were included. This model predicted subsets of its training set as well or better than the other models, depending on the five-fold cross-validation statistic, and, most importantly, it outperformed the other models in predicting the solubility of two independent external test sets (one from PubChem and one from our laboratory’s compound collection). Encompassing a differentially localized chemical space than the previous Huuskonen data set, we anticipate that the Pruned and Fused AZ + MLSMR training set and its machine learning models will be of value to the community to predict the solubility of the chemical tool and drug discovery of small molecules of therapeutic relevance.

Methods

Please refer to the main text for the construction of the Bayesian models and to the Supporting Information for additional details regarding their construction and evaluation.

Principal Component Analyses

The physiochemical property space sampled by each data set was compared to the other data sets by concatenating their relevant sdf files together and performing a principal component analysis (PCA) in Discovery Studio 4.5. (BIOVIA, Inc.), using the default protocol.[22,31] Briefly, eight different interpretable molecular descriptors (that characterize the properties of each compound as a whole entity) were calculated and utilized to describe the physiochemical properties of each compound: number of hydrogen bond donors, number of hydrogen bond acceptors, AlogP, molecular weight, number of rings, number of aromatic rings, number of rotatable bonds, and molecular fractional polar surface area. These are eight of the nine descriptors that were utilized to construct our Bayesian models. The ninth descriptor (which was used for the Bayesian models but not for PCA) was FCFP_6, which describes the topology of different substructural pieces of each compound.

33 in total

1. Development of quantitative structure-property relationship models for early ADME evaluation in drug discovery. 1. Aqueous solubility.

Authors: R Liu; S S So
Journal: J Chem Inf Comput Sci Date: 2001 Nov-Dec

2. Prediction of aqueous solubility of organic compounds by the general solubility equation (GSE).

Authors: Y Ran; N Jain; S H Yalkowsky
Journal: J Chem Inf Comput Sci Date: 2001 Sep-Oct

3. Prediction of aqueous solubility and partition coefficient optimized by a genetic algorithm based descriptor selection method.

Authors: Jörg K Wegner; Andreas Zell
Journal: J Chem Inf Comput Sci Date: 2003 May-Jun

4. Linear and nonlinear functions on modeling of aqueous solubility of organic compounds by two structure representation methods.

Authors: Aixia Yan; Johann Gasteiger; Michael Krug; Soheila Anzali
Journal: J Comput Aided Mol Des Date: 2004-02 Impact factor: 3.686

Review 5. Bridging solubility between drug discovery and development.

Authors: Li Di; Paul V Fish; Takashi Mano
Journal: Drug Discov Today Date: 2011-11-26 Impact factor: 7.851

6. Prediction of drug solubility from molecular structure using a drug-like training set.

Authors: J Huuskonen; D J Livingstone; D T Manallack
Journal: SAR QSAR Environ Res Date: 2008 Apr-Jun Impact factor: 3.000

7. Aqueous solubility prediction of drugs based on molecular topology and neural network modeling.

Authors: J Huuskonen; M Salo; J Taskinen
Journal: J Chem Inf Comput Sci Date: 1998 May-Jun

8. Optimizing solubility: kinetic versus thermodynamic solubility temptations and risks.

Authors: Christoph Saal; Anna Christine Petereit
Journal: Eur J Pharm Sci Date: 2012-08-03 Impact factor: 4.384

9. Addressing the Metabolic Stability of Antituberculars through Machine Learning.

Authors: Thomas P Stratton; Alexander L Perryman; Catherine Vilchèze; Riccardo Russo; Shao-Gang Li; Jimmy S Patel; Eric Singleton; Sean Ekins; Nancy Connell; William R Jacobs; Joel S Freundlich
Journal: ACS Med Chem Lett Date: 2017-09-14 Impact factor: 4.345

10. PubChem Substance and Compound databases.

Authors: Sunghwan Kim; Paul A Thiessen; Evan E Bolton; Jie Chen; Gang Fu; Asta Gindulyte; Lianyi Han; Jane He; Siqian He; Benjamin A Shoemaker; Jiyao Wang; Bo Yu; Jian Zhang; Stephen H Bryant
Journal: Nucleic Acids Res Date: 2015-09-22 Impact factor: 16.971

2 in total

1. Novel Solubility Prediction Models: Molecular Fingerprints and Physicochemical Features vs Graph Convolutional Neural Networks.

Authors: Sumin Lee; Myeonghun Lee; Ki-Won Gyak; Sung Dug Kim; Mi-Jeong Kim; Kyoungmin Min
Journal: ACS Omega Date: 2022-04-04

2. Random Forest Model Prediction of Compound Oral Exposure in the Mouse.

Authors: Haseeb Mughal; Han Wang; Matthew Zimmerman; Marc D Paradis; Joel S Freundlich
Journal: ACS Pharmacol Transl Sci Date: 2021-01-26

2 in total