| Literature DB >> 27401409 |
Chris D Edwards1, Chris Luscombe2, Peter Eddershaw3, Edith M Hessel3.
Abstract
PURPOSE: We developed and tested a novel Quantitative Structure-Activity Relationship (QSAR) model to better understand the physicochemical drivers of pulmonary absorption, and to facilitate compound design through improved prediction of absorption. The model was tested using a large array of both existing and newly designed compounds.Entities:
Keywords: intratracheal delivery; isolated perfused lung; physicochemical properties; pulmonary absorption; quantitative structure-activity relationship model
Mesh:
Substances:
Year: 2016 PMID: 27401409 PMCID: PMC5040732 DOI: 10.1007/s11095-016-1983-4
Source DB: PubMed Journal: Pharm Res ISSN: 0724-8741 Impact factor: 4.200
Data for Marketed Drugs Generated in the IPRLu Model and Included in the OPLS Model “Training set”, Mean (n = 2) Data are Displayed with the Range Quoted in Brackets
| Drug | %TDiP | %SDiP | % Dose in solution | Lung T1/2 (mins) | Recovery of total dose (%)a | Calculated LogPb |
|---|---|---|---|---|---|---|
| Indacaterol | 11 (10,11) | 12 (12) | 88 | 273 (255–291) | 91 (77–104) | 3.03 |
| Ambroxol | 38 (14–61) | 41 (18–64) | 95 | 39 (14–64) | 70 (66–74) | 2.65 |
| Formoterol Fumerate | 47 (42–52) | 47 (42–52) | 100 | 24 (22–26) | 73 (70–76) | 0.83 |
| Ipratropium Bromide | 49 (34–64) | 50c | 100 | 23 (14–32) | –1.82 | |
| Tiotropium Bromided | 61 (59–62) | 16 (15,16) | –1.76 | |||
| Amiloride | 62 (61–62) | 62 (61–62) | 100 | 8 (7–10) | 81 (76–85) | -0.5 |
| Lidocainee | 75 (64–82) | 75c | 100 | 4 (4,5) | 1.54 | |
| Zanamivirf | 100 (100–100) | 100 | 100 | 7 (5–8) | 104 (100–107) | –6.54 |
| Flunisolide | 41 (28–53) | 96 (66–126) | 42 | 15 (7–22) | 139 (135–142) | 1.56 |
| Montelukast | 67 (57–78) | 92 (78–106) | 73 | 42 (23–60) | 77 (70–85) | 8.49 |
| Fluticasone Propionate | 10 (10) | 2000c | 0.5 | 115 (98–131) | 3.72 | |
| Fluticasone Furoate | 10 (7–12) | 300c | 3.2 | 153 (112–194) | 4.13 | |
| Nedocromil | 32 (23–40) | 125 (83–166) | 24 | 32 (24–40) | 85 (79–90) | 2.5 |
| Tacrolimus (prograf) | 38 (38) | 178 (176–180) | 21 | 28 (27,28) | 5.59 | |
| Budesonide | 77 (76–79) | 551 (540–561) | 14 | 3.3 (3,4) | 84 (82–87) | 2.73 |
| Salmeterol | 15 (6–24) | 41 (16–66) | 36 | 146 (59–233) | 69 (61–76) | 3.59 |
| Salbutamol | 42 (33–51) | 77 (60–93) | 55 | 21 (13–28) | 78 (75–81) | 0.32 |
aData only available for compounds where lung analysis was carried out to confirm recovery
bCalculated using Chemaxon v5.4.1.1 (http://www.chemaxon.com/)
cSolubility in dose vehicle was determined post study on a discrete weighing of the compound
dSolubility of Tiotropium was not measured in the IPRLu model experiment, therefore it was not included in the “Training set”. Safety data sheet quotes solubility to be ~5 mg/ml in PBS (pH 7.2) https://www.caymanchem.com/msdss/15773m.pdf
eLidocaine data is from n = 4
fZanamivir dose analysis failed, expected to be fully solubilised in the dose vehicle (250 μg/ml 0.2% Tween 80 in saline) due to its zwitterionic nature. Merck Index states solubility of zanamivir to be 18 mg/ml in water
Fig. 1IPRLu model profiles for different salt forms of the same parent drug discovery compound expressed as (a) % dose in perfusate, where a 6 fold difference is noted between salt forms at 20 mins and (b) %SDiP i.e. normalised for the amount of dose in solution, where profiles are comparable. HNA = hydroxynapthoate, FB = free base, HCl = hydrochloride salt.
Range of Physicochemical Properties and IPRLu Model Endpoints Across the 98 Compounds (7 Zwitterions, 8 Acids, 31 Bases and 52 Neutral) in the “Training Set”
| Physicochemical property | IPRLu model endpoint | Mean | Lower range | Upper range |
|---|---|---|---|---|
| cLogP | 3.2 | –3.7 | 9.1 | |
| cLogD pH7.4 | 2.1 | –4.4 | 6.5 | |
| M.Wt. | 507 | 177 | 842 | |
| tPSA | 105 | 24 | 198 | |
| %TDiP | 32 | 0.1 | 100 | |
| %SDiP | 164 | 0.1 | 2400 | |
| Lung absorption half-life (mins) | 212 | 3.3 | 5210 |
Ionisation classes were predicted and cLogP/cLogD values calculated using the ACDlabs software ACD_v11: Advanced Chemistry Development, Inc. 8 King Street East, Suite 107, Toronto, Ontario, Canada M5C 1B5. tPSA = total Polar Surface Area
Fig. 2Distribution of data in training dataset “Log%SDiP”.
Fig. 3The scores plot from the resulting 2 component OPLS model generated within SIMCA-P+ on the 98 compounds where increasing size and increasing blue intensity of the spots are equated with the size of the response Log%SDiP.
Fig. 4OPLS model coefficient plot displaying the contribution of each descriptor to the model components and whether each descriptor correlates positively or negatively with Log%SDiP.
Information Around the in-House QSAR Models and Descriptors used to Build the Log%SDiP OPLS Model
| QSAR model / Descriptor | Details | Coefficients with regards to IPRLu QSAR model |
|---|---|---|
| FA_rat_v1.logFA_score | Output from an inhouse QSAR model that predicts fraction absorbed in the rat, this model was built on a GSK rat oral bioavailability dataset with total clearance values of less than 10 ml/min/kg and therefore minimal first pass effect. | + |
| Perm_chrom_p.perm_score | An inhouse PLS-Discriminant analysis model which predicts permeability and oral absorption of compounds. | + |
| MDCK2.Perm_pH74_nm_sec MDCK2.Perm_pH64_nm_sec | Inhouse QSAR models which predict passive permeability across a madine darby canine kidney (MDCK) cell monolayer ( | + |
| Pgp_v31.Pgp_Score | An inhouse PLS-Discriminant analysis model which predicts the likelihood of a compound being a substrate for the ABC active efflux transporter P-glycoprotein. | + |
| Chrom LogD_v3.value | An inhouse OPLS QSAR model that predicts hydrophobicity, the model was built on a GSK dataset of chromatographic hydrophobicity index (CHI) measurements ( | + |
| neutral_ionised_form | % of the molecule in the neutral form at pH7.4. pKa’s calculated using the ACDlabs softwarea ACD_v11. | + |
| logd_pH55_acd | Calculated logD value from the ACDlabs software at pH5.5 | + |
| logd_pH65_acd | Calculated logD value from the ACDlabs software at pH6.5 | + |
| Bpka1_modified | The most basic pka value calculated using ACDlabs software ACD_v11 | – |
| basic_ionised_form | % of the molecule in the acidic form at pH7.4 pKa’s calculated using the ACDlabs software ACD_v11. | – |
| abe | Andrews Binding energy ( | – |
| cmr | Calculated molar refraction | – |
| mw | Average molecular weight of parent | – |
| tpsa | Polar surface area calculated by the method of Ertl ( | – |
| hbd | Count of the number of hydrogen bond acceptors in a molecule. | – |
| pos | The count of the number of positively ionisable/charged groups in a molecule. | – |
| rb | count of the number of rotatable bonds in a molecule. | – |
| flex | related to the ratio of the number of rotatable bonds to total bonds = int(100*rotatable_bonds/total_bonds) | - |
| alpha | An Abraham’s molecular descriptor relating to Hydrogen Bond Acidity (hydrogen bond donors) ( | - |
| betah | An Abraham’s molecular descriptor relating to Hydrogen Bond Basicity (hydrogen bond acceptors) ( | - |
| pi | An Abraham’s molecular descriptor relating to the dipolarity/dipolarizability ( | - |
| vx | An Abraham’s molecular descriptor relating to McGowan characteristic volume (size) ( | - |
| total_HB | Sum of hydrogen bond donors and acceptors (physchem_desc.hba + physchem_desc.hbd) | - |
| total_charge | Sum of positive and negative charges (physchem_desc.neg + physchem_desc.pos) | - |
| nonPSA | Approximation of total surface area - polar surface area. | - |
aACDlabs software: Advanced Chemistry Development, Inc. 8 King Street East, Suite 107, Toronto, Ontario, Canada M5C 1B5
Statistical Output from SIMCA for the Final OPLS Predictive Model for Log%SDiP, OPLS Observations (N) = 107, Variables (K) = 27 (X = 26, Y = 1)
| A | R2X | R2X(cum) | Eigenvalues | R2Y | R2Y(cum) | Q2 | Limit | Q2(cum) | Significance |
|---|---|---|---|---|---|---|---|---|---|
| 0 | Cent. | Cent. | |||||||
| 1 + 0 | 0.462 | 0.462 | 6.26 | 0.34 | 0.34 | 0.322 | 0 | 0.322 | R1 |
| Rotation | –0.221 | 0.241 | 0.621 | 0.491 | |||||
| 1 + 1 | 0.26 | 0.26 | 6.77 | 0.0722 | 0.0722 | 0.038 | 0 | 0.038 | R1 |
| 1 + 2 | 0.143 | 0.403 | 3.71 | 0.119 | 0.192 | 0.0719 | 0 | 0.11 | R1 |
| 1 + 3 | 0.0884 | 0.492 | 2.3 | 0.0412 | 0.233 | 0.0403 | 0 | 0.15 | R1 |
| 1 + 4 | 0.0809 | 0.572 | 2.1 | 0.0328 | 0.266 | 0.0117 | 0 | 0.162 | R1 |
| 1 + 5 | 0.0549 | 0.627 | 1.43 | 0.0153 | 0.281 | 0.0074 | 0 | 0.169 | R1 |
| Sum | 0.868 | 0.621 | 0.491 |
Fig. 5The output from the OPLS model showing the correlation between predicted Log%SDiP on the x axis and observed Log%SDiP on the y axis. The model output R2 and Q2 values were 0.621 and 0.491 respectively. Circled area highlights potential for model to under-predict for some “high” classification compounds (>300%SDiP).
Comparison of Observed and Predicted %SDiP Data for the 9 “Test Set” Compounds
| Compound | Observed mean % [range] (category) | Predicted % (category) | Accuracy of prediction |
|---|---|---|---|
| GSK_A | <13 (low/mod) | 4 (low) | Good Prediction |
| GSK_B | 170 [169–170] (high) | 188 (high) | Good Prediction |
| GSK_C | 19 [15–23] (mod) | 58 (mod) | Same category but >2fold |
| GSK_D | 23 [15–30] (mod) | 26 (mod) | Good Prediction |
| GSK_E | 44 [32–56] (mod) | 5 (low) | Underestimate |
| GSK_F | 23 [16–29] (mod) | 14 (mod) | Good Prediction |
| GSK_G | 94 [77–111] (mod) | 150 (high) | Prediction within 2 fold |
| GSK_H | 100 [90–111] (mod) | 143 (high) | Prediction within 2 fold |
| GSK_I | 352 [309–395] (high) | 122 (high) | Same category but >2fold |
Observed data were generated by the IPRLu model and mean values are displayed along with the range (n = 2), predicted values are the anti-Log of the output from the OPLS model
The values of %SDiP were classified as Low (<10%) i.e. compounds which did not cross into the perfusate to any great extent, Moderate (10–100%) and High (>100%) indicting that absorption is not limited by solubility in the dose formulation
Fig. 6Comparison between predicted and mean observed %SDiP for the “Test set”. Dashed line = unity. Solid = linear trendline with R2 of 0.85 with outlier removed.