Literature DB >> 25798192

Predicting Abraham model solvent coefficients.

Jean-Claude Bradley1, Michael H Abraham2, William E Acree3, Andrew Sid Lang4.   

Abstract

BACKGROUND: The Abraham general solvation model can be used in a broad set of scenarios involving partitioning and solubility, yet is limited to a set of solvents with measured Abraham coefficients. Here we extend the range of applicability of Abraham's model by creating open models that can be used to predict the solvent coefficients for all organic solvents.
RESULTS: We created open random forest models for the solvent coefficients e, s, a, b, and v that had out-of-bag R(2) values of 0.31, 0.77, 0.92, 0.47, and 0.63 respectively. The models were used to suggest sustainable solvent replacements for commonly used solvents. For example, our models predict that propylene glycol may be used as a general sustainable solvent replacement for methanol.
CONCLUSION: The solvent coefficient models extend the range of applicability of the Abraham general solvation equations to all organic solvents. The models were developed under Open Notebook Science conditions which makes them open, reproducible, and as useful as possible. Graphical AbstractChemical space for solvents with known Abraham coefficients.

Entities:  

Keywords:  Abraham general solvation model; Partition coefficients; Solubility; Solvent coefficients; Solvent replacement; Sustainable solvents

Year:  2015        PMID: 25798192      PMCID: PMC4369285          DOI: 10.1186/s13065-015-0085-4

Source DB:  PubMed          Journal:  Chem Cent J        ISSN: 1752-153X            Impact factor:   4.215


Background

The Abraham model was developed and is widely used to predict partition coefficients for both conventional organic solvents [1-11] and ionic liquid solvents [12,13], for the partitioning of drug molecules between blood and select body organs [14-18], and for partitioning into micelles [19] and for prediction of enthalpies of solvation in organic solvents [20] and ionic organic liquids [21]. The Abraham model is based on the linear free energy relationship (LFER)where logP is the solvent/water partition coefficient. Under reasonable conditions, this model can also be used to predict the solubility of organic compounds in organic solvents [22] as followswhere S is the molar concentration of the solute in the organic solvent, S is the molar concentration of the solute in water, (c, e, s, a, b) are the solvent coefficients, and (E, S, A, B, V) are the solute descriptors: E is the solute excess molar refractivity in units of (cm^3/mol)/10, S is the solute dipolarity/polarizability, A and B are the overall or summation hydrogen bond acidity and basicity, and V is the McGowan characteristic volume in units of (cm^3/mol)/100. The solvent coefficients are obtained by linear regression using experimentally determined partitions and solubilities of solutes with known Abraham descriptors. Traditionally, the intercept c is allowed to float and is assumed to encode information not characterized by the other solvent-solute interaction terms. However, for some partitioning systems the value of c can vary greatly depending upon the training-set used [23]. This makes it difficult to directly compare different solvents by examining their solvent coefficients. Van Noort has even suggested that the c-coefficient be derived directly from structure before the other coefficients are determined [24]. A problem with this suggestion is that the c-coefficient depends on the standard state. Partition coefficients can be expressed in concentration units of molarity and mole fractions, and the numerical value of the c-coefficient will be different for each concentration unit. Abraham model correlations considered in this study have partition coefficients expressed in concentration units of molarity. To date, solvent coefficients have been determined for over 90 commonly used solvents (Additional file 1), and group contribution methods have been developed to approximate all coefficients for certain classes of solvents that do not have published solvent coefficients [25,26]. The solvent coefficients in the supporting material pertain to dry solvents, or solvents that take up very little water (hexane, toluene, etc.). This study expands the applicability of the Abraham model by developing open models, using open descriptors from the Chemistry Development Kit (CDK) [27] that can be used to predict the Abraham solvent coefficients of any organic solvent directly from structure.

Procedure

In order to directly compare various solvents, it is advantageous to first recalculate the solvent coefficients with the c-coefficient equal zero. This was accomplished by using equation (1) to calculate the log P values for 2144 compounds from our Open Data database of compounds with known Abraham descriptors [28] and then by regressing the results against the following equationwhere the subscript-zero indicates that c = 0 has been used in the regression [29]. As an informational note one could have set the c-coefficient of a given solvent equal to a calculated average value determined from numerical c-coefficients of solvents similar to the solvent under consideration. For example, the c-coefficient of all alkane solvents could be set equal to c = 0.225, which is the average value for the c-coefficients of the 13 alkane and cycloalkane solvents for which log P correlations have been determined. While average values could be used for several solvents, there is the problem of what value to use in the case of solvents for which a similar solvent log P solvent is not available. Abraham model correlations are available for two dialkyl ethers (e.g., diethyl ether and dibutyl ether) and for several alcohols, but not for alkoxyalcohols (e.g., 2-ethoxyethanol, 2-propoxyethanol, 2-butyoxyethanol) which contain both an ether and hydroxyl alcohol group. Our intended solvent set in the present communication includes the alternative “green” solvents, and there a number of solvents in this group that contain multi-functional groups. For several of the solvents on the list of alternative “green” solvents, such as 1,3-dioxan-5-ol, 1,3-dioxolane-4-methanol, 3-hydroxypropionic acid, 5-(hydroxymethyl)furfural, ethyl lactate, furfuryl alcohol, and other solvents, there are no similar solvents having a Abraham model log P correlation. To treat all solvents equally we have elected to set c = 0 in this study. Table 1 lists the original solvent coefficients together with the c = 0 adjusted coefficients. Comparing the coefficients, we see, not surprisingly, the largest changes in coefficient values occur for solvents with c-values furthest away from zero (Additional file 1). What is intriguing is that all the coefficients move consistently the same way. That is, solvents with negative c-values all saw an increase in e and b (and a decrease in s, a, and v) when recalculated, whereas solvents with positive c-values all saw an increase in s, a, and v (and decrease in e and b).
Table 1

Solvent coefficients

c e s a b v Solvent e 0 s 0 a 0 b 0 v 0
0.3510.223−0.150−1.035−4.5273.972methyl acetate0.195−0.068−0.924−4.5714.152
0.3280.369−0.446−0.700−4.9044.150ethyl acetate0.343−0.369−0.597−4.9454.319
0.2480.356−0.501−0.867−4.9734.281butyl acetate0.336−0.443−0.788−5.0054.409
−0.6050.930−1.153−1.682−4.0934.249isopropyl myristate0.977−1.295−1.870−4.0183.939
0.0900.205−0.1721.305−4.5893.833N-methylacetamide0.197−0.1511.335−4.6013.880
0.2840.128−0.4421.180−4.7283.856N-ethylacetamide0.105−0.3751.269−4.7644.002
−0.2710.0840.2090.915−5.0034.557dimethylacetamide0.1050.1450.832−4.9704.419
0.2130.0340.0891.342−5.0844.088N,N-diethylacetamide0.0170.1391.409−5.1114.198
−0.1710.0700.3080.589−3.1522.432formamide0.0830.2680.537−3.1322.345
0.1140.407−0.2870.542−4.0853.471N-methylformamide0.398−0.2600.579−4.1003.530
0.2200.034−0.1660.935−4.5893.730N-ethylformamide0.016−0.1141.005−4.6173.843
−0.305−0.0580.3430.358−4.8654.486DMF−0.0340.2710.264−4.8284.330
0.3320.302−0.4360.358−4.9023.952dibutylformamide0.275−0.3580.462−4.9444.123
0.2760.334−0.7140.243−3.3203.549methanol0.312−0.6490.330−3.3553.691
0.2220.471−1.0350.326−3.5963.857ethanol0.453−0.9830.396−3.6233.971
0.2430.213−0.5750.262−3.4503.545ethanol/water(90:10)vol0.193−0.5180.339−3.4813.670
0.1720.175−0.4650.260−3.2123.323ethanol/water(80:20)vol0.161−0.4240.314−3.2333.411
0.0630.085−0.3680.311−2.9363.102ethanol/water(70:30)vol0.079−0.3530.331−2.9443.134
−0.0400.138−0.3350.293−2.6752.812ethanol/water(60:40)vol0.141−0.3440.281−2.6702.792
−0.1420.124−0.2520.251−2.2752.415ethanol/water(50:50)vol0.135−0.2850.207−2.2572.342
−0.2210.131−0.1590.171−1.8091.918ethanol/water(40:60)vol0.148−0.2110.103−1.7821.805
−0.2690.107−0.0980.133−1.3161.414ethanol/water(30:70)vol0.128−0.1610.049−1.2831.276
−0.2520.043−0.0400.096−0.8320.916ethanol/water(20:80)vol0.063−0.0990.017−0.8010.787
−0.173−0.023−0.0010.065−0.3720.454ethanol/water(10:90)vol−0.009−0.0420.011−0.3500.365
0.0000.0000.0000.0000.0000.000water0.0000.0000.0000.0000.000
0.1390.405−1.0290.247−3.7673.9861-propanol0.393−0.9960.291−3.7854.058
0.0990.343−1.0490.406−3.8274.0332-propanol0.335−1.0260.438−3.8394.084
0.1880.354−1.1270.016−3.5683.9682-methyl-1-propanol0.339−1.0830.076−3.5924.065
0.2110.171−0.9470.331−4.0854.1092-methyl-2-propanol0.154−0.8970.398−4.1124.218
0.1650.401−1.0110.056−3.9584.0441-butanol0.388−0.9720.108−3.9794.129
0.1270.253−0.9760.158−3.8824.1142-butanol0.242−0.9460.199−3.8984.179
0.0730.360−1.2730.090−3.7704.3993-methyl-1-butanol0.354−1.2560.113−3.7794.437
0.1500.536−1.2290.141−3.8644.0771-pentanol0.524−1.1940.188−3.8834.154
0.1150.455−1.3310.206−3.7454.2012-pentanol0.445−1.3040.243−3.7594.260
0.1150.492−1.1640.054−3.9784.1311-hexanol0.483−1.1370.091−3.9934.191
0.0350.398−1.0630.002−4.3424.3171-heptanol0.395−1.0550.014−4.3474.335
−0.0340.489−1.044−0.024−4.2354.2181-octanol0.491−1.052−0.034−4.2314.201
−0.0580.616−1.3190.026−4.1534.2791-decanol0.620−1.3330.009−4.1464.250
−0.0960.148−0.841−0.438−4.0404.125octadecanol0.155−0.864−0.467−4.0284.076
0.3690.386−1.568−3.535−5.2154.514pentane0.357−1.481−3.419−5.2614.704
0.3330.560−1.710−3.578−4.9394.463hexane0.533−1.632−3.473−4.9814.634
0.2970.643−1.755−3.571−4.9464.488heptane0.619−1.685−3.477−4.9834.641
0.2310.738−1.840−3.585−4.9074.502octane0.719−1.786−3.512−4.9364.621
0.2400.619−1.713−3.532−4.9214.482nonane0.600−1.657−3.457−4.9514.606
0.1860.722−1.741−3.449−4.9704.476decane0.707−1.697−3.390−4.9934.572
0.0580.603−1.661−3.421−5.1204.619undecane0.598−1.647−3.402−5.1284.649
0.1140.668−1.644−3.545−5.0064.459dodecane0.659−1.617−3.509−5.0214.518
0.0870.667−1.617−3.587−4.8694.433hexadecane0.660−1.596−3.560−4.8804.478
0.3200.511−1.685−3.687−4.8114.3992,2,4-trimethylpentane0.485−1.610−3.586−4.8514.564
0.1040.615−1.796−3.070−4.2914.5181,9-decadiene0.606−1.771−3.037−4.3044.572
0.1160.706−1.616−3.181−4.7964.3221-hexadecene0.697−1.589−3.144−4.8114.382
0.1830.294−0.134−2.801−4.2914.1801,2-dichloroethane0.279−0.091−2.743−4.3144.274
0.2220.273−0.569−2.918−4.8834.4561-chlorobutane0.255−0.517−2.848−4.9114.570
0.3190.102−0.187−3.058−4.0904.324dichloromethane0.076−0.112−2.957−4.1304.488
0.1910.105−0.403−3.112−3.5144.395chloroform0.089−0.358−3.051−3.5384.493
0.1990.523−1.159−3.560−4.5944.618carbon tetrachloride0.507−1.112−3.497−4.6194.721
0.395−0.094−0.594−1.280−1.2743.088trifluoroethanol−0.126−0.501−1.156−1.3233.291
0.3500.358−0.820−0.588−4.9564.350diethyl ether0.330−0.737−0.478−5.0004.530
0.1760.394−0.985−1.414−5.3574.524dibutyl ether0.380−0.944−1.358−5.3794.615
0.3410.307−0.817−0.618−5.0974.425methyl tert-butyl ether0.280−0.737−0.510−5.1404.600
0.1420.464−0.588−3.009−4.6254.491benzene0.452−0.554−2.964−4.6434.564
0.1390.152−0.374−3.030−4.6014.540fluorobenzene0.140−0.341−2.985−4.6184.611
0.0650.381−0.521−3.183−4.7004.614chlorobenzene0.375−0.506−3.161−4.7084.648
−0.0170.436−0.424−3.174−4.5584.445bromobenzene0.437−0.428−3.178−4.5564.437
−0.1920.298−0.308−3.213−4.6534.588iodobenzene0.313−0.353−3.272−4.6294.490
0.1250.431−0.644−3.002−4.7484.524toluene0.421−0.615−2.962−4.7644.589
0.0930.467−0.723−3.001−4.8444.514ethylbenzene0.459−0.701−2.971−4.8564.562
0.1220.377−0.603−2.981−4.9614.535m-xylene0.367−0.574−2.941−4.9774.598
0.0830.518−0.813−2.884−4.8214.559o-xylene0.511−0.793−2.857−4.8314.602
0.1660.477−0.812−2.939−4.8744.532p-xylene0.463−0.773−2.886−4.8954.618
−0.1960.5370.042−2.328−4.6084.314nitrobenzene0.552−0.004−2.388−4.5844.214
0.1590.784−1.678−3.740−4.9294.577cyclohexane0.771−1.640−3.689−4.9494.659
0.023−0.0910.793−1.463−4.3643.460nitromethane−0.0930.799−1.454−4.3683.472
0.2460.782−1.982−3.517−4.2934.528methylcyclohexane0.762−1.924−3.439−4.3244.655
0.2230.363−0.384−0.238−4.9324.450THF0.345−0.332−0.167−4.9604.565
0.1230.347−0.033−0.582−4.8104.1101,4-dioxane0.337−0.004−0.542−4.8264.173
0.0040.1680.504−1.283−4.4073.421propylene carbonate0.1670.505−1.281−4.4083.423
0.0380.2250.058−0.976−4.8424.315cyclohexanone0.2220.067−0.963−4.8474.335
0.1470.5320.2250.840−4.7943.674N-methylpyrrolidinone0.5200.2600.887−4.8133.750
0.0560.3320.2571.556−5.0353.983N-methyl-2-piperidone0.3270.2711.575−5.0444.012
−0.0320.696−0.0620.014−4.0923.405N-formylmorpholine0.698−0.0690.005−4.0893.389
0.0970.2850.059−1.605−4.5624.028benzonitrile0.2770.082−1.574−4.5754.078
0.4130.0770.326−1.566−4.3913.364acetonitrile0.0440.423−1.436−4.4433.576
−0.2700.578−0.5110.715−2.6192.729ethylene glycol0.599−0.5750.631−2.5852.591
0.3130.312−0.121−0.608−4.7533.942acetone0.287−0.047−0.509−4.7924.103
0.2460.256−0.080−0.767−4.8554.148butanone0.236−0.022−0.689−4.8864.275
−0.1940.3270.7911.260−4.5403.361DMSO0.3420.7461.200−4.5173.262
0.0470.686−0.943−3.603−5.8184.921carbon disulfide0.682−0.932−3.587−5.8254.946
0.0000.1470.601−0.381−4.5413.290sulfolane0.1470.601−0.380−4.5423.290
0.0220.350−0.4320.708−4.7254.192tributyl phosphate0.544−0.761−0.966−4.3744.087
0.5740.715−1.027−1.296−4.5123.446peanut oil0.670−0.892−1.121−4.5823.744

Original on the left, with c = 0 on the right.

Solvent coefficients Original on the left, with c = 0 on the right. One way to measure the effect of making c = 0 is to evaluate how the values of each solute-solvent term change as measured against the average solute descriptors (Eave = 0.884, Save = 1.002, Aave = 0.173, Bave = 0.486, Vave = 1.308). By multiplying the average absolute deviation of the solvent coefficients and the mean solute descriptor value, e.g. AAE(v) * Mean(Vave), the coefficients shifted from greatest to least in the following order v (0.124), s (0.043), e (0.013), b (0.011), a (0.010).

Results and discussion

Modeling

We calculated CDK descriptors for each solvent using the cdkdescui [30] and then created five random forest models for e0, s0, a0, b0, and v0 using R. The resulting models had out of bag (OOB) R2 values ranging between the barely significant 0.31 for e0 to the very seignificant 0.92 for a0, see the Open Notebook page for more details [29]. It is important to note that due to the limited number of data points, we decided not to split the data into training and test sets and instead use the OOB values which are automatically generated with random forest models as our means of validation. A summary of the modeling results can be found in Table 2.
Table 2

Summary of statistical measures of the results of modeling

Model N OOB RMSE OOB R 2 RMSE R 2 Most significant descriptor
e0 890.1810.3080.0740.885XLogP
s0 890.3260.7680.1350.960XLogP
a0 890.4770.9190.2050.985nHBAcc
b0 890.4710.4740.2030.903khs.sOH
v0 890.2280.6270.1220.933TopoPSA
Summary of statistical measures of the results of modeling Quite why some endpoints are more difficult to model than others is not known. Comparing the OOB R2 values with the standard deviation of the endpoints (e0: 0.31, s0: 0.77, a0: 0.92, b0:0.47, and v0: 0.63) we see no negative correlation between the range of a given endpoint and the actual prediction performances of the associated models as one would possibly suspect. It is our conjecture that as more measured values become available that refined models will have better performance. For now, these models should be used only as an initial starting point for exploring the wider solvent chemical space. Errors in the predications of the coefficients for new solvents are not equivalent because when used to predict partition coefficients they are scaled by their corresponding Abraham descriptors, see equation 3. Thus, on average, when predicting solvent coefficients for new solvents, the errors in predicting v and s are more significant that errors in predicting a and b due to the difference in the sizes of average values for the solute descriptors. Multiplying the OOB-RMSE for each coefficient by the corresponding average descriptors value we see the following scaled RMSE values for e0, s0, a0, b0, and v0 of 0.16, 0.33, 0.08, 0.23, and 0.30 respectively. Thus the poor OOB R2 values for e0 (0.31) and b0 (0.47) seem not to be as detrimental to the applicability of the model as suggested by a first glance. To analyze the modeling results further and to investigate model outliers we calculated an adjusted error D, the distance between the observed values and the predicted values scaled by the average descriptor values, for each solvent using the following equation:where the superscript p indicates the predicted value. These distances were then plotted as colors on a graph with the x and y axes corresponding to the first two principal components of the measured values for e0, s0, a0, b0, and v0, see Figure 1. Those solvents colored red have higher calculated distances between their measured and predicted values [Figure 1].
Figure 1

Performance of the models on the existing chemical space of solvents with known coefficients. The red color indicates poor performance – model outliers.

Performance of the models on the existing chemical space of solvents with known coefficients. The red color indicates poor performance – model outliers. As we can see from the figure, model outliers include: formamide, trifluoroethanol, carbon disulfide, and DMSO. These solvents are on the outskirts of the chemical space. In fact, we can clearly see that the model makes far better predictions for solvents towards the center of the chemical space with particular success in predicting the coefficients for series such as alkanes and alcohols. These observations should give us caution when using the models to predict the solvent coefficients for novel solvents, especially when they do not lie within the chemical space established by solvents with known coefficients. These Open Models (CC0) can be downloaded from the Open Notebook pages [29,31] and can be used to predict the solvent coefficients for any organic solvent; either with the view of predicting partition coefficients or other partitioning processes including solubilities via equation (1); or with the view of finding replacement and novel solvents for current syntheses, recrystallization procedures, and other solvent dependent processes [32]. As an informational note we remind readers that solute solubility and partitioning are only two of the considerations in finding an appropriate replacement solvent. Other considerations include the toxicity and the purchase price of the solvent, disposal costs of the solvent, physical properties of the solvent, and whether or not the solvent undergoes any undesired chemical reactions with other chemical compounds that might be present in the solution. For example, some chemical reactions take place at elevated temperatures and here one would want to use a solvent having a sufficiently high boiling point temperature that it would not vaporize under the experimental conditions.

Sustainable solvents

As an example of the application of our models, we used our models to calculate the solvent descriptors for a list of sustainable solvents from a paper by Moity et. al. [33]. The resulting coefficients for 119 select novel sustainable solvents are presented in Table 3. A complete set of coefficients for all 293 solvents (sustainable, classic, and measured) can be found in Additional file 2. These values should be used in light of the limitation of the model as described above, as possible starting places for further investigation, and not as gospel.
Table 3

Predicted solvent coefficients for select sustainable solvents

Solvent e 0 s 0 a 0 b 0 v 0
1,3-dioxan-5-ol0.407−0.238−0.110−3.6163.523
1,3-dioxolane0.311−0.233−0.305−4.6614.029
1,3-dioxolane-4-methanol0.404−0.250−0.108−3.6413.528
1,4-cineol0.397−0.616−0.909−4.7184.299
1,8-cineol0.393−0.581−0.921−4.7234.316
2-butoxy-1,3-propanediol0.452−0.493−0.285−3.5313.826
2-furfuraldehyde0.3000.023−0.539−4.3053.885
2-methyltetrahydrofuran0.344−0.557−0.565−4.6864.440
2-pyrrolidone0.3060.0110.734−4.7094.020
3-hydroxypropionic acid0.324−0.155−0.180−3.7583.500
3-methoxy-3-methyl-1-butanol0.310−0.637−0.200−3.9164.002
5-(hydroxymethyl)furfural0.4130.084−0.314−3.4513.729
acetic acid0.1930.016−0.103−3.6253.565
acetyl tributyl citrate0.689−0.837−1.091−4.4153.969
alpha-pinene0.544−1.225−3.200−4.7194.511
alpha-terpineol0.410−0.853−0.527−4.0034.172
benzyl alcohol0.365−0.399−0.381−3.9494.143
benzyl benzoate0.483−0.550−1.155−4.5264.072
beta-farnesen0.576−1.418−3.214−4.8524.470
beta-myrcene0.572−1.421−3.173−4.9044.599
beta-pinene0.543−1.245−3.217−4.7234.511
beta-terpineol0.439−0.807−0.554−4.0384.158
butyl laurate0.617−0.934−1.210−4.5784.108
butyl myristate0.635−0.922−1.210−4.5414.108
butyl palmitate0.623−0.917−1.210−4.5414.106
butyl stearate0.615−0.917−1.208−4.5384.106
caprylic acid diethanolamide0.476−0.532−0.279−3.7173.864
cyclademol0.463−1.076−0.413−4.0124.182
cyclopentyl methyl ether0.385−0.387−0.654−4.7174.470
decamethylcyclo-pentasiloxane0.460−0.728−0.788−4.3183.963
dibutyl sebacate0.680−0.892−1.239−4.3293.976
diethyl adipate0.359−0.384−0.954−4.5154.025
diethyl glutarate0.349−0.308−0.979−4.4774.021
diethyl phthalate0.444−0.397−1.082−4.4124.016
diethyl succinate0.354−0.169−0.897−4.4803.956
dihydromyrcenol0.479−1.150−0.458−4.0254.272
diisoamylsuccinate0.571−0.700−1.006−4.2863.973
diisobutyl adipate0.464−0.607−0.997−4.3384.007
diisobutyl glutarate0.481−0.543−1.019−4.3014.000
diisobutyl succinate0.424−0.430−1.007−4.3263.982
diisooctyl succinate0.711−0.861−1.127−4.2623.967
dimethyl 2-methylglutarate0.344−0.144−0.783−4.3653.879
dimethyl adipate0.345−0.156−0.875−4.4463.878
dimethyl glutarate0.342−0.099−0.855−4.4223.849
dimethyl isosorbide0.353−0.132−0.394−4.0833.587
dimethyl phthalate0.411−0.189−1.005−4.3853.940
dimethyl succinate0.3370.063−0.704−4.4303.830
dioctyl succinate0.701−0.891−1.233−4.2983.966
dipropyleneglycol0.392−0.442−0.225−3.4683.748
d-limonene0.558−1.298−3.188−4.8324.527
ethyl lactate0.242−0.026−0.412−3.5753.868
ethyl laurate0.590−0.902−1.133−4.6604.124
ethyl linoleate0.531−0.837−1.094−4.5194.129
ethyl linolenate0.535−0.838−1.100−4.5224.135
ethyl myristate0.601−0.934−1.168−4.5854.116
ethyl oleate0.577−0.885−1.120−4.5204.114
ethyl palmitate0.630−0.922−1.179−4.5444.111
ethylhexyllactate0.515−0.690−0.821−3.6433.974
furfuryl alcohol0.351−0.195−0.047−3.5533.880
gamma-valerolactone0.2930.151−0.795−4.5213.957
geraniol0.435−0.953−0.428−4.1134.238
geranyl acetate0.517−0.799−1.124−4.6254.128
glycerol0.405−0.4300.076−3.4213.476
glycerol carbonate0.2820.082−0.587−3.5303.529
glycerol triacetate0.325−0.139−0.913−4.3813.893
glycerol-1,2,3-tributyl ether0.542−0.934−0.994−4.2574.082
glycerol-1,2,3-triethyl ether0.370−0.473−0.778−4.4274.078
glycerol-1,2,3-trimethyl ether0.315−0.358−0.407−4.2803.931
glycerol-1,2-dibutyl ether0.437−0.680−0.624−3.7093.983
glycerol-1,2-diethyl ether0.361−0.415−0.244−3.6633.932
glycerol-1,2-dimethyl ether0.338−0.410−0.121−3.5253.663
glycerol-1,3-Dibutyl ether0.423−0.658−0.583−3.5924.001
glycerol-1,3-diethyl ether0.357−0.398−0.255−3.5553.864
glycerol-1,3-dimethyl ether0.324−0.402−0.131−3.4673.676
glycerol-1-ethyl monoether0.424−0.380−0.172−3.4323.583
glycerol-1-methyl monoether0.394−0.376−0.106−3.4033.510
glycerol-2-ethyl monoether0.435−0.400−0.151−3.4303.579
glycerol-2-methyl monoether0.403−0.429−0.108−3.3823.500
glycofurol (n = 2)0.479−0.420−0.427−3.3543.673
isoamyl acetate0.310−0.358−0.830−4.7544.262
isobutyl acetate0.251−0.237−0.798−4.7714.249
isododecane0.631−1.656−3.473−4.8424.548
isopropyl palmitate0.730−0.984−1.332−4.3544.040
isopropylacetate0.232−0.186−0.803−4.7084.234
isosorbide dioctanoate0.618−0.827−1.092−4.2163.888
menthanol0.485−1.103−0.435−4.0314.184
menthanyl acetate0.568−0.685−1.121−4.4724.094
menthyl acetate0.566−0.697−1.117−4.5084.107
methyl 5-(dimethylamino) 2-methyl-oxopentanoate0.323−0.119−0.405−4.3783.848
methyl abietate0.635−0.720−1.152−4.4504.083
methyl laurate0.535−0.858−1.071−4.6764.090
methyl linoleate0.505−0.806−1.028−4.5244.081
methyl linolenate0.510−0.794−1.023−4.5234.097
methyl myristate0.583−0.885−1.110−4.6454.078
methyl oleate0.572−0.852−1.072−4.5234.078
methyl palmitate0.611−0.890−1.127−4.5784.076
methyl ricinoleate0.578−0.808−0.890−3.9153.979
methyl stearate0.591−0.880−1.121−4.5544.074
N,N-diethylolcapramide0.485−0.640−0.330−3.7183.857
N,N-dimethyldecanamide0.563−0.767−0.035−4.8124.103
N,N-dimethyloctanamide0.484−0.549−0.010−4.8434.121
nopol0.365−0.784−0.385−4.0264.112
n-propyl acetate0.299−0.349−0.738−4.8894.267
oleic acid0.485−0.817−0.611−4.1064.042
p-cymene0.564−1.163−3.112−4.7974.526
PEG 2000.490−0.423−0.310−3.2973.495
PEG 6000.469−0.528−0.309−3.3073.502
perfluorooctane0.386−0.813−2.663−4.0334.079
propionic acid0.207−0.105−0.185−3.9813.840
1,2-propylene glycol0.387−0.4470.259−3.4473.586
ricinoleic acid0.477−0.812−0.787−3.9383.971
solketal0.297−0.208−0.251−3.6783.789
terpineol acetate0.470−0.618−1.089−4.5414.097
terpinolene0.544−1.209−3.212−4.8604.535
tetrahydrofurfurylic alcohol0.433−0.365−0.168−3.5443.857
tributyl citrate0.572−0.723−0.887−3.8923.961
triethyl citrate0.379−0.317−0.618−3.8353.826
trimethylene glycol0.434−0.6270.236−3.7263.600
Predicted solvent coefficients for select sustainable solvents By comparing the predicted solvent coefficients to that of solvents with measured coefficients, we can make solvent replacement suggestions both in general and in particular. In general, the distance between solvents can be measured as the difference in predicted solubilities for the average compound. Using this method we found several possible replacements. For example, 1,2-propylene glycol (e0 = 0.387, s0 = −0.447, a0 = 0.259, b0 = −3.447, v0 = 3.586) and methanol (e0 = 0.312, s0 = −0.649, a0 = 0.330, b0 = −3.355, v0 = 3.691) have a d-value of 0.07. This suggests that 1,2-propylene glycol may be a general sustainable solvent replacement for methanol. To confirm our model’s suggestion, we compared the solubilities of compounds from the Open Notebook Science Challenge solubility database [34] that had solubility values for both 1,2-propylene glycol and methanol, see [Figure 2].
Figure 2

Experimental solubilities in both methanol and 1,2-propylene glycol.

Experimental solubilities in both methanol and 1,2-propylene glycol. Examining Figure 2, we see that solubility values are of the same order in most cases. The biggest discrepancy being for dimethyl fumerate. The measured solubility values are reported to be 0.182 M and 0.005 M for methanol and propylene glycol respectively [34], whereas the predicted solubilities are 0.174 M for methanol and 0.232 M for propylene glycol based upon the Abraham descriptors: E = 0.292, S = 1.511, A = 0.000, B = 0.456, V = 1.060 [35]. This suggests that the reported value for the solubility of dimethyl fumerate in ethylene glycol may be incorrect and that, in general, 1,2-propylene glycol is a sustainable solvent replacement for methanol. Other strongly suggested general replacements include: dimethyl adipate for hexane, ethanol/water(50:50)vol for o-dichlorobenzene, and alpha-pinene for 1,1,1-trichloroethane. Many more replacement suggestions can be generated by this technique. In a similar manner to the above procedure for general solvent replacement for all possible solutes, one can easily compare partition and solvation properties across all solvents for a specific solute (or set of solutes) with known or predicted Abraham descriptors (E, S, A, B, V). For example, using descriptors E = 0.730, S = 0.90, A = 0.59, B = 0.40, V = 0.9317 for benzoic acid (and using d = 0.001), we can make several benzoic acid-specific solvent replacement recommendations, see Table 4. These replacement suggestions do not seem unreasonable chemically and several examples can be explicitly verified by comparing actual measured solubility values [34]. Such a procedure can easily be done for other specific compounds with known or predicted Abraham descriptors to find alternative green solvents in varying specific circumstances (solubility, partition, etc.).
Table 4

Replacement solvent suggestions for procedures involving benzoic acid

Solvents Possible replacement
benzonitrile1,4-cineol
1-hexanolN,N-diethylolcapramide
sulfolaneethylene glycol
methyl tert-butyl etherdiethyl succinate
diethyl etherethylhexyllactate
2-pentanolgamma-valerolactone
2-methyl-2-propanolglycerol, cyclopentyl methyl ether
trifluoroethanolmethyl ricinoleate
diphenyl etherisopropyl palmitate
n-propylaminericinoleic acid
propionitrile1,4-cineol
ethylenediaminediisobutyl adipate
2,6-dimethylpyridineethyl acetate
4-picolineethanol/water(80:20)vol
diethylamineglycerol-1,2,3-triethyl ether, dihydromyrcenol
morpholineethanol/water(90:10)vol
diethylene glycolglycerol-1-ethyl monoether
2-aminoethanolglycerol-1-methyl monoether
3-pentanolglycerol-2-methyl monoether
anilinedimethyl phthalate
2-methyl-2-butanol2-furfuraldehyde
nitroethanebutyl myristate
Replacement solvent suggestions for procedures involving benzoic acid In addition to sustainable solvents, we also considered the list of commonly used solvents in the pharmaceutical industry [36]. Of all the solvents listed, the only one not covered previously by this work (Additional file 2) was 4-methylpent-3-en-2-one which has SMILES: O = C(\\C = C(/C)C)C and predicted solvent coefficients: e0 = 0.269, s0 = −0.362, a0 = −0.610, b0 = −4.830, v0 = 4.240.

Conclusions

We have provided a set of Open Models that can be used to predict the Abraham coefficients for any organic solvent. These coefficients can then in turn be used to predict various partition processes and solubilities of compounds with known or predicted Abraham descriptors. We illustrated the usefulness of the models by demonstrating how one can compare solvent coefficients both in general and in particular for specific solutes or sets of solutes to find solvent replacement leads.
  9 in total

1.  Air to muscle and blood/plasma to muscle distribution of volatile organic compounds and drugs: linear free energy analyses.

Authors:  Michael H Abraham; Adam Ibrahim; William E Acree
Journal:  Chem Res Toxicol       Date:  2006-06       Impact factor: 3.739

2.  A data base for partition of volatile organic compounds and drugs from blood/plasma/serum to brain, and an LFER analysis of the data.

Authors:  Michael H Abraham; Adam Ibrahim; Yuan Zhao; William E Acree
Journal:  J Pharm Sci       Date:  2006-10       Impact factor: 3.534

3.  Air to fat and blood to fat distribution of volatile organic compounds and drugs: linear free energy analyses.

Authors:  Michael H Abraham; Adam Ibrahim
Journal:  Eur J Med Chem       Date:  2006-09-22       Impact factor: 6.514

4.  Air to liver partition coefficients for volatile organic compounds and blood to liver partition coefficients for volatile organic compounds and drugs.

Authors:  Michael H Abraham; Adam Ibrahim; William E Acree
Journal:  Eur J Med Chem       Date:  2007-01-09       Impact factor: 6.514

5.  Air to brain, blood to brain and plasma to brain distribution of volatile organic compounds: linear free energy analyses.

Authors:  Michael H Abraham; Adam Ibrahim; William E Acree
Journal:  Eur J Med Chem       Date:  2006-03-03       Impact factor: 6.514

6.  Prediction of solubility of drugs and other compounds in organic solvents.

Authors:  Michael H Abraham; Robert E Smith; Ron Luchtefeld; Aaron J Boorem; Rensheng Luo; William E Acree
Journal:  J Pharm Sci       Date:  2010-03       Impact factor: 3.534

Review 7.  Organic solvents in the pharmaceutical industry.

Authors:  Katarzyna Grodowska; Andrzej Parczewski
Journal:  Acta Pol Pharm       Date:  2010 Jan-Feb       Impact factor: 0.330

8.  Partition of compounds from water and from air into amides.

Authors:  Michael H Abraham; William E Acree; J Enrique Cometto-Muñiz
Journal:  New J Chem       Date:  2009       Impact factor: 3.591

9.  Solvation thermodynamics and the physical-chemical meaning of the constant in Abraham solvation equations.

Authors:  Paul C M van Noort
Journal:  Chemosphere       Date:  2011-12-22       Impact factor: 7.086

  9 in total
  3 in total

1.  Prediction of 1-octanol solubilities using data from the Open Notebook Science Challenge.

Authors:  Michael A Buonaiuto; Andrew S I D Lang
Journal:  Chem Cent J       Date:  2015-09-24       Impact factor: 4.215

2.  Using water-solvent systems to estimate in vivo blood-tissue partition coefficients.

Authors:  Caitlin E Derricott; Emily A Knight; William E Acree; Andrew Sid Lang
Journal:  Chem Cent J       Date:  2015-10-16       Impact factor: 4.215

3.  Multi-phase multi-layer mechanistic dermal absorption (MPML MechDermA) model to predict local and systemic exposure of drug products applied on skin.

Authors:  Nikunjkumar Patel; James F Clarke; Farzaneh Salem; Tariq Abdulla; Frederico Martins; Sumit Arora; Eleftheria Tsakalozou; Arran Hodgkinson; Omid Arjmandi-Tash; Sinziana Cristea; Priyanka Ghosh; Khondoker Alam; Sam G Raney; Masoud Jamei; Sebastian Polak
Journal:  CPT Pharmacometrics Syst Pharmacol       Date:  2022-06-27
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.