Literature DB >> 35936486

General Equation to Express Changes in the Physicochemical Properties of Organic Homologues.

Chao-Tun Cao1, Chenzhong Cao1.   

Abstract

Changes in various physicochemical properties (P (n)) of organic compounds with the number of carbon atoms (n) can be roughly divided into linear and nonlinear changes. To date, there has been no general equation to express nonlinear changes in the properties of organic homologues. This study proposes a general equation expressing nonlinear changes in the physicochemical properties of organic homologues, including boiling point, viscosity, ionization potential, and vapor pressure, named the "NPOH equation", as follows: P (n) = P (1) α n - 1 e ∑i=2 n(β/(i - 1)) where α and β are adjustable parameters, and P (1) represents the property of the starting compound (pseudo-value at n = 1) of each homologue. The results show that various nonlinear changes in the properties of homologues can be expressed by the NPOH equation. Linear and nonlinear changes in the properties of homologues can all be correlated with n and the "sum of carbon number effects", ∑i=2 n(1/i - 1). Using these two parameters, a quantitative correlation equation can be established between any two properties of each homologue, providing convenient mutual estimation of the properties of a homologue series. The NPOH equation can also be used in property correlation for structures with functionality located elsewhere along a linear alkyl chain as well as for branched organic compounds. This work can provide new perspectives for studying quantitative structure-property relationships.
© 2022 The Authors. Published by American Chemical Society.

Entities:  

Year:  2022        PMID: 35936486      PMCID: PMC9352247          DOI: 10.1021/acsomega.2c02828

Source DB:  PubMed          Journal:  ACS Omega        ISSN: 2470-1343


Introduction

In 2021, Kontogeorgis et al.[1] investigated the industrial requirements for thermodynamic and transport properties, reporting that: “In terms of models, companies ideally wish for a single universal model for all/many applications, but there is understanding that this is possibly utopian. The second major wish is the need for predictive models validated on extensive experimental databases and not only on just a few available experimental data points.” This shows the importance of molecular modeling. The quantitative structure–property relationship (QSPR) method is important for understanding and predicting compound properties and it has seen rapid development.[2,3] Recently, molecular dynamics and machine learning have also been used in drug design.[4] The two key steps for developing successful QSPR models are molecular descriptor calculations and equation selection. Generally, the development of validated QSPR models first involves calculating the molecular descriptors and then, based on experimental data, selecting an appropriate number of descriptors from a large number of calculated molecular descriptors to establish the QSPR equation.[2,5−9] In this method, descriptor screening and equation optimization are time-consuming, and the physical meaning of the resulting equations is unclear and difficult to interpret. Other methods are also available for predicting the physicochemical properties of compounds. In these methods, theoretical equations are first proposed and molecular characteristic parameters are extracted for specific compounds. Data fitting is then used to form specific application equations, such as in state equation 1(2)where u and w are characteristics of the equation (u = 1 and w = 0 or u = 2 and w = −1 for Redlich-Kwong[10] or Peng–Robinson (PR),[11] respectively), and a and b are parameters computed using critical pressure (Pc) and temperature (Tc), as shown in eqs and 3where Ωa and Ωb have specific values depending on the selected equation, and the temperature function, α(T), is equal to 1 for T = Tc. Similar to eq , these theoretical equations have clear physical meanings, allowing molecular descriptors to be calculated in a targeted manner. However, this type of equation is not easy to propose. The aforementioned methods show that a QSPR equation can be established more easily if the general regularity of changes in compound properties is known in advance and then molecular descriptors are selected. Therefore, we have considered classifying chain organic compounds and discussing the regularity of changes in their properties, which is more conducive to establishing QSPR equations. Accordingly, organic compounds with different functional groups were first divided into homologues and then into isomers. Following the concept of homologue–isomer property correlation, we can establish a structure–property correlation equation for chain organic compounds with different functional groups. Changes in the properties (P) of organic homologues are known to be related to the number of carbon atoms (n), allowing a theoretical equation between P and n, P = f(n), to be established. On this basis, the characteristic parameter of the molecular structure of an isomer (s) can be extracted using the molecular descriptor calculation in the QSPR method, and then the property-change equation of the chain organic compound (P = f(n, s)) can be established. This accounts for the advantages of the two aforementioned methods and reduces the difficulty of establishing structure–property correlation equations. This study aimed to explore a general equation for expressing changes in the physicochemical properties of organic homologues. Much research has focused on expressing the regularity of property changes in organic homologues, with four types of empirical expression having been proposed. As early as 1961, Kreglewsky[12] proposed eq to evaluate the boiling points of n-paraffin compounds in the range of C1–C100. This work pioneered the quantitative correlation of homologue propertieswhere Y is the boiling point value at carbon number n and Y0 is the boiling point pseudo-value at n = 0, Y∞ = 1078, and β = 4.999 × 10–3. In 1985, Gasem and Robinson et al.[13,14] proposed eq for evaluating the critical properties of n-paraffins, known as the ABC equationwhere variables Y, Y∞, and Y0 are property values at carbon number n, the limiting property value as n becomes very large, and the property pseudo-value at n = 1, respectively, and β and α are adjustable parameters. In 1997, Marano and Holder[15] proposed eq to correlate the thermodynamic and transport properties of n-paraffins and other homologous series, which can be used for a wide range of different thermophysical propertieswhere five adjustable parameters, n0, Y0, Y∞, β, and γ, are employed. In 2009, Zenkevich[16] employed a recurrent function to correlate the properties of homologous series, as shown in eq where A(n + 1) is the physicochemical property value for homologues with (n + 1) carbon atoms, A(n) is the property value for homologues with n carbon atoms, and coefficients a and b are calculated by the least squares method. Equation is different from eqs –6 because the parameter Y∞ is not required. Notably, eq was proposed more than 60 years ago, and the inherent change regularity of homologue properties with carbon number n remains unknown. The equations proposed previously provide great insight, but two problems remain: (i) Their physical meaning is unclear, making them difficult to explain; and (ii) they cannot correlate one property to another property for a series of homologues. Therefore, this study investigated the change regularity of homologue properties with carbon atom number n to explore the relationship between the properties of homologues.

Results and Discussion

Theoretical Analysis

Changes in the physicochemical properties (P) of organic homologues with carbon atom number n can be roughly divided into linear and nonlinear changes. The linear change of a property is relatively simple and will not be discussed in detail here. This study focuses on the complex nonlinear change of a property. Using n-paraffins (H–(CH2)–H) as an example, some P values increase with an increasing n value, while some P values decrease with an increasing n value, The P changes show significant differences, as shown in Figure . We investigated how these different changes are dominantly affected by n, and whether all of these changes follow a common rule, through theoretical analysis. Usually, organic homologues can be represented as X–(A)–Y, where A is a repeating structural unit, n is the number of repeating structural units, and X and Y are end groups. For n-paraffins, A is CH2, and both X and Y are H atoms, represented by H–(CH2)–H. In a series of organic homologues, the property of compound X–(A)–Y containing n repeating structural units is represented by P(, and the property of another compound containing n – 1 repeating structural units X–(A)–Y is represented by P(. The molecular structure of X–(A)–Y is known to be formed by adding repeating structural unit A to molecule X–(A)–Y, namely, X–(A)–Y becomes X–(A)–A–Y. The addition of unit A causes a change of property P( of X–(A)–Y to property P( of X–(A)–Y. Therefore, property P( of X–(A)–Y can be considered a perturbation based on property P( of X–(A)–Y. This perturbation effect must be related to the original molecular chain length (A). Assuming that the perturbation effect acts in the form of an exponential, exp(β/(n – 1)), then, the relationship between P( and P( can be expressed by eq where α is the proportional coefficient of the property change and β is an adjustable parameter of the perturbation effect. Based on eq , expressions for P(2), P(3), P(4), ..., can be deduced as followsFrom eqs –11, eq can also be expressed as eq Equation expresses the relationship between the property of organic homologues X–(A)–Y and the number of repeating structural units, n. If the logarithm is taken, eq can be expressed as eq where a = ln(P(1)), b = ln(α), and c = β. These are characteristic coefficients related to the properties of each homologue and can be obtained by regression analysis. For convenience, term in eqs and 13 is subsequently denoted as SCNE (abbreviation of “sum of carbon number effects”). Accordingly, eqs and 13 can be written in compact forms of eqs and 15, respectively, named the “NPOH equation” (that is, the abbreviation of “Nonlinear Properties of Organic Homologues equation”)In contrast to eqs –7, eqs and 14 show that property P( of homologue X–(A)–Y is related to the starting compound property P(1) (property pseudo-value at n = 1) of each homologue, as well as the number of repeating structural units, n, and the cumulative perturbation effect, SCNE, of repeating structural units.
Figure 1

(a) Relationship of boiling point (Tb), (b) viscosity (μ), and (c) ionization potential (Ip) with the carbon atom number (n) of n-paraffins.

(a) Relationship of boiling point (Tb), (b) viscosity (μ), and (c) ionization potential (Ip) with the carbon atom number (n) of n-paraffins.

Applicability of NPOH Equation

Correlation with the Properties of n-Paraffins

To test the applicability of NPOH equations 14 or 15, quantitative correlation analysis was conducted using various physicochemical properties (nonlinear changes) of n-paraffins. First, the SCNE value was calculated, as listed in Table , and then eq was employed as a model to perform regression analysis of properties P( of n-paraffins. The results are listed in Table .
Table 1

Number of Repeating Structural Units, n, and SCNE Values

nn – 1SCNEnn – 1SCNEnn – 1SCNEnn – 1SCNE
10 26253.816051504.499276754.9014
211.000027263.854452514.518877764.9145
321.500028273.891553524.538078774.9275
431.833329283.927254534.556979784.9403
542.083330293.961755544.575480794.9530
652.283331303.995056554.593681804.9655
762.450032314.027257564.611582814.9778
872.592933324.058558574.629083824.9900
982.717934334.088859584.646384835.0021
1092.829035344.118260594.663285845.0140
11102.929036354.146861604.679986855.0257
12113.019937364.174662614.696387865.0374
13123.103238374.201663624.712488875.0489
14133.180139384.227964634.728389885.0602
15143.251640394.253565644.743990895.0715
16153.318241404.278566654.759391905.0826
17163.380742414.302967664.774492915.0936
18173.439643424.326768674.789493925.1044
19183.495144434.350069684.804194935.1152
20193.547745444.372770694.818695945.1258
21203.597746454.394971704.832896955.1363
22213.645447464.416772714.846997965.1468
23223.690848474.438073724.860898975.1571
24233.734349484.458874734.874599985.1673
25243.776050494.479275744.8880100995.1774
Table 2

Correlation Equations of Properties for n-Paraffins H–(CH2)–H (Model Equation: ln(P() = a + b(n – 1) + cSCNE)

no.propertyarange of nbrefabcRSNcF
1Tb2–100(17)4.6973–0.0045690.510400.99990.00573473 691.18
2Tc2–20(20)5.2972–0.0089810.427580.99990.00301965 747.43
3Pc2–20(20)1.9097–0.050147–0.246980.99920.0191195084.85
4Cp4–16(21)3.63820.0256660.663680.99970.0099117888.46
5λ5–16(22)–2.7579–0.0146470.307620.99440.01017177.44
6ST5–16(22)1.1369–0.0480770.867170.99900.01087978.56
7μ5–35(18)–5.62860.0434851.799920.99960.04482814 301.74
8nD4–20(21)0.15997–0.0034470.0762260.99870.0012162534.24
9Pv2–16(21)7.4528–0.461790–0.368341.00000.010315347 794.19
10ω2–20(23)–3.2314–0.0056940.905140.99970.01581913 904.65
11Ip2–11(19)2.56430.004981–0.121560.99960.0020104094.40
12D5–16(19)–1.0445–0.0161740.309660.99940.0025123850.05
13TF3–16(24) and (25)4.47830.0000310.454560.99700.021314915.62
14Pw2–8(26)1.12610.5364780.124950.99900.07166736.35

Tb, boiling point (K); Tc, critical temperature (K); Pc, critical pressure (MPa); Cp, heat capacity (J·mol–1·K–1); λ, thermal conductivity (W·m–1 K–1); ST, surface tension (mN·m–1); μ, liquid viscosity [μm Pa·s(300 K)]; nD, refractive Index; Pv, vapor pressure (regression in the form of log Pv); ω, acentric factor; Ip, ionization potential (eV); D, liquid density (g·cm3); TF, flash point (K); and Pw, partition coefficient (regression in the form of log Pw). These properties data are listed in the Supporting Information.

Carbon atom number range.

Number of data points.

Tb, boiling point (K); Tc, critical temperature (K); Pc, critical pressure (MPa); Cp, heat capacity (J·mol–1·K–1); λ, thermal conductivity (W·m–1 K–1); ST, surface tension (mN·m–1); μ, liquid viscosity [μm Pa·s(300 K)]; nD, refractive Index; Pv, vapor pressure (regression in the form of log Pv); ω, acentric factor; Ip, ionization potential (eV); D, liquid density (g·cm3); TF, flash point (K); and Pw, partition coefficient (regression in the form of log Pw). These properties data are listed in the Supporting Information. Carbon atom number range. Number of data points. The results listed in Table show that the correlation coefficients (R) were all above 0.99 for the correlation equations of 14 properties of n-paraffins, indicating that eq can be used to express the regularity of property changes among homologues using the carbon atom number n. That properties with different change regularities can have the same expression is surprising and has not been reported previously. We noted that a value of boiling point Tb equation in Table (no. 1) is 4.6973, indicating that ln(P(1)) is 4.6973, and the starting compound boiling point (pseudo-value at n = 1), P(1), is 109.65 K for n-paraffins. The experimental boiling point of methane is known to be 109.15 K, which is very close to 109.65 K. Interestingly, for the linear change property, PLC(, of homologues, it was only necessary to directly replace term ln(P() in eq with term PLC(, as shown in eq For example, for the critical volume[21] (Vc) of n-paraffins (n = 2–18), n was used directly to perform regression against Vc, giving eq , while eq was used as the model equation to perform regression against Vc to obtain eq . Equation showed better correlation compared with eq , and the standard error (S) was greatly reducedAs another example, the vaporization enthalpy[21] (Hv) of n-paraffins (n = 2–20) was regressed with the methods used in eqs and 18, giving eqs and 20. The correlation of eq was better than that of eq , and the standard error (S) of eq was greatly reducedThe above regression analysis results show that eq is more suitable for expressing the linear change properties of homologues compared with directly using n. However, for some linear change properties of homologues, such as molar mass (Mmol), coefficient c in eq was equal to zero, meaning that the term SCNE did not need to be used in such cases.

Correlation with the Properties of Other Homologues

Regarding whether eq was applicable to other homologues, cycloalkanes, terminal alkenes, terminal alkynes, linear alcohols, 2-methylparaffins, 2,2-dimethylparaffins, 2,3-dimethylparaffins, and 1-cyclopentylparaffins were selected as model compounds for testing. Using the boiling points Tb of cycloalkanes, terminal alkenes, terminal alkynes, linear alcohols, 2-methylparaffins, 2,2-dimethylparaffins, 2,3-dimethylparaffins, and 1-cyclopentylparaffins and the partition coefficient Pw of alcohols as verification examples, quantitative correlation analysis was conducted using model equation15. The results are listed in Table , showing that the property correlation was very good for each homologue, and that the correlation coefficients (R) of the nine equations were all greater than 0.999. These results showed that eq is a general equation that can be used to express the change regularity of properties P of various homologues using carbon atom number n.
Table 3

Correlation Equations of Properties for Cycloalkanes, Alkenes, Alkynes, Alcohols, 2-Methylparaffins, 2,2-dimethylparaffins, 2,3-Dimethylparaffins, and 1-Cyclopentylparaffins (Model Equation: ln(P() = a + b(n – 1) + cSCNE)

Pw, partition coefficient (regression in the form of logPw); other symbols are defined in Table . These properties data are listed in the Supporting Information.

Carbon atom number range.

Number of data points.

Pw, partition coefficient (regression in the form of logPw); other symbols are defined in Table . These properties data are listed in the Supporting Information. Carbon atom number range. Number of data points. The results obtained in Sections and 2.2.2 show that, regardless of whether the properties of homologues vary linearly or nonlinearly with carbon atom number n, they can be related using variables (n – 1) and SCNE.

Relationship between Nonlinear Properties of Homologues

Equation not only expresses the change regularity of the properties of homologues with the carbon atom number n but can also link the change regularities of different properties of homologues. If one property of homologue is P( and the other property is P′(, basing on eq , we can theoretically deduce eqs and 22In eq , ar = ln(P(1)) – ln(P′(1)), br = ln(α) – ln(α′), and cr = β – β′, while in eq , as = ln(P(1)) + ln(P′(1)), bs = ln(α) + ln(α′), and cs = β + β′. For the two specific physicochemical properties, P( and P′(, of the same series of compounds, parameters ar, br, cr, as, bs, and cs can be obtained using the regression method. When the logarithms of eqs and 22 are removed, they return to eqs and 24Equations and 24 express the relationship between two properties of a homologues series, where k, l, cr, m, q, and cs are all coefficients. Therefore, one property of homologues can be correlated with another using eqs and 24, providing convenient correlation of the homologue properties. For example, the relationship between the critical temperature (Tc) and boiling point (Tb) of n-paraffins is shown in eq Equation expresses the quantitative relationship between Tb and Tc for n-paraffins. Using eq , the critical temperatures of n-paraffins can be calculated from their boiling points. Table shows the results of critical temperature calculations for n-paraffins with n = 2–32, where the average absolute error between the experimental value (Tc,exp.) and the calculated value (Tc,cal.) was 1.84 K for the 19 compounds (n = 2–20). Figure shows a plot of the calculated and experimental values of critical temperature versus the boiling point of n-paraffins.
Table 4

Experimental and Calculated Critical Temperature, Tc (K), for n-Paraffins H–(CH2)–H

naTb,exp.bTc,exp.cTc,cal.dnaTb,exp.bTc,exp.cTc,cal.dnaTb,exp.bTc,exp.cTc,cal.d
2184.55305.6307.8413508.29676.1678.3624664.29 793.90
3231.05369.2368.8514526.69693.1694.8625674.29 798.76
4272.65425.2421.9115543.59708.1709.2326685.19 804.63
5309.25469.9466.8416559.99722.1722.8227695.09 809.28
6342.15507.7505.8017574.79734.1734.2328704.59 813.41
7371.55540.7539.2518589.09747.1744.9029713.79 817.16
8398.85568.9569.3519601.99755.4753.7130722.69 820.52
9423.92594.6596.0020615.99767.6763.8131731.69 823.96
10447.27617.8620.0021629.49 773.2032740.09 826.68
11468.79639.1641.2422641.59 780.77    
12489.29658.2660.9023653.19 787.67    

Number of carbon atoms.

Experimental boiling point (K), from ref (17).

Experimental critical temperature (K), from ref (20).

Critical temperature calculated using eq .

Figure 2

Plot of calculated (solid line) and experimental (open circle) critical temperature (Tc) values versus experimental boiling point (Tb) of n-paraffins (n = 2–32).

Plot of calculated (solid line) and experimental (open circle) critical temperature (Tc) values versus experimental boiling point (Tb) of n-paraffins (n = 2–32). Number of carbon atoms. Experimental boiling point (K), from ref (17). Experimental critical temperature (K), from ref (20). Critical temperature calculated using eq . As further examples, eqs and 24 were used as model equations to quantitatively correlate the boiling point of n-paraffins with other properties in Table . The results are listed in Table , showing that any nonlinear change property of n-paraffins could be correlated with the boiling point. As another example, the liquid viscosity (μ) equation in Table (no. 6) was used to calculate the μ values (at 300 K) of n-paraffins using boiling points (Tb), with the results shown in Figure . The calculated values were in good agreement with the experimental values.
Table 5

Correlation Equations between the Boiling Point, Tb, and Other Properties of n-Paraffins (Model Equations 23 and 24)

no.propertyarange of nar(as)br(bs)cr(cs)RSNF
1Tc2–200.593807–0.0056465–0.076510.99890.0042193715.10
2Pc2–20–2.79364–0.0468120–0.751070.99960.02351910 289.86
3Cp4–16–1.011020.03146570.131460.99880.0095111670.97
4λ5–16–7.39959–0.0084513–0.228590.99790.01047476.21
5STb5–16(5.77866)(−0.0542733)(1.40338)0.99980.010674300.28
6μ5–32–10.29840.04872151.277320.99920.0475256681.44
7nD4–20–4.481130.0030487–0.461181.00000.00191698 646.53
8Pv2–162.737019–0.4605237–0.861351.00000.011915339 306.10
9ω2–10–7.934717–0.00235960.4010520.99910.0126194279.41
10Ip2–11–2.1721430.0013372–0.5933270.99990.00561019 231.41
11D5–16–5.686163–0.0099647–0.2265920.99980.00261212752.15
12TFb3–16(9.143340)(−0.0048236)(0.977959)0.99930.0206144213.98
13Pw2–8–3.6314010.5256402–0.3216550.99850.06866487.63

Symbols are defined in Table , obtained using model equation 23 except when noted otherwise.

Obtained using model equation 24.

Figure 3

Plot of calculated (−) and experimental (△) liquid viscosity, μ, values versus experimental boiling point, Tb, of n-paraffins (n = 5–32).

Plot of calculated (−) and experimental (△) liquid viscosity, μ, values versus experimental boiling point, Tb, of n-paraffins (n = 5–32). Symbols are defined in Table , obtained using model equation 23 except when noted otherwise. Obtained using model equation 24.

Relationship between Nonlinear and Linear Properties of Homologues

Equation expresses the change regularity of the linear properties of homologues, while eq expresses the change regularity of the nonlinear properties of homologues. Comparing eq with eq showed that they had the same variables (n – 1) and SCNE. Therefore, a correlation between the nonlinear and linear properties of the homologues can be established by combining eqs and 15, as shown in eq where, for the two specific physicochemical properties, P( and PLC(, of the same series of homologues, parameters a′, b′, and c′ can be obtained by the regression method Using the boiling point (Tb) and critical volume (Vc) of n-paraffins as an exampleThereforeHere, taking Vc, (n – 1), and SCNE as variables, regression equation can be obtained for ln(Tb)Equation can be rewritten as eq Using eq and employing parameters Vc, (n – 1), and SCNE of n-paraffins, the Tb values of n-paraffins can be calculated. The average absolute error between the experimental and calculated Tb values was 0.94 K for the 17 n-paraffins with n = 2–18.

Simplification of Equation

Equations and 13 contain the term , which is inconvenient to use. This section discusses the simplification of this term. According to mathematical principles, for the infinite series SS{1, 1/2, 1/3, ..., 1/n}, the sum of the first n terms is shown in eq Equation can also be approximately expressed as eq Therefore, the term can be expressed as followsTherefore, eqs and 13 can be simplified to eqs and 35For example, using eq as a model equation to perform regression analysis of Tb and Tc values of n-paraffins, respectively, afforded eqs and 37 For the boiling point (Tb)For the critical temperature (Tc)Equations and 37 still show good correlation, which is only slightly worse than that of the corresponding equations (nos. 1 and 2 of Table ). However, eqs and 37 are more convenient to use than eqs and 13.

Relationship between SCNE and the Parameter c(G)

Recently, Mukwembi et al.[27] proposed a new graph parameter c(G), called the conduction of a graph, and exploited the conduction of a graph to develop a single parameter model for predicting the boiling point of any given alkane. This work is very meaningful, which simplifies the parameters for estimating the boiling point of alkanes. It is interesting to compare and contrast the SCNE parameter with the parameter c(G). Here, we made a plot of exp(SCNE) of this work versus the parameter c(G)[27] of n-paraffins (n = 4–14), and obtained Figure . From Figure , we can see that there is a good linear correlation of exp(SCNE) with the parameter c(G). It means that the parameter exp(SCNE) is intrinsically related to the parameter c(G) for n-paraffins. Whether there is a relationship of exp(SCNE) to other graph theoretical parameters or not is worth studying, and the interested readers can pursue this area.
Figure 4

Plot of exp(SCNE) versus the parameter c(G) of ref (27) for n-paraffins (n = 4–14).

Plot of exp(SCNE) versus the parameter c(G) of ref (27) for n-paraffins (n = 4–14).

Application of NPOH Equation in Branched Alkanes and Alcohols

For the branched alkanes and organic compounds with functionality located elsewhere along a linear alkyl chain, their physical properties are first affected by the number of carbon atoms in the structure, and second by their molecular structure differences. Therefore, we can establish an estimation model of their properties by adding structural parameters that characterize the molecular carbon atomic skeleton to the NPOH equation. We take the boiling points of alkanes and alcohols for example.

Example 1, Boiling Point of Alkanes

Alkane molecules are almost nonpolar, and their boiling points are affected by both the number of carbon atoms and the molecular carbon atomic skeleton in the molecule. The effect of the number of carbon atoms on the boiling point can be expressed with the parameters (n – 1) and SCNE, while that of the molecular carbon atomic skeleton can be expressed using the Odd–Even Index (OEI) difference, ΔOEI.[21] That is, for the alkane molecule with n carbon atoms, ΔOEI = OEI (branched alkane) – OEI (normal alkane). Based on Yuan’s method,[21] the OEI values of 2-methylpropane and butane are 4.5000 and 5.2222, respectively. Thus, ΔOEI(2-methylpropane) = 4.5000 – 5.2222 = −0.7222; ΔOEI (butane) = 4.5000 – 4.5000 = 0. It means that ΔOEI = 0 for any normal alkane. Then, we employed (n – 1), SCNE, and (ΔOEI/n) as variables to perform the regression analysis of boiling points, Tb(, of alkanes and obtained eq in which the boiling points for all alkanes were taken from a listing by Cao et al.[28] These data are listed in the Supporting InformationIn eq , the 210 alkanes involve all isomers of C2–C10, and the carbon atoms are in the range of C1–C100. The average absolute error between the experimental value (Tb,exp.) and the calculated value (Tb,cal.) was 3.07 K, and the average percentage error was 0.69%.

Example 2, Boiling Point of Alcohols

The molecular structure of alcohol is different from that of alkane in which there is a polar functional group OH. Thus, the boiling points of alcohols are affected by the number of carbon atoms, molecular carbon atomic skeleton, and position of group OH located at the alkyl chain in a molecule. The effect of the latter two factors on the boiling point can be expressed by the polarization effect index PEIOH difference, ΔPEIOH, because the PEIOH of the alcohol is related to the alkyl skeleton and the position of the OH group in the molecule. That is, for the alcohol molecule with n carbon atoms, ΔPEIOH = PEIOH (branched alcohol) – PEIOH (normal alcohol). Taking 2-propanol (CH3)2CHOH (i-PrOH) and 1-propanol CH3CH2CH2OH (PrOH), for example, the PEIOH values of i-Pr and Pr are 1.2811 and 1.1887, respectively. Hence, the difference in PEIOH of i-PrOH is ΔPEIOH(i-PrOH) = 1.2811 – 1.1887 = 0.0924, and the difference in PEIOH of PrOH is ΔPEIOH(PrOH) = 1.1887 – 1.1887 = 0. It means that ΔPEIOH = 0 for any normal alcohol. For the detailed calculation method of PEIOH, one can refer to Cao’s report.[28] Thus, we employed (n – 1), SCNE, and (ΔPEIOH/n) as variables to perform regression analysis of boiling points Tb( of alcohols and obtained eq 39 in which the boiling points for all alcohols were taken from a listing by Cao et al.[28] These data are listed in the Supporting InformationIn eq , the 105 alcohols involving the carbon atoms are in the range of C2–C11. The average absolute error between the experimental value (Tb,exp.) and the calculated value (Tb,cal.) was 4.40 K, and the average percentage error was 1.01%. The results indicate that eqs and 39 provide very satisfactory results. We noted that it is difficult to compare the results of this work with those of other works because different data sets and different number of variables were employed in various works. In particular, many of the other works only include alkanes with below 11 carbon atoms. Nevertheless, it is instructive that Allison et al.,[29] in 2022, used a graph convolutional neural network to predict the normal boiling point and obtained an average absolute error of 6 K (corresponding to an average percentage error of 1.32%). Cordes and Rarey[30] compared some group contribution methods for predicting the boiling points of acyclic alkanes and found that these methods produced average absolute errors ranging from 6.5 to 26.7 K. Palatinus et al.[31] proposed eq to calculate the YR parameter and then used the calculated YR values and eq to predict the boiling point for linear and branched alkanes in which the average absolute deviation between experimental and predicted boiling points was 1.45 K, and the average percentage error was 0.32%. Liu et al.[32] used the QSPR method to predict the boiling points of alcohols and obtained the standard error of 5.69 KThe above two examples show that the NPOH equation can also be used in property correlation for structures with functionality located elsewhere along a linear alkyl chain as well as for branched organic compounds.

Conclusions

This research shows that various nonlinear change properties of homologues can be expressed using a general equation, named the NPOH equation (eqs or 15), while various linear change properties of homologues can be expressed by general equation16. Therefore, both linear and nonlinear change properties of homologues can be correlated with the number of carbon atoms (n) and the “sum of carbon number effects” (SCNE). Furthermore, using the two parameters n and SCNE, a quantitative correlation equation can be established between any two properties of each homologue, providing convenient mutual estimation of properties of a series of homologues. Finally, for nonstraight-chain isomers, using the NPOH equation, we can add structural parameters that characterize the molecular carbon atomic skeleton to establish an estimation model for their properties. The study on the properties of branched chain organic compounds will be reported in a subsequent paper. This work can provide a new perspective for studying quantitative structure–property relationships (QSPRs).
  11 in total

1.  Prediction of surface tension, viscosity, and thermal conductivity for common organic solvents using quantitative structure-property relationships.

Authors:  G W Kauffman; P C Jurs
Journal:  J Chem Inf Comput Sci       Date:  2001 Mar-Apr

2.  Topological indices based on vertex, edge, ring, and distance: application to various physicochemical properties of diverse hydrocarbons.

Authors:  Hua Yuan; Chenzhong Cao
Journal:  J Chem Inf Comput Sci       Date:  2003 Mar-Apr

Review 3.  Quantitative correlation of physical and chemical properties with chemical structure: utility for prediction.

Authors:  Alan R Katritzky; Minati Kuanar; Svetoslav Slavov; C Dennis Hall; Mati Karelson; Iiris Kahn; Dimitar A Dobchev
Journal:  Chem Rev       Date:  2010-10-13       Impact factor: 60.622

4.  Inductive modeling of physico-chemical properties: flash point of alkanes.

Authors:  D Mathieu
Journal:  J Hazard Mater       Date:  2010-03-24       Impact factor: 10.588

5.  On the thermodynamics of solutions; an equation of state; fugacities of gaseous solutions.

Authors:  O REDLICH; J N S KWONG
Journal:  Chem Rev       Date:  1949-02       Impact factor: 60.622

6.  Industrial Requirements for Thermodynamic and Transport Properties: 2020.

Authors:  Georgios M Kontogeorgis; Ralf Dohrn; Ioannis G Economou; Jean-Charles de Hemptinne; Antoon Ten Kate; Susanna Kuitunen; Miranda Mooijer; Ljudmila Fele Žilnik; Velisa Vesovic
Journal:  Ind Eng Chem Res       Date:  2021-03-26       Impact factor: 3.720

7.  Graph convolutional neural network applied to the prediction of normal boiling point.

Authors:  Chen Qu; Anthony J Kearsley; Barry I Schneider; Walid Keyrouz; Thomas C Allison
Journal:  J Mol Graph Model       Date:  2022-02-04       Impact factor: 2.518

8.  Deciphering Antifungal Drug Resistance in Pneumocystis jirovecii DHFR with Molecular Dynamics and Machine Learning.

Authors:  Florian Leidner; Nese Kurt Yilmaz; Celia A Schiffer
Journal:  J Chem Inf Model       Date:  2021-06-17       Impact factor: 6.162

9.  A new model for predicting boiling points of alkanes.

Authors:  Simon Mukwembi; Farai Nyabadza
Journal:  Sci Rep       Date:  2021-12-20       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.