Literature DB >> 34930927

A new model for predicting boiling points of alkanes.

Simon Mukwembi1, Farai Nyabadza2.   

Abstract

A general perception among researchers is that boiling points, which is a key property in the optimization of lubricant performance, are difficult to predict successfully using a single-parameter model. In this contribution, we propose a new graph parameter which we call, for lack of better terminology, the conduction of a graph. We exploit the conduction of a graph to develop a single-parameter model for predicting the boiling point of any given alkane. The model was used to predict the boiling points for three sets of test data and predicted with a coefficient of determination, [Formula: see text] and 0.6488, respectively. The accuracy of our model compares favourably to the accuracy of experimental data in the literature. Our results have significant implications on the estimation of boiling points of chemical compounds in the absence of experimental data.
© 2021. The Author(s).

Entities:  

Year:  2021        PMID: 34930927      PMCID: PMC8688456          DOI: 10.1038/s41598-021-03541-z

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

A graph is a mathematical object which consists of a finite set V of elements called vertices, together with a set E, of 2-element subsets of V, called the edges of G. As early as 1875, Cayley (see for instance[1]), in his quest to enumerate chemical molecules called alkanes, he made an observation that molecules can be modelled by graphs where atoms are represented by vertices and two vertices are joined by an edge if the corresponding atoms are linked by bonds. This graph model became widely known as the molecular graph. An interesting numeric value attached to each molecule is the boiling point, i.e., the temperature at which a substance has a vapour pressure of 760 mmHg[2]. Whilst the boiling point is a key property in classifying molecules, such as alkanes which dominate mixtures of lubricants in industry, there is generally lack of experimental boiling point data (see, for instance[1] and references cited therein). Nevertheless, although expected to be accurate—experimental boiling points are sometimes inaccurate due to the presence of impurities, and as a result, have wide discrepancies especially for higher boiling points. This has the propensity to lead to wider and elevated boiling point ranges. For instance, the boiling point of cyclooctane (an alkane) is reported to range from to C, and that of methylcycloheptane ranges from to C[2]. The lack of data, and the inaccuracies where data is available, have necessitated the development of boiling point models which can be used to estimate boiling points of chemical compounds for which no boiling point data is available or where inaccuracies exist. A natural foundation on which these models are built are graph parameters[3,4]. The Wiener index of a graph, introduced by Harold Wiener in 1947[5], was the first graph parameter to be used in chemistry. In particular, it was used to predict boiling points of alkanes. Ever since its introduction, a lot of effort was invested in research that gave rise to the development of extensions of the Wiener index[3], and of other indices such as the Hosoya index[3], the Gutman and the Schultz indices[6], and a legion of other distance-based topological indices (see, for example[7]), which are used for the prediction of various properties of molecules. For alkanes, single variable models that are available are either weak[3] or they consider only a special class of the molecules[1]. Burch, Wakefield and Whitehead[1] successfully developed a single variable model to calculate boiling points of special families of alkanes and developed multivariate models that can predict boiling points of all alkanes up to and including those of order 12. Several other models, see for instance[8-10], abound in literature with the most recent one being due to Sandak and Conduit[11] who trained artificial neural networks to predict the physical properties of linear, single branched, and double branched alkanes. Clearly, as Dearden[2], pointed out several years ago, modeling a property is easier when one is dealing with a single chemical class. However, from the point of view of an engineer concerned with a wide range of compounds, the methods of greater interest are those that can adequately model the behavior of varied data sets. In this paper, we are concerned with relating two mathematical objects, associated with a chemical molecule, namely the molecular graph and the numeric value attached to the boiling point of the molecule. We focus on molecular graphs for all alkanes and introduce a new graph parameter which, we define as the ‘conduction’. We use this new parameter, the conduction, to develop a single-variable model that predicts the boiling point of any given alkane. We use[1] as the source for the experimental data of boiling points. This paper is organised as follows: in the next section, we introduce the new graph parameter, the conduction of a graph, and illustrate how it is computed by determining the conduction of four special classes of graphs representing some important series of alkanes. The model is developed in the “Model development”. The main results are presented in “Main results” in which the model is tested. This will be followed by the discussion and the conclusion is “Discussion and conclusion”.

The graph parameter: conduction

Consider a connected graph of order n. The distance between vertices u and v in G is defined as the length of a shortest path joining u and v in G. For vertex v denote by , the breadth-first search tree of G based at v, and let be the set of end vertices of . We select to be the tree that minimizes amongst all breadth-first search trees of G based at v. To contrive the conduction parameter, we envisage heat flowing from one atom of a chemical compound to the other atoms. We assume that in the heat conduction process between atoms (depicted by vertices of a graph), heat is transferred from vertex v outwardly to all other vertices in the graph through contact via the edges of the tree until it reaches the end vertices in . The speed of transfer, conceivably, is governed by the breadth of which can be approximated by the square of the degree, , of vertex v. As a result, in the heat conduction process, let the score s(v) of vertex v be the quantity Mathematically, the conduction of a graph G, c(G),  is defined as For instance, the graph in Fig. 1 has conduction .
Figure 1

Molecular graph for the alkane, 2,2,3,3-tetramethylbutane.

Molecular graph for the alkane, 2,2,3,3-tetramethylbutane. We use four special classes of graphs. The first is the path, , of order n. The broom graph, , is a graph of order n obtained by taking the path and attaching q end vertices to one end of . The second class of graphs we will consider is while the third class is . These classes of graphs, namely , and , represent the class of normal-alkanes, the 2-methyl, and the 2,2-dimethyl series of alkanes, respectively. The fourth class is that of graphs of order n where has fairly large conduction among all graphs of order n, and consists of a path P, and having as many vertices of degree 4 on P as possible. For instance, is given in Fig. 1. Table 1 shows some of the graphs, . In Tables 1, 2, 3, 4, 5, 6, 7, 8 and 9, is the conduction.
Table 1

Some graphs.

NumberAlkanecBoiling point in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}C
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$E_5$$\end{document}E52,2-Dimethylpropane17.69.5
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$E_6$$\end{document}E62,3-Dimethylbutane23.3358.1
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$E_7$$\end{document}E72,2,3-Trimethylbutane33.7181
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$E_8$$\end{document}E82,2,3,3-Tetramethylbutane45.75106.5
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$E_9$$\end{document}E92,3,3,4-Tetramethylpentane52141.5
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$E_{10}$$\end{document}E102,2,3,3,4-Pentamethylpentane66.3161.1
Table 2

Data for 5 alkanes with 6 carbon atoms.

NumberAlkanecBoiling point in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}C
1Hexane1568.8
23-Methylpentane18.8363.3
32-Methylpentane19.560.9
42,2-Dimethylbutane2349.8
52,3-Dimethylbutane23.3358.1
Table 3

Data for 11, normal-alkanes.

NumberAlkanecBoiling point in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}C
1Butane7.50
2Pentane11.236.1
3Hexane1568.8
4Heptane18.8571498.38
5Octane22.75125.6
6Nonane26.66667150.7
7Decane30.6174.1
8Undecane34.54545195
9Dodecane38.5216
10Tridecane42.46154234
11Tetradecane46.42857255
Table 4

Data for 8, 2-methyl series.

NumberAlkanecBoiling point in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}C
12-Methylbutane14.527.4
22-Methylpentane19.560.9
32-Methylhexane24.8571490.1
42-Methylheptane30.375117.6
52-Methyloctane36143
62-Methylnonane41.7166.9
72-Methyldecane47.45455189.3
82-Methylundecane53.25208.9
92-Methyldodecane59.07692229.5
102-Methyltridecane64.92857247.9
Table 5

Data for 8, 2,2-dimethyl series.

NumberAlkanecBoiling point in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}C
12,2-Dimethylpropane17.69.5
22,2-Dimethylbutane2349.8
32,2-Dimethylpentane29.1428679.2
42,2-Dimethylhexane35.75106.9
52,2-Dimethylheptane42.66667131.9
62,2-Dimethyloctane49.8154
72,2-Dimethylnonane
82,2-Dimethyldecane64.5200.12
92,2-Dimethylundecane72220
102,2-Dimethyldodecane
Table 6

Data for 4 alkanes of order n.

AlkanecBoiling point in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}C
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P_n$$\end{document}Pn\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_1$$\end{document}x1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_1=\alpha \left( x_1\right) +\beta + \alpha _1\log _{\alpha _2}\left[ \alpha _3\left( x_1\right) +\alpha _4\right] $$\end{document}y1=αx1+β+α1logα2α3x1+α4
Where constants are as given in (4)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{n,2}$$\end{document}Bn,2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_2$$\end{document}x2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_2=\alpha \left( x_2\right) +\beta + \alpha _1\log _{\alpha _2}\left[ \alpha _3\left( x_2\right) +\alpha _4\right] $$\end{document}y2=αx2+β+α1logα2α3x2+α4
Where constants are as given in (5)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{n,3}$$\end{document}Bn,3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_3$$\end{document}x3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_3=\alpha \left( x_3\right) +\beta + \alpha _1\log _{\alpha _2}\left[ \alpha _3\left( x_3\right) +\alpha _4\right] $$\end{document}y3=αx3+β+α1logα2α3x3+α4
Where constants are as given in (6)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$E_{n}$$\end{document}En\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$x_4$$\end{document}x4\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y_4=\alpha \left( x_4\right) +\beta + \alpha _1\log _{\alpha _2}\left[ \alpha _3\left( x_4\right) +\alpha _4\right] $$\end{document}y4=αx4+β+α1logα2α3x4+α4
Where constants are as given in (7)
Table 7

All alkanes of order 6: Predictive ability of model (8).

NumberAlkanecBp in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}CPredicted
12-Methylpentane19.560.959.8
23-Methylpentane18.863.361.1
32,2-Dimethylbutane2349.853.1
42,3-Dimethylbutane23.358.152.4
5Hexane1568.868.5
Table 8

All alkanes of order 6: Predictive ability of model (8).

NumberAlkanecBp in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}CPredicted
12,2,3-Trimethylbutane33.7180.980.7
23-Ethylpentane23.1493.592.1
33,3-Dimethylpentane2886.186.9
42,4-Dimethylpentane30.8680.683.8
52,3-Dimethylpentane28.5789.886.2
62,2-Dimethylpentane29.1479.285.6
72-Methylhexane24.8690.190.3
Figure 9

Shows a comparison of the linear least squares fit to experimental data and the linear model (8) for all alkanes of order 8.

Some graphs. A simple calculation establishes the following formulae.

Proposition 2.1

Let . Then , , Using data in Table 1, we can approximate the conduction of by an equation of the formwhere and are constants. Fitting data in Table 1 to Eq. (1), we get the following equation with a coefficient of determination, CoD, We discuss in detail the concept of the CoD is “Model quality assessment”.

Proposition 2.2

Let . Then

Model development

Model quality assessment

In the development process of our model, we adopt the folkore method, goodness of fit, of determining the quality of the model. After performing a fitting process, it is important to determine the goodness of the fit. While there are many methods of determining the goodness of a fit, in this paper we evaluate the residuals and make a plot of the residuals. The evaluation of the residuals is important in determining the goodness of fit of our model to data. We use the CoD, that is used to explain the variability between our model output and the data. This coefficient is commonly known as R-squared, (). The formula of coefficient of determination is given by:where, , and A CoD value of 1 indicates a perfect fit, and thus the model can be deemed very reliable for any future forecasts, while a value of 0 indicates that the model does not accurately model the data. Some researchers have argued that it is better to look at adjusted R-squared rather than the R-squared, but for the work presented in this paper, it suffices to use the non-adjusted R-squared.

Problem statement

Let be an alkane with n carbon atoms. Our problem is to estimate the boiling point of the alkane, . We will represent this by a graph, , of order n. We will often use and interchangeably. We will thus denote the boiling point of by or

Model when sufficient data is available

We first consider a situation in which there is sufficient data on the boiling points of a group of alkanes. The general observation is that the conduction values of alkanes of order n have a global linear relationship with the experimental boiling points and there are localised oscillations about the regression line where the diameter of the oscillations decreases as conduction increases. In general, as a first approximation, to find , we propose a model of the form:where is the conduction of graph and and are constants to be determined by fitting the model to boiling points data of alkanes of order n. To improve the fit and accuracy, we propose a combination of linear fit with logarithmic and trigonometric functions. The logarithmic and trigonometric components are incorporated to capture the oscillatory tendencies. We thus propose a function of the form,where , , , are constants to be determined by fitting the model to data of alkanes of order n. To illustrate the application of (3), we consider the data set depicted in Table 2 of all alkanes of order 6. Data for 5 alkanes with 6 carbon atoms. While a linear model, (2), fits the data with and a CoD value of our model, (3), fits the data withand a CoD value of . The graphs of the fit using the least squares (lsq) fitting method and the residuals are given in Fig. 2. It is clear that the proposed function (3) fits well to the data in Table 2 with very low values of the residuals.
Figure 2

shows the model fit with a 95% confidence interval and the residuals.

shows the model fit with a 95% confidence interval and the residuals.

Model with insufficient data

Our model to estimate the boiling point of an alkane of order n depends on the availability of data for alkanes of order n. In the absence of such data, our initial step will be to develop the data. Once we develop the data, we can then use (2) [or (3), for more accuracy] to find the equation that estimates the boiling point to a reasonable degree of accuracy (namely, coefficient of determination values of and 0, 6488 for three test data sets, respectively). We cover this below.

Generating data

We need to generate data for some alkanes of order n as more data yields better accuracy during the fitting process. The graphs , , and are all alkanes of order n we estimate their boiling points in turn below.

Boiling point for

Comparing the boiling points of normal alkanes with their conduction values, we see that growth of boiling points of the normal-alkanes is linear in conduction values but with an additional logarithmic increase added to it. We, thus, propose the modelwhere , , , are constants to be determined by fitting the model to data of normal-alkanes. From Proposition 2.1, this reduces to Fitting (4) to the data of the first few normal-alkanes given in Table 3, we obtain the values of the constants aswith a CoD value of . Data for 11, normal-alkanes. The graph and the plot of residuals are given in Fig. 3.
Figure 3

Shows the model fit and the residuals.

Shows the model fit and the residuals. The proposed function (4) fits well to the data in Table 3 and the residual values are also very low. As in the case of normal-alkanes, we propose the modelwhere , , , are constants to be determined by fitting the model to data of 2-methyl series of alkanes. From Proposition 2.1, this reduces to Fitting (5) to the data of the first few 2-methyl series given in Table 4, we obtain the values of the constants aswith a CoD value of . Data for 8, 2-methyl series. The graph and the plot of residuals are given in Fig. 4.
Figure 4

Shows the model fit and the residuals.

Shows the model fit and the residuals. The function (5) produces a perfect fit to the data in Table 4 and the resulting residual values are very low. similar observations are made for Figs. 5 and 6.
Figure 5

Shows the model fit and the residuals.

Figure 6

Shows the model fit and the residuals.

As in the case of normal-alkanes, we propose the modelwhere , , , are constants to be determined by fitting the model to data of 2-dimethyl series of alkanes. From Proposition 2.1, this reduces to Fitting (6) to the data of the first few 2-dimethyl series given in Table 5, we obtain the values of the constants aswith a CoD value of . Data for 8, 2,2-dimethyl series. The graph and the plot of residuals are given in Fig. 5. Shows the model fit and the residuals. As in the case of normal-alkanes, we propose the modelwhere , , , are constants to be determined by fitting the model to data given in Table 1. From Proposition 2.2, this reduces to Fitting (7) to the data given in Table 1, we obtain the values of the constants aswith a CoD value of . The graph and the plot of residuals are given in Fig. 6. Shows the model fit and the residuals.

Main results

From the previous sections, we have now generated data for boiling points of some alkanes with n carbon atoms. LetandWe present the data in Table 6. Data for 4 alkanes of order n. To find , we propose the lsq model:whereand

Model testing

We first test the predictive ability of our model, (8), on all alkanes of order 6. We present the results in Table 7. In Tables 7, 8 and 9, Bp in C means the experimental boiling points in degree Celsius, while predicted means the boiling points predicted by our model. All alkanes of order 6: Predictive ability of model (8). The CoD value is and the graphs are presented in Fig. 7 below.
Figure 7

Shows a comparison of the linear least squares fit to experimental data and the linear model (8) for all alkanes of order 6.

Shows a comparison of the linear least squares fit to experimental data and the linear model (8) for all alkanes of order 6. Next we test the predictive ability of our model, (8), on all alkanes of order 7. We present the results in Table 8. All alkanes of order 6: Predictive ability of model (8). The CoD value is and the graphs are presented in Fig. 8 below.
Figure 8

Shows a comparison of the linear least squares fit to experimental data and the linear model (8) for all alkanes of order 7.

Shows a comparison of the linear least squares fit to experimental data and the linear model (8) for all alkanes of order 7. We now test the predictive power of our model, (8), on alkanes of order 8. We present the results in Table 9.
Table 9

All alkanes of order 8: Predictive ability of model (8).

NumberAlkanecBp in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^\circ $$\end{document}CPredicted
12,2,3,3-Tetramethylbutane45.75106.5106.7
22,2,3-Trimethylpentane39.625110111.1
32,3,3-Trimethylpentane39.125114.7111.4
42,2,4-Trimethylpentane43.37599.2108.4
52,2-Dimethylhexane35.75106.9113.8
63,3-Dimethylhexane33.75112115.2
73-Ethyl-3-methylpentane32.75118.2115.9
82,3,4-Trimethylpentane40.625113.4110.4
92,3-Dimethy lhexane34.5115.6114.7
103-Ethyl-2-methylpentane33.5115.6115.4
113,4-Dimethylhexane33.5117.7115.4
122,4-Dimethylhexane36109.4113
132,5-Dimethylhexane38.5109111.9
142-Methylheptane30.375117.6117.6
153-Methylheptane25.875118120.8
164-Methylheptane28.375117.7119
173-Ethy lhexane27.875118.5119.4
18Octane22.75125.7123
All alkanes of order 8: Predictive ability of model (8). The CoD value is and the graphs are presented in Fig. 9 below. Shows a comparison of the linear least squares fit to experimental data and the linear model (8) for all alkanes of order 8.

Discussion and conclusion

In this paper we provide a novel model for predicting the boiling points of alkanes. We consider alkanes of orders 6, 7 and 8 to test the usefulness of our model. We consider alkanes with n carbon atoms represented by a graph whose boiling points may be known experimentally. In particular, the methods presented here have the ability to develop the boiling points data using the conduction parameter, which we defined as the conduction of a graph. The nature of the boiling points data necessitated the use of a combination of trigonometric and logarithmic functions in coming up with models that fit to the data. The models are then used to predict the boiling points of alkanes whose number of carbon atoms are known. The model presented in this article is without inculpability. First we fit linear models to experimental data that has very few number of data points in some instances. Second, some boiling points of certain compounds could not be ascertained from literature and last, the approximation of the numerical values of some parameters give rise to variability in the model predictions. Despite these shortcomings, the model presents some strengths to literature. Firstly, our single-variable model puts to rest the general perception among researchers that boiling points are difficult to predict successfully using a single-parameter model[3,4]. Our key contribution has therefore been the development of a new parameter, the conduction of a graph, which could adequately capture the boiling points. In predicting boiling points, the conduction of a graph has proved to be more superior than previously considered parameters such as the Wiener index, and other commonly used topological indices (see, for example[3]). In light of this, it will be interesting for future research to see how the conduction of a graph relates, mathematically, to the other indices such as the Wiener index. Secondly, as Sandak and Conduit[11] puts it, an engineer wants a model that accurately predicts boiling points of the full range of alkanes. Whilst existing models have been successful in predicting boiling points for only a sub-class of alkanes, our model predicts boiling points for the full range of alkanes. As seen above, our model for the considered data sets successfully predicted boiling points with CoD values, and 0.6488. Whilst these CoD values are considered satisfactory, one way of improving accuracy of the model is to include more special graphs that generate data for alkanes of order n (see, for instance Table 6).
  4 in total

Review 1.  Quantitative structure-property relationships for prediction of boiling point, vapor pressure, and melting point.

Authors:  John C Dearden
Journal:  Environ Toxicol Chem       Date:  2003-08       Impact factor: 3.742

2.  Structural determination of paraffin boiling points.

Authors:  H WIENER
Journal:  J Am Chem Soc       Date:  1947-01       Impact factor: 15.419

Review 3.  Principal component analysis: a review and recent developments.

Authors:  Ian T Jolliffe; Jorge Cadima
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2016-04-13       Impact factor: 4.226

4.  Neural network based quantitative structural property relations (QSPRs) for predicting boiling points of aliphatic hydrocarbons

Authors: 
Journal:  J Chem Inf Comput Sci       Date:  2000-05
  4 in total
  1 in total

1.  General Equation to Express Changes in the Physicochemical Properties of Organic Homologues.

Authors:  Chao-Tun Cao; Chenzhong Cao
Journal:  ACS Omega       Date:  2022-07-25
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.