| Literature DB >> 32825028 |
Meade E Erickson1, Marvellous Ngongang1, Bakhtiyor Rasulev1.
Abstract
Predicting the activities and properties of materials via in silico methods has been shown to be a cost- and time-effective way of aiding chemists in synthesizing materials with desired properties. Refractive index (n) is one of the most important defining characteristics of an optical material. Presented in this work is a quantitative structure-property relationship (QSPR) model that was developed to predict the refractive index for a diverse set of polymers. A number of models were created, where a four-variable model showed the best predictive performance with R2 = 0.904 and Q2LOO = 0.897. The robustness and predictability of the best model was validated using the leave-one-out technique, external set and y-scrambling methods. The predictive ability of the model was confirmed with the external set, showing the R2ext = 0.880. For the refractive index, the ionization potential, polarizability, 2D and 3D geometrical descriptors were the most influential properties. The developed model was transparent and mechanistically explainable and can be used in the prediction of the refractive index for new and untested polymers.Entities:
Keywords: QSAR; QSPR; descriptors; ionization potential; polarizability; polymers; refractive index
Mesh:
Substances:
Year: 2020 PMID: 32825028 PMCID: PMC7503810 DOI: 10.3390/molecules25173772
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1(a) Correlation plot of the experimental refractive index data vs. that predicted by the model; (b) Williams plot; and (c) the y-scrambling plot, where the red dots are for the R values and the yellow dots are for Q values.
List of the best models with 1–5 variables.
| Variable | Equations |
|
|
| RMSEtr | r2m_ave | R2ext | RMSEext | CCCcv |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Log(Ri) = −0.1124 Mi + 0.205 | 0.676 | 0.675 | 0.665 | 0.012 | 0.559 | 0.684 | 0.010 | 0.802 |
| 2 | Log(Ri) = −0.080 Mi – 0.049 GATS1p + 0.222 | 0.847 | 0.845 | 0.840 | 0.008 | 0.752 | 0.867 | 0.009 | 0.913 |
| 3 | Log(Ri) = –0.046 GATS1p – 0.029 VR2_G – 0.064 R2s +0.228 | 0.883 | 0.881 | 0.876 | 0.007 | 0.847 | 0.921 | 0.008 | 0.934 |
| 4 | Log(Ri) = −0.068 Mi + 0.031 SpMAD_A – 0.048 GATS1p + 0.040 WiA_RG + 0.191 | 0.904 | 0.902 | 0.897 | 0.007 | 0.842 | 0.880 | 0.008 | 0.946 |
| 5 | Log(Ri) = –0.017 SpMAD_D/DT – 0.039 GATS1p – 0.019 VR2_G + 0.009 G2m – 0.075 H0i + 0.238 | 0.907 | 0.903 | 0.899 | 0.006 | 0.886 | 0.823 | 0.009 | 0.938 |
Description and statistical coefficients of the descriptors in the 4-variable model.
| Name of Descriptor | Coefficient | Std. Coeff. | Co. Int. 95% | Description | Type of Descriptor |
|---|---|---|---|---|---|
| Intercept | 0.191 | ------ | 0.009 | ------ | ------ |
| Mi | −0.068 | −0.491 | 0.009 | mean first ionization potential (scaled on Carbon atom) | Constitutional indices |
| GATS1p | −0.048 | −0.443 | 0.006 | Geary autocorrelation of lag 1 weighted by polarizability | 2D autocorrelations |
| WiA_RG | 0.040 | 0.266 | 0.009 | average Wiener-like index from reciprocal squared geometrical matrix | 3D matrix-based descriptors |
| SpMAD_A | 0.031 | 0.281 | 0.009 | spectral mean absolute deviation from adjacency matrix | 2D matrix-based descriptors |