| Literature DB >> 34266497 |
Ruben Pawellek1, Jovana Krmar2, Adrian Leistner1, Nevena Djajić2, Biljana Otašević2, Ana Protić3, Ulrike Holzgrabe4.
Abstract
The charged aerosol detector (CAD) is the latest representative of aerosol-based detectors that generate a response independent of the analytes' chemical structure. This study was aimed at accurately predicting the CAD response of homologous fatty acids under varying experimental conditions. Fatty acids from C12 to C18 were used as model substances due to semivolatile characterics that caused non-uniform CAD behaviour. Considering both experimental conditions and molecular descriptors, a mixed quantitative structure-property relationship (QSPR) modeling was performed using Gradient Boosted Trees (GBT). The ensemble of 10 decisions trees (learning rate set at 0.55, the maximal depth set at 5, and the sample rate set at 1.0) was able to explain approximately 99% (Q2: 0.987, RMSE: 0.051) of the observed variance in CAD responses. Validation using an external test compound confirmed the high predictive ability of the model established (R2: 0.990, RMSEP: 0.050). With respect to the intrinsic attribute selection strategy, GBT used almost all independent variables during model building. Finally, it attributed the highest importance to the power function value, the flow rate of the mobile phase, evaporation temperature, the content of the organic solvent in the mobile phase and the molecular descriptors such as molecular weight (MW), Radial Distribution Function-080/weighted by mass (RDF080m) and average coefficient of the last eigenvector from distance/detour matrix (Ve2_D/Dt). The identification of the factors most relevant to the CAD responsiveness has contributed to a better understanding of the underlying mechanisms of signal generation. An increased CAD response that was obtained for acetone as organic modifier demonstrated its potential to replace the more expensive and environmentally harmful acetonitrile.Entities:
Keywords: Charged aerosol detector (CAD); Fatty acids; Gradient boosted trees (GBT); High-performance liquid chromatography (HPLC); Quantitative structure–property relationship modeling (QSPR)
Year: 2021 PMID: 34266497 PMCID: PMC8281619 DOI: 10.1186/s13321-021-00532-0
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1Structural formulas of the seven fatty acids utilized as model substances, with the corresponding vapor pressure and boiling point values
Fig. 2Average CAD response for the fatty acids investigated obtained from FIA. The fatty acids are coded with their respective number of C-atoms. The degree of unsaturation is indicated in brackets where applicable
Fig. 3a Regression plot of the optimized GBT-QSPR model. b Residual plot of the optimized GBT-QSPR model
Fig. 4The independent variables (y-axis) and their importance (x-axis) toward CAD response
Fig. 5Graphs showing the relationships between the predicted CAD response and a flow rate, b evaporation temperature, c type of organic solvent: 1—ACN; 2—MeOH; 3—Acetone; 4—EtOH, d the content of organic solvent in the mobile phase (v/v)
Fig. 6Effect of the MW of the fatty acids investigated on the CAD response
Fig. 7Geometrical center of the 3D represented myristic acid