| Literature DB >> 31336981 |
J Jay Liu1, Alham Alipuly1, Tomasz Bączek2, Ming Wah Wong3, Petar Žuvela4.
Abstract
In this work, we employed a non-linear programming (NLP) approach via quantitative structure-retention relationships (QSRRs) modelling for prediction of elution order in reversed phase-liquid chromatography. With our rapid and efficient approach, error in prediction of retention time is sacrificed in favor of decreasing the error in elution order. Two case studies were evaluated: (i) analysis of 62 organic molecules on the Supelcosil LC-18 column; and (ii) analysis of 98 synthetic peptides on seven reversed phase-liquid chromatography (RP-LC) columns with varied gradients and column temperatures. On average across all the columns, all the chromatographic conditions and all the case studies, percentage root mean square error (%RMSE) of retention time exhibited a relative increase of 29.13%, while the %RMSE of elution order a relative decrease of 37.29%. Therefore, sacrificing %RMSE(tR) led to a considerable increase in the elution order predictive ability of the QSRR models across all the case studies. Results of our preliminary study show that the real value of the developed NLP-based method lies in its ability to easily obtain better-performing QSRR models that can accurately predict both retention time and elution order, even for complex mixtures, such as proteomics and metabolomics mixtures.Entities:
Keywords: chromatography; elution order prediction; non-linear programming (NLP); quantitative structure-retention relationships (QSRR); reversed phase-liquid chromatography (RP-LC)
Mesh:
Year: 2019 PMID: 31336981 PMCID: PMC6678770 DOI: 10.3390/ijms20143443
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Schematic depiction of the non-linear programming (NLP)-based elution order prediction methodology. Abbreviations (in order of appearance): RP-LC—reversed-phase liquid chromatography, RMSE—root mean square error, QSRR—quantitative structure-retention relationships.
Figure 2QSRR model performance expressed in terms of %RMSE(tR) for MLR (control) and MLR-NLP models. Legend: Kal—Supelcosil LC18, tG = 10 min, T = 35 °C (case study 1); Xt—Xterra, tG = 20 min, T = 40 °C; L1—Licrospher, tG = 20 min, T = 40 °C; L2—tG = 60 min, T = 40 °C; L3—tG = 120 min, T = 40 °C; L4—tG = 20 min, T = 60 °C; L5—tG = 20 min, T = 80 °C; L6—Licrospher CN, tG = 20 min, T = 40 °C; P1—PRP, tG = 20 min, T = 40 °C; P2—tG = 60 min, T = 40 °C; P3—tG = 20 min, T = 60 °C; P4—tG = 60 min, T = 60 °C; P5—tG = 20 min, T = 80 °C; P6—tG = 60 min, T = 80 °C; D1—Discovery RP-Amide C-16, tG = 20 min, T = 40 °C; D2—tG = 20 min, T = 60 °C; D3—tG = 20 min, T = 80 °C; D4—Discovery HS F5-3, tG = 20 min, T = 40 °C; C1—Chromolith, tG = 20 min, T = 40 °C (case study 2). Abbreviations: QSRR—quantitative structure-retention relationships; %RMSE(tR)—percentage root mean square error of retention time.
Figure 3Distribution of %RMSE (order) values of MLR (control) and MLR-NLP models. The legend for the X-axis is analogous to the one in Figure 2.
Summary of the paired t-test for all the QSRR model performances for all the columns between the two approaches (MLR and MLR-NLP).
| Statistics | %RMSE(tR) MLR | %RMSE(tR) MLR-NLP |
|---|---|---|
| Mean | 26.635 | 36.848 |
| Variance | 135.67 | 490.97 |
| Observations | 19 | 19 |
| Pearson Correlation | 0.961 | |
| Df | 18 | |
| t Stat | −3.897 | |
| P(T<=t) one-tail | 0.00053 | |
| t Critical one-tail | 1.734 | |
| P(T<=t) two-tail | 0.00106 | |
| t Critical two-tail | 2.100 |
Figure 4Relative difference in retention time and elution order %RMSE values between MLR and MLR-NLP models. The legend for the X-axis is analogous to the one in Figure 2. Abbreviations: %RMSE—percentage root mean square error, MLR—multiple linear regression, MLR-NLP—MLR–non-linear programming.
Summary of model performances for the first and second case studies.
| CS a | Column | Analysis Parameters b | Model | %RMSE( | %RMSE(order) |
|---|---|---|---|---|---|
| I | Supelcosil | MLR (control) | 8.57 | 59.07 | |
| MLR-NLP | 8.07 | 51.77 | |||
| II | Xterra | MLR (control) | 11.50 | 25.01 | |
| MLR-NLP | 15.17 | 22.40 | |||
| II | Licrospher | MLR (control) | 13.25 | 30.28 | |
| MLR-NLP | 12.42 | 39.59 | |||
| II | Licrospher | MLR (control) | 25.60 | 34.11 | |
| MLR-NLP | 37.94 | 30.10 | |||
| II | Licrospher | MLR (control) | 42.31 | 153.00 | |
| MLR-NLP | 85.62 | 25.17 | |||
| II | Licrospher | MLR (control) | 18.45 | 36.12 | |
| MLR-NLP | 16.86 | 40.70 | |||
| II | Licrospher | MLR (control) | 18.82 | 35.25 | |
| MLR-NLP | 21.06 | 34.65 | |||
| II | Licrospher | MLR (control) | 39.28 | 195.82 | |
| MLR-NLP | 55.53 | 53.45 | |||
| II | PRP | MLR (control) | 20.07 | 69.44 | |
| MLR-NLP | 20.72 | 58.09 | |||
| II | PRP | MLR (control) | 37.92 | 107.94 | |
| MLR-NLP | 52.40 | 41.33 | |||
| II | PRP | MLR (control) | 21.75 | 94.97 | |
| MLR-NLP | 24.06 | 82.54 | |||
| II | PRP | MLR (control) | 40.11 | 321.65 | |
| MLR-NLP | 54.35 | 37.16 | |||
| II | PRP | MLR (control) | 22.36 | 137.16 | |
| MLR-NLP | 26.19 | 53.30 | |||
| II | PRP | MLR (control) | 42.60 | 194.56 | |
| MLR-NLP | 61.56 | 40.18 | |||
| II | Discovery | MLR (control) | 36.73 | 261.22 | |
| MLR-NLP | 58.07 | 91.81 | |||
| II | Discovery | MLR (control) | 36.37 | 219.01 | |
| MLR-NLP | 57.16 | 96.70 | |||
| II | Discovery | MLR (control) | 36.74 | 241.63 | |
| MLR-NLP | 54.75 | 81.05 | |||
| II | Discovery | MLR (control) | 12.81 | 34.00 | |
| MLR-NLP | 13.84 | 28.12 | |||
| II | Chromolith | MLR (control) | 20.82 | 43.81 | |
| MLR-NLP | 24.36 | 28.55 |
a CS—case study; b tG—gradient retention time; MLR—multiple linear regression; MLR-NLP—multiple linear regression–non-linear programming.
Figure 5Performance of the MLR-NLP method for prediction of (A) retention time, (B) elution order, and (C) applicability domain for case study 1 (separation of organic molecules using Supelcosil LC, tG = 10 min, T = 35 °C), (D) prediction of retention time, (E) elution order, and (F) applicability domain for case study 2 (separation of synthetic peptides on Xterra, tG = 20 min, T = 40 °C). Abbreviations: tG—gradient retention time, T—temperature.