| Literature DB >> 30155120 |
Jyoti K Gupta1, Dave J Adams1, Neil G Berry1.
Abstract
The self-assembly of low molecular weight gelators to form gels has enormous potential for cell culturing, optoelectronics, sensing, and for the preparation of structured materials. There is an enormous "chemical space" of gelators. Even within one class, functionalised dipeptides, there are many structures based on both natural and unnatural amino acids that can be proposed and there is a need for methods that can successfully predict the gelation propensity of such molecules. We have successfully developed computational models, based on experimental data, which are robust and are able to identify in silico dipeptide structures that can form gels. A virtual computational screen of 2025 dipeptide candidates identified 9 dipeptides that were synthesised and tested. Every one of the 9 dipeptides synthesised and tested were correctly predicted for their gelation properties. This approach and set of tools enables the "dipeptide space" to be searched effectively and efficiently in order to deliver novel gelator molecules.Entities:
Year: 2016 PMID: 30155120 PMCID: PMC6016447 DOI: 10.1039/c6sc00722h
Source DB: PubMed Journal: Chem Sci ISSN: 2041-6520 Impact factor: 9.825
Fig. 1Generic structure of library (AA – amino acid); see ESI† for specific structures.
Fig. 2Overall QSPR modelling, synthesis and testing workflow.
Optimisation and performance statistics of the QSPR models developed for the training set
| Method | Resampling results of optimal model | Performance of optimal model on training set | ||||
|
| Kappa | Balanced accuracy |
|
| Overall quality of model | |
| SVM | 0.764 ± 0.28 | 0.941 | 0.971 | 2.04 × 10–9 | 1 |
|
| RF | 0.771 ± 0.22 | 0.941 | 0.971 | 2.04 × 10–9 | 1 |
|
|
| 0.570 ± 0.26 | 0.824 | 0.912 | 3.83 × 10–7 | 0.738 |
|
| NN | 0.774 ± 0.24 | 0.941 | 0.971 | 2.04 × 10–9 | 0.907 |
|
| PLS | 0.751 ± 0.22 | 0.529 | 0.765 | 1.47 × 10–3 | 0.761 |
|
| NB | 0.701 ± 0.24 | 0.765 | 0.882 | 3.08 × 10–6 | 0.761 |
|
| C5.0 | 0.646 ± 0.25 | 1 | 1 | 5.82 × 10–11 | 1 |
|
Performance on the models predicting the gelator properties of the 12 external test set compounds within the model domain of applicability. Green – meets criteria. Red – fails criteria. (Criteria for good: kappa > 0.4, balanced accuracy > 0.7, H > 0.6)
| Method | Performance on external test set of 14 compounds in models applicability domain | |||
| Kappa | Balanced accuracy |
| Quality of predictions | |
| SVM | 0.417 | 0.708 | 0.703 |
|
| RF | 0.759 | 0.958 | 1.000 |
|
|
| 0.286 | 0.7941 | 0.311 |
|
| NN | 0.462 | 0.875 | 1.000 |
|
| PLS | 0.177 | 0.625 | 0.526 |
|
| NB | 0.286 | 0.791 | 0.526 |
|
| C5.0 | 0.103 | 0.583 | 0.334 |
|
Fig. 3ROC curves for the SVM (), RF () and NN () models (RF and NN plots lie on top of each other).
Structures of molecules predicted, synthesized and tested for gelation property. % likelihood is the average probability from SVM, RF and NN models that the prediction is as indicated
| Compound | Prediction (% likelihood) | Measurement |
|
| No (85%) | No |
|
| No (85%) | No |
|
| No (85%) | No |
|
| No (82%) | No |
|
| No (83%) | No |
|
| Yes (83%) | Yes |
|
| Yes (75%) | Yes |
|
| Yes (79%) | Yes |
|
| Yes (63%) | Yes |