| Literature DB >> 34123314 |
Spyridon Koutsoukos1, Frederik Philippi1, Francisco Malaret2, Tom Welton1.
Abstract
There are thousands of papers published every year investigating the properties and possible applications of ionic liquids. Industrial use of these exceptional fluids requires adequate understanding of their physical properties, in order to create the ionic liquid that will optimally suit the application. Computational property prediction arose from the urgent need to minimise the time and cost that would be required to experimentally test different combinations of ions. This review discusses the use of machine learning algorithms as property prediction tools for ionic liquids (either as standalone methods or in conjunction with molecular dynamics simulations), presents common problems of training datasets and proposes ways that could lead to more accurate and efficient models. This journal is © The Royal Society of Chemistry.Entities:
Year: 2021 PMID: 34123314 PMCID: PMC8153233 DOI: 10.1039/d1sc01000j
Source DB: PubMed Journal: Chem Sci ISSN: 2041-6520 Impact factor: 9.825
Fig. 1Web of science search of “Ionic Liquids” and “Machine Learning” (search January 2021).
Fig. 2Categorisation of AI computational methods discussed in this work.
Fig. 3Schematic representation of the promise versus reality of the use of ML for chemical reaction prediction. Reprinted with permission from Kammeraad et al.[25] Copyright 2020 American Chemical Society.
Fig. 4Comparison between conventional ML and DL workflows. Redrawn from Visvikis et al.[33]
Fig. 5Conventional feedforward ANNs (FFANN) (a) differ from DNNs (b) by having only one hidden neuron layer. Bias terms (output of the NNs when input is zero) are not connected in DNN for simplicity.
Fig. 6Structure of a simple DT.
Summary of works using ML methods for prediction of properties in IL
| Property | IL family | Method | Distinct ILs | Training/test set points | Ref. |
|---|---|---|---|---|---|
| Viscosity | Im, Py, Quin, Pyr, Ox, Pip, Mo, Azp, Guan, N, P, S, dicationic | FFANN | 1484 | 11031/613 |
|
| Im, Py, Pyr, N, P | FFANN | 81 | 654/81 |
| |
| Im, AA, N, Guan, Quin, Mo, Ox, P, Pip, Py, Pyr, Pyrr, S | LSSVM | 443 | 1254/418 |
| |
| Im, Py, Pyr, P, Quin, N | FFANN | 66 | 612/124 |
| |
| Im, Py, Pyr, P, N, Mo, Pip, S | ELM (FFANN) | 89 | 1205/297 |
| |
| Im, Py, Pyr, P, N | MLP (FFANN) | 33 | 651/72 |
| |
| Im, N, Py, Pyr, P, Pip, Mo, S, Cprop, Azp, Guan, Trz, Bic, Pz, Thur, Quin, thz, amd, ox, pipz, tetraz | FFANN and LSSVM | 1974 | 1437/159 and 4479/453 |
| |
| Im, Py, N | FFANN | 31 | 327/31 |
| |
| Density | Im | MLP (FFANN) and RBF | n/a | 317/68 |
|
| Im, N, Py, Pyr, P, Pip, Mo, S, Cprop, Azp, Guan, Trz, Bic, Pz, Thur, Quin, thz, amd, ox, pipz, tetraz | MLR, FFANN and LSSVM | 1999 | 5632/625 |
| |
| Im, Py, Pyr | FFANN | 50 | 399/83 |
| |
| Melting point | Trz, Pyr, Py, Pip, P, Mo, Im, N, S | PLSR, SVM, RF, GBM and k-nn | 2212 | 1486/726 |
|
| Im, Py, Pip, P, N | FFANN | 62 | 50/12 |
| |
| Im | Regression trees and SVR | 281 and 134 | 225/22 and 107/13 |
| |
| Trz, Pyr, Py, Pip, P, Mo, Im, N, S | KKR | 2212 | 1770/442 |
| |
| Im, N, P, Py, Pyr, S | PLSR, GBM, Cubist, RF, CART | 467 | 1646/1501 |
| |
| Py | FFANN, DT | 126 | n/a |
| |
| Guan | CPG NN | 101 | 81/20 |
| |
| Py | RNN | 126 | 84/42 |
| |
| Surface tension | Im, Py, P | FFANN | 79 | 616/132 |
|
| Toxicity | Im, Py, Pyr, P, N, Pip, Mo, Quin, S | GFA and LSSVM | 270 | 203/67 |
|
| Im, Py, Pyr, Pip, N, Quin | ELM (FFANN) | 119 | 100/19 |
| |
| Im, Py, Pyr, Pip, P, N, Quin | MLR and ELM | 160 | 128/32 |
| |
| Im, Py, Pyr, Pip, N, P, Mo | CCN and SVM | 292 | 204/88 |
| |
| Im, Py, Pyr, Pip, P, N, Mo | ELM | 142 | 113/29 |
| |
| CO2 solubility | Im, N, P | MLFNN (FFANN) | 144 (pre-trained on H2S) |
| |
| Im, P, Pyr | MLP and ANFIS | 14 | 546/182 |
| |
| Im, N, Py, Pyr | MLR and LSSVM | 21 | 16/5 |
| |
| Im, N, Guan, Py, Pyr, P, Ur | PLSR, CTREE and RF | 158 | 5424/5424 |
| |
| Im, P | LSSVM | 11 | 128/385 |
| |
| Im, P | MLP | 20 | 907/208 |
| |
| Im, Pyr, P | DNN, RNN and CNN | 13 | n/a (ratio 7/3) |
| |
| Im, Pyr, P | MLP | 13 | 595/149 |
| |
| Im, PY, Pyr, P, N | LSSVM, MLR, RF and DT | 36 | 1241/414 |
| |
| Im, Py, Pyr, Pip, N, P, S | FFANN and SVM | 124 | 8093/2023 |
| |
| H2S solubility | Im, N, P | MLFNN (FFANN) | 513/165 |
| |
| Im | MLFNN (FFANN) | 11 | 372/93 |
| |
| Im, N | ELM (FFANN) | 37, 27 | 1025/257 |
| |
| Im | ANFIS, MLP, RBF | 13 | 554/1140 |
| |
| Im | LSSVM | 9 | 590/62 |
| |
| Im | SGB (DT) | 11 | 369/96 |
| |
| Im, N | ELM (FFANN) | 28 | 1055/263 |
|
Explanation of cations abbreviations presented in Table 1. Structures given in the ESI (see ESI)
| Cation names | Cation names |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fig. 7(a) Logarithmic plot of the isomer count for imidazolium cations in the work of Paduszyński;[85] (b) taking into account the 86 different functionalized substituents that are shown in the paper.
Fig. 8(a) Logarithmic plot of the isomer count for ammonium ILs in the work of Paduszyński;[85] (b) taking into account the 64 different functionalized substituents that are shown in the paper.
Fig. 9Interpolation vs. extrapolation with ML of the function |x|0.5 (black line). ML predictions (blue line) were obtained with kernel ridge regression trained on 25 randomly drawn points (red dots) from x ∈ [0; 5]. Reprinted with permission from Pavlo Dral.[141] Copyright 2020 American Chemical Society.
Fig. 10Typical steps of an MD simulation.
Fig. 11Scheme of the general approach to automatically construct a force field using ML, in this case HDNNP. The ML algorithm is trained using the output (forces, energies) of a more expensive higher level method. The simulation is evolved using the MLIP, and re-trained every few steps to avoid extrapolation. Once converged, the computationally inexpensive MLIP can be used for production purposes. Reprinted by Gastegger et al.[233] – published by The Royal Society of Chemistry.
Fig. 12Subtopics of ML applications for chemistry research, categorised by the number of published works. Red: highly underexplored; yellow: some attempts demonstrated; green: fields of major attention. Reprinted by Pflüger and Glorius[267] – Published by John Wiley & Sons.