| Literature DB >> 35010109 |
Lifeng Li1, Zenan Shi1, Hong Liang1, Jie Liu2, Zhiwei Qiao1.
Abstract
Atmospheric water harvesting by strong adsorbents is a feasible method of solving the shortage of water resources, especially for arid regions. In this study, a machine learning (ML)-assisted high-throughput computational screening is employed to calculate the capture of H2O from N2 and O2 for 6013 computation-ready, experimental metal-organic frameworks (CoRE-MOFs) and 137,953 hypothetical MOFs (hMOFs). Through the univariate analysis of MOF structure-performance relationships, Qst is shown to be a key descriptor. Moreover, three ML algorithms (random forest, gradient boosted regression trees, and neighbor component analysis (NCA)) are applied to hunt for the complicated interrelation between six descriptors and performance. After the optimizing strategy of grid search and five-fold cross-validation is performed, three ML can effectively build the predictive model for CoRE-MOFs, and the accuracy R2 of NCA can reach 0.97. In addition, based on the relative importance of the descriptors by ML, it can be quantitatively concluded that the Qst is dominant in governing the capture of H2O. Besides, the NCA model trained by 6013 CoRE-MOFs can predict the selectivity of hMOFs with a R2 of 0.86, which is more universal than other models. Finally, 10 CoRE-MOFs and 10 hMOFs with high performance are identified. The computational screening and prediction of ML could provide guidance and inspiration for the development of materials for water harvesting in the atmosphere.Entities:
Keywords: absorption; algorithm; metal-organic frameworks; molecular simulation; water harvesting
Year: 2022 PMID: 35010109 PMCID: PMC8746952 DOI: 10.3390/nano12010159
Source DB: PubMed Journal: Nanomaterials (Basel) ISSN: 2079-4991 Impact factor: 5.076
Figure 1The relationship of selectivity S0[H versus (a) LCD, (b) VSA, (c) ρ, and (d) PLD. The color code represents void fraction ϕ.
Figure 2The relationship of selectivity S0[H versus (a) Qst, (b) KH2O. The color code represents void fraction ϕ.
Figure 3Model (a) NCA, (b) GBRT, and (c) RF trained by 6013 CoRE-MOFs. The color represents a base-e logarithm of the number of MOFs.
Figure 4Relative importance of the six descriptors on the selectivity of MOFs by three ML prediction.
Figure 5Model (a) NCA, (b) GBRT, and (c) RF prediction for 10,000 hMOFs. The color represents a base-e logarithm of the number of MOFs.
Top 10 CoRE-MOFs with optimal performance of water harvesting.
| No. | CSD Code | LCD (nm) |
| VSA | PLD (nm) |
| |||
|---|---|---|---|---|---|---|---|---|---|
|
| QUTHAP | 0.569 | 0.44 | 654.97 | 0.441 | 1257.79 | 479.91 ± 8.31 | 2.78 × 10124 | 4.14 × 10128 |
|
| CAJWIV | 0.620 | 0.49 | 856.37 | 0.380 | 1078.91 | 307.79 ± 10.19 | 4.30 × 1077 | 6.85 × 1082 |
|
| PIBLUJ | 0.595 | 0.39 | 473.39 | 0.352 | 1304.13 | 261.19 ± 5.98 | 5.81 × 1041 | 2.35 × 1046 |
|
| LIRVAK | 0.355 | 0.22 | 17.88 | 0.301 | 1535.50 | 318.82 ± 7.42 | 7.16 × 1044 | 3.11 × 1045 |
|
| HUZSUR01 | 0.988 | 0.62 | 1422.66 | 0.867 | 842.16 | 255.45 ± 8.23 | 3.18 × 1039 | 2.48 × 1043 |
|
| HEWFUL | 0.558 | 0.16 | 293.03 | 0.494 | 1665.78 | 271.03 ± 3.11 | 1.63 × 1036 | 1.31 × 1042 |
|
| YUJWAD | 0.388 | 0.26 | 16.62 | 0.264 | 1429.41 | 265.55 ± 9.25 | 6.35 × 1036 | 9.73 × 1041 |
|
| YUJWAD01 | 0.398 | 0.28 | 42.73 | 0.270 | 1409.42 | 261.01 ± 9.77 | 1.73 × 1036 | 1.30 × 1041 |
|
| - | 0.384 | 0.26 | 10.46 | 0.271 | 1414.41 | 254.38 ± 9.03 | 1.85 × 1034 | 4.26 × 1039 |
|
| ECUFEP | 0.532 | 0.22 | 491.72 | 0.447 | 2912.23 | 247.36 ± 7.13 | 2.53 × 1032 | 1.17 × 1039 |
CSD code is the number of MOFs in the Cambridge Structural Database. This MOF came from Tominaka et al.’s work [60].