| Literature DB >> 30897779 |
Wenyuan Yang1, Hong Liang2, Feng Peng3,4, Zili Liu5, Jie Liu6, Zhiwei Qiao7,8.
Abstract
The Monte Carlo and molecular dynamics simulations are employed to screen the separation performance of 6013 computation-ready, experimental metal⁻organic framework membranes (CoRE-MOFMs) for 15 binary gas mixtures. After the univariate analysis, principal component analysis is used to reduce 44 performance metrics of 15 mixtures to a 10-dimension set. Then, four machine learning algorithms (decision tree, random forest, support vector machine, and back propagation neural network) are combined with k times repeated k-fold cross-validation to predict and analyze the relationships between six structural feature descriptors and 10 principal components. Based on the linear correlation value R and the root mean square error predicted by the machine learning algorithm, the random forest algorithm is the most suitable for the prediction of the separation performance of CoRE-MOFMs. One descriptor, pore limiting diameter, possesses the highest weight importance for each principal component index. Finally, the 30 best CoRE-MOFMs for each binary gas mixture are screened out. The high-throughput computational screening and the microanalysis of high-dimensional performance metrics can provide guidance for experimental research through the relationships between the multi-structure variables and multi-performance variables.Entities:
Keywords: gas separation; linear dimension reduction; machine learning; metal–organic framework; molecular simulation
Year: 2019 PMID: 30897779 PMCID: PMC6474094 DOI: 10.3390/nano9030467
Source DB: PubMed Journal: Nanomaterials (Basel) ISSN: 2079-4991 Impact factor: 5.076
Figure 1Schematic diagram of the project framework.
Figure 2Relationships among (a) PO, (b) PH, and pore limiting diameter (PLD); relationships among (c) Sperm(H, (d) Sperm(O, and PLD; relationship between (e) Sperm(H and PH and (f) Sperm(O and PO; the red line represents Robeson’s penetration data based on a wide range of polymer film upper bounds [39].
The prediction of R values and root mean square error (RMSE) for principal component (PC1, PC2, PC3) on the algorithms model of four machine learning. DT: decision tree, SVM: support vector machine, BPNN: back propagation neural network.
| ML | RMSE | |||||
|---|---|---|---|---|---|---|
| PC1 | PC2 | PC3 | PC1 | PC2 | PC3 | |
|
| 0.80 | 0.89 | 0.62 | 0.081 | 1.190 | 0.601 |
|
| 0.81 | 0.93 | 0.72 | 0.080 | 1.000 | 0.700 |
|
| 0.79 | 0.89 | 0.63 | 0.080 | 1.210 | 0.790 |
|
| 0.74 | 0.87 | 0.60 | 0.090 | 1.230 | 0.820 |
Figure 3Predicted performance of the first three principal components (PC1, PC2, PC3) by the machine learning algorithm model of random forest (RF) versus the simulated results of computation-ready, experimental metal–organic framework membranes (CoRE-MOFMs) on the test set. The color of the point represents the amount of materials.
Figure 4Relative importance of six feature descriptors versus PC1, PC2, PC3, PC1 + PC2 + ... + PC3 from the RF algorithm model.
Best CoRE-MOFMs. LCD: large cavity diameter, PLD: pore limiting diameter, PSD: pore size distribution, VSA: volumetric surface area, CSD: Cambridge structural database.
| No. | CSD code | LCD |
| VSA | PLD | PSD% |
| ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | CO2/CH4 | XUZDUS | 4.25 | 0.16 | 15.50 | 2.99 | 1.82 | 3.31 | 1.26 × 109 | 1.75 × 103 | 7.16 × 105 |
| XEJXER | 4.12 | 0.19 | 4.33 | 3.58 | 1.60 | 0 | 1.33 × 108 | 9.26 × 102 | 1.44 × 105 | ||
| 2 | CO2/N2 | ELUQIM06 | 2.89 | 0.04 | 0 | 2.41 | 1.80 | 100 | 7.20 × 106 | 7.72 × 102 | 9.32 × 103 |
| NHBZZN10 | 3.41 | 0.08 | 0 | 2.94 | 1.54 | 68.84 | 1.57 × 104 | 2.32 × 102 | 67.63 | ||
| 3 | H2/CO2 | TUMGOX | 3.47 | 0.27 | 0 | 2.61 | 1.87 | 54.55 | 7.75 × 103 | 1.76 × 103 | 4.39 |
| HEDCEA | 4.76 | 0.27 | 226.10 | 2.93 | 1.28 | 2.15 | 2.35 × 103 | 5.95 × 102 | 3.95 | ||
| 4 | H2S/CH4 | SEYFAE | 4.14 | 0.28 | 87.08 | 3.31 | 2.03 | 0.43 | 5.57 × 104 | 7.32 × 102 | 76.12 |
| GUXPUL | 2.79 | 0.02 | 0 | 2.58 | 1.60 | 100 | 4.11 × 104 | 3.15 × 103 | 13.07 | ||
| 5 | H2/CH4 | TESGUU | 4.82 | 0.26 | 338.16 | 3.58 | 1.92 | 0 | 8.90 × 102 | 0.07 | 1.20 × 104 |
| ZIJVOF | 5.29 | 0.43 | 622.86 | 3.32 | 1.23 | 0.10 | 9.35 × 102 | 0.10 | 9.37 × 103 | ||
| 6 | H2/O2 | TOWPAY | 3.47 | 0.27 | 0 | 2.61 | 1.87 | 54.55 | 7.75 × 103 | 1.89 × 103 | 4.10 |
| POWBIO | 4.34 | 0.19 | 142.64 | 2.60 | 3.66 | 1.54 | 4.56 × 103 | 1.18 × 103 | 3.87 | ||
| 7 | CO2/H2S | FAPYEA | 2.53 | 0.00 | 0 | 2.46 | 1.58 | 100 | 1.93 × 109 | 5.11 × 105 | 3.78 × 103 |
| XUZDUS | 4.25 | 0.16 | 15.50 | 2.99 | 1.82 | 3.31 | 1.26 × 109 | 6.80 × 106 | 1.85 × 102 | ||
| 8 | H2/N2 | TUMGOX | 3.47 | 0.27 | 0 | 2.61 | 1.87 | 54.55 | 7.75 × 103 | 2.01 × 103 | 3.85 |
| DIMQOH | 4.69 | 0.42 | 626.81 | 3.15 | 1.40 | 2.97 | 3.80 × 103 | 1.12 × 103 | 3.39 | ||
| 9 | He/N2 | TUMGOX | 3.47 | 0.27 | 0 | 2.61 | 1.87 | 54.55 | 7.11 × 103 | 2.01 × 103 | 3.53 |
| COWXOC | 5.83 | 0.30 | 517.90 | 2.91 | 1.28 | 0.57 | 7.79 × 103 | 2.43 × 103 | 3.21 | ||
| 10 | He/H2 | BUYNAL | 4.62 | 0.47 | 617.88 | 3.97 | 0.92 | 0 | 20.34 | 5.12 | 3.97 |
| VULKOD | 5.02 | 0.33 | 489.24 | 3.96 | 1.40 | 0.95 | 25.60 | 6.52 | 3.93 | ||
| 11 | He/CH4 | YEKWOC | 8.21 | 0.50 | 1099.57 | 3.05 | 1.28 | 0 | 1.21 × 102 | 0.11 | 1.15 × 103 |
| COXFOL | 5.01 | 0.33 | 403.58 | 3.19 | 1.43 | 1.70 | 3.24 × 102 | 7.48 | 43.32 | ||
| 12 | N2/CH4 | YEKWOC | 8.21 | 0.50 | 1099.57 | 3.05 | 1.28 | 0 | 6.82 × 103 | 0.11 | 6.44 × 104 |
| BAHGUN04 | 4.27 | 0.26 | 108.86 | 3.29 | 1.49 | 0 | 6.24 × 105 | 11.11 | 5.62 × 104 | ||
| 13 | He/CO2 | PUPNAQ | 3.59 | 0.15 | 0 | 2.70 | 1.44 | 32.37 | 38.34 | 4.11 | 9.33 |
| TUMGOX | 3.47 | 0.27 | 0 | 2.61 | 1.87 | 54.55 | 7.11 × 103 | 1.76 × 103 | 4.03 | ||
| 14 | O2/N2 | GETXAG | 3.33 | 0.10 | 0 | 2.61 | 1.54 | 99.99 | 1.11 × 104 | 3.17 × 103 | 3.48 |
| GOLQII | 3.73 | 0.14 | 9.74 | 3.37 | 2.17 | 0 | 8.32 × 102 | 2.48 × 102 | 3.35 | ||
| 15 | He/O2 | COWXOC | 5.83 | 0.30 | 517.90 | 2.91 | 1.28 | 0.57 | 7.79 × 103 | 2.07 × 103 | 3.76 |
| TOWPAY | 3.47 | 0.27 | 0 | 2.61 | 1.87 | 54.55 | 7.11 × 103 | 1.89 × 103 | 3.76 |
represent the same metal–organic framework membranes (MOFMs) for different binary gas mixtures.