| Literature DB >> 35056671 |
Andrei Tereshchenko1, Danil Pashkov1,2, Alexander Guda1, Sergey Guda1,2, Yury Rusalev1, Alexander Soldatov1.
Abstract
Catalytic properties of noble-metal nanoparticles (NPs) are largely determined by their surface morphology. The latter is probed by surface-sensitive spectroscopic techniques in different spectra regions. A fast and precise computational approach enabling the prediction of surface-adsorbate interaction would help the reliable description and interpretation of experimental data. In this work, we applied Machine Learning (ML) algorithms for the task of adsorption-energy approximation for CO on Pd nanoclusters. Due to a high dependency of binding energy from the nature of the adsorbing site and its local coordination, we tested several structural descriptors for the ML algorithm, including mean Pd-C distances, coordination numbers (CN) and generalized coordination numbers (GCN), radial distribution functions (RDF), and angular distribution functions (ADF). To avoid overtraining and to probe the most relevant positions above the metal surface, we utilized the adaptive sampling methodology for guiding the ab initio Density Functional Theory (DFT) calculations. The support vector machines (SVM) and Extra Trees algorithms provided the best approximation quality and mean absolute error in energy prediction up to 0.12 eV. Based on the developed potential, we constructed an energy-surface 3D map for the whole Pd55 nanocluster and extended it to new geometries, Pd79, and Pd85, not implemented in the training sample. The methodology can be easily extended to adsorption energies onto mono- and bimetallic NPs at an affordable computational cost and accuracy.Entities:
Keywords: adaptive sampling; adsorption energy; machine learning; palladium nanoparticles; probing molecules; radial distribution function
Year: 2022 PMID: 35056671 PMCID: PMC8780420 DOI: 10.3390/molecules27020357
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Scheme 1Algorithm of adaptive sampling.
Figure 1Machine learning (ML)-predicted vs. Density Functional Theory (DFT)-calculated binding energy in eV for different ML algorithms: (a) Ridge regression; (b) Decision tree; (c) Least absolute shrinkage and selection operator (Lasso); (d) AdaBoost; (e) XGBoost; (f) Gradient boosting; (g) Random forest; (h) Extra trees; (i) support vector machines (SVM). The entire interval of radial distribution functions (RDF) 0–7 Å from the carbon atom of CO was used as a descriptor. Histograms on the opposite axis represent distributions of respective values of energy.
Comparison of the used ML algorithms in terms of their efficiency for predicting the binding energy over the entire interval of the calculated RDF.
| ML Algorithm | MAE, eV | MSE, eV | R2-Score |
|---|---|---|---|
| Ridge regression | 0.40 | 0.28 | 0.31 |
| Decision tree | 0.30 | 0.27 | 0.33 |
| Lasso | 0.39 | 0.26 | 0.36 |
| AdaBoost | 0.29 | 0.16 | 0.60 |
| XGBoost | 0.20 | 0.15 | 0.64 |
| Gradient boosting | 0.22 | 0.14 | 0.64 |
| Random forest | 0.22 | 0.14 | 0.65 |
| Extra trees | 0.19 | 0.13 | 0.68 |
| SVM | 0.15 | 0.08 | 0.81 |
Figure 2DFT potential energy scans upon moving CO molecules (a) from adsorption sites on the surface of Pd55 nanocluster; (b) along the plane perpendicular to Pd(100) (nearest Pd atoms are marked by red and green dashes). (c) Series of smeared RDF upon CO moving away from the respective adsorbing site, colored according to binding energies. The color scale is the same for parts (b,c).
Figure 3Comparison of the quality of prediction of binding energy when using different descriptors/ML methods.
Figure 4The quality of binding-energy prediction when using a different length of RDF.
Figure 5Energy surface of Pd55 predicted by trained SMV ML method. A grid with steps π/100 for φ and θ was used.