Literature DB >> 36245939

Accelerating the theoretical study of Li-polysulfide adsorption on single-atom catalysts via machine learning approaches.

Abstract

Li-S batteries are a promising alternative to Li-ion batteries, offering large energy storage capacity and wide operating temperature range. However, their performance is heavily affected by the Li-polysulfide (LiPS) shuttling. Computational screening of LiPS adsorption on single-atom catalyst (SAC) substrates is of great aid to the design of Li-S batteries which are robust against the LiPS shuttling from the cathode to the anode and the electrolyte. To facilitate this process, we develop a machine learning (ML) protocol to accelerate the systematic mapping of dominant local energy minima found with calculations based on the density functional theory (DFT), and, in turn, fast screening of LiPS adsorption properties on SACs. We first validate the approach by probing the potential energy surface for LiPS adsorbed on graphene decorated with a Fe-N4-C SAC. We identify minima whose binding energies are better or on par with the one previously reported in the literature. We then move to analyze the adsorption trends on Zn-N4-C SAC and observe similar adsorption strength and behavior with the Fe-N4-C SAC, highlighting the good predictive power of our protocol. Our approach offers a comprehensive and computationally efficient alternative to conventional approaches studying LiPS adsorption.

Entities: Chemical

Keywords: DFT; Li‐polysulfide (LiPS) shuttling; Li–S batteries; Zn–N4–C SAC; adsorption; machine learning; single‐atom catalysts

Year: 2022 PMID： 36245939 PMCID： PMC9541244 DOI： 10.1002/qua.26956

Source DB: PubMed Journal: Int J Quantum Chem ISSN： 0020-7608 Impact factor: 2.437

INTRODUCTION

Lithium–sulfur (Li–S) batteries are an emerging technology in the field of energy storage. Their high‐density energy storage capacity, of ∼2500 W h kg−1 theoretical specific energy [1], is indeed much higher than that of Li‐ion or other commercial batteries. However, in practice, the Li–S battery rate capability and life cycle degrades severely over time because of the so called “shuttle” effect. The latter is caused by lithium polysulfide (LiPS) molecules remaining in the electrolyte during the charge–discharge process, leading to loss of active material and reduced battery performance. Controlling the LiPS rate of migration to the anode and the reaction kinetics in the cathode is important for tackling the issue of the “shuttle” effect. Several strategies have been adopted to reduce the diffusivity of LiPS to the electrolyte, mainly focusing on the development of cathode host materials that can strongly bind the LiPS while also improving the reaction kinetics [2, 3, 4]. However, regardless of the vast amount of research in the field over the recent years, prevention of battery degradation still remains a challenge. To overcome the issue, recent studies focus on single‐atom catalyst (SAC) materials. The latter provide strong polysulfide adsorption and improve the reaction kinetics, thanks to their exceptional electrocatalytic properties. Recent studies indeed suggest that SAC with atomically dispersed transition metals (TMs) on graphene lattices are particularly promising candidates. In particular, calculations on the effect of SACs with TM–N4–C formation (TM = Co, Fe, Mn, Ru, V, W, Zn, and C as a graphene lattice) on the LiPS adsorption, the Gibbs free energy change and the reaction activation barrier [5, 6, 7, 8, 9, 10, 11, 12, 13], predict a remarkable improvement in the cathode properties which could potentially reduce the LiPS “shuttle” effect. Numerical methods provide ground to screen SAC adsorption properties. The identification of the global energy minimum LiPS/SAC structure, a stepping stone to predict adsorption trends, is however rather challenging. Several different initial guess structures and structural optimizations have to be considered in order to identify the global minimum structure. For example, in high‐order LiPS intermediates (Li2S4, Li2S6, and Li2S8) the final binding energies varies over .5 eV [14] depending on the initial LiPS structure that is used as input in the relaxation. By the same token, reported values of Li2S6 binding energy on Fe‐based SAC, one of the most examined SAC material for Li–S batteries, vary by about .6 eV, from −.9 eV to −1.5 eV [5, 7, 9, 13]. The large discrepancy in the results, raises concerns regarding the quality of the protocols adopted to identify the minimum energy structure, and adds uncertainty when comparing SAC results from different studies. Here, we demonstrate the use of machine learning (ML) methods to accelerate the prediction of LiPS binding energies on SAC materials. The proposed approach gauges on the recent advances in ML methods applied to the modeling of catalysts and materials for energy applications [15, 16]. While application of ML methods emerged as a new paradigm to accelerate the exploration of complex interactions between small and medium molecules with a variety of binding sites, [17, 18, 19, 20] and to predict properties in energy storage materials [21, 22, 23, 24, 25, 26, 27, 28, 29] data‐driven approaches in the field of Li–S batteries still remain under‐exploited [30]. Data being the primary key in data‐driven approaches, the construction of non‐noisy databases has been the first stepping‐stone toward the rapid development of ML accelerated design of Li‐based materials, which exploit experimental data [31, 32, 33]. On the other hand, ML provide a platform to largely speed‐up quantum mechanical calculations, which can be promptly organized in homogeneous databases, and enable in silico high‐throughput screening [34]. To date, ML modeling in Li–S batteries has been used to examine the impact of materials and battery design on the battery performance [35]. Also, the LiPS adsorption on two‐dimensional layer‐structured MoSe2 and WSe2 was probed by calculating the binding energy of LiPSs for arbitrary spatial configurations at random sites on the host material [36]. Further, a recent study combined a machine learning framework, based on a modified Crystal Graph Convolutional Neural Network and DFT calculations to offer rational design of SACs for Li–S batteries [37]. The authors focus mainly on the low‐order LiPS and S8 to reveal the binding energy pattern of LiPSs on a wide range of SACs. In this work, we develop a protocol which hinges on a supervised/unsupervised machine learning strategy, in combination with density functional theory (DFT) calculations, to efficiently screen the stable structures of Li2S (n = 1, 2, 4, 6, 8) on TM–N4–C (TM = Fe, Zn) anchoring materials. The Fe based system will be used to benchmark with previous results in the literature [13]. The Zn‐containing material is instead selected as a focus of our investigation because it presents a similar chemistry to the one of Fe, but with a potential for improved properties when applied as a LiPs anchoring site [31, 32, 38]. To isolate a structurally diverse and energetically favorable set of structures from a database encompassing roto‐translations of LiPSs on a graphene‐based SAC substrate we adopt unsupervised and supervised machine learning schemes. We then show that by relaxing these structures we identify SAC/LiPS geometries whose binding energy is on par or lower with those reported in the literature. We thus demonstrate how machine learning methods of different nature can accelerate the systematic identification of the energetically favorable LiPS/SAC substrate geometries.

COMPUTATIONAL METHODS

The workflow we adopt in this study is visualized in Figure 1. While we refer the interested reader to the dedicated subsections (2.3–2.5) for detailed technical description on each technique, we here briefly summarize the key aspect in each step of the workflow:

FIGURE 1

Schematic representation of the five steps describing the binding energy prediction workflow in this study

Database construction: Firstly, we define a supercell space and relax the forces on atoms in the LiPS and in the SAC, each considered as an isolated system, at DFT level. This ensures that the LiPS and SAC structure are more likely to be energetically favorable when modeling LiPS adsorption on the SAC. We define a 3D grid in the supercell, where translations of the LiPS will be placed. The grid spacing is .5 Å along the plane parallel to the substrate, and .25 Å along the axis perpendicular to the substrate. For each translation, we apply rotation of 10° in the solid angle centered at the center of mass of the polysulfide, and obtain the full set of structures. The final number roto‐translation is reduced and differs for each LiPS, as we apply cutoff criteria so that no LiPS atom is closer than 1.5 Å to atoms in the substrate. Reference force‐energy calculations: We map each structure generated in the previous step into a high dimensional feature space. The latter is defined through a smooth overlap of atomic orbitals (SOAP) representation [39], employing seven angular and six radial components and a radial cut‐off of 6.5 Å. We then select a maximum of 10% of the most different structures, in terms of SOAP representation, within the full set of LiPS roto‐translations along the substrate, by means of a furthest point sampling (FPS) algorithm [40]. We calculate from single‐point DFT calculations the binding energy corresponding to each of these structures. ML training: we train a Kernel Ridge regressor [41] to predict the binding energies of selected LiPS–SAC configuration, based on its SOAP representation. We utilize the regressor to map predicted binding energies in the full set of roto‐translations generated in the initial step of this protocol. Surrogate generalized convex hull (GCH) construction: We build a GCH which utilizes the first principal component analysis (PCA) [42] component projection of the SOAP features assigned to each LiPS–SAC as the free thermodynamic variable, and the predicted binding energy as the energetic coordinate. Minimize GCH Vertexes: We perform DFT geometry optimization to the structures which lie at vertexes of the GCH. We make this choice under the heuristic hypothesis that vertexes of the GCH are the most likely to relax into dissimilar low‐energy structures [43]. Schematic representation of the five steps describing the binding energy prediction workflow in this study

Materials

In this study, we examine two SAC materials with TM–N4–C formation (TM = Fe, Zn) and five LiPS intermediates (Li2S , n = 1, 2, 4, 6, 8). The TM–N4–C configurations, based on a modified 72‐atom relaxed pristine graphene lattice, consist of 66 C atoms (92.96 at%), four substitutional N atoms (5.63 at%) and one interstitial TM atom (1.41 at%). The TM and two N atoms were manually shifted out of plane by .5 Å and .3 Å, respectively, in order to realistically represent the previously observed geometry in SAC materials, when LiPSs are present [7, 9]. The top and side views of the TM–N4–C, and the relaxed LiPS structures are shown in Figure 2.

FIGURE 2

Top (A) and side (B) views of TM–N4–C structure. Geometry optimized Li2S (n = 1, 2, 4, 6, 8) structures. Gray, blue, purple, magenta and yellow denote the C, N, TM, Li, and S elements, respectively (C)

DFT set up

We perform all spin‐polarized DFT simulations with the CASTEP code [44]. We calculate the exchange and correlation potential using the generalized gradient approximation (GGA) with the Perdew–Burke–Ernzerhof (PBE) functional, using plane‐wave cut‐off energy of 500 eV. We use energy and force convergence criteria of 10−5 eV and 5 × 10−5 eV/Å respectively for the self‐consistent field (SCF) for single‐point and geometry optimization simulations, and 10−5 eV and 10−3 eV/Å respectively, for the geometry optimization simulations. The sampling of the Brillouin zone is carried on a Monkhorst‐Pack grid, using a 3 × 3 × 1 k‐point mesh. We include Van der Waals dispersion corrections as described in Grimme's empirical method [45]. We consider periodic boundaries conditions in the simulation box and a vacuum separation distance of 18 Å at the direction normal to the substrate's surface to avoid replica interaction. We define the binding energy (E b) as: where E SAC, E LiPS, and E mat are the energies of the substrate, LiPS and substrate and LiPS together, respectively. For reference, we report the total energies for the SACs and LiPSs as: , , , , , , .

SOAP representation

Each configuration consisting of a LiPS and the SAC is described by means of D features deriving from the smooth overlap of atomic orbitals representation [39]. This representation is smooth, invariant to permutations, rotations, and translations of the system. Within the SOAP representation, structures are encoded by means Gaussian smeared atomic densities expanded via orthonormal functions, based on spherical harmonics and radial basis functions, centered at atoms positions. Operatively, the SOAP power spectrum is calculated as: where n and n are the indices for the radial basis functions (up to n max, l labels the degree of the spherical harmonics (up to l max) and Z 1 and Z 2 is a label associated to the atom species. The coefficients c in Equation (2) are instead defined as (inner products): where ρ (r), is the Gaussian smoothed atomic density for atoms with atomic number Z defined as Y (θ, Φ) are the real spherical harmonics, and g (r) is the radial basis function (spherical Gaussian type orbitals here). To calculate the SOAP power spectra we adopt the DSCRIBE [46] library, and utilize the following parameters: n max = 6, l max = 7, and r cut = 6.5 Å. After calculating the power spectrum associated to each atomic sites in a structure, we evaluate an average power spectrum which is associated to the atomic configuration. We utilize the elements in the averaged p power spectrum as the features in kernel ridge regression (KRR). SOAP power spectrum coefficients [47, 48, 49], and local‐density expansion coefficients more in general [50, 51, 52, 53], have been largely successful features in kernel‐based and linear machine learning models for structure classification and energy regression. SOAP representations, and in general local‐density representations, when coupled with kernel and linear methods alike, are proven to be capable of accurate and fast predictions of atomistic systems properties in bulk and of molecular materials with diverse chemistries, also providing fairly accurate predictions when trained on small (∼103 configurations) datasets. [51, 54, 55]. Atom‐density representation further encode by design the physical symmetries (rotation, permutation, translation invariance) which rule interatomic interaction, and, though not‐transparent, correlate with more interpretable “classical” descriptors, for example, coordination [56, 57, 58].

Kernel ridge regression

KRR is a well‐known and popular non‐parametric regression approach. In particular, it consists of L2‐norm regularized least squares fitting which exploits the kernel trick [41], where the latter allows for an efficient framework to transform and compare data in an high number dimensions. Within the framework of kernel methods, data are represented by a matrix of pairwise similarity where each (i, j) entry is defined by the kernel function k(x , x ). The latter returns the dot product of the features vectors associated to each i and j data point in the projected space, for any mapping . The kernel trick thus allows for operations in a high dimensional space without the explicit need of calculating ϕ(x). Given a set of covariate variables x , the KRR prediction f(x) is written as a linear combination [59]: and the vector of α weights is found by solving: where K is the kernel matrix, λ tunes the regularization strength and y labels response variables in the training set. In the reported case study, we choose λ to 0.025 and a radial basis function kernel (length scale l = 1) to evaluate the K elements of the K matrix. We do not exclude that other values of λ and l could also work well, as they have been chosen following a non‐systematic trial‐and‐error procedure.

Generalized convex hull

The generalized convex hull (GCH) allows to evaluate the probabilities of structures being stabilized by general thermodynamic constraints. In a nutshell, the convex hull of a finite number of points is defined as the smallest convex set that contains them. In materials science, the convex hull construction is used to identify structures that are stable. Given a set of energies, as a function of one or more thermodynamic variables, only the configurations which lie at the vertexes of the convex hull which are lower in energy will be stable. If a structure does not lie on one of such vertexes it is unstable with respect to decomposition into two or more parent structures (assuming fixed thermodynamic conditions and disregarding possible stabilization due to kinetic effects). The generalized convex hull construction adopts a geometric fingerprints = {Φ }, to encode the full structural diversity of a dataset of configurations, rather than one or more explicit thermodynamic variables to predict stable phases. The GCH framework has been successfully used in the past for predicting previously unseen ice phases [56] and molecular crystals polymorphs [43]. Overall, this elegant construction provides a rigorous framework to account on the same footing for both geometrically diversity and predicted energetic stability in the choice of diverse candidates for relaxation at DFT level.

WORKFLOW VALIDATION FOR FE–N–C SAC

Li on Fe–N–C

To showcase in a detailed manner the application of the workflow we discuss the case of Li2S binding on a Fe‐based SAC. Figure 3A shows a 2D map where the two axes are the first and second principal components of the SOAP features, associated to all structures generated for this system. Each point in the map corresponds to one of those structures. The color scale refers to the order with which structures have been generated during the systematic enumeration of LiPS roto‐translations. Points in black corresponds to structures selected via an heuristic FPS scheme where the SOAP features for each structure and an Euclidean metric were considered to evaluate the distance between different structures.

FIGURE 3

(A) Visualization of the SOAP features for initial full set of ∼27 000 structure generated for Li2S on Fe‐based SAC, when projected in the space determined by their first two PCA components (adimensional). Points in the map are color coded according to their structure index, while the ∼2000 structures selected by the FPS procedure are shown in black. Note their heterogeneous distribution in the PCA space, which also comprises outliers. (B) Parity plot between the ML‐predicted and DFT‐calculated binding energies for the training, testing, and validation datapoints. (C) Visualization of the GCH construction when utilizing the first SOAP feature's PCA component as the thermodynamic variable and binding energy as the energetic variables. Vertexes are highlighted with red crosses. The binding energy of structures found by relaxing each GCH vertex is shown in blue. A thick line acts as a guide to the eye in connecting each GCH vertex energy to its relaxed configuration binding energy We repeat the same process for the other LiPS molecules and run DFT‐based single point calculation on the first ∼10% most different structures (i.e., ∼2000% out of the total ∼20 000% structures per each LiPS), and utilize the ∼80% of that data as the training dataset in the Kernel Ridge regression fitting. Features associated to each data point are normalized by removing the mean and scaling to unit variance before performing the KRR training. A grand total of 7793 datapoints are used as the training test, after excluding configurations with too unfavorable binding energies. The model Mean Absolute Error (MAE) and R 2 correlation coefficient for training set predictions are .08 eV and .99, respectively. The same quantities for validation set predictions are instead equal to .12 eV and .98. The models errors are well within the acceptable limit for this type of calculations. MAE and R 2 values are obtained by averaging result found when performing an 8‐fold cross‐validation test [60]. To further evaluate our model's accuracy, we select the ∼2% most different structures with respect to the ones in the training set as our test set (i.e., ∼50 structures per each LiPS). More in detail, if the index 1…20000 list the structures ranked via a furthest point sampling starting from structure 1, the training set comprises a random selection of 80% of structures with index between 1 and 2000, the validation set contains the remaining 20% of structures with index ranging from 1 to 2000, while the test set includes structures 2001 to 2050. We then calculate the binding energy in the test set configurations via single point DFT calculations. The parity plot between the ML predicted and DFT true binding energies for this test is illustrated in Figure 3B. A MAE of .11 eV is registered for the test set, along with a very good correlation (R 2 = .99) between the single‐point predicted binding energy and the ground truth energy. We utilize the KRR regressor to evaluate the binding energies for the whole set of Li2S structures, generated at the first stage of our minima exploration protocol. We then map them on an Energy—First SOAP PCA dimension map, Figure 3C, where each LiPS roto‐translated configuration corresponds to one orange dot. The thin red edges and the 11 bold red crosses respectively identify the edges and vertexes of the convex hull construction associated to this dataset. While a number of CH vertexes appear at very negative first PCA component values, we consider only one because of their small structural and energetic diversity. The blue dots label the DFT‐level Binding Energy found when relaxing each of the structure lying at a CH vertexes. We then analyze the intermolecular distances in the LiPS molecule and the intramolecular one between the LiPS and the Fe and N atoms in the support, and that all lie in a distinct local minima.

LiPSs on Fe–N–C

We follow the protocol described for Li2S2 to calculate the binding energy for the rest of the examined LiPSs (Li2S n = 1, 4, 6, 8) when interacting with Fe–N4–C. Our training set initially comprises of approximately 10 000 structures, that is, the first most diverse ∼2000 Li2S , n = 1, 2, 4, 6, 8) structures and associated binding‐energies. Configurations with too unfavorable energies (above 5 eV) are then discarded. The training test thus consists of 7793 datapoints. A single ML model to learn predict binding energies regardless of the LiS stoichiometry is finally trained on this dataset. Figure 4 illustrates the GCH construction resulting from the SOAP‐KRR model. Each LiPS configuration is reported as an orange dot, the GCH vertexes are highlighted with red crosses, and the edge connecting to vertexes is visualized with a red line. The blue dots in Figure 4 instead report the DFT‐level Binding Energy found when relaxing each GCH vertex. For the four examined cases, the LiPS isomers found to give the lowest binding energies after geometry optimization are GCH vertex with positive 1st PCA component and low single‐point binding energy (lower right‐hand side region of the plots).

FIGURE 4

The surrogate GCH construction for Li2S (n = 1, 4, 6, 8) structures adsorbed on Fe–N4–C is illustrated with orange points. GCH vertexes are shown with red crosses. Blue dots show the binding energy corresponding to structures sampled after relaxing the GCH vertexes First, we compare the binding energies found with our protocol with respect to those reported in the literature, and gather them in Figure 5. The reported binding energies are in good agreement with reported values from the literature, showing similar or stronger binding, but the Li2S2 case were the binding energy is underestimated by .1 eV. This highlights the predictive power of the framework under investigation, which is generally able to identify structures offering the strongest binding. We note that when we include higher energy Li2S2 CH vertexes at the relaxation stage, we identify a configuration with binding energy of −2.04 eV, in line with what reported in the literature. In this regards, the binding energy change trend among the various LiPSs is also in good agreement with our previous study [13], which adopts the exact same DFT calculation set up.

FIGURE 5

Summary of the highest LiPS binding energies for Fe–N4–C SAC calculated by geometry optimized DFT simulations found in “This work” or found in the literature [5, 6, 7, 9, 13]. All datapoints are in eV. If datapoint are not available, the bar chart is not shown Figure 6 shows the top and side views of LiPS structures lowest energy structures. All relaxed LiPS structures have a S atom located on/near the top of the TM atom, while the Li atoms are located closer to the substrate. This is verified by an atomic distance analysis, reported in Figure 6, describing the shortest Li–TM and S–TM distances for each LiPS molecule corresponding to the lowest energy structure. The distance between Li atoms and the TM is always smaller than the one between the S atoms and the TM. The closest S–TM distance decreases for high order LiPSs, while the opposite is observed for the closest Li–TM distance. The tendency of the TM in “pulling” the S atoms closer while “pushing” the Li atoms away is in agreement with our previous report in the literature [13].

FIGURE 6

Top and side views, binding energy, and structural descriptors of the lowest energy configurations for each LiPS (Li2S , n = 1, 2, 4, 6, 8) on Fe–N4–C. Color code follows Figure 2

WORKFLOW EXPLOITATION: ZN–N–C SAC

We move on to exploit the binding energy prediction workflow described in this study and identify the low‐lying minima for the shuttle‐effect cycle on a Zn‐based SAC. The system configuration and geometry of the Zn‐based SAC structures were designed in accordance to the previously described Fe‐based SAC, and are illustrated in Figure 7. We examine the binding energies for five LiPSs (Li2S , n = 1, 2, 4, 6, 8), and compare LiPS energetics and geometries against the one found for Fe‐based SAC.

FIGURE 7

Top and side views, binding energy, and structural descriptors of the lowest energy configurations for each LiPS (Li2S , n = 1, 2, 4, 6, 8) on Zn–N4–C. Color code follows Figure 2

Top and side views, binding energy, and structural descriptors of the lowest energy configurations for each LiPS (Li2S , n = 1, 2, 4, 6, 8) on Zn–N4–C. Color code follows Figure 2 Figure 5 reports on the lowest identified binding energies for all examined structures lying on a GCH vertex. Although the Zn–N4–C has not been extensively studied previously, the observed binding energies are in good agreement with previously reported values ( [9]). Atomic binding on Zn‐based SAC seems stronger than on Fe‐based SAC, by .45–.64 eV depending on the LiPS order. The presence of Zn rather than FE in the SAC tends to affect more the lower order LiPSs comparing, a property that can be of help toward the reduction of the “shuttle” effect at the final catalytic steps of a LiPS. The shortest Li–TM distances behave similarly for either Zn and Fe‐based SACs, although these are generally shorter when Zn is present. In contrast, the shortest reported S–TM distances for the two systems follow an opposite trend. When Zn is present the S–TM distance tends to increase as the LiPS order increases, and, generally, the distances are larger when comparing to the Fe‐based SAC. Based on the above, we can conclude that Zn tends to prefer Li bonding for high order LiPSs. This can potentially improve the conversion rate at that stage. As a next step, we compare the lowest found energy configurations for the two different SACs, as reported in Figures 6 and 7. We note that some of the LiPSs which end up in the most competitive minima find the same, or almost the same, initial vertex in the GCH, comparing the Fe and Zn SAC cases. For Li2S, Li2S2, and Li2S8, where the initial unrelaxed configurations are not lying in the same vertex for the two different SACs, the final relaxed configurations are minimizing into similar configurations, as shown in Figures 6 and 7. It is a matter of debate whether best LiPS configurations found for a given substrate could be translated to other systems of similar nature. Our results suggest that relaxing a LiPS/SAC configuration which resulted in lower binding energy for the case of a given TM, is likely, yet not always, to lead to favorable configurations when substituting the TM itself. More robust approaches to identify putative minimum energy configurations, like the one here proposed, are thus necessary toward more accurate predictions. For completeness in the discussion on the computational costs associated with he discussed protocol, we display the number of DFT relaxation steps necessary in our workflow for each Zn‐LiPS system in Table 1. In particular, we report the total number of relaxation steps required in this workflow, including the single‐point calculations necessary during the “Reference Force‐Energy calculations” step, and the structure relaxation calculations of all GCH vertexes during the “Minimize GCH Vertexes” step. From that table, it is clear that the computational complexity increases for the high‐order LiPSs (Li2S6 and Li2S8) as the number of possible local energy minima increases for larger molecules. We recognize that the computational cost of DFT‐based calculations widely depends on multiple system and software dependent factors, hence effective cross‐reference comparison is not trivial. Additionally, the computational cost per DFT relaxation step varies when using self‐consistent functionals, hence time comparison would not be immediately insightful.

TABLE 1

Material	S‐P number	Vertexes	Total steps
Zn–N₄–C/Li₂S	537	9	3962
Zn–N₄–C/Li₂S₂	496	9	3922
Zn–N₄–C/Li₂S₄	505	8	3139
Zn–N₄–C/Li₂S₆	404	7	8831
Zn–N₄–C/Li₂S₈	408	8	7700

Reported relaxation steps for each material. “Single‐point (S‐P) number” refers to the calculations required during the “Reference Force‐Energy calculations” to train the ML model, the label “Vertexes” refers the total number of geometry optimizations performed, and “Total steps” refers to the sum of the geometry optimization steps for all vertexes The standard approach of manually, and often arbitrarily, constructing the LiPS/SAC configuration is known to be ineffective and often might not correspond to the most energetically favorable structure, unless excessive testing is performed. On one hand we cannot exclude that other minima searches methods, for example, via genetic algorithm approaches, will find minima structures competitive with those found in this study. On the other we note that these more extensive searches, involving hundreds or thousands structural relaxations, rather than tens of structural relaxations as in the proposed protocol, would result in much larger computational costs and practically unusable. However, we argue that the workflow under examination proposes a good set of candidate structures and offers a fair balance between accuracy and computational efficiency.

DISCUSSION AND CONCLUSIONS

The binding energy of LiPSs on TM–N–C substrates is a useful descriptor to infer which SAC material is likely to provide an optimal performance in preventing shuttle effects in Li–S batteries. Here, we propose an efficient protocol to identify a likely minimum energy configuration candidate, and thus the most informative binding energy prediction, via a combination of DFT single‐point calculations, unsupervised and supervised data‐driven algorithms, and DFT structure optimizations. The protocol hinges on: I) the construction of a database of roto‐translation of LiPS molecules deposited on a rigid TM–N–C substrate, II) the calculation of the binding energy for a carefully selected number of these, III) the training of a kernel ridge regressor to chart structure‐binding energy relationships systematically, IV) the use of a generalized convex hull construction to isolate likely‐stable and structurally diverse geometries, V) their final optimization at DFT level. We benchmark the protocol robustness for the case of LiPS on a Fe–N4–C SAC and recover binding energies on‐par or lower than the ones reported in the literature. We move to analyze the LiPS adsorption properties on a Zn–N4–C SAC and find that the latter systematically displays a strong interaction (i.e. lower binding energies). From a structural perspective, we confirm that the minimum S–TM distances are always lower than the minimum Li–TM distances, for both systems. We further find that, regardless of the TM nature, an increase in the nuclearity of the LiPS molecule corresponds to a decrease in the S–TM minimum distance and an increase in the Li–TM minimum distance. Interestingly, we note that notwithstanding a change in the transition metal single atom catalysts, the lowest energy relaxed structure often, but not always, originates from a LiPS located and oriented in similar initial position and orientation w.r.t. the substrate TM. Few upgrades could further benefit the efficiency and accuracy of the search strategy we propose. If deemed necessary, the GCH vertex search can encompass not the first PCA component but also high order ones. Further, adopting more complex statistical learning methods (e.g., Gaussian Process regression or Neural Network based) or other more recent local‐density featurization approaches (e.g., Atomic Cluster Expansion based [52, 61, 62] approaches) could enable models which retain the same target accuracy but demand for a lesser number of training structures. Second, co‐regionalized [63] or transfer learning [64] approaches may further benefit the making of more efficient and accurate models when tackling the study of LiPS adsorption on a substrates with many and diverse SACs transition metals. Third, ML approaches could be also adopted to accelerate the minimization [65, 66, 67] and reduce drastically the number of DFT force‐and‐energy calculations which are necessary at the “structure optimization” stage following the surrogate generalized convex hull construction. Currently, the proposed strategy does not aim to replace DFT‐based calculations, but rather to guide and speed‐up DFT‐based searches of LiPS configurations for strong adsorption on SACs. The ML acceleration of the structural relaxation step could further diminish the need of DFT single‐point calculations by a factor of 2 or larger. Finally, we remark that the method is general and transferable to study not only other LiPS‐substrate interactions, but, in principle, also other molecular system–substrate interactions. Here, we demonstrate our protocol's transferability by examining the Zn–N4–C SAC, and observing consistent results as in the Fe–N4–C SAC. Conversely we believe this work will nurture the advancement in minima searches protocols that investigate medium and large molecules adsorption on rigid and soft substrates of diverse nature. From an application standpoint, the synthetic process and performance of single atom catalysts have witnessed ground‐breaking advancement since the emergence of this class of materials, almost 15 year ago [68, 69]. We hope our work aids and contributes to raise interest in the integration of that category of materials in the next generation of Li–S‐based energy storage device.

AUTHOR CONTRIBUTIONS

Eleftherios Andritsos: Conceptualization; data curation; formal analysis; investigation; methodology; project administration; resources; software; validation; visualization; writing – original draft; writing – review and editing. Kevin Rossi: Conceptualization; data curation; formal analysis; investigation; methodology; project administration; resources; software; validation; visualization; writing – original draft; writing – review and editing.

23 in total

1. Semiempirical GGA-type density functional constructed with a long-range dispersion correction.

Authors: Stefan Grimme
Journal: J Comput Chem Date: 2006-11-30 Impact factor: 3.376

2. Zinc Complex-Based Multifunctional Reactive Lithium Polysulfide Trapper Approaching Its Theoretical Efficiency.

Authors: Zhong Ma; Zhijun Zuo; Yuning Li
Journal: ACS Appl Mater Interfaces Date: 2021-05-13 Impact factor: 9.229

3. Cobalt in Nitrogen-Doped Graphene as Single-Atom Catalyst for High-Sulfur Content Lithium-Sulfur Batteries.

Authors: Zhenzhen Du; Xingjia Chen; Wei Hu; Chenghao Chuang; Shuai Xie; Ajuan Hu; Wensheng Yan; Xianghua Kong; Xiaojun Wu; Hengxing Ji; Li-Jun Wan
Journal: J Am Chem Soc Date: 2019-02-21 Impact factor: 15.419

4. On-the-Fly Machine Learning of Atomic Potential in Density Functional Theory Structure Optimization.

Authors: T L Jacobsen; M S Jørgensen; B Hammer
Journal: Phys Rev Lett Date: 2018-01-12 Impact factor: 9.161

Review 5. Artificial Intelligence Applied to Battery Research: Hype or Reality?

Authors: Teo Lombardo; Marc Duquesnoy; Hassna El-Bouysidy; Fabian Årén; Alfonso Gallo-Bueno; Peter Bjørn Jørgensen; Arghya Bhowmik; Arnaud Demortière; Elixabete Ayerbe; Francisco Alcaide; Marine Reynaud; Javier Carrasco; Alexis Grimaud; Chao Zhang; Tejs Vegge; Patrik Johansson; Alejandro A Franco
Journal: Chem Rev Date: 2021-09-16 Impact factor: 72.087

6. Machine Learning Derived Blueprint for Rational Design of the Effective Single-Atom Cathode Catalyst of the Lithium-Sulfur Battery.

Authors: Zan Lian; Min Yang; Faheem Jan; Bo Li
Journal: J Phys Chem Lett Date: 2021-07-22 Impact factor: 6.475

7. Theoretical Calculation Guided Design of Single-Atom Catalysts toward Fast Kinetic and Long-Life Li-S Batteries.

Authors: Guangmin Zhou; Shiyong Zhao; Tianshuai Wang; Shi-Ze Yang; Bernt Johannessen; Hao Chen; Chenwei Liu; Yusheng Ye; Yecun Wu; Yucan Peng; Chang Liu; San Ping Jiang; Qianfan Zhang; Yi Cui
Journal: Nano Lett Date: 2020-01-03 Impact factor: 11.189

8. Balancing surface adsorption and diffusion of lithium-polysulfides on nonconductive oxides for lithium-sulfur battery design.

Authors: Xinyong Tao; Jianguo Wang; Chong Liu; Haotian Wang; Hongbin Yao; Guangyuan Zheng; Zhi Wei Seh; Qiuxia Cai; Weiyang Li; Guangmin Zhou; Chenxi Zu; Yi Cui
Journal: Nat Commun Date: 2016-04-05 Impact factor: 14.919