| Literature DB >> 34878706 |
Rémi Pétuya1,2, Samantha Durdy3,4, Dmytro Antypov1,3, Michael W Gaultois1,3, Neil G Berry1, George R Darling1, Alexandros P Katsoulidis1,3, Matthew S Dyer1,3, Matthew J Rosseinsky1,3.
Abstract
The choice of metal and linker together define the structure and therefore the guest accessibility of a metal-organic framework (MOF), but the large number of possible metal-linker combinations makes the selection of components for synthesis challenging. We predict the guest accessibility of a MOF with 80.5 % certainty based solely on the identity of these two components as chosen by the experimentalist, by decomposing reported experimental three-dimensional MOF structures in the Cambridge Structural Database into metal and linker and then learning the connection between the components' chemistry and the MOF porosity. Pore dimensions of the guest-accessible space are classified into four ranges with three sequential models. Both the dataset and the predictive models are available to download and offer simple guidance in prioritization of the choice of the components for exploratory MOF synthesis for separation and catalysis based on guest accessibility considerations.Entities:
Keywords: Database; Guest accessibility; Machine learning; Metal-organic frameworks; Porosity
Year: 2022 PMID: 34878706 PMCID: PMC9303542 DOI: 10.1002/anie.202114573
Source DB: PubMed Journal: Angew Chem Int Ed Engl ISSN: 1433-7851 Impact factor: 16.823
Figure 1Classification of the 3D MOF component (organic linkers and metal species) dataset from the Cambridge Structural Database 3D MOF subset. Once the MOF structures are cleaned by removing species not bound to metal atoms, their porosity is evaluated by calculating their pore limiting diameter (PLD) with Zeo++. UiO‐66 (refcode RUBTAK) is shown here as an illustrative example and its Connolly surface, highlighting its porosity, is displayed for a probe diameter of 2.4 Å.
Figure 2Workflow of creating the 1M1L3D dataset and using it to develop machine learning tools. The starting point is the information contained in the MOF subset of the experimental structures in the CSD that is used to select 3D MOF structures (step 1). These structures are decomposed into metal and linker (step 2) to produce the 1M1L3D dataset containing materials with a single metal and single linker (step 3). The evaluation of this dataset then takes place (step 4) to produce both the features (shown in blue for the linker and grey for the metal) and the porosity target (shown in green for one of the models as an example) on which the ML models are trained (step 5) to predict MOF guest accessibility (step 6). MOF are considered guest‐accessible when their pore limiting diameter is larger than 2.4 Å. The model is 80.5 % accurate in predicting guest accessibility based on the nature of the metal and the linker.
Figure 3Sequence of three binary classifier models that predict the range of the pore limiting diameter (PLD) of a candidate MOF based on its linker and metal components. The four ranges are defined as non‐porous (PLD<2.4 Å, red), small pores (2.4 Å