Literature DB >> 29892644

Data and Supplemental information for predicting the thermodynamic stability of perovskite oxides using machine learning models.

Wei Li1, Ryan Jacobs1, Dane Morgan1.   

Abstract

To better present the machine learning work and the data used, we prepared a supplemental spreadsheet to organize the full training dataset prepared from DFT calculations, the individual elemental properties, the generated element-based descriptors derived from the elements present in each perovskite composition, and lists of the specific features selected and used our machine learning models. We have also provided supplemental information which contains additional details related to our machine learning models which were not provided in the main text (Li et al., In press) [1]. In particular, the supplemental information provides results on training and testing five regression models (using the same data and descriptors as the regression of Ehull in main text) to predict the formation energies of perovskite oxides. We provided source code that trains the machine learning models on the provided training dataset and predicts the stability for the test data.

Entities:  

Year:  2018        PMID: 29892644      PMCID: PMC5992996          DOI: 10.1016/j.dib.2018.05.007

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data Provide thermodynamic stability data of more than 1900 perovskite oxides, which can be future extended to other data mining / machine learning tasks. Present an informative elemental property list for all elements in the periodic table, which can be valuable for building descriptors for other materials informatics projects. Provide detailed information on the feature selection and model training results, making it easy for other researchers to study feature importance, and to understand and reproduce the results in the manuscript. Provided source code for training machine learning models on provided training dataset and making predictions on new perovskite compounds. Users can supply their own perovskite dataset for training or testing.

Data

Project_complete_dataset.xlsx:

This spreadsheet contains 5 sheets, which are listed below. DFT Calculated Dataset: The dataset of DFT-calculated Ehull and formation energies of 1929 perovskites. Elemental Property Table: The 82 elemental properties of all elements in the periodic table. Shannon Radius: The Shannon radii of the elements that appeared in the 1929 perovskites which were used to calculate geometric descriptors unique to the perovskite structure. Dataset with Generated Features: The 962 descriptors (including 171 constant descriptors that were not used and 791 descriptors that were used for feature selection) generated from the elemental properties. Top features Selected: The descriptors selected during feature selection as optimal (top 70 from the 791 descriptors) for use the regression model and the classification model.

Supplemental information.docx

This file contains additional details related to our machine learning models which were not provided in the main text. In particular, the supplemental information provides results on training and testing five regression models (using the same data and descriptors as the regression of Ehull in main text) to predict the formation energies of perovskite oxides. The supplemental information contains 4 sections: Models/APIs used in this work, Best Parameters for Each Model Investigated, Regression of Formation Energy, Learning Curve with the Best Model for Stability Classification.

Code repository

This repository contains source code for training/testing machine learning models on the provided dataset. This package can provide predictions of stability of new perovskite compounds using the machine learning approaches developed in this work. The code repository is available online at https://github.com/uw-cmg/perovskite-oxide-stability-prediction.

Experimental design, materials and methods

N/A.
Subject areaPhysics, Chemistry, Machine Learning
More specific subject areaThermodynamic stability of perovskite oxides dataset and source code for training of machine-learning predictive model
Type of dataSpreadsheet, Source Code, Word Document
How data was acquiredDensity Functional Theory Calculations[2], MAGPIE Elemental Property Database[3], Online resources[4], [5](See Manuscript[1]Sec. 2.1)
Data formatAnalyzed
Experimental factorsN/A
Experimental featuresN/A
Data source locationN/A
Data accessibilityData is with this article
  6 in total

Review 1.  A Survey of Datasets, Preprocessing, Modeling Mechanisms, and Simulation Tools Based on AI for Material Analysis and Discovery.

Authors:  Faiza Qayyum; Do-Hyeun Kim; Seon-Jong Bong; Su-Young Chi; Yo-Han Choi
Journal:  Materials (Basel)       Date:  2022-02-15       Impact factor: 3.623

2.  Screening Perovskites from ABO3 Combinations Generated by Constraint Satisfaction Techniques Using Machine Learning.

Authors:  Jie Zhao; Xiaoyan Wang
Journal:  ACS Omega       Date:  2022-03-16

3.  Environmental Stability of Crystals: A Greedy Screening.

Authors:  Nicholas M Twyman; Aron Walsh; Tonio Buonassisi
Journal:  Chem Mater       Date:  2022-03-02       Impact factor: 9.811

4.  Data-Driven Analysis of Hole-Transporting Materials for Perovskite Solar Cells Performance.

Authors:  Marcos Del Cueto; Charles Rawski-Furman; Juan Aragó; Enrique Ortí; Alessandro Troisi
Journal:  J Phys Chem C Nanomater Interfaces       Date:  2022-07-29       Impact factor: 4.177

5.  Physics-Guided Descriptors for Prediction of Structural Polymorphs.

Authors:  Bastien F Grosso; Nicola A Spaldin; Aria Mansouri Tehrani
Journal:  J Phys Chem Lett       Date:  2022-08-03       Impact factor: 6.888

6.  Rapidly predicting Kohn-Sham total energy using data-centric AI.

Authors:  Hasan Kurban; Mustafa Kurban; Mehmet M Dalkilic
Journal:  Sci Rep       Date:  2022-08-24       Impact factor: 4.996

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.