Literature DB >> 31432806

Statistically representative databases for density functional theory via data science.

Pierpaolo Morgante1, Roberto Peverati1.   

Abstract

The amount of data and number of databases for the assessment and parameterization of density functional theory methods has grown substantially in the past two decades. In this work, we introduce a novel cluster analysis technique for density functional theory calculations of the electronic structure of atoms and molecules with the goal of creating new statistically significant databases with broad chemical scope, and a manageable number of data-points. By analyzing without a priori chemical assumptions a population of almost 350k data-points, we create a new database called ASCDB containing only 200 data-points. This new database holds the same chemical information as the larger population of data from which it is obtained, but with a computational cost that is reduced by several orders of magnitude. The labelling of the significant chemical properties is performed a posteriori on the resulting 16 subsets, classifying them into four areas of chemical importance: non-covalent interactions, thermochemistry, non-local effects, and unbiased calculations. The analysis of the results and their transferability shows that ASCDB is capable of providing the same information as that of the larger collection of data-such as GMTKN55, MGCDB84, and Minnesota 2015B-for several density functional theory methods and basis sets. In light of these results, we suggest the use of this new small database as a first inexpensive tool for the evaluation and parameterization of electronic structure theory methods.

Entities:  

Year:  2019        PMID: 31432806     DOI: 10.1039/c9cp03211h

Source DB:  PubMed          Journal:  Phys Chem Chem Phys        ISSN: 1463-9076            Impact factor:   3.676


  1 in total

1.  Competition between cyclization and unusual Norrish type I and type II nitro-acyl migration pathways in the photouncaging of 1-acyl-7-nitroindoline revealed by computations.

Authors:  Pierpaolo Morgante; Charitha Guruge; Yannick P Ouedraogo; Nasri Nesnas; Roberto Peverati
Journal:  Sci Rep       Date:  2021-01-14       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.