Literature DB >> 25547365

Statistical analysis and interpolation of compositional data in materials science.

Misha Z Pesenson1, Santosh K Suram, John M Gregoire.   

Abstract

Compositional data are ubiquitous in chemistry and materials science: analysis of elements in multicomponent systems, combinatorial problems, etc., lead to data that are non-negative and sum to a constant (for example, atomic concentrations). The constant sum constraint restricts the sampling space to a simplex instead of the usual Euclidean space. Since statistical measures such as mean and standard deviation are defined for the Euclidean space, traditional correlation studies, multivariate analysis, and hypothesis testing may lead to erroneous dependencies and incorrect inferences when applied to compositional data. Furthermore, composition measurements that are used for data analytics may not include all of the elements contained in the material; that is, the measurements may be subcompositions of a higher-dimensional parent composition. Physically meaningful statistical analysis must yield results that are invariant under the number of composition elements, requiring the application of specialized statistical tools. We present specifics and subtleties of compositional data processing through discussion of illustrative examples. We introduce basic concepts, terminology, and methods required for the analysis of compositional data and utilize them for the spatial interpolation of composition in a sputtered thin film. The results demonstrate the importance of this mathematical framework for compositional data analysis (CDA) in the fields of materials science and chemistry.

Keywords:  big data; complex data; compositional data; electrocatalyst; high-throughput screening; inkjet printing; interpolation; sputtering; statistical data analysis; thin-films

Mesh:

Year:  2015        PMID: 25547365     DOI: 10.1021/co5001458

Source DB:  PubMed          Journal:  ACS Comb Sci        ISSN: 2156-8944            Impact factor:   3.784


  3 in total

1.  Performance Assessment in Water Polo Using Compositional Data Analysis.

Authors:  Enrique García Ordóñez; María Del Carmen Iglesias Pérez; Carlos Touriño González
Journal:  J Hum Kinet       Date:  2016-12-15       Impact factor: 2.193

2.  Robust Compositional Analysis of Physical Activity and Sedentary Behaviour Data.

Authors:  Nikola Štefelová; Jan Dygrýn; Karel Hron; Aleš Gába; Lukáš Rubín; Javier Palarea-Albaladejo
Journal:  Int J Environ Res Public Health       Date:  2018-10-14       Impact factor: 3.390

3.  A Lachnospiraceae-dominated bacterial signature in the fecal microbiota of HIV-infected individuals from Colombia, South America.

Authors:  Homero San-Juan-Vergara; Eduardo Zurek; Nadim J Ajami; Christian Mogollon; Mario Peña; Ivan Portnoy; Jorge I Vélez; Christian Cadena-Cruz; Yirys Diaz-Olmos; Leidy Hurtado-Gómez; Silvana Sanchez-Sit; Danitza Hernández; Irina Urruchurtu; Pierina Di-Ruggiero; Ella Guardo-García; Nury Torres; Oscar Vidal-Orjuela; Diego Viasus; Joseph F Petrosino; Guillermo Cervantes-Acosta
Journal:  Sci Rep       Date:  2018-03-14       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.