Literature DB >> 18228011

Assessing and improving the stability of chemometric models in small sample size situations.

Claudia Beleites1, Reiner Salzer.   

Abstract

Small sample sizes are very common in multivariate analysis. Sample sizes of 10-100 statistically independent objects (rejects from processes or loading dock analysis, or patients with a rare disease), each with hundreds of data points, cause unstable models with poor predictive quality. Model stability is assessed by comparing models that were built using slightly varying training data. Iterated k-fold cross-validation is used for this purpose. Aggregation stabilizes models. It is possible to assess the quality of the aggregated model without calculating further models. The validation and aggregation methods investigated in this study apply to regression as well as to classification. These techniques are useful for analyzing data with large numbers of variates, e.g., any spectral data like FT-IR, Raman, UV/VIS, fluorescence, AAS, and MS. FT-IR images of tumor tissue were used in this study. Some tissue types occur frequently, while some are very rare. They are classified using LDA. Initial models were severely unstable. Aggregation stabilizes the predictions. The hit rate increased from 67% to 82%.

Entities:  

Mesh:

Year:  2008        PMID: 18228011     DOI: 10.1007/s00216-007-1818-6

Source DB:  PubMed          Journal:  Anal Bioanal Chem        ISSN: 1618-2642            Impact factor:   4.142


  7 in total

1.  Ensemble multivariate analysis to improve identification of articular cartilage disease in noisy Raman spectra.

Authors:  Wade Richardson; Dan Wilkinson; Ling Wu; Frank Petrigliano; Bruce Dunn; Denis Evseenko
Journal:  J Biophotonics       Date:  2014-09-26       Impact factor: 3.207

2.  Classification and prediction of HCC tissues by Raman imaging with identification of fatty acids as potential lipid biomarkers.

Authors:  T Tolstik; C Marquardt; C Beleites; C Matthäus; C Bielecki; M Bürger; C Krafft; O Dirsch; U Settmacher; J Popp; A Stallmach
Journal:  J Cancer Res Clin Oncol       Date:  2014-09-20       Impact factor: 4.553

Review 3.  Emerging Themes in Image Informatics and Molecular Analysis for Digital Pathology.

Authors:  Rohit Bhargava; Anant Madabhushi
Journal:  Annu Rev Biomed Eng       Date:  2016-07-11       Impact factor: 9.590

4.  Individual differences in local functional brain connectivity affect TMS effects on behavior.

Authors:  Carsten Gießing; Mohsen Alavash; Christoph S Herrmann; Claus C Hilgetag; Christiane M Thiel
Journal:  Sci Rep       Date:  2020-06-26       Impact factor: 4.379

5.  A design of experiments approach for the rapid formulation of a chemically defined medium for metabolic profiling of industrially important microbes.

Authors:  Chloe Singleton; James Gilman; Jessica Rollit; Kun Zhang; David A Parker; John Love
Journal:  PLoS One       Date:  2019-06-12       Impact factor: 3.240

6.  Alignment-Free Method to Predict Enzyme Classes and Subclasses.

Authors:  Riccardo Concu; M Natália D S Cordeiro
Journal:  Int J Mol Sci       Date:  2019-10-29       Impact factor: 5.923

7.  Sniffing Bacteria with a Carbon-Dot Artificial Nose.

Authors:  Nitzan Shauloff; Ahiud Morag; Karin Yaniv; Seema Singh; Ravit Malishev; Ofra Paz-Tal; Lior Rokach; Raz Jelinek
Journal:  Nanomicro Lett       Date:  2021-04-20
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.