Literature DB >> 21087946

Probabilistic classifiers with high-dimensional data.

Kyung In Kim1, Richard Simon.   

Abstract

For medical classification problems, it is often desirable to have a probability associated with each class. Probabilistic classifiers have received relatively little attention for small n large p classification problems despite of their importance in medical decision making. In this paper, we introduce 2 criteria for assessment of probabilistic classifiers: well-calibratedness and refinement and develop corresponding evaluation measures. We evaluated several published high-dimensional probabilistic classifiers and developed 2 extensions of the Bayesian compound covariate classifier. Based on simulation studies and analysis of gene expression microarray data, we found that proper probabilistic classification is more difficult than deterministic classification. It is important to ensure that a probabilistic classifier is well calibrated or at least not "anticonservative" using the methods developed here. We provide this evaluation for several probabilistic classifiers and also evaluate their refinement as a function of sample size under weak and strong signal conditions. We also present a cross-validation method for evaluating the calibration and refinement of any probabilistic classifier on any data set.

Mesh:

Year:  2010        PMID: 21087946      PMCID: PMC3138069          DOI: 10.1093/biostatistics/kxq069

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  8 in total

1.  A gene expression-based method to diagnose clinically distinct subgroups of diffuse large B cell lymphoma.

Authors:  George Wright; Bruce Tan; Andreas Rosenwald; Elaine H Hurt; Adrian Wiestner; Louis M Staudt
Journal:  Proc Natl Acad Sci U S A       Date:  2003-08-04       Impact factor: 11.205

2.  A protocol for building and evaluating predictors of disease state based on microarray data.

Authors:  Lodewyk F A Wessels; Marcel J T Reinders; Augustinus A M Hart; Cor J Veenman; Hongyue Dai; Yudong D He; Laura J van't Veer
Journal:  Bioinformatics       Date:  2005-04-07       Impact factor: 6.937

3.  Prediction error estimation: a comparison of resampling methods.

Authors:  Annette M Molinaro; Richard Simon; Ruth M Pfeiffer
Journal:  Bioinformatics       Date:  2005-05-19       Impact factor: 6.937

4.  A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics.

Authors:  Juliane Schäfer; Korbinian Strimmer
Journal:  Stat Appl Genet Mol Biol       Date:  2005-11-14

5.  Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Authors:  Robert Tibshirani; Trevor Hastie; Balasubramanian Narasimhan; Gilbert Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-14       Impact factor: 11.205

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

7.  BagBoosting for tumor classification with gene expression data.

Authors:  Marcel Dettling
Journal:  Bioinformatics       Date:  2004-10-05       Impact factor: 6.937

8.  A comparison of univariate and multivariate gene selection techniques for classification of cancer datasets.

Authors:  Carmen Lai; Marcel J T Reinders; Laura J van't Veer; Lodewyk F A Wessels
Journal:  BMC Bioinformatics       Date:  2006-05-02       Impact factor: 3.169

  8 in total
  5 in total

1.  Assessing rejection-related disease in kidney transplant biopsies based on archetypal analysis of molecular phenotypes.

Authors:  Jeff Reeve; Georg A Böhmig; Farsad Eskandary; Gunilla Einecke; Carmen Lefaucheur; Alexandre Loupy; Philip F Halloran
Journal:  JCI Insight       Date:  2017-06-15

2.  Factors affecting the accuracy of a class prediction model in gene expression data.

Authors:  Putri W Novianti; Victor L Jong; Kit C B Roes; Marinus J C Eijkemans
Journal:  BMC Bioinformatics       Date:  2015-06-21       Impact factor: 3.169

3.  Predicting Progression from Mild Cognitive Impairment to Alzheimer's Dementia Using Clinical, MRI, and Plasma Biomarkers via Probabilistic Pattern Classification.

Authors:  Igor O Korolev; Laura L Symonds; Andrea C Bozoki
Journal:  PLoS One       Date:  2016-02-22       Impact factor: 3.240

4.  Transcriptome assists prognosis of disease severity in respiratory syncytial virus infected infants.

Authors:  Victor L Jong; Inge M L Ahout; Henk-Jan van den Ham; Jop Jans; Fatiha Zaaraoui-Boutahar; Aldert Zomer; Elles Simonetti; Maarten A Bijl; H Kim Brand; Wilfred F J van IJcken; Marien I de Jonge; Pieter L Fraaij; Ronald de Groot; Albert D M E Osterhaus; Marinus J Eijkemans; Gerben Ferwerda; Arno C Andeweg
Journal:  Sci Rep       Date:  2016-11-11       Impact factor: 4.379

5.  A gene signature that distinguishes conventional and leukemic nonnodal mantle cell lymphoma helps predict outcome.

Authors:  Guillem Clot; Pedro Jares; Eva Giné; Alba Navarro; Cristina Royo; Magda Pinyol; David Martín-Garcia; Santiago Demajo; Blanca Espinet; Antonio Salar; Ana Ferrer; Ana Muntañola; Marta Aymerich; Hilka Rauert-Wunderlich; Elaine S Jaffe; Joseph M Connors; Randy D Gascoyne; Jan Delabie; Armando López-Guillermo; German Ott; George W Wright; Louis M Staudt; Andreas Rosenwald; David W Scott; Lisa M Rimsza; Sílvia Beà; Elías Campo
Journal:  Blood       Date:  2018-05-16       Impact factor: 25.476

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.