Literature DB >> 32124917

Regression on imperfect class labels derived by unsupervised clustering.

Rasmus Froberg Brøndum, Thomas Yssing Michaelsen, Martin Bøgsted.   

Abstract

Outcome regressed on class labels identified by unsupervised clustering is custom in many applications. However, it is common to ignore the misclassification of class labels caused by the learning algorithm, which potentially leads to serious bias of the estimated effect parameters. Due to their generality we suggest to address the problem by use of regression calibration or the misclassification simulation and extrapolation method. Performance is illustrated by simulated data from Gaussian mixture models, documenting a reduced bias and improved coverage of confidence intervals when adjusting for misclassification with either method. Finally, we apply our method to data from a previous study, which regressed overall survival on class labels derived from unsupervised clustering of gene expression data from bone marrow samples of multiple myeloma patients.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  Clustering; cancer; machine learning; statistics; survival analysis

Mesh:

Year:  2021        PMID: 32124917      PMCID: PMC7986660          DOI: 10.1093/bib/bbaa014

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  16 in total

1.  Efficient regression calibration for logistic regression in main study/internal validation study designs with an imperfect reference instrument.

Authors:  D Spiegelman; R J Carroll; V Kipnis
Journal:  Stat Med       Date:  2001-01-15       Impact factor: 2.373

2.  Multiple-imputation for measurement-error correction.

Authors:  Stephen R Cole; Haitao Chu; Sander Greenland
Journal:  Int J Epidemiol       Date:  2006-05-18       Impact factor: 7.196

3.  The molecular classification of multiple myeloma.

Authors:  Fenghuang Zhan; Yongsheng Huang; Simona Colla; James P Stewart; Ichiro Hanamura; Sushil Gupta; Joshua Epstein; Shmuel Yaccoby; Jeffrey Sawyer; Bart Burington; Elias Anaissie; Klaus Hollmig; Mauricio Pineda-Roman; Guido Tricot; Frits van Rhee; Ronald Walker; Maurizio Zangari; John Crowley; Bart Barlogie; John D Shaughnessy
Journal:  Blood       Date:  2006-05-25       Impact factor: 22.113

4.  A general method for dealing with misclassification in regression: the misclassification SIMEX.

Authors:  Helmut Küchenhoff; Samuel M Mwalili; Emmanuel Lesaffre
Journal:  Biometrics       Date:  2006-03       Impact factor: 2.571

5.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

6.  Measurement error is often neglected in medical literature: a systematic review.

Authors:  Timo B Brakenhoff; Marian Mitroiu; Ruth H Keogh; Karel G M Moons; Rolf H H Groenwold; Maarten van Smeden
Journal:  J Clin Epidemiol       Date:  2018-03-06       Impact factor: 6.437

7.  Matrix methods for estimating odds ratios with misclassified exposure data: extensions and comparisons.

Authors:  M J Morrissey; D Spiegelman
Journal:  Biometrics       Date:  1999-06       Impact factor: 2.571

8.  Accounting for measurement error in biomarker data and misclassification of subtypes in the analysis of tumor data.

Authors:  Daniel Nevo; David M Zucker; Rulla M Tamimi; Molin Wang
Journal:  Stat Med       Date:  2016-08-24       Impact factor: 2.373

9.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.

Authors:  A A Alizadeh; M B Eisen; R E Davis; C Ma; I S Lossos; A Rosenwald; J C Boldrick; H Sabet; T Tran; X Yu; J I Powell; L Yang; G E Marti; T Moore; J Hudson; L Lu; D B Lewis; R Tibshirani; G Sherlock; W C Chan; T C Greiner; D D Weisenburger; J O Armitage; R Warnke; R Levy; W Wilson; M R Grever; J C Byrd; D Botstein; P O Brown; L M Staudt
Journal:  Nature       Date:  2000-02-03       Impact factor: 49.962

10.  The consensus molecular subtypes of colorectal cancer.

Authors:  Justin Guinney; Rodrigo Dienstmann; Xin Wang; Aurélien de Reyniès; Andreas Schlicker; Charlotte Soneson; Laetitia Marisa; Paul Roepman; Gift Nyamundanda; Paolo Angelino; Brian M Bot; Jeffrey S Morris; Iris M Simon; Sarah Gerster; Evelyn Fessler; Felipe De Sousa E Melo; Edoardo Missiaglia; Hena Ramay; David Barras; Krisztian Homicsko; Dipen Maru; Ganiraju C Manyam; Bradley Broom; Valerie Boige; Beatriz Perez-Villamil; Ted Laderas; Ramon Salazar; Joe W Gray; Douglas Hanahan; Josep Tabernero; Rene Bernards; Stephen H Friend; Pierre Laurent-Puig; Jan Paul Medema; Anguraj Sadanandam; Lodewyk Wessels; Mauro Delorenzi; Scott Kopetz; Louis Vermeulen; Sabine Tejpar
Journal:  Nat Med       Date:  2015-10-12       Impact factor: 53.440

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.