Literature DB >> 19948743

Boosting with missing predictors.

C Y Wang1, Ziding Feng.   

Abstract

Boosting is an important tool in classification methodology. It combines the performance of many weak classifiers to produce a powerful committee, and its validity can be explained by additive modeling and maximum likelihood. The method has very general applications, especially for high-dimensional predictors. For example, it can be applied to distinguish cancer samples from healthy control samples by using antibody microarray data. Microarray data are often high-dimensional and many of them are incomplete. One natural idea is to impute a missing variable based on the observed predictors. However, the calculation of imputation for high-dimensional predictors with missing data may be rather tedious. In this paper, we propose 2 conditional mean imputation methods. They can be applied to the situation even when a complete-case subset does not exist. Simulation results indicate that the proposed methods are superior than other naive methods. We apply the methods to a pancreatic cancer study in which serum protein microarrays are used for classification.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19948743      PMCID: PMC2830576          DOI: 10.1093/biostatistics/kxp052

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  6 in total

1.  Partially supervised learning using an EM-boosting algorithm.

Authors:  Yutaka Yasui; Margaret Pepe; Li Hsu; Bao-Ling Adam; Ziding Feng
Journal:  Biometrics       Date:  2004-03       Impact factor: 2.571

2.  Survival ensembles.

Authors:  Torsten Hothorn; Peter Bühlmann; Sandrine Dudoit; Annette Molinaro; Mark J van der Laan
Journal:  Biostatistics       Date:  2005-12-12       Impact factor: 5.899

3.  Numerical equivalence of imputing scores and weighted estimators in regression analysis with missing covariates.

Authors:  C Y Wang; Shen-Ming Lee; Edward C Chao
Journal:  Biostatistics       Date:  2006-09-12       Impact factor: 5.899

4.  Antibody microarray profiling reveals individual and combined serum proteins associated with pancreatic cancer.

Authors:  Randal Orchekowski; Darren Hamelinck; Lin Li; Ewa Gliwa; Matt vanBrocklin; Jorge A Marrero; George F Vande Woude; Ziding Feng; Randall Brand; Brian B Haab
Journal:  Cancer Res       Date:  2005-12-01       Impact factor: 12.701

5.  Regression calibration in failure time regression.

Authors:  C Y Wang; L Hsu; Z D Feng; R L Prentice
Journal:  Biometrics       Date:  1997-03       Impact factor: 2.571

6.  A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detection.

Authors:  Yutaka Yasui; Margaret Pepe; Mary Lou Thompson; Bao-Ling Adam; George L Wright; Yinsheng Qu; John D Potter; Marcy Winget; Mark Thornquist; Ziding Feng
Journal:  Biostatistics       Date:  2003-07       Impact factor: 5.899

  6 in total
  1 in total

1.  The Marker State Space (MSS) method for classifying clinical samples.

Authors:  Brian P Fallon; Bryan Curnutte; Kevin A Maupin; Katie Partyka; Sunguk Choi; Randall E Brand; Christopher J Langmead; Waibhav Tembe; Brian B Haab
Journal:  PLoS One       Date:  2013-06-04       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.