Literature DB >> 24770370

stepwiseCM: An R Package for Stepwise Classification of Cancer Samples Using Multiple Heterogeneous Data Sets.

Askar Obulkasim1, Mark A van de Wiel2.   

Abstract

This paper presents the R/Bioconductor package stepwiseCM, which classifies cancer samples using two heterogeneous data sets in an efficient way. The algorithm is able to capture the distinct classification power of two given data types without actually combining them. This package suits for classification problems where two different types of data sets on the same samples are available. One of these data types has measurements on all samples and the other one has measurements on some samples. One is easy to collect and/or relatively cheap (eg, clinical covariates) compared to the latter (high-dimensional data, eg, gene expression). One additional application for which stepwiseCM is proven to be useful as well is the combination of two high-dimensional data types, eg, DNA copy number and mRNA expression. The package includes functions to project the neighborhood information in one data space to the other to determine a potential group of samples that are likely to benefit most by measuring the second type of covariates. The two heterogeneous data spaces are connected by indirect mapping. The crucial difference between the stepwise classification strategy implemented in this package and the existing packages is that our approach aims to be cost-efficient by avoiding measuring additional covariates, which might be expensive or patient-unfriendly, for a potentially large subgroup of individuals. Moreover, in diagnosis for these individuals test, results would be quickly available, which may lead to reduced waiting times and hence lower the patients' distress. The improvement described remedies the key limitations of existing packages, and facilitates the use of the stepwiseCM package in diverse applications.

Entities:  

Keywords:  R package; classification; data integration; high-dimensional data

Year:  2014        PMID: 24770370      PMCID: PMC3885337          DOI: 10.4137/CIN.S13075

Source DB:  PubMed          Journal:  Cancer Inform        ISSN: 1176-9351


  11 in total

1.  Adjusting confounders in ranking biomarkers: a model-based ROC approach.

Authors:  Tao Yu; Jialiang Li; Shuangge Ma
Journal:  Brief Bioinform       Date:  2012-03-06       Impact factor: 11.622

2.  Random forest similarity for protein-protein interaction prediction from multiple sources.

Authors:  Yanjun Qi; Judith Klein-Seetharaman; Ziv Bar-Joseph
Journal:  Pac Symp Biocomput       Date:  2005

3.  Prediction of cancer outcome with microarrays: a multiple random validation strategy.

Authors:  Stefan Michiels; Serge Koscielny; Catherine Hill
Journal:  Lancet       Date:  2005 Feb 5-11       Impact factor: 79.321

4.  Gene expression profiling: does it add predictive accuracy to clinical characteristics in cancer prognosis?

Authors:  Daniela Dunkler; Stefan Michiels; Michael Schemper
Journal:  Eur J Cancer       Date:  2007-01-25       Impact factor: 9.162

5.  Microarray-based classification and clinical predictors: on combined classifiers and additional predictive value.

Authors:  Anne-Laure Boulesteix; Christine Porzelius; Martin Daumer
Journal:  Bioinformatics       Date:  2008-06-09       Impact factor: 6.937

6.  Prediction of central nervous system embryonal tumour outcome based on gene expression.

Authors:  Scott L Pomeroy; Pablo Tamayo; Michelle Gaasenbeek; Lisa M Sturla; Michael Angelo; Margaret E McLaughlin; John Y H Kim; Liliana C Goumnerova; Peter M Black; Ching Lau; Jeffrey C Allen; David Zagzag; James M Olson; Tom Curran; Cynthia Wetmore; Jaclyn A Biegel; Tomaso Poggio; Shayan Mukherjee; Ryan Rifkin; Andrea Califano; Gustavo Stolovitzky; David N Louis; Jill P Mesirov; Eric S Lander; Todd R Golub
Journal:  Nature       Date:  2002-01-24       Impact factor: 49.962

7.  Confidence scores for prediction models.

Authors:  Thomas A Gerds; Mark A van de Wiel
Journal:  Biom J       Date:  2011-02-17       Impact factor: 2.207

8.  Genetic reclassification of histologic grade delineates new clinical subtypes of breast cancer.

Authors:  Anna V Ivshina; Joshy George; Oleg Senko; Benjamin Mow; Thomas C Putti; Johanna Smeds; Thomas Lindahl; Yudi Pawitan; Per Hall; Hans Nordgren; John E L Wong; Edison T Liu; Jonas Bergh; Vladimir A Kuznetsov; Lance D Miller
Journal:  Cancer Res       Date:  2006-11-01       Impact factor: 12.701

9.  Integrative mixture of experts to combine clinical factors and gene markers.

Authors:  Kim-Anh Lê Cao; Emmanuelle Meugnier; Geoffrey J McLachlan
Journal:  Bioinformatics       Date:  2010-03-11       Impact factor: 6.937

10.  Stepwise classification of cancer samples using clinical and molecular data.

Authors:  Askar Obulkasim; Gerrit A Meijer; Mark A van de Wiel
Journal:  BMC Bioinformatics       Date:  2011-10-28       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.