Literature DB >> 20719761

Module-based prediction approach for robust inter-study predictions in microarray data.

Zhibao Mi1, Kui Shen, Nan Song, Chunrong Cheng, Chi Song, Naftali Kaminski, George C Tseng.   

Abstract

MOTIVATION: Traditional genomic prediction models based on individual genes suffer from low reproducibility across microarray studies due to the lack of robustness to expression measurement noise and gene missingness when they are matched across platforms. It is common that some of the genes in the prediction model established in a training study cannot be matched to another test study because a different platform is applied. The failure of inter-study predictions has severely hindered the clinical applications of microarray. To overcome the drawbacks of traditional gene-based prediction (GBP) models, we propose a module-based prediction (MBP) strategy via unsupervised gene clustering.
RESULTS: K-means clustering is used to group genes sharing similar expression profiles into gene modules, and small modules are merged into their nearest neighbors. Conventional univariate or multivariate feature selection procedure is applied and a representative gene from each selected module is identified to construct the final prediction model. As a result, the prediction model is portable to any test study as long as partial genes in each module exist in the test study. We demonstrate that K-means cluster sizes generally follow a multinomial distribution and the failure probability of inter-study prediction due to missing genes is diminished by merging small clusters into their nearest neighbors. By simulation and applications of real datasets in inter-study predictions, we show that the proposed MBP provides slightly improved accuracy while is considerably more robust than traditional GBP. AVAILABILITY: http://www.biostat.pitt.edu/bioinfo/ CONTACT: ctseng@pitt.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2010        PMID: 20719761      PMCID: PMC2951088          DOI: 10.1093/bioinformatics/btq472

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  44 in total

1.  Averaged gene expressions for regression.

Authors:  Mee Young Park; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2006-05-11       Impact factor: 5.899

2.  A rapid method for microarray cross platform comparisons using gene expression signatures.

Authors:  Chris Cheadle; Kevin G Becker; Yoon S Cho-Chung; Maria Nesterova; Tonya Watkins; William Wood; Vinayakumar Prabhu; Kathleen C Barnes
Journal:  Mol Cell Probes       Date:  2006-08-10       Impact factor: 2.365

3.  Penalized and weighted K-means for clustering with scattered objects and prior information in high-throughput biological data.

Authors:  George C Tseng
Journal:  Bioinformatics       Date:  2007-06-27       Impact factor: 6.937

4.  How large a training set is needed to develop a classifier for microarray data?

Authors:  Kevin K Dobbin; Yingdong Zhao; Richard M Simon
Journal:  Clin Cancer Res       Date:  2008-01-01       Impact factor: 12.531

5.  Ratio adjustment and calibration scheme for gene-wise normalization to enhance microarray inter-study prediction.

Authors:  Chunrong Cheng; Kui Shen; Chi Song; Jianhua Luo; George C Tseng
Journal:  Bioinformatics       Date:  2009-05-04       Impact factor: 6.937

6.  Revealing targeted therapy for human cancer by gene module maps.

Authors:  David J Wong; Dimitry S A Nuyten; Aviv Regev; Meihong Lin; Adam S Adler; Eran Segal; Marc J van de Vijver; Howard Y Chang
Journal:  Cancer Res       Date:  2008-01-15       Impact factor: 12.701

7.  Cross platform microarray analysis for robust identification of differentially expressed genes.

Authors:  Roberta Bosotti; Giuseppe Locatelli; Sandra Healy; Emanuela Scacheri; Luca Sartori; Ciro Mercurio; Raffaele Calogero; Antonella Isacchi
Journal:  BMC Bioinformatics       Date:  2007-03-08       Impact factor: 3.169

8.  Promises and caveats of in silico biomarker discovery.

Authors:  L Pusztai; B Leyland-Jones
Journal:  Br J Cancer       Date:  2008-08-05       Impact factor: 7.640

9.  Module-based outcome prediction using breast cancer compendia.

Authors:  Martin H van Vliet; Christiaan N Klijn; Lodewyk F A Wessels; Marcel J T Reinders
Journal:  PLoS One       Date:  2007-10-17       Impact factor: 3.240

10.  Cross-species and cross-platform gene expression studies with the Bioconductor-compliant R package 'annotationTools'.

Authors:  Alexandre Kuhn; Ruth Luthi-Carter; Mauro Delorenzi
Journal:  BMC Bioinformatics       Date:  2008-01-17       Impact factor: 3.169

View more
  4 in total

1.  MetaKTSP: a meta-analytic top scoring pair method for robust cross-study validation of omics prediction analysis.

Authors:  SungHwan Kim; Chien-Wei Lin; George C Tseng
Journal:  Bioinformatics       Date:  2016-03-02       Impact factor: 6.937

Review 2.  Comprehensive literature review and statistical considerations for microarray meta-analysis.

Authors:  George C Tseng; Debashis Ghosh; Eleanor Feingold
Journal:  Nucleic Acids Res       Date:  2012-01-19       Impact factor: 16.971

3.  High accordance in prognosis prediction of colorectal cancer across independent datasets by multi-gene module expression profiles.

Authors:  Wenting Li; Rui Wang; Zhangming Yan; Linfu Bai; Zhirong Sun
Journal:  PLoS One       Date:  2012-03-16       Impact factor: 3.240

4.  Prediction of breast cancer metastasis by gene expression profiles: a comparison of metagenes and single genes.

Authors:  Mark Burton; Mads Thomassen; Qihua Tan; Torben A Kruse
Journal:  Cancer Inform       Date:  2012-12-10
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.