Literature DB >> 18312212

Comparing the characteristics of gene expression profiles derived by univariate and multivariate classification methods.

Manuela Zucknick1, Sylvia Richardson, Euan A Stronach.   

Abstract

One application of gene expression arrays is to derive molecular profiles, i.e., sets of genes, which discriminate well between two classes of samples, for example between tumour types. Users are confronted with a multitude of classification methods of varying complexity that can be applied to this task. To help decide which method to use in a given situation, we compare important characteristics of a range of classification methods, including simple univariate filtering, penalised likelihood methods and the random forest. Classification accuracy is an important characteristic, but the biological interpretability of molecular profiles is also important. This implies both parsimony and stability, in the sense that profiles should not vary much when there are slight changes in the training data. We perform a random resampling study to compare these characteristics between the methods and across a range of profile sizes. We measure stability by adopting the Jaccard index to assess the similarity of resampled molecular profiles. We carry out a case study on five well-established cancer microarray data sets, for two of which we have the benefit of being able to validate the results in an independent data set. The study shows that those methods which produce parsimonious profiles generally result in better prediction accuracy than methods which don't include variable selection. For very small profile sizes, the sparse penalised likelihood methods tend to result in more stable profiles than univariate filtering while maintaining similar predictive performance.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18312212      PMCID: PMC2496885          DOI: 10.2202/1544-6115.1307

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  21 in total

Review 1.  Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification.

Authors:  Richard Simon; Michael D Radmacher; Kevin Dobbin; Lisa M McShane
Journal:  J Natl Cancer Inst       Date:  2003-01-01       Impact factor: 13.506

2.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data.

Authors:  Rafael A Irizarry; Bridget Hobbs; Francois Collin; Yasmin D Beazer-Barclay; Kristen J Antonellis; Uwe Scherf; Terence P Speed
Journal:  Biostatistics       Date:  2003-04       Impact factor: 5.899

3.  Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions.

Authors:  R L Somorjai; B Dolenko; R Baumgartner
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

4.  Outcome signature genes in breast cancer: is there a unique set?

Authors:  Liat Ein-Dor; Itai Kela; Gad Getz; David Givol; Eytan Domany
Journal:  Bioinformatics       Date:  2004-08-12       Impact factor: 6.937

5.  Prediction of cancer outcome with microarrays: a multiple random validation strategy.

Authors:  Stefan Michiels; Serge Koscielny; Catherine Hill
Journal:  Lancet       Date:  2005 Feb 5-11       Impact factor: 79.321

6.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.

Authors:  T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

7.  Prognostically useful gene-expression profiles in acute myeloid leukemia.

Authors:  Peter J M Valk; Roel G W Verhaak; M Antoinette Beijen; Claudia A J Erpelinck; Sahar Barjesteh van Waalwijk van Doorn-Khosrovani; Judith M Boer; H Berna Beverloo; Michael J Moorhouse; Peter J van der Spek; Bob Löwenberg; Ruud Delwel
Journal:  N Engl J Med       Date:  2004-04-15       Impact factor: 91.245

8.  A gene-expression signature as a predictor of survival in breast cancer.

Authors:  Marc J van de Vijver; Yudong D He; Laura J van't Veer; Hongyue Dai; Augustinus A M Hart; Dorien W Voskuil; George J Schreiber; Johannes L Peterse; Chris Roberts; Matthew J Marton; Mark Parrish; Douwe Atsma; Anke Witteveen; Annuska Glas; Leonie Delahaye; Tony van der Velde; Harry Bartelink; Sjoerd Rodenhuis; Emiel T Rutgers; Stephen H Friend; René Bernards
Journal:  N Engl J Med       Date:  2002-12-19       Impact factor: 91.245

9.  Gene expression in ovarian cancer reflects both morphology and biological behavior, distinguishing clear cell from other poor-prognosis ovarian carcinomas.

Authors:  Donald R Schwartz; Sharon L R Kardia; Kerby A Shedden; Rork Kuick; George Michailidis; Jeremy M G Taylor; David E Misek; Rong Wu; Yali Zhai; Danielle M Darrah; Heather Reed; Lora H Ellenson; Thomas J Giordano; Eric R Fearon; Samir M Hanash; Kathleen R Cho
Journal:  Cancer Res       Date:  2002-08-15       Impact factor: 12.701

10.  Selection of potential markers for epithelial ovarian cancer with gene expression arrays and recursive descent partition analysis.

Authors:  Karen H Lu; Andrea P Patterson; Lin Wang; Rebecca T Marquez; Edward N Atkinson; Keith A Baggerly; Lance R Ramoth; Daniel G Rosen; Jinsong Liu; Ingegerd Hellstrom; David Smith; Lynn Hartmann; David Fishman; Andrew Berchuck; Rosemarie Schmandt; Regina Whitaker; David M Gershenson; Gordon B Mills; Robert C Bast
Journal:  Clin Cancer Res       Date:  2004-05-15       Impact factor: 12.531

View more
  13 in total

1.  Exploiting Linkage Disequilibrium for Ultrahigh-Dimensional Genome-Wide Data with an Integrated Statistical Approach.

Authors:  Michelle Carlsen; Guifang Fu; Shaun Bushman; Christopher Corcoran
Journal:  Genetics       Date:  2015-12-12       Impact factor: 4.562

2.  Combined Plasma and Cerebrospinal Fluid Signature for the Prediction of Midterm Progression From Mild Cognitive Impairment to Alzheimer Disease.

Authors:  Benoit Lehallier; Laurent Essioux; Javier Gayan; Roxana Alexandridis; Tania Nikolcheva; Tony Wyss-Coray; Markus Britschgi
Journal:  JAMA Neurol       Date:  2015-12-14       Impact factor: 18.302

3.  Effect of training-sample size and classification difficulty on the accuracy of genomic predictors.

Authors:  Vlad Popovici; Weijie Chen; Brandon G Gallas; Christos Hatzis; Weiwei Shi; Frank W Samuelson; Yuri Nikolsky; Marina Tsyganova; Alex Ishkin; Tatiana Nikolskaya; Kenneth R Hess; Vicente Valero; Daniel Booser; Mauro Delorenzi; Gabriel N Hortobagyi; Leming Shi; W Fraser Symmans; Lajos Pusztai
Journal:  Breast Cancer Res       Date:  2010-01-11       Impact factor: 6.466

4.  Challenges in Biomarker Discovery: Combining Expert Insights with Statistical Analysis of Complex Omics Data.

Authors:  Jason E McDermott; Jing Wang; Hugh Mitchell; Bobbie-Jo Webb-Robertson; Ryan Hafen; John Ramey; Karin D Rodland
Journal:  Expert Opin Med Diagn       Date:  2013-01

5.  Systems medicine: the future of medical genomics and healthcare.

Authors:  Charles Auffray; Zhu Chen; Leroy Hood
Journal:  Genome Med       Date:  2009-01-20       Impact factor: 11.117

6.  Balancing the robustness and predictive performance of biomarkers.

Authors:  Paul Kirk; Aviva Witkover; Charles R M Bangham; Sylvia Richardson; Alexandra M Lewin; Michael P H Stumpf
Journal:  J Comput Biol       Date:  2013-08-02       Impact factor: 1.479

7.  Significance testing in ridge regression for genetic data.

Authors:  Erika Cule; Paolo Vineis; Maria De Iorio
Journal:  BMC Bioinformatics       Date:  2011-09-19       Impact factor: 3.169

8.  Effect of size and heterogeneity of samples on biomarker discovery: synthetic and real data assessment.

Authors:  Barbara Di Camillo; Tiziana Sanavia; Matteo Martini; Giuseppe Jurman; Francesco Sambo; Annalisa Barla; Margherita Squillario; Cesare Furlanello; Gianna Toffolo; Claudio Cobelli
Journal:  PLoS One       Date:  2012-03-05       Impact factor: 3.240

9.  A null model for Pearson coexpression networks.

Authors:  Andrea Gobbi; Giuseppe Jurman
Journal:  PLoS One       Date:  2015-06-01       Impact factor: 3.240

10.  Gene expression profiles for predicting metastasis in breast cancer: a cross-study comparison of classification methods.

Authors:  Mark Burton; Mads Thomassen; Qihua Tan; Torben A Kruse
Journal:  ScientificWorldJournal       Date:  2012-11-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.