Literature DB >> 20339085

Exploring the within- and between-class correlation distributions for tumor classification.

Xuelian Wei1, Ker-Chau Li.   

Abstract

To many biomedical researchers, effective tumor classification methods such as the support vector machine often appear like a black box not only because the procedures are complex but also because the required specifications, such as the choice of a kernel function, suffer from a clear guidance either mathematically or biologically. As commonly observed, samples within the same tumor class tend to be more similar in gene expression than samples from different tumor classes. But can this well-received observation lead to a useful procedure of classification and prediction? To address this issue, we first conceived a statistical framework and derived general conditions to serve as the theoretical foundation that supported the aforementioned empirical observation. Then we constructed a classification procedure that fully utilized the information obtained by comparing the distributions of within-class correlations with between-class correlations via Kullback-Leibler divergence. We compared our approach with many machine-learning techniques by applying to 22 binary- and multiclass gene-expression datasets involving human cancers. The results showed that our method performed as efficiently as support vector machine and Naïve Bayesian and outperformed other learning methods (decision trees, linear discriminate analysis, and k-nearest neighbor). In addition, we conducted a simulation study and showed that our method would be more effective if the arriving new samples are subject to the often-encountered baseline shift or increased noise level problems. Our method can be extended for general classification problems when only the similarity scores between samples are available.

Entities:  

Mesh:

Year:  2010        PMID: 20339085      PMCID: PMC2872377          DOI: 10.1073/pnas.0910140107

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  23 in total

1.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

Authors:  Margaret A Shipp; Ken N Ross; Pablo Tamayo; Andrew P Weng; Jeffery L Kutok; Ricardo C T Aguiar; Michelle Gaasenbeek; Michael Angelo; Michael Reich; Geraldine S Pinkus; Tane S Ray; Margaret A Koval; Kim W Last; Andrew Norton; T Andrew Lister; Jill Mesirov; Donna S Neuberg; Eric S Lander; Jon C Aster; Todd R Golub
Journal:  Nat Med       Date:  2002-01       Impact factor: 53.440

2.  Prediction of central nervous system embryonal tumour outcome based on gene expression.

Authors:  Scott L Pomeroy; Pablo Tamayo; Michelle Gaasenbeek; Lisa M Sturla; Michael Angelo; Margaret E McLaughlin; John Y H Kim; Liliana C Goumnerova; Peter M Black; Ching Lau; Jeffrey C Allen; David Zagzag; James M Olson; Tom Curran; Cynthia Wetmore; Jaclyn A Biegel; Tomaso Poggio; Shayan Mukherjee; Ryan Rifkin; Andrea Califano; Gustavo Stolovitzky; David N Louis; Jill P Mesirov; Eric S Lander; Todd R Golub
Journal:  Nature       Date:  2002-01-24       Impact factor: 49.962

3.  Predisposing gene for early-onset prostate cancer, localized on chromosome 1q42.2-43.

Authors:  P Berthon; A Valeri; A Cohen-Akenine; E Drelon; T Paiss; G Wöhr; A Latil; P Millasseau; I Mellah; N Cohen; H Blanché; C Bellané-Chantelot; F Demenais; P Teillac; A Le Duc; R de Petriconi; R Hautmann; I Chumakov; L Bachner; N J Maitland; R Lidereau; W Vogel; G Fournier; P Mangin; O Cussenot
Journal:  Am J Hum Genet       Date:  1998-06       Impact factor: 11.025

4.  Molecular profiling of non-small cell lung cancer and correlation with disease-free survival.

Authors:  Dennis A Wigle; Igor Jurisica; Niki Radulovich; Melania Pintilie; Janet Rossant; Ni Liu; Chao Lu; James Woodgett; Isolde Seiden; Michael Johnston; Shaf Keshavjee; Gail Darling; Timothy Winton; Bobby-Joe Breitkreutz; Paul Jorgenson; Mike Tyers; Frances A Shepherd; Ming Sound Tsao
Journal:  Cancer Res       Date:  2002-06-01       Impact factor: 12.701

5.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

6.  Diversity of gene expression in adenocarcinoma of the lung.

Authors:  M E Garber; O G Troyanskaya; K Schluens; S Petersen; Z Thaesler; M Pacyna-Gengelbach; M van de Rijn; G D Rosen; C M Perou; R I Whyte; R B Altman; P O Brown; D Botstein; I Petersen
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

7.  KAI1, a metastasis suppressor gene for prostate cancer on human chromosome 11p11.2.

Authors:  J T Dong; P W Lamb; C W Rinker-Schaeffer; J Vukanovic; T Ichikawa; J T Isaacs; J C Barrett
Journal:  Science       Date:  1995-05-12       Impact factor: 47.728

8.  In silico dissection of cell-type-associated patterns of gene expression in prostate cancer.

Authors:  Robert O Stuart; William Wachsman; Charles C Berry; Jessica Wang-Rodriguez; Linda Wasserman; Igor Klacansky; Dan Masys; Karen Arden; Steven Goodison; Michael McClelland; Yipeng Wang; Anne Sawyers; Iveta Kalcheva; David Tarin; Dan Mercola
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-13       Impact factor: 11.205

9.  Gene expression profiling identifies clinically relevant subtypes of prostate cancer.

Authors:  Jacques Lapointe; Chunde Li; John P Higgins; Matt van de Rijn; Eric Bair; Kelli Montgomery; Michelle Ferrari; Lars Egevad; Walter Rayford; Ulf Bergerheim; Peter Ekman; Angelo M DeMarzo; Robert Tibshirani; David Botstein; Patrick O Brown; James D Brooks; Jonathan R Pollack
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-07       Impact factor: 11.205

10.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling.

Authors:  Eng-Juh Yeoh; Mary E Ross; Sheila A Shurtleff; W Kent Williams; Divyen Patel; Rami Mahfouz; Fred G Behm; Susana C Raimondi; Mary V Relling; Anami Patel; Cheng Cheng; Dario Campana; Dawn Wilkins; Xiaodong Zhou; Jinyan Li; Huiqing Liu; Ching-Hon Pui; William E Evans; Clayton Naeve; Limsoon Wong; James R Downing
Journal:  Cancer Cell       Date:  2002-03       Impact factor: 31.743

View more
  6 in total

1.  Knowledge discovery by accuracy maximization.

Authors:  Stefano Cacciatore; Claudio Luchinat; Leonardo Tenori
Journal:  Proc Natl Acad Sci U S A       Date:  2014-03-24       Impact factor: 11.205

2.  TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection.

Authors:  Haiyan Wang; Hongyan Zhang; Zhijun Dai; Ming-shun Chen; Zheming Yuan
Journal:  BMC Med Genomics       Date:  2013-01-23       Impact factor: 3.063

3.  Metagenomic biomarker discovery and explanation.

Authors:  Nicola Segata; Jacques Izard; Levi Waldron; Dirk Gevers; Larisa Miropolsky; Wendy S Garrett; Curtis Huttenhower
Journal:  Genome Biol       Date:  2011-06-24       Impact factor: 13.583

4.  Investigating microRNA-target interaction-supported tissues in human cancer tissues based on miRNA and target gene expression profiling.

Authors:  Wan J Hsieh; Feng-Mao Lin; Hsien-Da Huang; Hsiuying Wang
Journal:  PLoS One       Date:  2014-04-22       Impact factor: 3.240

5.  100% classification accuracy considered harmful: the normalized information transfer factor explains the accuracy paradox.

Authors:  Francisco J Valverde-Albacete; Carmen Peláez-Moreno
Journal:  PLoS One       Date:  2014-01-10       Impact factor: 3.240

6.  Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients.

Authors:  Johannes Smolander; Alexey Stupnikov; Galina Glazko; Matthias Dehmer; Frank Emmert-Streib
Journal:  BMC Cancer       Date:  2019-12-03       Impact factor: 4.430

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.