Literature DB >> 19580522

Simultaneous class discovery and classification of microarray data using spectral analysis.

Peng Qiu1, Sylvia K Plevritis.   

Abstract

Classification methods are commonly divided into two categories: unsupervised and supervised. Unsupervised methods have the ability to discover new classes by grouping data into clusters or tree structures without using the class labels, but they carry the risk of producing noninterpretable results. On the other hand, supervised methods always find decision rules that discriminate samples with different class labels. However, the class label information plays such an important role that it confines supervised methods by defining the possible classes. Consequently, supervised methods do not have the ability to discover new classes. To overcome the limitations of unsupervised and supervised methods, we propose a new method, which utilizes the class labels to a less important role so as to perform class discovery and classification simultaneously. The proposed method is called SPACC (SPectral Analysis for Class discovery and Classification). In SPACC, the training samples are nodes of an undirected weighted network. Using spectral analysis, SPACC iteratively partitions the network into a top-down binary tree. Each partitioning step is unsupervised, and the class labels are only used to define the stopping criterion. When the partitioning ends, the training samples have been divided into several subsets, each corresponding to one class label. Because multiple subsets can correspond to the same class label, SPACC may identify biologically meaningful subclasses, and minimize the impact of outliers and mislabeled data. We demonstrate the effectiveness of SPACC for class discovery and classification on microarray data of lymphomas and leukemias. SPACC software is available at http://icbp.stanford.edu/software/SPACC/.

Entities:  

Mesh:

Year:  2009        PMID: 19580522      PMCID: PMC3148134          DOI: 10.1089/cmb.2008.0227

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  13 in total

1.  Support vector machine classification and validation of cancer tissue samples using microarray expression data.

Authors:  T S Furey; N Cristianini; N Duffy; D W Bednarski; M Schummer; D Haussler
Journal:  Bioinformatics       Date:  2000-10       Impact factor: 6.937

2.  Systematic determination of genetic network architecture.

Authors:  S Tavazoie; J D Hughes; M J Campbell; R J Cho; G M Church
Journal:  Nat Genet       Date:  1999-07       Impact factor: 38.330

Review 3.  Genomics, gene expression and DNA arrays.

Authors:  D J Lockhart; E A Winzeler
Journal:  Nature       Date:  2000-06-15       Impact factor: 49.962

4.  Classification of a large microarray data set: algorithm comparison and analysis of drug signatures.

Authors:  Georges Natsoulis; Laurent El Ghaoui; Gert R G Lanckriet; Alexander M Tolley; Fabrice Leroy; Shane Dunlea; Barrett P Eynon; Cecelia I Pearson; Stuart Tugendreich; Kurt Jarnagin
Journal:  Genome Res       Date:  2005-05       Impact factor: 9.043

5.  Improved prediction of treatment response using microarrays and existing biological knowledge.

Authors:  Simon M Lin; Jyothi Devakumar; Warren A Kibbe
Journal:  Pharmacogenomics       Date:  2006-04       Impact factor: 2.533

6.  A general framework for weighted gene co-expression network analysis.

Authors:  Bin Zhang; Steve Horvath
Journal:  Stat Appl Genet Mol Biol       Date:  2005-08-12

7.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.

Authors:  A A Alizadeh; M B Eisen; R E Davis; C Ma; I S Lossos; A Rosenwald; J C Boldrick; H Sabet; T Tran; X Yu; J I Powell; L Yang; G E Marti; T Moore; J Hudson; L Lu; D B Lewis; R Tibshirani; G Sherlock; W C Chan; T C Greiner; D D Weisenburger; J O Armitage; R Warnke; R Levy; W Wilson; M R Grever; J C Byrd; D Botstein; P O Brown; L M Staudt
Journal:  Nature       Date:  2000-02-03       Impact factor: 49.962

8.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

9.  Portraits of breast cancer progression.

Authors:  Gul S Dalgin; Gabriela Alexe; Daniel Scanfeld; Pablo Tamayo; Jill P Mesirov; Shridar Ganesan; Charles DeLisi; Gyan Bhanot
Journal:  BMC Bioinformatics       Date:  2007-08-06       Impact factor: 3.169

10.  Neural network analysis of lymphoma microarray data: prognosis and diagnosis near-perfect.

Authors:  Michael C O'Neill; Li Song
Journal:  BMC Bioinformatics       Date:  2003-04-10       Impact factor: 3.169

View more
  4 in total

Review 1.  Systems analysis of high-throughput data.

Authors:  Rosemary Braun
Journal:  Adv Exp Med Biol       Date:  2014       Impact factor: 2.622

2.  Discovering biological progression underlying microarray samples.

Authors:  Peng Qiu; Andrew J Gentles; Sylvia K Plevritis
Journal:  PLoS Comput Biol       Date:  2011-04-14       Impact factor: 4.475

3.  Partition decoupling for multi-gene analysis of gene expression profiling data.

Authors:  Rosemary Braun; Gregory Leibon; Scott Pauls; Daniel Rockmore
Journal:  BMC Bioinformatics       Date:  2011-12-30       Impact factor: 3.169

4.  New bandwidth selection criterion for Kernel PCA: approach to dimensionality reduction and classification problems.

Authors:  Minta Thomas; Kris De Brabanter; Bart De Moor
Journal:  BMC Bioinformatics       Date:  2014-05-10       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.