Literature DB >> 15790388

An entropy-based gene selection method for cancer classification using microarray data.

Xiaoxing Liu1, Arun Krishnan, Adrian Mondry.   

Abstract

BACKGROUND: Accurate diagnosis of cancer subtypes remains a challenging problem. Building classifiers based on gene expression data is a promising approach; yet the selection of non-redundant but relevant genes is difficult. The selected gene set should be small enough to allow diagnosis even in regular clinical laboratories and ideally identify genes involved in cancer-specific regulatory pathways. Here an entropy-based method is proposed that selects genes related to the different cancer classes while at the same time reducing the redundancy among the genes.
RESULTS: The present study identifies a subset of features by maximizing the relevance and minimizing the redundancy of the selected genes. A merit called normalized mutual information is employed to measure the relevance and the redundancy of the genes. In order to find a more representative subset of features, an iterative procedure is adopted that incorporates an initial clustering followed by data partitioning and the application of the algorithm to each of the partitions. A leave-one-out approach then selects the most commonly selected genes across all the different runs and the gene selection algorithm is applied again to pare down the list of selected genes until a minimal subset is obtained that gives a satisfactory accuracy of classification. The algorithm was applied to three different data sets and the results obtained were compared to work done by others using the same data sets.
CONCLUSION: This study presents an entropy-based iterative algorithm for selecting genes from microarray data that are able to classify various cancer sub-types with high accuracy. In addition, the feature set obtained is very compact, that is, the redundancy between genes is reduced to a large extent. This implies that classifiers can be built with a smaller subset of genes.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15790388      PMCID: PMC1087831          DOI: 10.1186/1471-2105-6-76

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  Support vector machine classification and validation of cancer tissue samples using microarray expression data.

Authors:  T S Furey; N Cristianini; N Duffy; D W Bednarski; M Schummer; D Haussler
Journal:  Bioinformatics       Date:  2000-10       Impact factor: 6.937

2.  Systematic determination of genetic network architecture.

Authors:  S Tavazoie; J D Hughes; M J Campbell; R J Cho; G M Church
Journal:  Nat Genet       Date:  1999-07       Impact factor: 38.330

3.  Using mutual information for selecting features in supervised neural net learning.

Authors:  R Battiti
Journal:  IEEE Trans Neural Netw       Date:  1994

4.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

5.  Molecular classification of cutaneous malignant melanoma by gene expression profiling.

Authors:  M Bittner; P Meltzer; Y Chen; Y Jiang; E Seftor; M Hendrix; M Radmacher; R Simon; Z Yakhini; A Ben-Dor; N Sampas; E Dougherty; E Wang; F Marincola; C Gooden; J Lueders; A Glatfelter; P Pollock; J Carpten; E Gillanders; D Leja; K Dietrich; C Beaudry; M Berens; D Alberts; V Sondak
Journal:  Nature       Date:  2000-08-03       Impact factor: 49.962

6.  Predicting the clinical status of human breast cancer by using gene expression profiles.

Authors:  M West; C Blanchette; H Dressman; E Huang; S Ishida; R Spang; H Zuzan; J A Olson; J R Marks; J R Nevins
Journal:  Proc Natl Acad Sci U S A       Date:  2001-09-18       Impact factor: 11.205

7.  An information-intensive approach to the molecular pharmacology of cancer.

Authors:  J N Weinstein; T G Myers; P M O'Connor; S H Friend; A J Fornace; K W Kohn; T Fojo; S E Bates; L V Rubinstein; N L Anderson; J K Buolamwini; W W van Osdol; A P Monks; D A Scudiero; E A Sausville; D W Zaharevitz; B Bunow; V N Viswanadhan; G S Johnson; R E Wittes; K D Paull
Journal:  Science       Date:  1997-01-17       Impact factor: 47.728

8.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

9.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

Authors:  J Khan; J S Wei; M Ringnér; L H Saal; M Ladanyi; F Westermann; F Berthold; M Schwab; C R Antonescu; C Peterson; P S Meltzer
Journal:  Nat Med       Date:  2001-06       Impact factor: 53.440

Review 10.  Insulin-like growth factor (IGF)-I, IGF binding protein-3, and cancer risk: systematic review and meta-regression analysis.

Authors:  Andrew G Renehan; Marcel Zwahlen; Christoph Minder; Sarah T O'Dwyer; Stephen M Shalet; Matthias Egger
Journal:  Lancet       Date:  2004-04-24       Impact factor: 79.321

View more
  20 in total

1.  A hybrid BPSO-CGA approach for gene selection and classification of microarray data.

Authors:  Li-Yeh Chuang; Cheng-Huei Yang; Jung-Chike Li; Cheng-Hong Yang
Journal:  J Comput Biol       Date:  2011-01-06       Impact factor: 1.479

2.  The Role of microRNA Expression in Cortical Development During Conversion to Psychosis.

Authors:  Amanda B Zheutlin; Clark D Jeffries; Diana O Perkins; Yoonho Chung; Adam M Chekroud; Jean Addington; Carrie E Bearden; Kristin S Cadenhead; Barbara A Cornblatt; Daniel H Mathalon; Thomas H McGlashan; Larry J Seidman; Elaine F Walker; Scott W Woods; Ming Tsuang; Tyrone D Cannon
Journal:  Neuropsychopharmacology       Date:  2017-02-10       Impact factor: 7.853

3.  Severity of thought disorder predicts psychosis in persons at clinical high-risk.

Authors:  Diana O Perkins; Clark D Jeffries; Barbara A Cornblatt; Scott W Woods; Jean Addington; Carrie E Bearden; Kristin S Cadenhead; Tyrone D Cannon; Robert Heinssen; Daniel H Mathalon; Larry J Seidman; Ming T Tsuang; Elaine F Walker; Thomas H McGlashan
Journal:  Schizophr Res       Date:  2015-10-04       Impact factor: 4.939

4.  Large-scale analysis of Arabidopsis transcription reveals a basal co-regulation network.

Authors:  Osnat Atias; Benny Chor; Daniel A Chamovitz
Journal:  BMC Syst Biol       Date:  2009-09-03

5.  Identification of single- and multiple-class specific signature genes from gene expression profiles by group marker index.

Authors:  Yu-Shuen Tsai; Kripamoy Aguan; Nikhil R Pal; I-Fang Chung
Journal:  PLoS One       Date:  2011-09-01       Impact factor: 3.240

6.  A Population Proportion approach for ranking differentially expressed genes.

Authors:  Mugdha Gadgil
Journal:  BMC Bioinformatics       Date:  2008-09-18       Impact factor: 3.169

7.  Gene selection algorithms for microarray data based on least squares support vector machine.

Authors:  E Ke Tang; P N Suganthan; Xin Yao
Journal:  BMC Bioinformatics       Date:  2006-02-27       Impact factor: 3.169

8.  Data perturbation independent diagnosis and validation of breast cancer subtypes using clustering and patterns.

Authors:  G Alexe; G S Dalgin; R Ramaswamy; C Delisi; G Bhanot
Journal:  Cancer Inform       Date:  2007-02-19

9.  A new regularized least squares support vector regression for gene selection.

Authors:  Pei-Chun Chen; Su-Yun Huang; Wei J Chen; Chuhsing K Hsiao
Journal:  BMC Bioinformatics       Date:  2009-02-03       Impact factor: 3.169

10.  Constructing disease-specific gene networks using pair-wise relevance metric: application to colon cancer identifies interleukin 8, desmin and enolase 1 as the central elements.

Authors:  Wei Jiang; Xia Li; Shaoqi Rao; Lihong Wang; Lei Du; Chuanxing Li; Chao Wu; Hongzhi Wang; Yadong Wang; Baofeng Yang
Journal:  BMC Syst Biol       Date:  2008-08-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.