Literature DB >> 15852500

Minimum redundancy feature selection from microarray gene expression data.

Chris Ding1, Hanchuan Peng.   

Abstract

How to selecting a small subset out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. We propose a minimum redundancy - maximum relevance (MRMR) feature selection framework. Genes selected via MRMR provide a more balanced coverage of the space and capture broader characteristics of phenotypes. They lead to significantly improved class predictions in extensive experiments on 6 gene expression data sets: NCI, Lymphoma, Lung, Child Leukemia, Leukemia, and Colon. Improvements are observed consistently among 4 classification methods: Naive Bayes, Linear discriminant analysis, Logistic regression, and Support vector machines. SUPPLIMENTARY: The top 60 MRMR genes for each of the datasets are listed in http://crd.lbl.gov/~cding/MRMR/. More information related to MRMR methods can be found at http://www.hpeng.net/.

Entities:  

Mesh:

Year:  2005        PMID: 15852500     DOI: 10.1142/s0219720005001004

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  325 in total

1.  Network-based Prediction of Cancer under Genetic Storm.

Authors:  Ahmet Ay; Dihong Gong; Tamer Kahveci
Journal:  Cancer Inform       Date:  2014-10-15

2.  Information theory filters for wavelet packet coefficient selection with application to corrosion type identification from acoustic emission signals.

Authors:  Gert Van Dijck; Marc M Van Hulle
Journal:  Sensors (Basel)       Date:  2011-05-27       Impact factor: 3.576

3.  System approaches reveal the molecular networks involved in neural stem cell differentiation.

Authors:  Kai Wang; Haifeng Wang; Jiao Wang; Yuqiong Xie; Jun Chen; Huang Yan; Zengrong Liu; Tieqiao Wen
Journal:  Protein Cell       Date:  2012-04-10       Impact factor: 14.870

4.  Translational bioinformatics and healthcare informatics: computational and ethical challenges.

Authors:  Prerna Sethi; Kimberly Theodos
Journal:  Perspect Health Inf Manag       Date:  2009-09-16

5.  Information-theoretic inference of large transcriptional regulatory networks.

Authors:  Patrick E Meyer; Kevin Kontos; Frederic Lafitte; Gianluca Bontempi
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

6.  Diagnostic value of blood gene expression signatures in active tuberculosis in Thais: a pilot study.

Authors:  N Satproedprai; N Wichukchinda; S Suphankong; W Inunchot; T Kuntima; S Kumpeerasart; S Wattanapokayakit; S Nedsuwan; H Yanai; K Higuchi; N Harada; S Mahasirimongkol
Journal:  Genes Immun       Date:  2015-03-12       Impact factor: 2.676

7.  minet: A R/Bioconductor package for inferring large transcriptional networks using mutual information.

Authors:  Patrick E Meyer; Frédéric Lafitte; Gianluca Bontempi
Journal:  BMC Bioinformatics       Date:  2008-10-29       Impact factor: 3.169

8.  Specific gut microbiome signature predicts the early-stage lung cancer.

Authors:  Yajuan Zheng; Zhaoyuan Fang; Yun Xue; Jian Zhang; Junjie Zhu; Renyuan Gao; Shun Yao; Yi Ye; Shihui Wang; Changdong Lin; Shiyang Chen; Hsinyi Huang; Liang Hu; Ge-Ning Jiang; Huanlong Qin; Peng Zhang; Jianfeng Chen; Hongbin Ji
Journal:  Gut Microbes       Date:  2020-04-02

9.  Removing batch effects from histopathological images for enhanced cancer diagnosis.

Authors:  Sonal Kothari; John H Phan; Todd H Stokes; Adeboye O Osunkoya; Andrew N Young; May D Wang
Journal:  IEEE J Biomed Health Inform       Date:  2014-05       Impact factor: 5.772

10.  Feature selection by optimizing a lower bound of conditional mutual information.

Authors:  Hanyang Peng; Yong Fan
Journal:  Inf Sci (N Y)       Date:  2017-08-09       Impact factor: 6.795

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.