Literature DB >> 11262969

A nonparametric scoring algorithm for identifying informative genes from microarray data.

P J Park1, M Pagano, M Bonetti.   

Abstract

Microarray data routinely contain gene expression levels of thousands of genes. In the context of medical diagnostics, an important problem is to find the genes that are correlated with given phenotypes. These genes may reveal insights to biological processes and may be used to predict the phenotypes of new samples. In most cases, while the gene expression levels are available for a large number of genes, only a small fraction of these genes may be informative in classification with statistical significance. We introduce a nonparametric scoring algorithm that assigns a score to each gene based on samples with known classes. Based on these scores, we can find a small set of genes which are informative of their class, and subsequent analysis can be carried out with this set. This procedure is robust to outliers and different normalization schemes, and immediately reduces the size of the data with little loss of information. We study the properties of this algorithm and apply it to the data set from cancer patients. We quantify the information in a given set of genes by comparing its distribution of the score statistics to a set of distributions generated by permutations that preserve the correlation structure among the genes.

Entities:  

Mesh:

Year:  2001        PMID: 11262969     DOI: 10.1142/9789814447362_0006

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  18 in total

1.  ESPD: a pattern detection model underlying gene expression profiles.

Authors:  Chun Tang; Aidong Zhang; Murali Ramanathan
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

2.  Ranking analysis of microarray data: a powerful method for identifying differentially expressed genes.

Authors:  Yuan-De Tan; Myriam Fornage; Yun-Xin Fu
Journal:  Genomics       Date:  2006-09-18       Impact factor: 5.736

3.  Identification of disease-causing genes using microarray data mining and Gene Ontology.

Authors:  Azadeh Mohammadi; Mohammad H Saraee; Mansoor Salehi
Journal:  BMC Med Genomics       Date:  2011-01-26       Impact factor: 3.063

4.  Stability of ranked gene lists in large microarray analysis studies.

Authors:  Gregor Stiglic; Peter Kokol
Journal:  J Biomed Biotechnol       Date:  2010-06-27

5.  Novel methods to identify biologically relevant genes for leukemia and prostate cancer from gene expression profiles.

Authors:  Austin H Chen; Yin-Wu Tsau; Ching-Heng Lin
Journal:  BMC Genomics       Date:  2010-04-30       Impact factor: 3.969

6.  Association of genes with physiological functions by comparative analysis of pooled expression microarray data.

Authors:  Iuan-bor D Chen; Vinay K Rathi; Diana S DeAndrade; Patrick Y Jay
Journal:  Physiol Genomics       Date:  2012-11-20       Impact factor: 3.107

7.  Nonparametric tests for differential gene expression and interaction effects in multi-factorial microarray experiments.

Authors:  Xin Gao; Peter X K Song
Journal:  BMC Bioinformatics       Date:  2005-07-21       Impact factor: 3.169

8.  ANMM4CBR: a case-based reasoning method for gene expression data classification.

Authors:  Bangpeng Yao; Shao Li
Journal:  Algorithms Mol Biol       Date:  2010-01-06       Impact factor: 1.405

9.  A robust hybrid approach based on estimation of distribution algorithm and support vector machine for hunting candidate disease genes.

Authors:  Li Li; Hongmei Chen; Chang Liu; Fang Wang; Fangfang Zhang; Lihua Bai; Yihan Chen; Luying Peng
Journal:  ScientificWorldJournal       Date:  2013-02-07

10.  Classification of tumor samples from expression data using decision trunks.

Authors:  Benjamin Ulfenborg; Karin Klinga-Levan; Björn Olsson
Journal:  Cancer Inform       Date:  2013-02-13
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.