Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Identifying marker genes in transcription profiling data using a mixture of feature relevance experts.

Literature DB >> 11242594

Identifying marker genes in transcription profiling data using a mixture of feature relevance experts.

Abstract

Transcription profiling experiments permit the expression levels of many genes to be measured simultaneously. Given profiling data from two types of samples, genes that most distinguish the samples (marker genes) are good candidates for subsequent in-depth experimental studies and developing decision support systems for diagnosis, prognosis, and monitoring. This work proposes a mixture of feature relevance experts as a method for identifying marker genes and illustrates the idea using published data from samples labeled as acute lymphoblastic and myeloid leukemia (ALL, AML). A feature relevance expert implements an algorithm that calculates how well a gene distinguishes samples, reorders genes according to this relevance measure, and uses a supervised learning method [here, support vector machines (SVMs)] to determine the generalization performances of different nested gene subsets. The mixture of three feature relevance experts examined implement two existing and one novel feature relevance measures. For each expert, a gene subset consisting of the top 50 genes distinguished ALL from AML samples as completely as all 7,070 genes. The 125 genes at the union of the top 50s are plausible markers for a prototype decision support system. Chromosomal aberration and other data support the prediction that the three genes at the intersection of the top 50s, cystatin C, azurocidin, and adipsin, are good targets for investigating the basic biology of ALL/AML. The same data were employed to identify markers that distinguish samples based on their labels of T cell/B cell, peripheral blood/bone marrow, and male/female. Selenoprotein W may discriminate T cells from B cells. Results from analysis of transcription profiling data from tumor/nontumor colon adenocarcinoma samples support the general utility of the aforementioned approach. Theoretical issues such as choosing SVM kernels and their parameters, training and evaluating feature relevance experts, and the impact of potentially mislabeled samples on marker identification (feature selection) are discussed.

Entities: Disease Species

Mesh：

Substances：

Year: 2001 PMID： 11242594 DOI： 10.1152/physiolgenomics.2001.5.2.99

Source DB: PubMed Journal: Physiol Genomics ISSN： 1094-8341 Impact factor: 3.107

Keyword Cloud
Cited

23 in total

1. Biomarker identification by feature wrappers.

Authors: M Xiong; X Fang; J Zhao
Journal: Genome Res Date: 2001-11 Impact factor: 9.043

Review 2. Contribution of bioinformatics prediction in microRNA-based cancer therapeutics.

Authors: Jasjit K Banwait; Dhundy R Bastola
Journal: Adv Drug Deliv Rev Date: 2014-11-06 Impact factor: 15.470

3. Prognostic significance of serum progranulin level in de novo adult acute lymphoblastic leukemia patients.

Authors: Amro M S El-Ghammaz; Mohamed O Azzazi; Nevine Mostafa; Hany M Hegab; Amir A Mahmoud
Journal: Clin Exp Med Date: 2020-01-31 Impact factor: 3.984

4. Selection bias in gene extraction on the basis of microarray gene-expression data.

Authors: Christophe Ambroise; Geoffrey J McLachlan
Journal: Proc Natl Acad Sci U S A Date: 2002-04-30 Impact factor: 11.205

5. Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling.

Authors: Xia Li; Shaoqi Rao; Yadong Wang; Binsheng Gong
Journal: Nucleic Acids Res Date: 2004-05-17 Impact factor: 16.971

6. Fusing Gene Interaction to Improve Disease Discrimination on Classification Analysis.

Authors: Ji-Gang Zhang; Jian Li; Wenlong Tang; Hong-Wen Deng
Journal: Adv Genet Eng Date: 2012-02-09

Review 7. Progranulin (granulin-epithelin precursor, PC-cell-derived growth factor, acrogranin) mediates tissue repair and tumorigenesis.

Authors: Zhiheng He; Andrew Bateman
Journal: J Mol Med (Berl) Date: 2003-08-19 Impact factor: 4.599

8. Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data.

Authors: Amalia Annest; Roger E Bumgarner; Adrian E Raftery; Ka Yee Yeung
Journal: BMC Bioinformatics Date: 2009-02-26 Impact factor: 3.169

9. AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology.

Authors: Fiona Achcar; Jean-Michel Camadro; Denis Mestivier
Journal: Nucleic Acids Res Date: 2009-05-27 Impact factor: 16.971

10. Merging microarray data, robust feature selection, and predicting prognosis in prostate cancer.

Authors: Jing Wang; Kim Anh Do; Sijin Wen; Spyros Tsavachidis; Timothy J McDonnell; Christopher J Logothetis; Kevin R Coombes
Journal: Cancer Inform Date: 2007-02-14