Literature DB >> 18535085

Knowledge-based gene expression classification via matrix factorization.

R Schachtner1, D Lutter, P Knollmüller, A M Tomé, F J Theis, G Schmitz, M Stetter, P Gómez Vilda, E W Lang.   

Abstract

MOTIVATION: Modern machine learning methods based on matrix decomposition techniques, like independent component analysis (ICA) or non-negative matrix factorization (NMF), provide new and efficient analysis tools which are currently explored to analyze gene expression profiles. These exploratory feature extraction techniques yield expression modes (ICA) or metagenes (NMF). These extracted features are considered indicative of underlying regulatory processes. They can as well be applied to the classification of gene expression datasets by grouping samples into different categories for diagnostic purposes or group genes into functional categories for further investigation of related metabolic pathways and regulatory networks.
RESULTS: In this study we focus on unsupervised matrix factorization techniques and apply ICA and sparse NMF to microarray datasets. The latter monitor the gene expression levels of human peripheral blood cells during differentiation from monocytes to macrophages. We show that these tools are able to identify relevant signatures in the deduced component matrices and extract informative sets of marker genes from these gene expression profiles. The methods rely on the joint discriminative power of a set of marker genes rather than on single marker genes. With these sets of marker genes, corroborated by leave-one-out or random forest cross-validation, the datasets could easily be classified into related diagnostic categories. The latter correspond to either monocytes versus macrophages or healthy vs Niemann Pick C disease patients.

Entities:  

Mesh:

Year:  2008        PMID: 18535085      PMCID: PMC2638868          DOI: 10.1093/bioinformatics/btn245

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Linear modes of gene expression determined by independent component analysis.

Authors:  Wolfram Liebermeister
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

Review 2.  Computational analysis of microarray data.

Authors:  J Quackenbush
Journal:  Nat Rev Genet       Date:  2001-06       Impact factor: 53.242

3.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias.

Authors:  B M Bolstad; R A Irizarry; M Astrand; T P Speed
Journal:  Bioinformatics       Date:  2003-01-22       Impact factor: 6.937

Review 4.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

5.  A distribution free summarization method for Affymetrix GeneChip arrays.

Authors:  Zhongxue Chen; Monnie McGee; Qingzhong Liu; Richard H Scheuermann
Journal:  Bioinformatics       Date:  2006-12-05       Impact factor: 6.937

6.  I/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data.

Authors:  Willem Talloen; Djork-Arné Clevert; Sepp Hochreiter; Dhammika Amaratunga; Luc Bijnens; Stefan Kass; Hinrich W H Göhlmann
Journal:  Bioinformatics       Date:  2007-10-05       Impact factor: 6.937

7.  Prediction and uncertainty in the analysis of gene expression profiles.

Authors:  Rainer Spang; Harry Zuzan; Mike West; Joseph Nevins; Carrie Blanchette; Jeffrey R Marks
Journal:  In Silico Biol       Date:  2002

8.  Gene expression data classification with Kernel principal component analysis.

Authors:  Zhenqiu Liu; Dechang Chen; Halima Bensmail
Journal:  J Biomed Biotechnol       Date:  2005-06-30

9.  Analyzing M-CSF dependent monocyte/macrophage differentiation: expression modes and meta-modes derived from an independent component analysis.

Authors:  Dominik Lutter; Peter Ugocsai; Margot Grandl; Evelyn Orso; Fabian Theis; Elmar W Lang; Gerd Schmitz
Journal:  BMC Bioinformatics       Date:  2008-02-17       Impact factor: 3.169

10.  Application of independent component analysis to microarrays.

Authors:  Su-In Lee; Serafim Batzoglou
Journal:  Genome Biol       Date:  2003-10-24       Impact factor: 13.583

View more
  10 in total

1.  SITC cancer immunotherapy resource document: a compass in the land of biomarker discovery.

Authors:  Siwen Hu-Lieskovan; Srabani Bhaumik; Kavita Dhodapkar; Jean-Charles J B Grivel; Sumati Gupta; Brent A Hanks; Sylvia Janetzki; Thomas O Kleen; Yoshinobu Koguchi; Amanda W Lund; Cristina Maccalli; Yolanda D Mahnke; Ruslan D Novosiadly; Senthamil R Selvan; Tasha Sims; Yingdong Zhao; Holden T Maecker
Journal:  J Immunother Cancer       Date:  2020-12       Impact factor: 13.751

2.  K1 and K15 of Kaposi's Sarcoma-Associated Herpesvirus Are Partial Functional Homologues of Latent Membrane Protein 2A of Epstein-Barr Virus.

Authors:  Lisa Steinbrück; Montse Gustems; Stephanie Medele; Thomas F Schulz; Dominik Lutter; Wolfgang Hammerschmidt
Journal:  J Virol       Date:  2015-05-06       Impact factor: 5.103

3.  Co-clustering phenome-genome for phenotype classification and disease gene discovery.

Authors:  TaeHyun Hwang; Gowtham Atluri; MaoQiang Xie; Sanjoy Dey; Changjin Hong; Vipin Kumar; Rui Kuang
Journal:  Nucleic Acids Res       Date:  2012-06-26       Impact factor: 16.971

4.  Configurable pattern-based evolutionary biclustering of gene expression data.

Authors:  Beatriz Pontes; Raúl Giráldez; Jesús S Aguilar-Ruiz
Journal:  Algorithms Mol Biol       Date:  2013-02-23       Impact factor: 1.405

5.  Knowledge-based matrix factorization temporally resolves the cellular responses to IL-6 stimulation.

Authors:  Andreas Kowarsch; Florian Blöchl; Sebastian Bohl; Maria Saile; Norbert Gretz; Ursula Klingmüller; Fabian J Theis
Journal:  BMC Bioinformatics       Date:  2010-11-30       Impact factor: 3.169

6.  A mixture model with a reference-based automatic selection of components for disease classification from protein and/or gene expression levels.

Authors:  Ivica Kopriva; Marko Filipović
Journal:  BMC Bioinformatics       Date:  2011-12-30       Impact factor: 3.169

7.  Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data.

Authors:  Mi Hyeon Kim; Hwa Jeong Seo; Je-Gun Joung; Ju Han Kim
Journal:  BMC Bioinformatics       Date:  2011-11-30       Impact factor: 3.169

Review 8.  Statistical methods for the analysis of high-throughput metabolomics data.

Authors:  Jörg Bartel; Jan Krumsiek; Fabian J Theis
Journal:  Comput Struct Biotechnol J       Date:  2013-03-22       Impact factor: 7.271

9.  iPcc: a novel feature extraction method for accurate disease class discovery and prediction.

Authors:  Xianwen Ren; Yong Wang; Xiang-Sun Zhang; Qi Jin
Journal:  Nucleic Acids Res       Date:  2013-06-12       Impact factor: 16.971

10.  Discovering subgroups of patients from DNA copy number data using NMF on compacted matrices.

Authors:  Cassio P de Campos; Paola M V Rancoita; Ivo Kwee; Emanuele Zucca; Marco Zaffalon; Francesco Bertoni
Journal:  PLoS One       Date:  2013-11-20       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.