Literature DB >> 21159623

Variable selection for discriminant analysis with Markov random field priors for the analysis of microarray data.

Francesco C Stingo1, Marina Vannucci.   

Abstract

MOTIVATION: Discriminant analysis is an effective tool for the classification of experimental units into groups. Here, we consider the typical problem of classifying subjects according to phenotypes via gene expression data and propose a method that incorporates variable selection into the inferential procedure, for the identification of the important biomarkers. To achieve this goal, we build upon a conjugate normal discriminant model, both linear and quadratic, and include a stochastic search variable selection procedure via an MCMC algorithm. Furthermore, we incorporate into the model prior information on the relationships among the genes as described by a gene-gene network. We use a Markov random field (MRF) prior to map the network connections among genes. Our prior model assumes that neighboring genes in the network are more likely to have a joint effect on the relevant biological processes.
RESULTS: We use simulated data to assess performances of our method. In particular, we compare the MRF prior to a situation where independent Bernoulli priors are chosen for the individual predictors. We also illustrate the method on benchmark datasets for gene expression. Our simulation studies show that employing the MRF prior improves on selection accuracy. In real data applications, in addition to identifying markers and improving prediction accuracy, we show how the integration of existing biological knowledge into the prior model results in an increased ability to identify genes with strong discriminatory power and also aids the interpretation of the results.

Mesh:

Year:  2010        PMID: 21159623      PMCID: PMC3105481          DOI: 10.1093/bioinformatics/btq690

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  A Markov random field model for network-based analysis of genomic data.

Authors:  Zhi Wei; Hongzhe Li
Journal:  Bioinformatics       Date:  2007-05-05       Impact factor: 6.937

2.  TGFBR2 mutation is correlated with CpG island methylator phenotype in microsatellite instability-high colorectal cancer.

Authors:  Shuji Ogino; Takako Kawasaki; Akiyo Ogawa; Gregory J Kirkner; Massimo Loda; Charles S Fuchs
Journal:  Hum Pathol       Date:  2007-01-31       Impact factor: 3.466

3.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

4.  Targeting of CD44 eradicates human acute myeloid leukemic stem cells.

Authors:  Liqing Jin; Kristin J Hope; Qiongli Zhai; Florence Smadja-Joffe; John E Dick
Journal:  Nat Med       Date:  2006-09-24       Impact factor: 53.440

5.  Network-based genomic discovery: application and comparison of Markov random field models.

Authors:  Peng Wei; Wei Pan
Journal:  J R Stat Soc Ser C Appl Stat       Date:  2010-01-01       Impact factor: 1.864

6.  Significant coexpression of GLUT-1, Bcl-xL, and Bax in colorectal cancer.

Authors:  Andrzej Wincewicz; Mariola Sulkowska; Mariusz Koda; Luiza Kanczuga-Koda; Ewa Witkowska; Stanislaw Sulkowski
Journal:  Ann N Y Acad Sci       Date:  2007-01       Impact factor: 5.691

7.  Potential pathogenetic implications of cyclooxygenase-2 overexpression in B chronic lymphoid leukemia cells.

Authors:  Paola Secchiero; Elisa Barbarotto; Arianna Gonelli; Mario Tiribelli; Carlotta Zerbinati; Claudio Celeghini; Claudio Agostinelli; Stefano A Pileri; Giorgio Zauli
Journal:  Am J Pathol       Date:  2005-12       Impact factor: 4.307

8.  The multi-functional cellular adhesion molecule CD44 is regulated by the 8;21 chromosomal translocation.

Authors:  L F Peterson; Y Wang; M-C Lo; M Yan; E Kanbe; D-E Zhang
Journal:  Leukemia       Date:  2007-07-26       Impact factor: 11.528

Review 9.  An assessment of recently published gene expression data analyses: reporting experimental design and statistical factors.

Authors:  Peyman Jafari; Francisco Azuaje
Journal:  BMC Med Inform Decis Mak       Date:  2006-06-21       Impact factor: 2.796

10.  KEGGgraph: a graph approach to KEGG PATHWAY in R and bioconductor.

Authors:  Jitao David Zhang; Stefan Wiemann
Journal:  Bioinformatics       Date:  2009-03-23       Impact factor: 6.937

View more
  21 in total

1.  A hidden Markov random field-based Bayesian method for the detection of long-range chromosomal interactions in Hi-C data.

Authors:  Zheng Xu; Guosheng Zhang; Fulai Jin; Mengjie Chen; Terrence S Furey; Patrick F Sullivan; Zhaohui Qin; Ming Hu; Yun Li
Journal:  Bioinformatics       Date:  2015-11-04       Impact factor: 6.937

2.  Scalable Bayesian variable selection for structured high-dimensional data.

Authors:  Changgee Chang; Suprateek Kundu; Qi Long
Journal:  Biometrics       Date:  2018-05-08       Impact factor: 2.571

Review 3.  Principles and methods of integrative genomic analyses in cancer.

Authors:  Vessela N Kristensen; Ole Christian Lingjærde; Hege G Russnes; Hans Kristian M Vollan; Arnoldo Frigessi; Anne-Lise Børresen-Dale
Journal:  Nat Rev Cancer       Date:  2014-05       Impact factor: 60.716

4.  Joint Bayesian variable and graph selection for regression models with network-structured predictors.

Authors:  Christine B Peterson; Francesco C Stingo; Marina Vannucci
Journal:  Stat Med       Date:  2015-10-29       Impact factor: 2.373

5.  Bayesian integrative analysis of epigenomic and transcriptomic data identifies Alzheimer's disease candidate genes and networks.

Authors:  Hans-Ulrich Klein; Martin Schäfer; David A Bennett; Holger Schwender; Philip L De Jager
Journal:  PLoS Comput Biol       Date:  2020-04-07       Impact factor: 4.475

6.  Bayesian Non-linear Support Vector Machine for High-Dimensional Data with Incorporation of Graph Information on Features.

Authors:  Wenli Sun; Changgee Chang; Qi Long
Journal:  Proc IEEE Int Conf Big Data       Date:  2020-02-24

7.  Bayesian Inference of Multiple Gaussian Graphical Models.

Authors:  Christine B Peterson; Francesco C Stingo; Marina Vannucci
Journal:  J Am Stat Assoc       Date:  2015-03-01       Impact factor: 5.033

8.  A Bayesian hidden Potts mixture model for analyzing lung cancer pathology images.

Authors:  Qiwei Li; Xinlei Wang; Faming Liang; Faliu Yi; Yang Xie; Adi Gazdar; Guanghua Xiao
Journal:  Biostatistics       Date:  2019-10-01       Impact factor: 5.899

9.  A Bayesian Approach for Learning Gene Networks Underlying Disease Severity in COPD.

Authors:  Elin Shaddox; Francesco C Stingo; Christine B Peterson; Sean Jacobson; Charmion Cruickshank-Quinn; Katerina Kechris; Russell Bowler; Marina Vannucci
Journal:  Stat Biosci       Date:  2016-10-28

10.  INCORPORATING BIOLOGICAL INFORMATION INTO LINEAR MODELS: A BAYESIAN APPROACH TO THE SELECTION OF PATHWAYS AND GENES.

Authors:  Francesco C Stingo; Yian A Chen; Mahlet G Tadesse; Marina Vannucci
Journal:  Ann Appl Stat       Date:  2011-09-01       Impact factor: 2.083

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.