Literature DB >> 15579233

A Bayesian network classification methodology for gene expression data.

Paul Helman1, Robert Veroff, Susan R Atlas, Cheryl Willman.   

Abstract

We present new techniques for the application of a Bayesian network learning framework to the problem of classifying gene expression data. The focus on classification permits us to develop techniques that address in several ways the complexities of learning Bayesian nets. Our classification model reduces the Bayesian network learning problem to the problem of learning multiple subnetworks, each consisting of a class label node and its set of parent genes. We argue that this classification model is more appropriate for the gene expression domain than are other structurally similar Bayesian network classification models, such as Naive Bayes and Tree Augmented Naive Bayes (TAN), because our model is consistent with prior domain experience suggesting that a relatively small number of genes, taken in different combinations, is required to predict most clinical classes of interest. Within this framework, we consider two different approaches to identifying parent sets which are supported by the gene expression observations and any other currently available evidence. One approach employs a simple greedy algorithm to search the universe of all genes; the second approach develops and applies a gene selection algorithm whose results are incorporated as a prior to enable an exhaustive search for parent sets over a restricted universe of genes. Two other significant contributions are the construction of classifiers from multiple, competing Bayesian network hypotheses and algorithmic methods for normalizing and binning gene expression data in the absence of prior expert knowledge. Our classifiers are developed under a cross validation regimen and then validated on corresponding out-of-sample test sets. The classifiers attain a classification rate in excess of 90% on out-of-sample test sets for two publicly available datasets. We present an extensive compilation of results reported in the literature for other classification methods run against these same two datasets. Our results are comparable to, or better than, any we have found reported for these two sets, when a train-test protocol as stringent as ours is followed.

Mesh:

Year:  2004        PMID: 15579233     DOI: 10.1089/cmb.2004.11.581

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  15 in total

1.  Gene expression signatures predictive of early response and outcome in high-risk childhood acute lymphoblastic leukemia: A Children's Oncology Group Study [corrected].

Authors:  Deepa Bhojwani; Huining Kang; Renee X Menezes; Wenjian Yang; Harland Sather; Naomi P Moskowitz; Dong-Joon Min; Jeffrey W Potter; Richard Harvey; Stephen P Hunger; Nita Seibel; Elizabeth A Raetz; Rob Pieters; Martin A Horstmann; Mary V Relling; Monique L den Boer; Cheryl L Willman; William L Carroll
Journal:  J Clin Oncol       Date:  2008-09-20       Impact factor: 44.544

2.  Prognostic Significance of the BIRC2-BIRC3 Gene Signature in Head and Neck Squamous Cell Carcinoma.

Authors:  Min Kyeong Lee; Joo Kyung Noh; Seon Rang Woo; Moonkyoo Kong; Young Chan Lee; Jung Woo Lee; Seong-Gyu Ko; Young-Gyu Eun
Journal:  Cancer Genomics Proteomics       Date:  2022 Sep-Oct       Impact factor: 3.395

3.  MicroRNA-integrated and network-embedded gene selection with diffusion distance.

Authors:  Di Huang; Xiaobo Zhou; Christopher J Lyon; Willa A Hsueh; Stephen T C Wong
Journal:  PLoS One       Date:  2010-10-29       Impact factor: 3.240

4.  Identification of altered biological processes in heterogeneous RNA-sequencing data by discretization of expression profiles.

Authors:  Andrea Lauria; Serena Peirone; Marco Del Giudice; Francesca Priante; Prabhakar Rajan; Michele Caselle; Salvatore Oliviero; Matteo Cereda
Journal:  Nucleic Acids Res       Date:  2020-02-28       Impact factor: 16.971

Review 5.  Microarray analysis: basic strategies for successful experiments.

Authors:  Scott A Ness
Journal:  Mol Biotechnol       Date:  2007-07       Impact factor: 2.860

6.  Topology-based cancer classification and related pathway mining using microarray data.

Authors:  Chun-Chi Liu; Wen-Shyen E Chen; Chin-Chung Lin; Hsiang-Chuan Liu; Hsuan-Yu Chen; Pan-Chyr Yang; Pei-Chun Chang; Jeremy J W Chen
Journal:  Nucleic Acids Res       Date:  2006-08-16       Impact factor: 16.971

7.  A hierarchical Naïve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays.

Authors:  Francesca Demichelis; Paolo Magni; Paolo Piergiorgi; Mark A Rubin; Riccardo Bellazzi
Journal:  BMC Bioinformatics       Date:  2006-11-24       Impact factor: 3.169

8.  A categorical network approach for discovering differentially expressed regulations in cancer.

Authors:  Nikolay Balov
Journal:  BMC Med Genomics       Date:  2013-11-11       Impact factor: 3.063

9.  Microbial forensics: predicting phenotypic characteristics and environmental conditions from large-scale gene expression profiles.

Authors:  Minseung Kim; Violeta Zorraquino; Ilias Tagkopoulos
Journal:  PLoS Comput Biol       Date:  2015-03-16       Impact factor: 4.475

10.  Comparison of classification methods for detecting associations between SNPs and chick mortality.

Authors:  Nanye Long; Daniel Gianola; Guilherme J M Rosa; Kent A Weigel; Santiago Avendaño
Journal:  Genet Sel Evol       Date:  2009-01-23       Impact factor: 4.297

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.