Literature DB >> 26279589

Classifier Design Given an Uncertainty Class of Feature Distributions via Regularized Maximum Likelihood and the Incorporation of Biological Pathway Knowledge in Steady-State Phenotype Classification.

Mohammad Shahrokh Esfahani1, Jason Knight1, Amin Zollanvari1, Byung-Jun Yoon1, Edward R Dougherty2.   

Abstract

Contemporary high-throughput technologies provide measurements of very large numbers of variables but often with very small sample sizes. This paper proposes an optimization-based paradigm for utilizing prior knowledge to design better performing classifiers when sample sizes are limited. We derive approximate expressions for the first and second moments of the true error rate of the proposed classifier under the assumption of two widely-used models for the uncertainty classes; ε-contamination and p-point classes. The applicability of the approximate expressions is discussed by defining the problem of finding optimal regularization parameters through minimizing the expected true error. Simulation results using the Zipf model show that the proposed paradigm yields improved classifiers that outperform traditional classifiers that use only training data. Our application of interest involves discrete gene regulatory networks possessing labeled steady-state distributions. Given prior operational knowledge of the process, our goal is to build a classifier that can accurately label future observations obtained in the steady state by utilizing both the available prior knowledge and the training data. We examine the proposed paradigm on networks containing NF-κB pathways, where it shows significant improvement in classifier performance over the classical data-only approach to classifier design. Companion website: http://gsp.tamu.edu/Publications/supplementary/shahrokh12a.

Entities:  

Keywords:  Steady-state classifier; biological-pathway knowledge; regularized; uncertainty class

Year:  2013        PMID: 26279589      PMCID: PMC4535735          DOI: 10.1016/j.patcog.2013.02.017

Source DB:  PubMed          Journal:  Pattern Recognit        ISSN: 0031-3203            Impact factor:   7.740


  8 in total

1.  Probabilistic Boolean Networks: a rule-based uncertainty model for gene regulatory networks.

Authors:  Ilya Shmulevich; Edward R Dougherty; Seungchan Kim; Wei Zhang
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

2.  Generating stochastic gene regulatory networks consistent with pathway information and steady-state behavior.

Authors:  Jason M Knight; Aniruddha Datta; Edward R Dougherty
Journal:  IEEE Trans Biomed Eng       Date:  2012-04-03       Impact factor: 4.538

3.  From biological pathways to regulatory networks.

Authors:  Ritwik K Layek; Aniruddha Datta; Edward R Dougherty
Journal:  Mol Biosyst       Date:  2010-12-15

Review 4.  NF-kappaB as a critical link between inflammation and cancer.

Authors:  Michael Karin
Journal:  Cold Spring Harb Perspect Biol       Date:  2009-11       Impact factor: 10.005

5.  Identification of diagnostic subnetwork markers for cancer in human protein-protein interaction network.

Authors:  Junjie Su; Byung-Jun Yoon; Edward R Dougherty
Journal:  BMC Bioinformatics       Date:  2010-10-07       Impact factor: 3.169

6.  Towards precise classification of cancers based on robust gene functional expression profiles.

Authors:  Zheng Guo; Tianwen Zhang; Xia Li; Qi Wang; Jianzhen Xu; Hui Yu; Jing Zhu; Haiyun Wang; Chenguang Wang; Eric J Topol; Qing Wang; Shaoqi Rao
Journal:  BMC Bioinformatics       Date:  2005-03-17       Impact factor: 3.169

7.  Pathway level analysis of gene expression using singular value decomposition.

Authors:  John Tomfohr; Jun Lu; Thomas B Kepler
Journal:  BMC Bioinformatics       Date:  2005-09-12       Impact factor: 3.169

8.  Network-based classification of breast cancer metastasis.

Authors:  Han-Yu Chuang; Eunjung Lee; Yu-Tsueng Liu; Doheon Lee; Trey Ideker
Journal:  Mol Syst Biol       Date:  2007-10-16       Impact factor: 11.429

  8 in total
  3 in total

1.  Optimal cancer prognosis under network uncertainty.

Authors:  Mohammadmahdi R Yousefi; Lori A Dalton
Journal:  EURASIP J Bioinform Syst Biol       Date:  2015-01-27

Review 2.  A Nonmathematical Review of Optimal Operator and Experimental Design for Uncertain Scientific Models with Application to Genomics.

Authors:  Edward R Dougherty
Journal:  Curr Genomics       Date:  2019-01       Impact factor: 2.236

3.  Data Requirements for Model-Based Cancer Prognosis Prediction.

Authors:  Lori A Dalton; Mohammadmahdi R Yousefi
Journal:  Cancer Inform       Date:  2016-04-21
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.