Literature DB >> 20150674

A trade-off between sample complexity and computational complexity in learning Boolean networks from time-series data.

Theodore J Perkins1, Michael T Hallett.   

Abstract

A key problem in molecular biology is to infer regulatory relationships between genes from expression data. This paper studies a simplified model of such inference problems in which one or more Boolean variables, modeling, for example, the expression levels of genes, each depend deterministically on a small but unknown subset of a large number of Boolean input variables. Our model assumes that the expression data comprises a time series, in which successive samples may be correlated. We provide bounds on the expected amount of data needed to infer the correct relationships between output and input variables. These bounds improve and generalize previous results for Boolean network inference and continuous-time switching network inference. Although the computational problem is intractable in general, we describe a fixed-parameter tractable algorithm that is guaranteed to provide at least a partial solution to the problem. Most interestingly, both the sample complexity and computational complexity of the problem depend on the strength of correlations between successive samples in the time series but in opposing ways. Uncorrelated samples minimize the total number of samples needed while maximizing computational complexity; a strong correlation between successive samples has the opposite effect. This observation has implications for the design of experiments for measuring gene expression.

Mesh:

Substances:

Year:  2010        PMID: 20150674     DOI: 10.1109/TCBB.2008.38

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  5 in total

1.  Computational and statistical tradeoffs via convex relaxation.

Authors:  Venkat Chandrasekaran; Michael I Jordan
Journal:  Proc Natl Acad Sci U S A       Date:  2013-03-11       Impact factor: 11.205

2.  Robust dynamics in minimal hybrid models of genetic networks.

Authors:  Theodore J Perkins; Roy Wilds; Leon Glass
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2010-11-13       Impact factor: 4.226

3.  On the Sample Complexity of Cancer Pathways Identification.

Authors:  Fabio Vandin; Benjamin J Raphael; Eli Upfal
Journal:  J Comput Biol       Date:  2015-12-08       Impact factor: 1.479

4.  Process-driven inference of biological network structure: feasibility, minimality, and multiplicity.

Authors:  Guanyu Wang; Yongwu Rong; Hao Chen; Carl Pearson; Chenghang Du; Rahul Simha; Chen Zeng
Journal:  PLoS One       Date:  2012-07-18       Impact factor: 3.240

5.  Pooled screening for synergistic interactions subject to blocking and noise.

Authors:  Kyle Li; Doina Precup; Theodore J Perkins
Journal:  PLoS One       Date:  2014-01-16       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.