Literature DB >> 23368951

A negative selection heuristic to predict new transcriptional targets.

Luigi Cerulo1, Vincenzo Paduano, Pietro Zoppoli, Michele Ceccarelli.   

Abstract

BACKGROUND: Supervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside traditional unsupervised techniques, a supervised classifier learns, from known examples, a function that is able to recognize new relationships for new data. In the context of gene regulatory inference a supervised classifier is coerced to learn from positive and unlabeled examples, as the counter negative examples are unavailable or hard to collect. Such a condition could limit the performance of the classifier especially when the amount of training examples is low.
RESULTS: In this paper we improve the supervised identification of transcriptional targets by selecting reliable counter negative examples from the unlabeled set. We introduce an heuristic based on the known topology of transcriptional networks that in fact restores the conventional positive/negative training condition and shows a significant improvement of the classification performance. We empirically evaluate the proposed heuristic with the experimental datasets of Escherichia coli and show an example of application in the prediction of BCL6 direct core targets in normal germinal center human B cells obtaining a precision of 60%.
CONCLUSIONS: The availability of only positive examples in learning transcriptional relationships negatively affects the performance of supervised classifiers. We show that the selection of reliable negative examples, a practice adopted in text mining approaches, improves the performance of such classifiers opening new perspectives in the identification of new transcriptional targets.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23368951      PMCID: PMC3548675          DOI: 10.1186/1471-2105-14-S1-S3

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  23 in total

1.  Predicting protein--protein interactions from primary structure.

Authors:  J R Bock; D A Gough
Journal:  Bioinformatics       Date:  2001-05       Impact factor: 6.937

2.  Genome-wide dissection of posttranscriptional and posttranslational interactions.

Authors:  Mukesh Bansal; Andrea Califano
Journal:  Methods Mol Biol       Date:  2012

3.  Systematic DNA-binding domain classification of transcription factors.

Authors:  Philip Stegmaier; Alexander E Kel; Edgar Wingender
Journal:  Genome Inform       Date:  2004

Review 4.  Network motifs: theory and experimental approaches.

Authors:  Uri Alon
Journal:  Nat Rev Genet       Date:  2007-06       Impact factor: 53.242

5.  SIRENE: supervised inference of regulatory networks.

Authors:  Fantine Mordelet; Jean-Philippe Vert
Journal:  Bioinformatics       Date:  2008-08-15       Impact factor: 6.937

6.  The BCL6 transcriptional program features repression of multiple oncogenes in primary B cells and is deregulated in DLBCL.

Authors:  Weimin Ci; Jose M Polo; Leandro Cerchietti; Rita Shaknovich; Ling Wang; Shao Ning Yang; Kenny Ye; Pedro Farinha; Douglas E Horsman; Randy D Gascoyne; Olivier Elemento; Ari Melnick
Journal:  Blood       Date:  2009-03-23       Impact factor: 22.113

7.  Learning gene regulatory networks from only positive and unlabeled data.

Authors:  Luigi Cerulo; Charles Elkan; Michele Ceccarelli
Journal:  BMC Bioinformatics       Date:  2010-05-05       Impact factor: 3.169

8.  TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes.

Authors:  V Matys; O V Kel-Margoulis; E Fricke; I Liebich; S Land; A Barre-Dirrie; I Reuter; D Chekmenev; M Krull; K Hornischer; N Voss; P Stegmaier; B Lewicki-Potapov; H Saxel; A E Kel; E Wingender
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles.

Authors:  Jeremiah J Faith; Boris Hayete; Joshua T Thaden; Ilaria Mogno; Jamey Wierzbowski; Guillaume Cottarel; Simon Kasif; James J Collins; Timothy S Gardner
Journal:  PLoS Biol       Date:  2007-01       Impact factor: 8.029

10.  How to infer gene networks from expression profiles.

Authors:  Mukesh Bansal; Vincenzo Belcastro; Alberto Ambesi-Impiombato; Diego di Bernardo
Journal:  Mol Syst Biol       Date:  2007-02-13       Impact factor: 11.429

View more
  1 in total

1.  Learning a Markov Logic network for supervised gene regulatory network inference.

Authors:  Céline Brouard; Christel Vrain; Julie Dubois; David Castel; Marie-Anne Debily; Florence d'Alché-Buc
Journal:  BMC Bioinformatics       Date:  2013-09-12       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.