| Literature DB >> 21356087 |
Evgeny Shmelkov1, Zuojian Tang, Iannis Aifantis, Alexander Statnikov.
Abstract
BACKGROUND: Pathway databases are becoming increasingly important and almost omnipresent in most types of biological and translational research. However, little is known about the quality and completeness of pathways stored in these databases. The present study conducts a comprehensive assessment of transcriptional regulatory pathways in humans for seven well-studied transcription factors: MYC, NOTCH1, BCL6, TP53, AR, STAT1, and RELA. The employed benchmarking methodology first involves integrating genome-wide binding with functional gene expression data to derive direct targets of transcription factors. Then the lists of experimentally obtained direct targets are compared with relevant lists of transcriptional targets from 10 commonly used pathway databases.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21356087 PMCID: PMC3055855 DOI: 10.1186/1745-6150-6-15
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1Number of genes in common between MYC transcriptional targets derived from ten different pathway databases. Cells are colored according to their values from white (low values) to red (high values).
Pathway databases.
| Pathway Database | Access | Primary data source | Vendor/Developer | Web site |
|---|---|---|---|---|
| Ingenuity Pathway Analyzer (IPA) | Commercial | Expert curation/data mining | Ingenuity Systems, Inc. | |
| BKL TRANSPATH | Commercial | Expert curation | BIOBASE | |
| BKL TRANSFAC | Commercial | Expert curation | BIOBASE | |
| BioCarta | Free | Expert curation/scientific community contribution | BioCarta, LLC | |
| KEGG | Free | Expert curation | Kanehisa Laboratories | |
| WikiPathways | Free | Expert curation/scientific community contribution | BiGCaT Bioinformatics (University of Maastricht) and Conklin Lab (Gladstone Institutes, UCSF) | |
| Cell Signaling Technology pathways (CST) | Free | Expert curation/scientific community contribution | Cell Signaling Technology, Inc. | |
| GeneSpring | Commercial | Expert curation/data mining | Agilent Technologies, Inc. | |
| Pathway Studio | Commercial | Data mining | Ariadne Genomics, Inc. | |
| MetaCore | Commercial | Expert curation | GeneGo, Inc. |
Figure 2Illustration of statistical methodology for comparison between a gold-standard and a pathway database.
Gold standards for each transcription factor (TF).
| TF | Gold Standard ID# | Genes bound by the TF are obtained from | Genes that are downstream of the TF are obtained from | |
|---|---|---|---|---|
| Study | Analyzed to find genes that are | |||
| AR | I | • differentially expressed between DHT treated (for 4 hr.) and control samples | ||
| II | Wang et al. [ | Wang et al. [ | • differentially expressed between DHT treated (for 16 hr.) and control samples | |
| III | • differentially expressed between DHT treated (for 4 hr. and 16 hr.) and control samples | |||
| BCL6 | I | Basso et al. [ | • differentially expressed between siRNA treated and control samples | |
| II | Ci et al [ | Basso et al. [ | • differentially expressed between siRNA treated and control samples | |
| III | Basso et al. [ | • up-regulated in siRNA treated samples | ||
| IV | Ci et al [ | • up-regulated in siRNA treated samples | ||
| MYC | I | Cappellen et al. [ | • differentially expressed between siRNA treated and control samples | |
| II | Margolin et al. [ | • down-regulated in siRNA treated samples | ||
| III | Bild et al. [ | • differentially expressed between MYCexpressing and control samples | ||
| IV | • up-regulated in MYC-expressing samples | |||
| NOTCH1 | I | Margolin et al. [ | • differentially expressed between GSI treated and control samples | |
| II | • down-regulated in GSI treated samples | |||
| III | Margolin et al. [ | Palomero et al. [ | • differentially expressed between GSI treated and control samples | |
| IV | • down-regulated in GSI treated samples | |||
| V | Sanda et al. [ | • differentially expressed between GSI treated and control samples | ||
| VI | • down-regulated in GSI treated samples | |||
| RELA | I | Espinosa et al. [ | • differentially expressed between NBD treated and control samples | |
| II | Kasowski et al. [ | • down-regulated in NBD treated samples | ||
| III | Kasowski et al. [ | • differentially expressed between TNF-α treated and control samples | ||
| IV | • up-regulated in TNF-α treated samples | |||
| STAT1 | I | Robertson et al. [ | Pitroda et al. [ | • differentially expressed between shRNA treated and control samples |
| II | • down-regulated in shRNA treated samples | |||
| TP53 | I | Wei et al. [ | Chau et al. [ | • differentially expressed between shRNA and control samples |
| II | • down-regulated in shRNA treated samples | |||
Figure 3Comparison between different pathway databases and experimentally derived gold-standards for all considered transcription factors. Value in a given cell is a number of overlapping genes between a gold-standard and a pathway-derived gene set. Cells are colored according to their values from white (low values) to red (high values). Underlined values in red represent statistically significant intersections.
Figure 4Summary of the pathway databases assessment. Green cells represent statistically significant intersections between experimentally derived gold-standards and transcriptional regulatory pathways. White cells denote results that are not statistically significant. Numbers are the enrichment fold change ratios (EFC) calculated for each intersection.