| Literature DB >> 23794736 |
Huilei Xu1, Caroline Baroukh, Ruth Dannenfelser, Edward Y Chen, Christopher M Tan, Yan Kou, Yujin E Kim, Ihor R Lemischka, Avi Ma'ayan.
Abstract
High content studies that profile mouse and human embryonic stem cells (m/hESCs) using various genome-wide technologies such as transcriptomics and proteomics are constantly being published. However, efforts to integrate such data to obtain a global view of the molecular circuitry in m/hESCs are lagging behind. Here, we present an m/hESC-centered database called Embryonic Stem Cell Atlas from Pluripotency Evidence integrating data from many recent diverse high-throughput studies including chromatin immunoprecipitation followed by deep sequencing, genome-wide inhibitory RNA screens, gene expression microarrays or RNA-seq after knockdown (KD) or overexpression of critical factors, immunoprecipitation followed by mass spectrometry proteomics and phosphoproteomics. The database provides web-based interactive search and visualization tools that can be used to build subnetworks and to identify known and novel regulatory interactions across various regulatory layers. The web-interface also includes tools to predict the effects of combinatorial KDs by additive effects controlled by sliders, or through simulation software implemented in MATLAB. Overall, the Embryonic Stem Cell Atlas from Pluripotency Evidence database is a comprehensive resource for the stem cell systems biology community. Database URL: http://www.maayanlab.net/ESCAPEEntities:
Mesh:
Substances:
Year: 2013 PMID: 23794736 PMCID: PMC3689438 DOI: 10.1093/database/bat045
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Statistics of the current version of the ESCAPE database
| Data type | Description | Interactions | Source Proteins | Studies |
|---|---|---|---|---|
| Protein–gene promoter binding | Protein–gene promoter binding interactions extracted from ChIP-chip/seq studies | 206 521 (mESC); | 61 (mESC) | 38 |
| Protein–gene transcriptional regulation | Protein–mRNA regulatory interactions extracted from studies of LOF/GOF and GOF followed by microarray profiling | 150 251 (mESC); 3669(hESC) | 73 (mESC); 3 (hESC) | 25(mESC); 3(hESC) |
| Protein–protein interactions | Protein–protein interactions from both high-throughput mass-spectrometry proteomics studies using key TF regulators as bait and manually curated from low-throughput studies | 912 (mESC,HT); 118 (mESC,LT); 7 (hESC,LT) | 7 (mESC,HT); 77 (mESC,LT); 4 (hESC,LT) | |
| miRNA-predicted target interactions | miRNA-predicted target interactions in mice integrated from published databases: TarBase, TargetScan, miRander and miRBase | 693 552 | ||
| Histone modifications | Histone modifications profiles of genes | 107 types | 6 | |
| Pluripotency genes | Potential pluripotency associated genes from multiple large-scale RNAi screen studies | 640 (mESC); 173 (hESC) | 9 (mESC); 1 (hESC) | |
| ESC-specific proteins | Undifferentiated and differentiating ESC specific proteins | 5547 (mESC); 6764 (hESC); 457 (diff-mESC);7033 (diff-hESC) | 2 (mESC); 4 (hESC) | |
| ESC-specific phosphoproteins-phosphosites | Undifferentiated and differentiating ESC specific phosphoproteins with specified phosphosites | 3697 (hESC) 4626 (diff-hESC) | 4 (hESC) | |
| Gene expression | Time-course gene expression of ESCs under undirected differentiation | 1 (3 datasets) | ||
| miRNA expression | miRNA expression in ESC/MEF/iPSC | 1 (1 dataset) |
aIncluding one unpublished data set for Esrrb.
ESCAPE literature references
| Interaction types | PMIDs | Interaction types | PMID | Interaction types | PMIDs |
|---|---|---|---|---|---|
| ChIP-chip/seq | 19251738 | LOF/GOF | 22210892 | Protein–protein interactions | 19036726 |
| 19339689 | 22196727 | 19056487 | |||
| 16625203 | 20720539 | 19117947 | |||
| 17442700 | 22327834 | 19158397 | |||
| 18264089 | 22020125 | 19172185 | |||
| 18358816 | Protein–protein interactions | 20362541 | 19349578 | ||
| 19079543 | 20362542 | 19421146 | |||
| 16518401 | 17093407 | 19440552 | |||
| 18347094 | 22083510 | 19489080 | |||
| 18692474 | 20946988 | 19536159 | |||
| 18700969 | 1769609 | 19544440 | |||
| 18959480 | 8939963 | 19564334 | |||
| 19796622 | 10849651 | 19571885 | |||
| 19030024 | 11791180 | 19625610 | |||
| 18555785 | 11934987 | 19650037 | |||
| 18804426 | 12145208 | 19703396 | |||
| 19587682 | 12646244 | 19740739 | |||
| 19884257 | 12774123 | 19798101 | |||
| 19884255 | 14551209 | 19816951 | |||
| 18467660 | 15103331 | 19821493 | |||
| 20064375 | 15861132 | 20075857 | |||
| 20075857 | 15863505 | 20110566 | |||
| 20946988 | 16129412 | 20508149 | |||
| 18974828 | 16253997 | 20736927 | |||
| 20872845 | 16325584 | 18568018 | |||
| 20139965 | 16382133 | 9748258 | |||
| 21062744 | 16395332 | 21062744 | |||
| 21170310 | 16631596 | 21159818 | |||
| 21183938 | 16702210 | 21589869 | |||
| 20123909 | 16702404 | 21884934 | |||
| 21448134 | 16763566 | 22334693 | |||
| 20581084 | 16790473 | Pluripotency genes from genome-wide RNAi screen | 16767105 | ||
| 20144788 | 16801560 | 19345177 | |||
| 21632747 | 16840789 | 19339689 | |||
| 22325148 | 16978048 | 18614019 | |||
| 20720539 | 16999741 | 20720539 | |||
| 23239880 | 17030610 | 20953172 | |||
| LOF/GOF | 16518401 | 17324942 | 22143885 | ||
| 16767105 | 17339329 | 22899353 | |||
| 17339329 | 17372190 | 22327834 | |||
| 17448993 | 17520687 | 21874018 | |||
| 17515932 | 17543867 | Proteomics | 16600995 | ||
| 18264089 | 17687327 | 19664995 | |||
| 18757296 | 17892859 | 19151416 | |||
| 19060217 | 17938196 | 21406692 | |||
| 19136965 | 17994007 | 21149613 | |||
| 19530134 | 18055446 | Histone modifications | 19884255 | ||
| 19618472 | 18055449 | 17603471 | |||
| 19884255 | 18223644 | 18692474 | |||
| 20075857 | 18454139 | 18600261 | |||
| 20139965 | 18454140 | 20682450 | |||
| 21915945 | 18454141 | 20944595 | |||
| 20875108 | 18462200 | Phosphoproteomics | 19664994 | ||
| 20526341 | 18467660 | 19664995 | |||
| 19796622 | 18585351 | 19151416 | |||
| 20953172 | 18680430 | 21406692 | |||
| 20123909 | 18687992 | Gene expression | 17394647 | ||
| 21589869 | 18818694 | miRNA expression | 18692474 | ||
| 21632747 | 18957414 | ||||
| 21924763 | 18983969 |
Figure 1.Entity relation diagram of the ESCAPE database. PK- primary key, FK- foreign key.
Figure 2.Screenshot of the main menu of ESCAPE.
Figure 3.An example from the ‘NETWORKS’ tab for linking seed nodes through other nodes based on the interactions within the database.
Figure 4.(A) General example from the ‘ENRICHMENT ANALYSIS’ tab for identifying overlap between gene lists from the database and other user inputted lists. (B) Enrichment results for a list of gene from MGI-MP with a phenotype designation ‘embryonic growth arrest’. (C) Enrichment results for a list of gene from MGI-MP with a phenotype designation ‘abnormal kidney morphology’. Enriched terms are highlighted in circles and some terms are annotated. Brighter circles represent more significant overlap.
Figure 5.An example from the ‘LINEAGE PREDICTION’ tab showing that LOF of Oct4/Pou5f1, Nanog and Sox2 results mostly in differentiating toward the trophectoderm lineage.
Figure 6.Heatmaps of 15 pluripotency components and 15 lineage marker genes. (A–G) Numeric values in the color bars represent similarity distance scores (1 – Similarity). Similarity is calculated using the Jaccard-coefficient for: (A) TF/target-gene promoter binding set overlap determined by ChIP-chip/seq publications; (B) Transcription-factor/mRNA regulatory interactions determined by LOF/GOF followed by expression; (C) Histone modifications determined by ChIP-chip/seq; (D) Protein–protein interactions from IP/MS studies; (E) miRNA-target gene interactions. (F) Similarity of pair-wise genes from mRNA co-expression was calculated using the Pearson’s correlation coefficient. (G) Multi-layer heatmap was constructed by normalizing the distances across all six layers.