Literature DB >> 25701763

Web resources for stem cell research.

Ting Wei1, Xing Peng2, Lili Ye1, Jiajia Wang2, Fuhai Song2, Zhouxian Bai2, Guangchun Han3, Fengmin Ji4, Hongxing Lei5.   

Abstract

In this short review, we have presented a brief overview on major web resources relevant to stem cell research. To facilitate more efficient use of these resources, we have provided a preliminary rating based on our own user experience of the overall quality for each resource. We plan to update the information on an annual basis.
Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Direct conversion; Network; Physical interaction; Regulatory interaction; Reprogramming

Mesh:

Year:  2015        PMID: 25701763      PMCID: PMC4411488          DOI: 10.1016/j.gpb.2015.01.001

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

Stem cell research is at the frontier of regenerative medicine [1], [2], [3]. To avoid the ethical issues related to the use of embryonic stem cell (ESC) or somatic cell nuclear transfer (SCNT) technology, induced pluripotent stem cell (iPSC) technology has been developed and matured in recent years [4], [5]. Fibroblast and other types of terminally-differentiated cells can be reprogrammed into iPSCs using defined factors. iPSCs can be further differentiated into various tissues using tissue-specific inducing factors [6]. Differentiated cells can also be directly converted to other types of differentiated cells (also termed “trans-differentiation”) [7]. To foster the fast development in this field, several databases and web servers have been established in the past few years (Figure 1). Relevant literature and high-throughput experimental data have been curated. Available data analyses range from identification of physical interactions and regulatory partners to enrichment analysis and network construction. Here, we provide a brief overview of these web resources. Based on our own user experience of the overall quality of the resources, we have provided a preliminary rating for those resources (Table 1). The rating is mainly based on: (1) how many types of data have been included? (2) how many samples or high-throughput experiments have been included? (3) what kind of online data analysis is available? (4) is the web interface user friendly? and most importantly, (5) can we gain any novel insight by using the web tool?
Figure 1

Integration of high-throughput data in stem cell research

In order to achieve better understanding on reprogramming, direct conversion, self-renewal, and other processes in stem cell biology, genome-wide profiling have been conducted at the end points and sometimes during those processes. Various types of high-throughput data have been collected and integrated in over a dozen of specialized web resources. The relationship among critical genes can be visualized in a variety of ways. Shown on the background is a network generated by StemCellNet.

Table 1

Major web resources for stem cell research

NameLinkMain featuresRatingRefs.
CellNethttp://cellnet.hms.harvard.edu/Cell type classification; gene regulatory networks; refinement of factors for cell engineering★★★★★[8]
LifeMaphttp://discovery.lifemapsc.com/Differentiation, development, and regenerative medicine; graphical display of embryonic development ontology tree★★★★★[9]
ESCAPEhttp://www.maayanlab.net/ESCAPE/Multiple data types for human and mouse ESCs; network construction, enrichment analysis, lineage prediction★★★★★[10]
StemCellNethttp://stemcellnet.sysbiolab.eu/Network with physical interaction and regulation; interactive visualization of the network online★★★★★[11]
HSC-explorerhttp://mips.helmholtz-muenchen.de/HSC/Early stage of hematopoiesis; interactive graphical display with many functionalities★★★★★[12]
SyStemCellhttp://lifecenter.sgst.cn/SyStemCell/Clear indication of up or down regulation; co-localization analysis for discovery of novel correlation★★★★☆[13]
CORTECONhttp://cortecon.neuralsci.org/NGS data from in vitro cortical development; gene, cluster, disease, KEGG pathway, and GO term★★★★☆[14]
SCDEhttp://discovery.hsci.harvard.edu/Tissue and cancer stem cells; curation on experiments; enrichment analysis; code sharing★★★★☆[15]
StemBasehttp://www.stembase.ca/?path=/Detailed curation of experiment information; correlation and mutual information analysis★★★★☆[16]
CODEXhttp://codex.stemcells.cam.ac.uk/NGS data for ESCs and haematopoietic cells★★★☆☆[17]
ESCDhttp://biit.cs.ut.ee/escd/ESCs, embryonic carcinoma cells; search by GO terms★★☆☆☆[18]

Note: Our rating is mainly based on the number of data types included; the number of samples or high-throughput experiments included, the kinds of online data analysis available, whether the web interface is user-friendly, and most importantly, whether users can gain any novel insight by using the web tool.

Integration of high-throughput data in stem cell research In order to achieve better understanding on reprogramming, direct conversion, self-renewal, and other processes in stem cell biology, genome-wide profiling have been conducted at the end points and sometimes during those processes. Various types of high-throughput data have been collected and integrated in over a dozen of specialized web resources. The relationship among critical genes can be visualized in a variety of ways. Shown on the background is a network generated by StemCellNet. Major web resources for stem cell research Note: Our rating is mainly based on the number of data types included; the number of samples or high-throughput experiments included, the kinds of online data analysis available, whether the web interface is user-friendly, and most importantly, whether users can gain any novel insight by using the web tool.

CellNet

Among the available web resources, CellNet is the most practical tool for somatic cell reprogramming and direct conversion [8]. Analyses on the gene regulatory network (GRN) have been conducted on 20 mouse cell lines or tissue types and 16 human cell lines or tissue types, and several characteristic GRN modules have been identified for each cell line or tissue type. The main aim of CellNet is to facilitate cell engineering, not limited to stem cell biology. User-uploaded gene expression profiles are compared with the benchmark profiles, and three types of analysis results can be obtained. The first is cell and tissue type classification, basically indicating how close the engineered cell is to any of the benchmark cells or tissues. The second is the GRN status, i.e., the evaluation of the establishment of the characteristic GRN modules for intended target cell or tissue. The third is the network influence score. For each of the critical transcriptional regulators of the intended target cell or tissue, the distance to the expected expression level will be calculated and the top 50 down-regulated regulators will be highlighted. Overall, CellNet provides a practical guide to fill the gap between the engineered cell and the intended target. Although CellNet is not specifically designed for stem cell research, this unique application on cell engineering is the main reason we gave it a 5-star rating.

LifeMap

LifeMap contains a large collection of the literature and gene expression data relevant to stem cell differentiation, embryonic development and regenerative medicine [9]. Information is available for cell types including ESCs, iPSCs, embryonic progenitor cells, adult stem cells, primary cells, and fully-differentiated somatic cells from human and mouse. Retrievable information include gene expression, signaling pathways, cell types, developmental stages, anatomical compartments, differentiation protocols, diseases, cell therapies, and literature references. Illustrative and interactive images are provided for better user experience. LifeMap is more like an encyclopedia for embryonic development and regenerative medicine. The main highlights include comprehensive curation of both literature and gene expression information, interactive graphical interface of the full development tree, and unique information on regenerative medicine. Registration is required for the access of the full features.

ESCAPE

The Embryonic Stem Cell Atlas from Pluripotency Evidence (ESCAPE) database is developed based on gene sets from published experiments on human and mouse ESCs [10]. The curated data types include chromatin immunoprecipitation (ChIP) data for protein−DNA interaction, regulatory information from loss-of-function and gain-of-function (Logof) experiments, protein–protein interaction (PPI) using key factors as baits, miRNA−target interactions from popular miRNA websites, potential key regulators from RNAi experiments, ESC- or differentiating ESC- specific proteins, histone modifications, miRNA expression, and time-course expression. In addition to the retrieval of the collected information, these gene sets can also be used to construct interaction and regulatory networks, conduct enrichment analysis for user-supplied gene lists, and predict one of the four lineages during ESC differentiation, the latter being a unique feature among the available web tools described in this article. The network is built upon the input gene list, curated ChIP, PPI, and Logof data.

StemCellNet

StemCellNet is mainly a network tool for stem cell biology [11]. The datasets supporting the network construction include physical protein interactions with key regulators, transcriptional regulatory interactions from ChIP binding experiments, generic physical and regulatory interactions from public resources, and stemness gene sets from the literature. The constructed network can be visualized online or downloaded (as exemplified in Figure 1). The online network display can be refined according to several options. The node size can be adjusted based on the number of appearances of the specific gene in the stemness datasets. Users can also evaluate the importance of the nodes based on the number of key stemness neighbors. In addition, analysis on the significance of enrichment can be performed on the network for each of the stemness gene sets. The network can also be annotated by incorporating user-uploaded gene expression profiles. Trimming of the network can be achieved by applying one or several of the filters. The network functionality in StemCellNet is the best among the web tools reviewed in this article.

HSC-explorer

HSC-explorer is a curated database for hematopoietic stem cells (HSCs) [12]. This database is focused on the early stage of hematopoiesis. At the time of the writing of this manuscript, over 7000 experimentally-validated interactions have been collected from 217 publications. Detailed data statistics is shown on the homepage. Search results can be displayed as both tables and graphical networks. The interactions are carefully curated with links to the original publications when necessary. The graphical network is user-friendly with a variety of functionalities. The heterogeneous network nodes include gene/protein, SNP, CpG site, drug, pathway, disease, organism, and environment, among others. The types of directional interactions include increasing, decreasing and affecting the expression, quantity, activity, etc. of one entity by the other. Detailed information can be displayed on mouseover at the nodes or edges. In addition to the retrieval of directly-collected information, several topics with special interest in hematopoiesis have been curated. Overall, this database is a good resource for researchers interested in hematopoiesis.

SyStemCell

SyStemCell collected 285 stem cell related publications at the initial release [13]. The majority of the data is on human and mouse, although a small amount of data is on rat and rhesus macaque. The data types include mRNA expression, protein expression, DNA methylation and hydromethylation, histone modification, miRNA information, and transcription factor (TF) regulation. The search results are displayed as increase, detected, and decrease with different colors. Annotations include information from gene ontology (GO), BioCarta, the NCBI BioSystems database, and the database of Differentially Expressed Proteins in human Cancer (dbDEPC). Other functionalities include data browsing and co-localization analysis. The co-localization analysis can be used to discover novel correlation among the selected features. The last release of SyStemCell was on Feb 10, 2012. Therefore, data in the past three years may not be available at this website.

CORTECON

CORTECON is a neural stem cell (NSC)-specific resource and a repository for gene expression in the in vitro developing human cortex [14]. The web tool is mainly based on one high-throughput sequencing study by the authors themselves. The temporal expression data can be retrieved by gene, disease, KEGG pathway, or GO term. Every gene belongs to one of the clusters according to the temporal expression profile. But a gene may be associated with several diseases or multiple stages of cortical development. In general, the relationship among gene cluster, disease, KEGG pathway, GO term, and development stage seems to be many-to-many. Since this is a single study-based web tool, interpretation of the search results shall be cautioned.

SCDE

The Stem Cell Discovery Engine (SCDE) is mainly focused on resources for cancer stem cells [15]. Over 53 relevant datasets (1098 assays) have been curated in the database, including samples from blood, intestine, and brain, almost all from human and mouse. User-specified gene lists can be compared against the curated datasets. They can also be compared against molecular signatures in GeneSigDB, MSigDB, and WikiPathway. SCDE has recently evolved into two components, Stem Cell Commons and Galaxy, although both appear to be in the process of further development. The Galaxy is mainly devoted to data analysis mentioned above. The Stem Cell Commons (http://stemcellcommons.org/) is being developed into an integrated platform, including browse, search, analysis, visualization, and code sharing. Users can also upload data to the Stem Cell Commons. The main goals are to promote discovery and reproducibility in stem cell research.

StemBase

StemBase has curated 62 experiments and 217 samples from mouse, human, and rat [16]. The database can be searched in simple and advanced modes. A portion of the expression information can be retrieved by specifying several fields. The retrieved information can be annotated by GO terms and relevant publications. An additional feature in StemBase is the correlation and mutual information of expression among the specified genes or probes. The expression of each probe can also be viewed on the UCSC genome browser, which seems to be a unique feature. StemBase was originally designed in 2007 without any major update. Therefore, most of the data collected are not so up-to-date.

CODEX

CODEX is devoted to next-generation sequencing (NGS) experiments including ChIP-seq, RNA-seq, and DNase-seq [17]. The datasets are divided by species (human and mouse data). The regulatory information derived from the datasets can also be retrieved. The CODEX server consists of three sections, i.e., HAEMCODE for haematopoietic cells, ESCODE for embryonic stem cells, and CODEX for all cell types. Due to the limited NGS data available for stem cell-related experiments, CODEX is of limited use at the present time.

ESCD

The Embryonic Stem Cell Database (ESCD) has mainly collected datasets on key transcription factor binding, RNAi knockdown, and protein overexpression experiments [18]. Data from both human and mouse samples have been included. In addition to ESCs, data for embryonic carcinoma cells have also been included. ESCD can be queried by gene IDs and GO terms. The major weakness of ESCD is the limited data types and datasets covered.

Other resources

Several other resources are available on the web. StemCellDB (http://stemcells.nih.gov/research/nihresearch/scunit/Pages/Default.aspx) is established by the NIH Stem Cell Unit with an aim for direct comparison of human ESC lines, adult stem cells, and iPSCs [19]. PluriNetWork (http://www.ibima.med.uni-rostock.de/IBIMA/PluriNetWork/) has curated 274 pluripotency genes in mouse with 574 interactions (the current data statistics) [20]. The network can be downloaded for further exploration. FunGenES was originally designed for mouse ESC differentiation [21]. However, the web server is no longer active. Additionally, large amount of data is available from some worldwide collaboration projects with broad scope, including ENCODE (http://genome.ucsc.edu/ENCODE/), TCGA (https://icgc.org/), and Roadmap Epigenomics (http://www.roadmapepigenomics.org/). However, a portion of the data from these projects has already been curated in some of the web tools described above.

Concluding remarks

It is an ongoing effort to develop efficient tools for the better understanding of reprogramming, differentiation, and trans-differentiation. Some of the web resources are continuously updated or upgraded. We shall point out that a good portion of the web resources have not been well maintained since the initial publication. New tools will surely emerge in the future. The continuous effort on web maintenance should be carefully considered when developing new web tools. We ourselves are also in the process of developing an integrated web server for stem cell research. Mere collection of public data will be far from sufficient in the future. A major effort should be focused on enhancing our fundamental understanding of the mechanism regarding the maintenance of pluripotency and gaining precise control of the reprogramming, differentiation, and direct conversion.

Competing interests

The authors declare that there are no conflicts of interest.
  21 in total

1.  StemBase: a resource for the analysis of stem cell gene expression data.

Authors:  Christopher J Porter; Gareth A Palidwor; Reatha Sandie; Paul M Krzyzanowski; Enrique M Muro; Carolina Perez-Iratxeta; Miguel A Andrade-Navarro
Journal:  Methods Mol Biol       Date:  2007

Review 2.  Control of the embryonic stem cell state.

Authors:  Richard A Young
Journal:  Cell       Date:  2011-03-18       Impact factor: 41.582

3.  Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors.

Authors:  Kazutoshi Takahashi; Shinya Yamanaka
Journal:  Cell       Date:  2006-08-10       Impact factor: 41.582

4.  A data integration approach to mapping OCT4 gene regulatory networks operative in embryonic stem cells and embryonal carcinoma cells.

Authors:  Marc Jung; Hedi Peterson; Lukas Chavez; Pascal Kahlem; Hans Lehrach; Jaak Vilo; James Adjaye
Journal:  PLoS One       Date:  2010-05-21       Impact factor: 3.240

5.  StemCellDB: the human pluripotent stem cell database at the National Institutes of Health.

Authors:  Barbara S Mallon; Josh G Chenoweth; Kory R Johnson; Rebecca S Hamilton; Paul J Tesar; Amarendra S Yavatkar; Leonard J Tyson; Kyeyoon Park; Kevin G Chen; Yang C Fann; Ronald D G McKay
Journal:  Stem Cell Res       Date:  2012-09-26       Impact factor: 2.020

6.  The PluriNetWork: an electronic representation of the network underlying pluripotency in mouse, and its applications.

Authors:  Anup Som; Clemens Harder; Boris Greber; Marcin Siatkowski; Yogesh Paudel; Gregor Warsow; Clemens Cap; Hans Schöler; Georg Fuellen
Journal:  PLoS One       Date:  2010-12-10       Impact factor: 3.240

7.  The Stem Cell Discovery Engine: an integrated repository and analysis system for cancer stem cell comparisons.

Authors:  Shannan J Ho Sui; Kimberly Begley; Dorothy Reilly; Brad Chapman; Ray McGovern; Philippe Rocca-Sera; Eamonn Maguire; Gabriel M Altschuler; Terah A A Hansen; Ramakrishna Sompallae; Andrei Krivtsov; Ramesh A Shivdasani; Scott A Armstrong; Aedín C Culhane; Mick Correll; Susanna-Assunta Sansone; Oliver Hofmann; Winston Hide
Journal:  Nucleic Acids Res       Date:  2011-11-24       Impact factor: 16.971

8.  SyStemCell: a database populated with multiple levels of experimental data from stem cell differentiation research.

Authors:  Jian Yu; Xiaobin Xing; Lingyao Zeng; Jiehuan Sun; Wei Li; Han Sun; Ying He; Jing Li; Guoqing Zhang; Chuan Wang; Yixue Li; Lu Xie
Journal:  PLoS One       Date:  2012-07-13       Impact factor: 3.240

9.  CODEX: a next-generation sequencing experiment database for the haematopoietic and embryonic stem cell communities.

Authors:  Manuel Sánchez-Castillo; David Ruau; Adam C Wilkinson; Felicia S L Ng; Rebecca Hannah; Evangelia Diamanti; Patrick Lombard; Nicola K Wilson; Berthold Gottgens
Journal:  Nucleic Acids Res       Date:  2014-09-30       Impact factor: 19.160

10.  The FunGenES database: a genomics resource for mouse embryonic stem cell differentiation.

Authors:  Herbert Schulz; Raivo Kolde; Priit Adler; Irène Aksoy; Konstantinos Anastassiadis; Michael Bader; Nathalie Billon; Hélène Boeuf; Pierre-Yves Bourillot; Frank Buchholz; Christian Dani; Michael Xavier Doss; Lesley Forrester; Murielle Gitton; Domingos Henrique; Jürgen Hescheler; Heinz Himmelbauer; Norbert Hübner; Efthimia Karantzali; Androniki Kretsovali; Sandra Lubitz; Laurent Pradier; Meena Rai; Jüri Reimand; Alexandra Rolletschek; Agapios Sachinidis; Pierre Savatier; Francis Stewart; Mike P Storm; Marina Trouillas; Jaak Vilo; Melanie J Welham; Johannes Winkler; Anna M Wobus; Antonis K Hatzopoulos
Journal:  PLoS One       Date:  2009-09-03       Impact factor: 3.240

View more
  3 in total

1.  On bioinformatic resources.

Authors:  Runsheng Chen
Journal:  Genomics Proteomics Bioinformatics       Date:  2015-03-02       Impact factor: 7.691

Review 2.  Neural Stem Cells (NSCs) and Proteomics.

Authors:  Lorelei D Shoemaker; Harley I Kornblum
Journal:  Mol Cell Proteomics       Date:  2015-10-22       Impact factor: 5.911

Review 3.  Informatics Approaches for Harmonized Intelligent Integration of Stem Cell Research.

Authors:  Joseph Finkelstein; Irena Parvanova; Frederick Zhang
Journal:  Stem Cells Cloning       Date:  2020-01-28
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.