| Literature DB >> 22121217 |
Shannan J Ho Sui1, Kimberly Begley, Dorothy Reilly, Brad Chapman, Ray McGovern, Philippe Rocca-Sera, Eamonn Maguire, Gabriel M Altschuler, Terah A A Hansen, Ramakrishna Sompallae, Andrei Krivtsov, Ramesh A Shivdasani, Scott A Armstrong, Aedín C Culhane, Mick Correll, Susanna-Assunta Sansone, Oliver Hofmann, Winston Hide.
Abstract
Mounting evidence suggests that malignant tumors are initiated and maintained by a subpopulation of cancerous cells with biological properties similar to those of normal stem cells. However, descriptions of stem-like gene and pathway signatures in cancers are inconsistent across experimental systems. Driven by a need to improve our understanding of molecular processes that are common and unique across cancer stem cells (CSCs), we have developed the Stem Cell Discovery Engine (SCDE)-an online database of curated CSC experiments coupled to the Galaxy analytical framework. The SCDE allows users to consistently describe, share and compare CSC data at the gene and pathway level. Our initial focus has been on carefully curating tissue and cancer stem cell-related experiments from blood, intestine and brain to create a high quality resource containing 53 public studies and 1098 assays. The experimental information is captured and stored in the multi-omics Investigation/Study/Assay (ISA-Tab) format and can be queried in the data repository. A linked Galaxy framework provides a comprehensive, flexible environment populated with novel tools for gene list comparisons against molecular signatures in GeneSigDB and MSigDB, curated experiments in the SCDE and pathways in WikiPathways. The SCDE is available at http://discovery.hsci.harvard.edu.Entities:
Mesh:
Year: 2011 PMID: 22121217 PMCID: PMC3245064 DOI: 10.1093/nar/gkr1051
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.System architecture diagram showing integration of data into the SCDE BioInvestigation Index (BII) and Galaxy instances. CSC-related experiments are submitted by stem cell researchers or selected from public repositories. After curation using the ISA tools and conversion to ISA-Tab format, the associated metadata, raw data files and processed gene lists are stored in the BII. The stem cell-specific gene lists are transformed into standardized gene identifiers to facilitate integration and comparison against similarly formatted reference lists (GeneSigDB, MSigDB, WikiPathways and other SCDE experiments) within Galaxy.
Curated metadata
| Field | Ontologies (in order of preference) |
|---|---|
| Organism | NEWT UniProt Taxonomy Database (Newt), NCBI Taxonomy (NCBITaxon) |
| Strain | Experimental Factor Ontology (EFO) |
| Developmental stage | EFO |
| Disease state | ICD-9, NCI Thesaurus, Disease Ontology |
| Organism part (tissue type) | Foundational Model of Anatomy, Mouse Gross Anatomy, BRENDA tissue/enzyme source (BTO), EFO |
| Cell type | Cell Type Ontology (CL), EFO |
| Cell line | EFO, NCI Thesaurus |
| Genotype | Ontology for Biomedical Investigations (OBI; depending on species) |
| Cell surface marker | Currently annotated as ±, high/lo. An appropriate standard needs to be developed. |
| Immunoprecipitation antibody | Protein Name (specify manufacturer where available) |
| Binding site | SO (sequence ontology) for methylation sites |
| Phenotypic quality | PATO: Phenotypic qualities (properties) |
| Treatment (perturbation) | PATO, CHEBI: Chemical Entities of Biological Interest OBI (to describe perturbations such as genetic modification, transient expression) |
SCDE data
| Studies | Assays | |
|---|---|---|
| Organism | ||
| Mouse | 45 (4) | 846 (54) |
| Human | 13 (1) | 255 (30) |
| Rat | 1 | 18 |
| Tissue type | ||
| Blood/bone marrow | 20 (3) | 374 (48) |
| Muscle | 8 | 125 |
| Brain/neural | 6 (1) | 68 (6) |
| Intestine | 4 | 39 |
| Mammary | 2 | 34 |
| Skin | 2 (1) | 135 (30) |
| Measurement | ||
| Transcription profiling | 57 (5) | 1161 (84) |
| Histone modification profiling | 2 | 42 |
| Transcription factor binding site identification | 1 | 21 |
| Tissue histology | 1 (1) | 6 (6) |
aTotal number of studies and assays; the number of private studies and assays are shown in brackets
bFurther information and details of technology platforms are available online at http://discovery.hsci.harvard.edu.
Figure 2.Screenshots showing elements of the BioInvestigation Index browse view. (A) The results of a free text search using the term ‘intestine’ that retrieves four studies—two human and two murine—that include transcription profiling using DNA microarrays, and ChIP-seq for transcription factor binding and histone modifications; (B) One matching record, SHIVDASANI-S-1, with descriptive text; (C) Annotated experimental factor metadata including time, antibodies used in the chromatin immunoprecipitation (ChIP) experiments, and cell differentiation status; (D) Annotated sample metadata including the tissue and cell types, using the Foundational Model of Anatomy (FMA) and Brenda Tissue (BTO) ontologies, respectively; (E) Download panel for the raw and processed data.
Figure 3.Composite figure showing the results of a ListMatch query using the set of intestinal differentiation genes that are reduced upon Cdx2 depletion from the SHIVDASANI-S-2 study. (A) SCDE ListMatch input page with options to compare against WikiPathways, GeneSigDB, MSigDB and the SCDE repository. (B) Results of the query against WikiPathways projected onto the canonical pathway representation with matching genes highlighted in red (partial screenshot shown). (C) Querying against GeneSigDB results in a top match to genes related to liver cancer.