| Literature DB >> 24304896 |
Harald Stachelscheid1, Stefanie Seltmann, Fritz Lekschas, Jean-Fred Fontaine, Nancy Mah, Mariana Neves, Miguel A Andrade-Navarro, Ulf Leser, Andreas Kurtz.
Abstract
CellFinder (http://www.cellfinder.org) is a comprehensive one-stop resource for molecular data characterizing mammalian cells in different tissues and in different development stages. It is built from carefully selected data sets stemming from other curated databases and the biomedical literature. To date, CellFinder describes 3394 cell types and 50 951 cell lines. The database currently contains 3055 microscopic and anatomical images, 205 whole-genome expression profiles of 194 cell/tissue types from RNA-seq and microarrays and 553 905 protein expressions for 535 cells/tissues. Text mining of a corpus of >2000 publications followed by manual curation confirmed expression information on ∼900 proteins and genes. CellFinder's data model is capable to seamlessly represent entities from single cells to the organ level, to incorporate mappings between homologous entities in different species and to describe processes of cell development and differentiation. Its ontological backbone currently consists of 204 741 ontology terms incorporated from 10 different ontologies unified under the novel CELDA ontology. CellFinder's web portal allows searching, browsing and comparing the stored data, interactive construction of developmental trees and navigating the partonomic hierarchy of cells and tissues through a unique body browser designed for life scientists and clinicians.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24304896 PMCID: PMC3965082 DOI: 10.1093/nar/gkt1264
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Information types, data sources and content of CellFinder and CELDA
| Information type | Ontologies (used in CELDA) | Data source (in addition to data from the ontology) | Current number of data entities |
|---|---|---|---|
| Cell type | CL, FMA, EFO | Manual curation | 3394 |
| Cell line | CL, CLO, EFO | hESCreg ( | 50 951 |
| Cellosaurus | |||
| Manual curation | |||
| Cell line origin/derivation | CL, CLO | Manual curation | NA |
| Cells/tissue | UBERON, FMA, MA | Manual curation | 535 |
| Developmental stages | EHDAA, EFO | Manual curation | 2578 |
| Gene expression | GO | GEO ( | 205 expression profiles of 194 cell/tissue types |
| Array express ( | |||
| RNA-seq atlas ( | |||
| StemBase ( | |||
| Text mining ( | |||
| Characterization tool ( | |||
| Protein expression | GO | Human protein atlas ( | 553 905 protein expressions for 535 cells/tissues |
| Text mining ( | |||
| Characterization tool ( | |||
| Histology/cytology | NA | Wellcome images | 3055 of 1790 cells/tissues |
| Cell image library ( | |||
| Manual curation | |||
| Cellular components | GO, EFO | Human protein atlas ( | 3021 |
ahttp://wellcomeimages.org.
Not listed are data provided by individual researchers. The CELDA ontology contains currently in total 196 777 terms and 4 899 810 relations.
CL, Cell Ontology; CLO, Cell Line Ontology; EFO, Experimental Factor Ontology; EHDAA, Human Developmental Anatomy ontology; FMA, Foundational Model of Anatomy ontology; MA, Adult Mouse Anatomical Dictionary; GO, Gene Ontology; NA, not available.
Figure 1.Overview of the backend, integration and access components of CellFinder. The Web site is implemented using PHP and JavaScript. The CELDA ontology is implemented with the JENA framework and translated into a MySQL database in which also the molecular and text mining data are stored. Images are stored on an OMERO server. Access to the data is provided via web services implemented in Java.
Figure 3.Text-based search application. The text-based search application was used to identify genes that are expressed in podocytes and compared in their expression with other tissues. (A) By searching for podocytes, a glomerular visceral epithelial cell of the kidney, a list of results is provided together with pictograms that identify the types of information available for each term. (B) Detailed information can be accessed for each term, including definition, relations and present/absent information on gene expression as well as relevant links to other databases. (C) Precalculated relative expression levels in different tissues (here podocalyxin, PODXL) can be displayed for each gene in the gene list as a bar plot (left) or visualized as a heatmap in the SBB (right).
Figure 2.Cooperation of the different web interface components for data retrieval. Information about specific cell types can currently be retrieved by (i) direct search for organs, tissues, cell types or cell lines, (ii) browsing in the human body using the SBB and (iii) browsing the developmental tree. (iv) Comparisons of gene and protein expression data can be made in the analysis component.
Figure 4.Semantic body browser (SBB) and Developmental Tree. (A) The SBB allows graphically explored tissues at different developmental stages (Carnegy Stage: CS) as shown for the nephron in the kidney. (B) The adult stage (CS23) allows zooming into the cell level by clicking on the relevant structure (red: podocytes). At all stages, relevant information is provided for the selected tissues and cells if available in CellFinder. (C) The Developmental Tree is linked to the cells in the SBB and provides a cell- and tissue-based hierarchical relationship tree, a section of which is shown here for the development of a metanephric glomerular visceral epithelial cell (podocyte) from a metanephric mesenchymal stem cell. The box on the upper right provides an overview of the whole tree from which the section is displayed. General information is provided for a selected cell in the tree in the lower right if available in CellFinder.