| Literature DB >> 29045725 |
José P Pinto1,2, Rui S R Machado1,2, Ramiro Magno2,3, Daniel V Oliveira4, Susana Machado2,3, Raquel P Andrade2,3,5, José Bragança2,3,5, Isabel Duarte2,3, Matthias E Futschik1,2,6,7.
Abstract
Transcriptomic data have become a fundamental resource for stem cell (SC) biologists as well as for a wider research audience studying SC-related processes such as aging, embryonic development and prevalent diseases including cancer, diabetes and neurodegenerative diseases. Access and analysis of the growing amount of freely available transcriptomics datasets for SCs, however, are not trivial tasks. Here, we present StemMapper, a manually curated gene expression database and comprehensive resource for SC research, built on integrated data for different lineages of human and mouse SCs. It is based on careful selection, standardized processing and stringent quality control of relevant transcriptomics datasets to minimize artefacts, and includes currently over 960 transcriptomes covering a broad range of SC types. Each of the integrated datasets was individually inspected and manually curated. StemMapper's user-friendly interface enables fast querying, comparison, and interactive visualization of quality-controlled SC gene expression data in a comprehensive manner. A proof-of-principle analysis discovering novel putative astrocyte/neural SC lineage markers exemplifies the utility of the integrated data resource. We believe that StemMapper can open the way for new insights and advances in SC research by greatly simplifying the access and analysis of SC transcriptomic data. StemMapper is freely accessible at http://stemmapper.sysbiolab.eu.Entities:
Mesh:
Year: 2018 PMID: 29045725 PMCID: PMC5753294 DOI: 10.1093/nar/gkx921
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.StemMapper workflow using a set of genes associated with the differentiation of neural precursor cells into astrocytes. Query panel (steps 1–4): After selecting all transcriptomic profiles for mouse (step 1), the 18 genes of interest were used as input (step 3) without the optional selection of pre-compiled marker genes (step 2), and the query was executed (step 4). Analysis panel (steps 5–7): Inspection of the produced heatmap (step 5) shows that Gfap is expressed specifically in samples of the neural lineages (highlighted with a green box). Calculation of the correlation with Gfap leads to the identification of 40 genes with strongly correlated or anti-correlated expression patterns (step 6). These genes were then used as new input together with Gfap (step 7). In the newly generated heatmap (step 8) seven genes (highlighted with a red box) displayed high expression in samples of the neural lineage and weak or no expression in most other samples similar to Gfap (indicated by a red star). The PCA plot based on the 40 input genes shows a clear separation of samples of the neural lineage including olfactory (bulb neural stem) cells (step 9). Researchers can then download the processed data to perform follow-up analyses.