| Literature DB >> 31455220 |
Di Feng1, Charles E Whitehurst2, Dechao Shan3, Jon D Hill3, Yong G Yue3,4.
Abstract
BACKGROUND: Single cell transcriptome sequencing has become an increasingly valuable technology for dissecting complex biology at a resolution impossible with bulk sequencing. However, the gap between the technical expertise required to effectively work with the resultant high dimensional data and the biological expertise required to interpret the results in their biological context remains incompletely addressed by the currently available tools.Entities:
Keywords: D3; Django; Pipeline; Python; RNA-seq; Single cell; Transcriptomics; Visualization
Year: 2019 PMID: 31455220 PMCID: PMC6712711 DOI: 10.1186/s12864-019-6053-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Single Cell Explorer workflow architecture process and component view. a Overview of the data process workflow steps for Single Cell Explorer. Step #1: Run pipeline to process FASTQ files using Python wrapper through Jupyter Notebook. Step #2: Quality control of data, generation of 2d representation, and database upload. Step #3: Interactive data analyses and annotation of cell types. Step #4: Recording of annotated results in MongoDB for sharing with all users. Step #5: All results from MongoDB can be accessed directly or via API. b A screenshot for Single Cell Explorer data navigator page and a t-SNE map for one dataset
Fig. 2Interactive FeaturePlot. a A t-SNE and UMAP representation from first-trimester placentas with matched maternal blood and decidual cells. Individual pre-labeled cell types are painted in different colors. The function of painting two genes (CD8A and CD3D) highlights the location of CD8 T cell clusters. A 2D plot of circles indicates the proportion of the single positive and double positive cells. b To query a list of genes, a heatmap can be generated after freehand selection of cells of interest. ILC3 cells can be identified using markers including KIT and DLL1
Fig. 3Understanding single cell clustering results. a UMAP of human normal PBMC with various clustering results using different resolution parameters by the leiden algorithm (scanpy.api.tl.leiden function). b Feature plot of cells which are positive for each individual marker gene. c A heatmap of marker gene expression within each cluster defined by leiden algorithm
Fig. 4Cell type and feature discovery. Step #1: Load 2D embedding map. Step #2 Use a freehand tool to select the cells of interest. Step #3: Compare the differentially expressed genes of selected cells with all unselected cells. Step #4: Interactively visualize gene expression levels using the resulting table. Step #5: Record cell types and marker genes for future reference. Step #6: Position the newly-labelled cells on the map and compare with other specific cell types
API function to retrieve data from database
| Name | Function |
|---|---|
| getAllClstrsByClstrsType | retrieve a table of cell barcodes and annotated cell types in a specific map |
| getNormalizedGeneExpr | get normalized counts matrix for genes of interest from specific cell types in a specific map |
| getAllNormalizedGeneExpr | get full normalized gene counts matrix from specific cell types in specific map |
| getMarkGenesByMapidAndClusterType | get annotated marker genes |
| getMaps | get meta data from a specific map |
| exportAllClstrsByClstrsType | export the cell barcodes and annotated cell types of the cells from a specific map into a csv file |