| Literature DB >> 32334526 |
Rui Dong1,2,3, Guo-Cheng Yuan4,5,6.
Abstract
BACKGROUND: With the rapid development of single-cell RNA sequencing technology, it is possible to dissect cell-type composition at high resolution. A number of methods have been developed with the purpose to identify rare cell types. However, existing methods are still not scalable to large datasets, limiting their utility. To overcome this limitation, we present a new software package, called GiniClust3, which is an extension of GiniClust2 and significantly faster and memory-efficient than previous versions.Entities:
Keywords: Gini index; Rare cell identification; Scalability; Single cell RNA-seq
Mesh:
Year: 2020 PMID: 32334526 PMCID: PMC7183612 DOI: 10.1186/s12859-020-3482-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Analysis of mouse brain dataset with more than one million cells. a An overview of the GiniClust3 pipeline. Input single-cell expression matrix is clustered based on features selected by Gini index (GiniIndexClust) and by Fano factor (FanoFactorClust), respectively. The results are then integrated using a cluster-aware, weighted consensus clustering algorithm (ConsensusClust). b UMAP visualization of the gene expression patterns based on Fano-factor (top) and Gini index (bottom) selected features, respectively. Consensus clustering results are indicated by different colors. c The proportion of rare cell cluster in entire population. d Heatmap of cell type mapping of common and rare clusters from scMCA analysis. Bar plot in the top indicates the cell number for each cluster