Literature DB >> 28657835

Identifying Cell Subpopulations and Their Genetic Drivers from Single-Cell RNA-Seq Data Using a Biclustering Approach.

Funan Shi1, Haiyan Huang1.   

Abstract

Single-cell RNA-Seq (scRNA-Seq) has attracted much attention recently because it allows unprecedented resolution into cellular activity; the technology, therefore, has been widely applied in studying cell heterogeneity such as the heterogeneity among embryonic cells at varied developmental stages or cells of different cancer types or subtypes. A pertinent question in such analyses is to identify cell subpopulations as well as their associated genetic drivers. Consequently, a multitude of approaches have been developed for clustering or biclustering analysis of scRNA-Seq data. In this article, we present a fast and simple iterative biclustering approach called "BiSNN-Walk" based on the existing SNN-Cliq algorithm. One of BiSNN-Walk's differentiating features is that it returns a ranked list of clusters, which may serve as an indicator of a cluster's reliability. Another important feature is that BiSNN-Walk ranks genes in a gene cluster according to their level of affiliation to the associated cell cluster, making the result more biologically interpretable. We also introduce an entropy-based measure for choosing a highly clusterable similarity matrix as our starting point among a wide selection to facilitate the efficient operation of our algorithm. We applied BiSNN-Walk to three large scRNA-Seq studies, where we demonstrated that BiSNN-Walk was able to retain and sometimes improve the cell clustering ability of SNN-Cliq. We were able to obtain biologically sensible gene clusters in terms of GO term enrichment. In addition, we saw that there was significant overlap in top characteristic genes for clusters corresponding to similar cell states, further demonstrating the fidelity of our gene clusters.

Entities:  

Keywords:  BiSNN-Walk; RNA-Seq; biclustering; single cell

Mesh:

Substances:

Year:  2017        PMID: 28657835      PMCID: PMC5510693          DOI: 10.1089/cmb.2017.0049

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  16 in total

1.  A Study of the Comparability of External Criteria for Hierarchical Cluster Analysis.

Authors:  G W Milligan; M C Cooper
Journal:  Multivariate Behav Res       Date:  1986-10-01       Impact factor: 5.923

2.  Biclustering algorithms for biological data analysis: a survey.

Authors:  Sara C Madeira; Arlindo L Oliveira
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2004 Jan-Mar       Impact factor: 3.710

3.  Identification of cell types from single-cell transcriptomes using a novel clustering method.

Authors:  Chen Xu; Zhengchang Su
Journal:  Bioinformatics       Date:  2015-02-11       Impact factor: 6.937

4.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

5.  Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells.

Authors:  Qiaolin Deng; Daniel Ramsköld; Björn Reinius; Rickard Sandberg
Journal:  Science       Date:  2014-01-10       Impact factor: 47.728

6.  Human housekeeping genes, revisited.

Authors:  Eli Eisenberg; Erez Y Levanon
Journal:  Trends Genet       Date:  2013-06-27       Impact factor: 11.639

7.  Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells.

Authors:  Liying Yan; Mingyu Yang; Hongshan Guo; Lu Yang; Jun Wu; Rong Li; Ping Liu; Ying Lian; Xiaoying Zheng; Jie Yan; Jin Huang; Ming Li; Xinglong Wu; Lu Wen; Kaiqin Lao; Ruiqiang Li; Jie Qiao; Fuchou Tang
Journal:  Nat Struct Mol Biol       Date:  2013-08-11       Impact factor: 15.369

8.  Quantitative assessment of single-cell RNA-sequencing methods.

Authors:  Angela R Wu; Norma F Neff; Tomer Kalisky; Piero Dalerba; Barbara Treutlein; Michael E Rothenberg; Francis M Mburu; Gary L Mantalas; Sopheak Sim; Michael F Clarke; Stephen R Quake
Journal:  Nat Methods       Date:  2013-10-20       Impact factor: 28.547

9.  Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells.

Authors:  Daniel Ramsköld; Shujun Luo; Yu-Chieh Wang; Robin Li; Qiaolin Deng; Omid R Faridani; Gregory A Daniels; Irina Khrebtukova; Jeanne F Loring; Louise C Laurent; Gary P Schroth; Rickard Sandberg
Journal:  Nat Biotechnol       Date:  2012-08       Impact factor: 54.908

10.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

View more
  4 in total

1.  Clustering and classification methods for single-cell RNA-sequencing data.

Authors:  Ren Qi; Anjun Ma; Qin Ma; Quan Zou
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

Review 2.  It is time to apply biclustering: a comprehensive review of biclustering applications in biological and biomedical data.

Authors:  Juan Xie; Anjun Ma; Anne Fennell; Qin Ma; Jing Zhao
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

3.  Network Modeling in Biology: Statistical Methods for Gene and Brain Networks.

Authors:  Y X Rachel Wang; Lexin Li; Jingyi Jessica Li; Haiyan Huang
Journal:  Stat Sci       Date:  2021-02       Impact factor: 2.901

4.  IRIS-FGM: an integrative single-cell RNA-Seq interpretation system for functional gene module analysis.

Authors:  Yuzhou Chang; Carter Allen; Changlin Wan; Dongjun Chung; Chi Zhang; Zihai Li; Qin Ma
Journal:  Bioinformatics       Date:  2021-02-17       Impact factor: 6.937

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.