Literature DB >> 33611426

Accurate feature selection improves single-cell RNA-seq cell clustering.

Kenong Su1, Tianwei Yu2, Hao Wu3.   

Abstract

Cell clustering is one of the most important and commonly performed tasks in single-cell RNA sequencing (scRNA-seq) data analysis. An important step in cell clustering is to select a subset of genes (referred to as 'features'), whose expression patterns will then be used for downstream clustering. A good set of features should include the ones that distinguish different cell types, and the quality of such set could have a significant impact on the clustering accuracy. All existing scRNA-seq clustering tools include a feature selection step relying on some simple unsupervised feature selection methods, mostly based on the statistical moments of gene-wise expression distributions. In this work, we carefully evaluate the impact of feature selection on cell clustering accuracy. In addition, we develop a feature selection algorithm named FEAture SelecTion (FEAST), which provides more representative features. We apply the method on 12 public scRNA-seq datasets and demonstrate that using features selected by FEAST with existing clustering tools significantly improve the clustering accuracy.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  cell clustering; feature selection; single-cell RNA sequencing

Year:  2021        PMID: 33611426     DOI: 10.1093/bib/bbab034

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  6 in total

1.  scREMOTE: Using multimodal single cell data to predict regulatory gene relationships and to build a computational cell reprogramming model.

Authors:  Andy Tran; Pengyi Yang; Jean Y H Yang; John T Ormerod
Journal:  NAR Genom Bioinform       Date:  2022-03-15

2.  EDClust: an EM-MM hybrid method for cell clustering in multiple-subject single-cell RNA sequencing.

Authors:  Xin Wei; Ziyi Li; Hongkai Ji; Hao Wu
Journal:  Bioinformatics       Date:  2022-05-13       Impact factor: 6.931

3.  Triku: a feature selection method based on nearest neighbors for single-cell data.

Authors:  Alex M Ascensión; Olga Ibáñez-Solé; Iñaki Inza; Ander Izeta; Marcos J Araúzo-Bravo
Journal:  Gigascience       Date:  2022-03-12       Impact factor: 6.524

Review 4.  Feature selection revisited in the single-cell era.

Authors:  Pengyi Yang; Hao Huang; Chunlei Liu
Journal:  Genome Biol       Date:  2021-12-01       Impact factor: 13.583

Review 5.  Selecting gene features for unsupervised analysis of single-cell gene expression data.

Authors:  Jie Sheng; Wei Vivian Li
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

6.  Evaluation of some aspects in supervised cell type identification for single-cell RNA-seq: classifier, feature selection, and reference construction.

Authors:  Wenjing Ma; Kenong Su; Hao Wu
Journal:  Genome Biol       Date:  2021-09-09       Impact factor: 13.583

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.