Literature DB >> 33285568

FEATS: feature selection-based clustering of single-cell RNA-seq data.

Edwin Vans1, Ashwini Patil2, Alok Sharma3.   

Abstract

MOTIVATION: Advances in next-generation sequencing have made it possible to carry out transcriptomic studies at single-cell resolution and generate vast amounts of single-cell RNA sequencing (RNA-seq) data rapidly. Thus, tools to analyze this data need to evolve as well as to improve accuracy and efficiency.
RESULTS: We present FEATS, a Python software package, that performs clustering on single-cell RNA-seq data. FEATS is capable of performing multiple tasks such as estimating the number of clusters, conducting outlier detection and integrating data from various experiments. We develop a univariate feature selection-based approach for clustering, which involves the selection of top informative features to improve clustering performance. This is motivated by the fact that cell types are often manually determined using the expression of only a few known marker genes. On a variety of single-cell RNA-seq datasets, FEATS gives superior performance compared with the current tools, in terms of adjusted Rand index and estimating the number of clusters. It achieves a 22% improvement in clustering and more accurately estimates the number of clusters when compared with other tools. In addition to cluster estimation, FEATS also performs outlier detection and data integration while giving an excellent computational performance. Thus, FEATS is a comprehensive clustering tool capable of addressing the challenges during the clustering of single-cell RNA-seq data. AVAILABILITY: The installation instructions and documentation of FEATS is available at https://edwinv87.github.io/feats/. SUPPLEMENTARY DATA: Supplementary data are available online at https://academic.oup.com/bib.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  feature selection; hierarchical clustering; single cell RNA-sequencing

Year:  2021        PMID: 33285568     DOI: 10.1093/bib/bbaa306

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  3 in total

1.  LSH-GAN enables in-silico generation of cells for small sample high dimensional scRNA-seq data.

Authors:  Snehalika Lall; Sumanta Ray; Sanghamitra Bandyopadhyay
Journal:  Commun Biol       Date:  2022-06-10

2.  DeepFeature: feature selection in nonimage data using convolutional neural network.

Authors:  Alok Sharma; Artem Lysenko; Keith A Boroevich; Edwin Vans; Tatsuhiko Tsunoda
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

Review 3.  Feature selection revisited in the single-cell era.

Authors:  Pengyi Yang; Hao Huang; Chunlei Liu
Journal:  Genome Biol       Date:  2021-12-01       Impact factor: 13.583

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.