Literature DB >> 35277963

Triku: a feature selection method based on nearest neighbors for single-cell data.

Alex M Ascensión1,2, Olga Ibáñez-Solé1,2, Iñaki Inza3, Ander Izeta2, Marcos J Araúzo-Bravo1,4,5,6.   

Abstract

BACKGROUND: Feature selection is a relevant step in the analysis of single-cell RNA sequencing datasets. Most of the current feature selection methods are based on general univariate descriptors of the data such as the dispersion or the percentage of zeros. Despite the use of correction methods, the generality of these feature selection methods biases the genes selected towards highly expressed genes, instead of the genes defining the cell populations of the dataset.
RESULTS: Triku is a feature selection method that favors genes defining the main cell populations. It does so by selecting genes expressed by groups of cells that are close in the k-nearest neighbor graph. The expression of these genes is higher than the expected expression if the k-cells were chosen at random. Triku efficiently recovers cell populations present in artificial and biological benchmarking datasets, based on adjusted Rand index, normalized mutual information, supervised classification, and silhouette coefficient measurements. Additionally, gene sets selected by triku are more likely to be related to relevant Gene Ontology terms and contain fewer ribosomal and mitochondrial genes.
CONCLUSION: Triku is developed in Python 3 and is available at https://github.com/alexmascension/triku.
© The Author(s) 2022. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  Python; bioinformatics; feature selection; machine learning; sc-RNAseq

Mesh:

Year:  2022        PMID: 35277963      PMCID: PMC8917514          DOI: 10.1093/gigascience/giac017

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  41 in total

1.  Benchmarking single-cell RNA-sequencing protocols for cell atlas projects.

Authors:  Elisabetta Mereu; Atefeh Lafzi; Catia Moutinho; Christoph Ziegenhain; Davis J McCarthy; Adrián Álvarez-Varela; Eduard Batlle; Dominic Grün; Julia K Lau; Stéphane C Boutet; Chad Sanada; Aik Ooi; Robert C Jones; Kelly Kaihara; Chris Brampton; Yasha Talaga; Yohei Sasagawa; Kaori Tanaka; Tetsutaro Hayashi; Caroline Braeuning; Cornelius Fischer; Sascha Sauer; Timo Trefzer; Christian Conrad; Xian Adiconis; Lan T Nguyen; Aviv Regev; Joshua Z Levin; Swati Parekh; Aleksandar Janjic; Lucas E Wange; Johannes W Bagnoli; Wolfgang Enard; Marta Gut; Rickard Sandberg; Itoshi Nikaido; Ivo Gut; Oliver Stegle; Holger Heyn
Journal:  Nat Biotechnol       Date:  2020-04-06       Impact factor: 54.908

2.  sc-REnF: An entropy guided robust feature selection for single-cell RNA-seq data.

Authors:  Snehalika Lall; Abhik Ghosh; Sumanta Ray; Sanghamitra Bandyopadhyay
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

3.  Statistics versus machine learning.

Authors:  Danilo Bzdok; Naomi Altman; Martin Krzywinski
Journal:  Nat Methods       Date:  2018-04-03       Impact factor: 28.547

4.  powsimR: power analysis for bulk and single cell RNA-seq experiments.

Authors:  Beate Vieth; Christoph Ziegenhain; Swati Parekh; Wolfgang Enard; Ines Hellmann
Journal:  Bioinformatics       Date:  2017-11-01       Impact factor: 6.937

5.  M3Drop: dropout-based feature selection for scRNASeq.

Authors:  Tallulah S Andrews; Martin Hemberg
Journal:  Bioinformatics       Date:  2019-08-15       Impact factor: 6.937

Review 6.  Challenges and Opportunities for Consistent Classification of Human B Cell and Plasma Cell Populations.

Authors:  Ignacio Sanz; Chungwen Wei; Scott A Jenks; Kevin S Cashman; Christopher Tipton; Matthew C Woodruff; Jennifer Hom; F Eun-Hyung Lee
Journal:  Front Immunol       Date:  2019-10-18       Impact factor: 7.561

7.  Integrated analysis of multimodal single-cell data.

Authors:  Yuhan Hao; Stephanie Hao; Erica Andersen-Nissen; William M Mauck; Shiwei Zheng; Andrew Butler; Maddie J Lee; Aaron J Wilk; Charlotte Darby; Michael Zager; Paul Hoffman; Marlon Stoeckius; Efthymia Papalexi; Eleni P Mimitou; Jaison Jain; Avi Srivastava; Tim Stuart; Lamar M Fleming; Bertrand Yeung; Angela J Rogers; Juliana M McElrath; Catherine A Blish; Raphael Gottardo; Peter Smibert; Rahul Satija
Journal:  Cell       Date:  2021-05-31       Impact factor: 41.582

8.  Single-Cell Sequencing of Peripheral Mononuclear Cells Reveals Distinct Immune Response Landscapes of COVID-19 and Influenza Patients.

Authors:  Linnan Zhu; Penghui Yang; Yingze Zhao; Zhenkun Zhuang; Zhifeng Wang; Rui Song; Jie Zhang; Chuanyu Liu; Qianqian Gao; Qumiao Xu; Xiaoyu Wei; Hai-Xi Sun; Beiwei Ye; Yanan Wu; Ning Zhang; Guanglin Lei; Lingxiang Yu; Jin Yan; Guanghao Diao; Fanping Meng; Changqing Bai; Panyong Mao; Yeya Yu; Mingyue Wang; Yue Yuan; Qiuting Deng; Ziyi Li; Yunting Huang; Guohai Hu; Yang Liu; Xiaoqian Wang; Ziqian Xu; Peipei Liu; Yuhai Bi; Yi Shi; Shaogeng Zhang; Zhihai Chen; Jian Wang; Xun Xu; Guizhen Wu; Fu-Sheng Wang; George F Gao; Longqi Liu; William J Liu
Journal:  Immunity       Date:  2020-07-19       Impact factor: 31.745

9.  PBMC fixation and processing for Chromium single-cell RNA sequencing.

Authors:  Jinguo Chen; Foo Cheung; Rongye Shi; Huizhi Zhou; Wenrui Lu
Journal:  J Transl Med       Date:  2018-07-17       Impact factor: 5.531

10.  A benchmark of batch-effect correction methods for single-cell RNA sequencing data.

Authors:  Hoa Thi Nhu Tran; Kok Siong Ang; Marion Chevrier; Xiaomeng Zhang; Nicole Yee Shin Lee; Michelle Goh; Jinmiao Chen
Journal:  Genome Biol       Date:  2020-01-16       Impact factor: 13.583

View more
  2 in total

1.  The need to reassess single-cell RNA sequencing datasets: the importance of biological sample processing.

Authors:  Alex M Ascensión; Marcos J Araúzo-Bravo; Ander Izeta
Journal:  F1000Res       Date:  2021-08-06

2.  Triku: a feature selection method based on nearest neighbors for single-cell data.

Authors:  Alex M Ascensión; Olga Ibáñez-Solé; Iñaki Inza; Ander Izeta; Marcos J Araúzo-Bravo
Journal:  Gigascience       Date:  2022-03-12       Impact factor: 6.524

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.