Literature DB >> 33411784

Optimal tuning of weighted kNN- and diffusion-based methods for denoising single cell genomics data.

Andreas Tjärnberg1,2,3, Omar Mahmood4, Christopher A Jackson2,3, Giuseppe-Antonio Saldi2, Kyunghyun Cho5,6, Lionel A Christiaen1,3, Richard A Bonneau2,3,4,5,6.   

Abstract

The analysis of single-cell genomics data presents several statistical challenges, and extensive efforts have been made to produce methods for the analysis of this data that impute missing values, address sampling issues and quantify and correct for noise. In spite of such efforts, no consensus on best practices has been established and all current approaches vary substantially based on the available data and empirical tests. The k-Nearest Neighbor Graph (kNN-G) is often used to infer the identities of, and relationships between, cells and is the basis of many widely used dimensionality-reduction and projection methods. The kNN-G has also been the basis for imputation methods using, e.g., neighbor averaging and graph diffusion. However, due to the lack of an agreed-upon optimal objective function for choosing hyperparameters, these methods tend to oversmooth data, thereby resulting in a loss of information with regard to cell identity and the specific gene-to-gene patterns underlying regulatory mechanisms. In this paper, we investigate the tuning of kNN- and diffusion-based denoising methods with a novel non-stochastic method for optimally preserving biologically relevant informative variance in single-cell data. The framework, Denoising Expression data with a Weighted Affinity Kernel and Self-Supervision (DEWÄKSS), uses a self-supervised technique to tune its parameters. We demonstrate that denoising with optimal parameters selected by our objective function (i) is robust to preprocessing methods using data from established benchmarks, (ii) disentangles cellular identity and maintains robust clusters over dimension-reduction methods, (iii) maintains variance along several expression dimensions, unlike previous heuristic-based methods that tend to oversmooth data variance, and (iv) rarely involves diffusion but rather uses a fixed weighted kNN graph for denoising. Together, these findings provide a new understanding of kNN- and diffusion-based denoising methods. Code and example data for DEWÄKSS is available at https://gitlab.com/Xparx/dewakss/-/tree/Tjarnberg2020branch.

Entities:  

Mesh:

Year:  2021        PMID: 33411784      PMCID: PMC7817019          DOI: 10.1371/journal.pcbi.1008569

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  29 in total

1.  Diffusion maps for high-dimensional single-cell analysis of differentiation data.

Authors:  Laleh Haghverdi; Florian Buettner; Fabian J Theis
Journal:  Bioinformatics       Date:  2015-05-21       Impact factor: 6.937

2.  Comparative Analysis of Single-Cell RNA Sequencing Methods.

Authors:  Christoph Ziegenhain; Beate Vieth; Swati Parekh; Björn Reinius; Amy Guillaumet-Adkins; Martha Smets; Heinrich Leonhardt; Holger Heyn; Ines Hellmann; Wolfgang Enard
Journal:  Mol Cell       Date:  2017-02-16       Impact factor: 17.970

3.  Diffusion pseudotime robustly reconstructs lineage branching.

Authors:  Laleh Haghverdi; Maren Büttner; F Alexander Wolf; Florian Buettner; Fabian J Theis
Journal:  Nat Methods       Date:  2016-08-29       Impact factor: 28.547

Review 4.  Challenges in unsupervised clustering of single-cell RNA-seq data.

Authors:  Vladimir Yu Kiselev; Tallulah S Andrews; Martin Hemberg
Journal:  Nat Rev Genet       Date:  2019-05       Impact factor: 53.242

Review 5.  Design and computational analysis of single-cell RNA-sequencing experiments.

Authors:  Rhonda Bacher; Christina Kendziorski
Journal:  Genome Biol       Date:  2016-04-07       Impact factor: 13.583

6.  SAVER: gene expression recovery for single-cell RNA sequencing.

Authors:  Mo Huang; Jingshu Wang; Eduardo Torre; Hannah Dueck; Sydney Shaffer; Roberto Bonasio; John I Murray; Arjun Raj; Mingyao Li; Nancy R Zhang
Journal:  Nat Methods       Date:  2018-06-25       Impact factor: 28.547

7.  Single-cell RNA-seq denoising using a deep count autoencoder.

Authors:  Gökcen Eraslan; Lukas M Simon; Maria Mircea; Nikola S Mueller; Fabian J Theis
Journal:  Nat Commun       Date:  2019-01-23       Impact factor: 14.919

8.  CEL-Seq2: sensitive highly-multiplexed single-cell RNA-Seq.

Authors:  Tamar Hashimshony; Naftalie Senderovich; Gal Avital; Agnes Klochendler; Yaron de Leeuw; Leon Anavy; Dave Gennert; Shuqiang Li; Kenneth J Livak; Orit Rozenblatt-Rosen; Yuval Dor; Aviv Regev; Itai Yanai
Journal:  Genome Biol       Date:  2016-04-28       Impact factor: 13.583

9.  A Single-Cell Transcriptome Atlas of the Human Pancreas.

Authors:  Mauro J Muraro; Gitanjali Dharmadhikari; Dominic Grün; Nathalie Groen; Tim Dielen; Erik Jansen; Leon van Gurp; Marten A Engelse; Francoise Carlotti; Eelco J P de Koning; Alexander van Oudenaarden
Journal:  Cell Syst       Date:  2016-09-29       Impact factor: 10.304

10.  RNA velocity of single cells.

Authors:  Gioele La Manno; Ruslan Soldatov; Amit Zeisel; Emelie Braun; Hannah Hochgerner; Viktor Petukhov; Katja Lidschreiber; Maria E Kastriti; Peter Lönnerberg; Alessandro Furlan; Jean Fan; Lars E Borm; Zehua Liu; David van Bruggen; Jimin Guo; Xiaoling He; Roger Barker; Erik Sundström; Gonçalo Castelo-Branco; Patrick Cramer; Igor Adameyko; Sten Linnarsson; Peter V Kharchenko
Journal:  Nature       Date:  2018-08-08       Impact factor: 49.962

View more
  4 in total

1.  G2S3: A gene graph-based imputation method for single-cell RNA sequencing data.

Authors:  Weimiao Wu; Yunqing Liu; Qile Dai; Xiting Yan; Zuoheng Wang
Journal:  PLoS Comput Biol       Date:  2021-05-18       Impact factor: 4.475

2.  GABA-receptive microglia selectively sculpt developing inhibitory circuits.

Authors:  Emilia Favuzzi; Shuhan Huang; Giuseppe A Saldi; Loïc Binan; Leena A Ibrahim; Marian Fernández-Otero; Yuqing Cao; Ayman Zeine; Adwoa Sefah; Karen Zheng; Qing Xu; Elizaveta Khlestova; Samouil L Farhi; Richard Bonneau; Sandeep Robert Datta; Beth Stevens; Gord Fishell
Journal:  Cell       Date:  2021-07-06       Impact factor: 66.850

3.  High performance single-cell gene regulatory network inference at scale: The Inferelator 3.0.

Authors:  Claudia Skok Gibbs; Christopher A Jackson; Giuseppe-Antonio Saldi; Andreas Tjärnberg; Aashna Shah; Aaron Watters; Nicholas De Veaux; Konstantine Tchourine; Ren Yi; Tymor Hamamsy; Dayanne M Castro; Nicholas Carriero; Bram L Gorissen; David Gresham; Emily R Miraldi; Richard Bonneau
Journal:  Bioinformatics       Date:  2022-02-21       Impact factor: 6.931

4.  Effective data filtering is prerequisite for robust microbial association network construction.

Authors:  Mengqi Wang; Qichao Tu
Journal:  Front Microbiol       Date:  2022-10-04       Impact factor: 6.064

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.