Literature DB >> 35793527

Shared Differential Expression-Based Distance Reflects Global Cell Type Relationships in Single-Cell RNA Sequencing Data.

Aidan Mcloughlin1, Haiyan Huang2.   

Abstract

Unsupervised cell clustering on the basis of meaningful biological variation in single-cell RNA sequencing (scRNA seq) data has received significant attention, as it assists with ontological subpopulation identification among the data. A key step in the clustering process is to compute distances between the cells under a specified distance measure. Although particular distance measures may successfully separate cells into biologically relevant clusters, they may fail to retain global structure of the data, such as relative similarity between the cell clusters. In this article, we modify a biologically motivated distance measure, SIDEseq, for use of aggregate comparisons of cell types in large single-cell assays, and demonstrate that, across simulated and real scRNA seq data, the distance matrix more consistently retains global cell type relationships than commonly used distance measures for scRNA seq clustering. We call the modified distance measure "SIDEREF." We explore spectral dimension reduction of the SIDEREF distance matrix as a means of noise filtering, similar to principal components analysis applied directly to expression data. We utilize a summary measure of relative cell type distances to better display the cell group relationships. SIDEREF visualizations more consistently reflect global structures in the data than other commonly considered distance measures. We utilize relative cell type distances and the SIDEREF distance measure to uncover compositional differences between annotated leukocyte cell groups in a compendium of Mus musculus scRNA seq assays comprising 12 tissues. SIDEREF and associated analysis is openly available on GitHub.

Entities:  

Keywords:  clustering; differential expression; distance; global structure; scRNA seq

Mesh:

Year:  2022        PMID: 35793527      PMCID: PMC9419948          DOI: 10.1089/cmb.2021.0652

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.549


  28 in total

1.  Accounting for technical noise in single-cell RNA-seq experiments.

Authors:  Philip Brennecke; Simon Anders; Jong Kyoung Kim; Aleksandra A Kołodziejczyk; Xiuwei Zhang; Valentina Proserpio; Bianka Baying; Vladimir Benes; Sarah A Teichmann; John C Marioni; Marcus G Heisler
Journal:  Nat Methods       Date:  2013-09-22       Impact factor: 28.547

2.  Identification of cell types from single-cell transcriptomes using a novel clustering method.

Authors:  Chen Xu; Zhengchang Su
Journal:  Bioinformatics       Date:  2015-02-11       Impact factor: 6.937

3.  SIDEseq: A Cell Similarity Measure Defined by Shared Identified Differentially Expressed Genes for Single-Cell RNA sequencing Data.

Authors:  Courtney Schiffman; Christina Lin; Funan Shi; Luonan Chen; Lydia Sohn; Haiyan Huang
Journal:  Stat Biosci       Date:  2017-05-17

4.  ScGSLC: An unsupervised graph similarity learning framework for single-cell RNA-seq data clustering.

Authors:  Junyi Li; Wei Jiang; Henry Han; Jing Liu; Bo Liu; Yadong Wang
Journal:  Comput Biol Chem       Date:  2020-11-18       Impact factor: 2.877

5.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

6.  Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders.

Authors:  Gregory P Way; Casey S Greene
Journal:  Pac Symp Biocomput       Date:  2018

Review 7.  The single-cell sequencing: new developments and medical applications.

Authors:  Xiaoning Tang; Yongmei Huang; Jinli Lei; Hui Luo; Xiao Zhu
Journal:  Cell Biosci       Date:  2019-06-26       Impact factor: 7.133

8.  GeneFishing to reconstruct context specific portraits of biological processes.

Authors:  Ke Liu; Elizabeth Theusch; Yun Zhou; Tal Ashuach; Andrea C Dose; Peter J Bickel; Marisa W Medina; Haiyan Huang
Journal:  Proc Natl Acad Sci U S A       Date:  2019-09-04       Impact factor: 11.205

9.  Integrated analysis of multimodal single-cell data.

Authors:  Yuhan Hao; Stephanie Hao; Erica Andersen-Nissen; William M Mauck; Shiwei Zheng; Andrew Butler; Maddie J Lee; Aaron J Wilk; Charlotte Darby; Michael Zager; Paul Hoffman; Marlon Stoeckius; Efthymia Papalexi; Eleni P Mimitou; Jaison Jain; Avi Srivastava; Tim Stuart; Lamar M Fleming; Bertrand Yeung; Angela J Rogers; Juliana M McElrath; Catherine A Blish; Raphael Gottardo; Peter Smibert; Rahul Satija
Journal:  Cell       Date:  2021-05-31       Impact factor: 41.582

10.  Splatter: simulation of single-cell RNA sequencing data.

Authors:  Luke Zappia; Belinda Phipson; Alicia Oshlack
Journal:  Genome Biol       Date:  2017-09-12       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.