Literature DB >> 33179041

D-EE: Distributed software for visualizing intrinsic structure of large-scale single-cell data.

Shaokun An1,2, Jizu Huang1,2, Lin Wan1,2.   

Abstract

BACKGROUND: Dimensionality reduction and visualization play vital roles in single-cell RNA sequencing (scRNA-seq) data analysis. While they have been extensively studied, state-of-the-art dimensionality reduction algorithms are often unable to preserve the global structures underlying data. Elastic embedding (EE), a nonlinear dimensionality reduction method, has shown promise in revealing low-dimensional intrinsic local and global data structure. However, the current implementation of the EE algorithm lacks scalability to large-scale scRNA-seq data.
RESULTS: We present a distributed optimization implementation of the EE algorithm, termed distributed elastic embedding (D-EE). D-EE reveals the low-dimensional intrinsic structures of data with accuracy equal to that of elastic embedding, and it is scalable to large-scale scRNA-seq data. It leverages distributed storage and distributed computation, achieving memory efficiency and high-performance computing simultaneously. In addition, an extended version of D-EE, termed distributed optimization implementation of time-series elastic embedding (D-TSEE), enables the user to visualize large-scale time-series scRNA-seq data by incorporating experimentally temporal information. Results with large-scale scRNA-seq data indicate that D-TSEE can uncover oscillatory gene expression patterns by using experimentally temporal information.
CONCLUSIONS: D-EE is a distributed dimensionality reduction and visualization tool. Its distributed storage and distributed computation technique allow us to efficiently analyze large-scale single-cell data at the cost of constant time speedup. The source code for D-EE algorithm based on C and MPI tailored to a high-performance computing cluster is available at https://github.com/ShaokunAn/D-EE.
© The Author(s) 2020. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  dimensionality reduction; distributed computation; distributed storage; large-scale data; single-cell sequencing

Year:  2020        PMID: 33179041      PMCID: PMC7657844          DOI: 10.1093/gigascience/giaa126

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  17 in total

1.  Geometric Sketching Compactly Summarizes the Single-Cell Transcriptomic Landscape.

Authors:  Brian Hie; Hyunghoon Cho; Benjamin DeMeo; Bryan Bryson; Bonnie Berger
Journal:  Cell Syst       Date:  2019-06-05       Impact factor: 10.304

2.  SOX2 regulates YAP1 to maintain stemness and determine cell fate in the osteo-adipo lineage.

Authors:  Eunjeong Seo; Upal Basu-Roy; Preethi H Gunaratne; Cristian Coarfa; Dae-Sik Lim; Claudio Basilico; Alka Mansukhani
Journal:  Cell Rep       Date:  2013-06-20       Impact factor: 9.423

3.  Generalizable and Scalable Visualization of Single-Cell Data Using Neural Networks.

Authors:  Hyunghoon Cho; Bonnie Berger; Jian Peng
Journal:  Cell Syst       Date:  2018-06-20       Impact factor: 10.304

4.  TSEE: an elastic embedding method to visualize the dynamic gene expression patterns of time series single-cell RNA sequencing data.

Authors:  Shaokun An; Liang Ma; Lin Wan
Journal:  BMC Genomics       Date:  2019-04-04       Impact factor: 3.969

5.  Ten quick tips for effective dimensionality reduction.

Authors:  Lan Huong Nguyen; Susan Holmes
Journal:  PLoS Comput Biol       Date:  2019-06-20       Impact factor: 4.475

6.  Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression.

Authors:  Christoph Hafemeister; Rahul Satija
Journal:  Genome Biol       Date:  2019-12-23       Impact factor: 13.583

7.  Visualizing structure and transitions in high-dimensional biological data.

Authors:  Kevin R Moon; David van Dijk; Zheng Wang; Scott Gigante; Daniel B Burkhardt; William S Chen; Kristina Yim; Antonia van den Elzen; Matthew J Hirn; Ronald R Coifman; Natalia B Ivanova; Guy Wolf; Smita Krishnaswamy
Journal:  Nat Biotechnol       Date:  2019-12-03       Impact factor: 54.908

8.  Wishbone identifies bifurcating developmental trajectories from single-cell data.

Authors:  Manu Setty; Michelle D Tadmor; Shlomit Reich-Zeliger; Omer Angel; Tomer Meir Salame; Pooja Kathail; Kristy Choi; Sean Bendall; Nir Friedman; Dana Pe'er
Journal:  Nat Biotechnol       Date:  2016-05-02       Impact factor: 54.908

9.  Nanog induced intermediate state in regulating stem cell differentiation and reprogramming.

Authors:  Peijia Yu; Qing Nie; Chao Tang; Lei Zhang
Journal:  BMC Syst Biol       Date:  2018-02-27

10.  The art of using t-SNE for single-cell transcriptomics.

Authors:  Dmitry Kobak; Philipp Berens
Journal:  Nat Commun       Date:  2019-11-28       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.