Literature DB >> 34585247

A versatile and scalable single-cell data integration algorithm based on domain-adversarial and variational approximation.

Jialu Hu1, Yuanke Zhong1, Xuequn Shang1.   

Abstract

Single-cell technologies provide us new ways to profile transcriptomic landscape, chromatin accessibility, spatial expression patterns in heterogeneous tissues at the resolution of single cell. With enormous generated single-cell datasets, a key analytic challenge is to integrate these datasets to gain biological insights into cellular compositions. Here, we developed a domain-adversarial and variational approximation, DAVAE, which can integrate multiple single-cell datasets across samples, technologies and modalities with a single strategy. Besides, DAVAE can also integrate paired data of ATAC profile and transcriptome profile that are simultaneously measured from a same cell. With a mini-batch stochastic gradient descent strategy, it is scalable for large-scale data and can be accelerated by GPUs. Results on seven real data integration applications demonstrated the effectiveness and scalability of DAVAE in batch-effect removing, transfer learning and cell-type predictions for multiple single-cell datasets across samples, technologies and modalities. Availability: DAVAE has been implemented in a toolkit package "scbean" in the pypi repository, and the source code can be also freely accessible at https://github.com/jhu99/scbean. All our data and source code for reproducing the results of this paper can be accessible at https://github.com/jhu99/davae_paper.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  data integration; domain-adversarial learning; multimodal data; regularized regression; single cell analysis; variational approximation

Mesh:

Substances:

Year:  2022        PMID: 34585247     DOI: 10.1093/bib/bbab400

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  3 in total

1.  scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model.

Authors:  Hongyu Duan; Feng Li; Junliang Shang; Jinxing Liu; Yan Li; Xikui Liu
Journal:  Interdiscip Sci       Date:  2022-08-08       Impact factor: 3.492

2.  Revealing the Key MSCs Niches and Pathogenic Genes in Influencing CEP Homeostasis: A Conjoint Analysis of Single-Cell and WGCNA.

Authors:  Weihang Li; Shilei Zhang; Yingjing Zhao; Dong Wang; Quan Shi; Ziyi Ding; Yongchun Wang; Bo Gao; Ming Yan
Journal:  Front Immunol       Date:  2022-06-27       Impact factor: 8.786

3.  Effective and scalable single-cell data alignment with non-linear canonical correlation analysis.

Authors:  Jialu Hu; Mengjie Chen; Xiang Zhou
Journal:  Nucleic Acids Res       Date:  2022-02-28       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.