Literature DB >> 35939233

scVAEBGM: Clustering Analysis of Single-Cell ATAC-seq Data Using a Deep Generative Model.

Hongyu Duan1, Feng Li2, Junliang Shang1, Jinxing Liu1, Yan Li3, Xikui Liu3.   

Abstract

A surge in research has occurred because of current developments in single-cell technologies. Above all, single-cell Assay for Transposase-Accessible Chromatin with high throughput sequencing (scATAC-seq) is a popular approach of analyzing chromatin accessibility differences at the level of single cell, either within or between groups. As a result, it is critical to examine cell heterogeneity at a previously unseen level and to identify both recognized and unknown cell types. However, with the ever-increasing number of cells engendered by technological development and the characteristics of the data, such as high noise, sparsity and dimension, challenges in distinguishing cell types have emerged. We propose scVAEBGM, which integrates a Variational Autoencoder (VAE) with a Bayesian Gaussian-mixture model (BGM) to process and analyze scATAC-seq data. This method combines and takes benefits of a Bayesian Gaussian mixture model to estimate the number of cell types without determining the cluster number in a beforehand. In other words, the size of the clusters is inferred from the data, thus avoiding biases introduced by subjective assessments when manually determining the size of the clusters. Additionally, the method is more robust to noise and can better represent single-cell data in lower dimensions. We also create a further clustering strategy. It is indicated by experiments that further clustering based on the already completed clustering can improve the clustering accuracy again. We test on six public datasets, and scVAEBGM outperforms various dimension reduction baselines. In downstream applications, scVAEBGM can reveal biological cell types.
© 2022. International Association of Scientists in the Interdisciplinary Areas.

Entities:  

Keywords:  Bayesian Gaussian-mixture model; Clustering; Deep learning; Variational autoencoder; scATAC-seq

Mesh:

Substances:

Year:  2022        PMID: 35939233     DOI: 10.1007/s12539-022-00536-w

Source DB:  PubMed          Journal:  Interdiscip Sci        ISSN: 1867-1462            Impact factor:   3.492


  16 in total

1.  Dynamic programming.

Authors:  R Bellman
Journal:  Science       Date:  1966-07-01       Impact factor: 47.728

2.  netAE: semi-supervised dimensionality reduction of single-cell RNA sequencing to facilitate cell labeling.

Authors:  Zhengyang Dong; Gil Alterovitz
Journal:  Bioinformatics       Date:  2021-04-09       Impact factor: 6.937

3.  An incremental DPMM-based method for trajectory clustering, modeling, and retrieval.

Authors:  Weiming Hu; Xi Li; Guodong Tian; Stephen Maybank; Zhongfei Zhang
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-05       Impact factor: 6.226

4.  Simultaneous deep generative modeling and clustering of single cell genomic data.

Authors:  Qiao Liu; Shengquan Chen; Rui Jiang; Wing Hung Wong
Journal:  Nat Mach Intell       Date:  2021-05-10

5.  Improving Single-Cell RNA-seq Clustering by Integrating Pathways.

Authors:  Chenxing Zhang; Lin Gao; Bingbo Wang; Yong Gao
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

6.  A versatile and scalable single-cell data integration algorithm based on domain-adversarial and variational approximation.

Authors:  Jialu Hu; Yuanke Zhong; Xuequn Shang
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

7.  SC3: consensus clustering of single-cell RNA-seq data.

Authors:  Vladimir Yu Kiselev; Kristina Kirschner; Michael T Schaub; Tallulah Andrews; Andrew Yiu; Tamir Chandra; Kedar N Natarajan; Wolf Reik; Mauricio Barahona; Anthony R Green; Martin Hemberg
Journal:  Nat Methods       Date:  2017-03-27       Impact factor: 28.547

8.  Deep generative modeling for single-cell transcriptomics.

Authors:  Romain Lopez; Jeffrey Regier; Michael B Cole; Michael I Jordan; Nir Yosef
Journal:  Nat Methods       Date:  2018-11-30       Impact factor: 28.547

9.  Deep learning shapes single-cell data analysis.

Authors:  Qin Ma; Dong Xu
Journal:  Nat Rev Mol Cell Biol       Date:  2022-05       Impact factor: 113.915

10.  The Human Cell Atlas.

Authors:  Aviv Regev; Sarah A Teichmann; Eric S Lander; Ido Amit; Christophe Benoist; Ewan Birney; Bernd Bodenmiller; Peter Campbell; Piero Carninci; Menna Clatworthy; Hans Clevers; Bart Deplancke; Ian Dunham; James Eberwine; Roland Eils; Wolfgang Enard; Andrew Farmer; Lars Fugger; Berthold Göttgens; Nir Hacohen; Muzlifah Haniffa; Martin Hemberg; Seung Kim; Paul Klenerman; Arnold Kriegstein; Ed Lein; Sten Linnarsson; Emma Lundberg; Joakim Lundeberg; Partha Majumder; John C Marioni; Miriam Merad; Musa Mhlanga; Martijn Nawijn; Mihai Netea; Garry Nolan; Dana Pe'er; Anthony Phillipakis; Chris P Ponting; Stephen Quake; Wolf Reik; Orit Rozenblatt-Rosen; Joshua Sanes; Rahul Satija; Ton N Schumacher; Alex Shalek; Ehud Shapiro; Padmanee Sharma; Jay W Shin; Oliver Stegle; Michael Stratton; Michael J T Stubbington; Fabian J Theis; Matthias Uhlen; Alexander van Oudenaarden; Allon Wagner; Fiona Watt; Jonathan Weissman; Barbara Wold; Ramnik Xavier; Nir Yosef
Journal:  Elife       Date:  2017-12-05       Impact factor: 8.140

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.