Literature DB >> 30768159

Dhaka: Variational Autoencoder for Unmasking Tumor Heterogeneity from Single Cell Genomic Data.

Sabrina Rashid1, Sohrab Shah2,3, Ziv Bar-Joseph4, Ravi Pandya5.   

Abstract

MOTIVATION: Intra-tumor heterogeneity is one of the key confounding factors in deciphering tumor evolution. Malignant cells exhibit variations in their gene expression, copy numbers, and mutation even when originating from a single progenitor cell. Single cell sequencing of tumor cells has recently emerged as a viable option for unmasking the underlying tumor heterogeneity. However, extracting features from single cell genomic data in order to infer their evolutionary trajectory remains computationally challenging due to the extremely noisy and sparse nature of the data.
RESULTS: Here we describe 'Dhaka', a variational autoencoder method which transforms single cell genomic data to a reduced dimension feature space that is more efficient in differentiating between (hidden) tumor subpopulations. Our method is general and can be applied to several different types of genomic data including copy number variation from scDNA-Seq and gene expression from scRNA-Seq experiments. We tested the method on synthetic and 6 single cell cancer datasets where the number of cells ranges from 250 to 6000 for each sample. Analysis of the resulting feature space revealed subpopulations of cells and their marker genes. The features are also able to infer the lineage and/or differentiation trajectory between cells greatly improving upon prior methods suggested for feature extraction and dimensionality reduction of such data.
AVAILABILITY AND IMPLEMENTATION: All the datasets used in the paper are publicly available and developed software package and supporting info is available on Github https://github.com/MicrosoftGenomics/Dhaka.
© The Author(s) (2019). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2019        PMID: 30768159     DOI: 10.1093/bioinformatics/btz095

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

Review 1.  Applications of machine learning in drug discovery and development.

Authors:  Jessica Vamathevan; Dominic Clark; Paul Czodrowski; Ian Dunham; Edgardo Ferran; George Lee; Bin Li; Anant Madabhushi; Parantu Shah; Michaela Spitzer; Shanrong Zhao
Journal:  Nat Rev Drug Discov       Date:  2019-06       Impact factor: 84.694

2.  Deep learning tackles single-cell analysis-a survey of deep learning for scRNA-seq analysis.

Authors:  Mario Flores; Zhentao Liu; Tinghe Zhang; Md Musaddaqui Hasib; Yu-Chiao Chiu; Zhenqing Ye; Karla Paniagua; Sumin Jo; Jianqiu Zhang; Shou-Jiang Gao; Yu-Fang Jin; Yidong Chen; Yufei Huang
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

3.  Machine Learning in Drug Discovery: A Review.

Authors:  Suresh Dara; Swetha Dhamercherla; Surender Singh Jadav; Ch Madhu Babu; Mohamed Jawed Ahsan
Journal:  Artif Intell Rev       Date:  2021-08-11       Impact factor: 9.588

Review 4.  Machine Learning Approaches on High Throughput NGS Data to Unveil Mechanisms of Function in Biology and Disease.

Authors:  Vasileios C Pezoulas; Orsalia Hazapis; Nefeli Lagopati; Themis P Exarchos; Andreas V Goules; Athanasios G Tzioufas; Dimitrios I Fotiadis; Ioannis G Stratis; Athanasios N Yannacopoulos; Vassilis G Gorgoulis
Journal:  Cancer Genomics Proteomics       Date:  2021 Sep-Oct       Impact factor: 4.069

5.  Unsupervised generative and graph representation learning for modelling cell differentiation.

Authors:  Ioana Bica; Helena Andrés-Terré; Ana Cvejic; Pietro Liò
Journal:  Sci Rep       Date:  2020-06-17       Impact factor: 4.379

6.  Prediction of Potential miRNA-Disease Associations Through a Novel Unsupervised Deep Learning Framework with Variational Autoencoder.

Authors:  Li Zhang; Xing Chen; Jun Yin
Journal:  Cells       Date:  2019-09-06       Impact factor: 6.600

Review 7.  Recent applications of quantitative systems pharmacology and machine learning models across diseases.

Authors:  Sara Sadat Aghamiri; Rada Amin; Tomáš Helikar
Journal:  J Pharmacokinet Pharmacodyn       Date:  2021-10-20       Impact factor: 2.410

8.  Knowledge-primed neural networks enable biologically interpretable deep learning on single-cell sequencing data.

Authors:  Nikolaus Fortelny; Christoph Bock
Journal:  Genome Biol       Date:  2020-08-03       Impact factor: 13.583

9.  Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis.

Authors:  Shiquan Sun; Jiaqiang Zhu; Ying Ma; Xiang Zhou
Journal:  Genome Biol       Date:  2019-12-10       Impact factor: 13.583

Review 10.  Enhancing scientific discoveries in molecular biology with deep generative models.

Authors:  Romain Lopez; Adam Gayoso; Nir Yosef
Journal:  Mol Syst Biol       Date:  2020-09       Impact factor: 11.429

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.