Valentine Svensson1, Adam Gayoso2, Nir Yosef2,3,4, Lior Pachter1,5. 1. Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA. 2. Center for Computational Biology. 3. Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 91125, USA. 4. Chan Zuckerberg Biohub, San Francisco, CA 94158, USA. 5. Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA.
Abstract
MOTIVATION: Single-cell RNA-seq makes possible the investigation of variability in gene expression among cells, and dependence of variation on cell type. Statistical inference methods for such analyses must be scalable, and ideally interpretable. RESULTS: We present an approach based on a modification of a recently published highly scalable variational autoencoder framework that provides interpretability without sacrificing much accuracy. We demonstrate that our approach enables identification of gene programs in massive datasets. Our strategy, namely the learning of factor models with the auto-encoding variational Bayes framework, is not domain specific and may be useful for other applications. AVAILABILITY AND IMPLEMENTATION: The factor model is available in the scVI package hosted at https://github.com/YosefLab/scVI/. CONTACT: v@nxn.se. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Single-cell RNA-seq makes possible the investigation of variability in gene expression among cells, and dependence of variation on cell type. Statistical inference methods for such analyses must be scalable, and ideally interpretable. RESULTS: We present an approach based on a modification of a recently published highly scalable variational autoencoder framework that provides interpretability without sacrificing much accuracy. We demonstrate that our approach enables identification of gene programs in massive datasets. Our strategy, namely the learning of factor models with the auto-encoding variational Bayes framework, is not domain specific and may be useful for other applications. AVAILABILITY AND IMPLEMENTATION: The factor model is available in the scVI package hosted at https://github.com/YosefLab/scVI/. CONTACT: v@nxn.se. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Zizhen Yao; Cindy T J van Velthoven; Thuc Nghi Nguyen; Jeff Goldy; Adriana E Sedeno-Cortes; Fahimeh Baftizadeh; Darren Bertagnolli; Tamara Casper; Megan Chiang; Kirsten Crichton; Song-Lin Ding; Olivia Fong; Emma Garren; Alexandra Glandon; Nathan W Gouwens; James Gray; Lucas T Graybuck; Michael J Hawrylycz; Daniel Hirschstein; Matthew Kroll; Kanan Lathia; Changkyu Lee; Boaz Levi; Delissa McMillen; Stephanie Mok; Thanh Pham; Qingzhong Ren; Christine Rimorin; Nadiya Shapovalova; Josef Sulc; Susan M Sunkin; Michael Tieu; Amy Torkelson; Herman Tung; Katelyn Ward; Nick Dee; Kimberly A Smith; Bosiljka Tasic; Hongkui Zeng Journal: Cell Date: 2021-05-17 Impact factor: 66.850
Authors: Anna S E Cuomo; Giordano Alvari; Christina B Azodi; Davis J McCarthy; Marc Jan Bonder Journal: Genome Biol Date: 2021-06-24 Impact factor: 13.583